参数名称	默认值	用法
hive.metastore.uris	-	Hive元数据存储的URI。
hive.metastore.client.socket.timeout	600	Hive元数据客户端套接字超时时间。
hive.metastore.warehouse.dir	/user/hive/warehouse	Hive数据仓库目录。
hive.warehouse.subdir.inherit.perms	true	子目录是否继承权限。
hive.auto.convert.join	true	自动转换连接类型的Join操作。
hive.auto.convert.join.noconditionaltask.size	10000000	自动转换连接类型的Join操作时条件不满足的最大数据量。
hive.optimize.bucketmapjoin.sortedmerge	false	是否优化Bucket Map Join的Sorted Merge。
hive.smbjoin.cache.rows	10000	SMB Join操作缓存的行数。
hive.server2.logging.operation.enabled	false	是否启用Hive Server2日志记录操作。
hive.server2.logging.operation.log.location	${system:java.io.tmpdir}/ ${system:user.name} /operation_logs	Hive Server2操作日志的存储位置。
mapred.reduce.tasks	-	MapReduce作业的Reduce任务数。
hive.exec.reducers.bytes.per.reducer	67108864	每个Reduce任务的数据量。
hive.exec.copyfile.maxsize	33554432	最大允许复制文件的大小。
hive.exec.reducers.max	-1	同时运行的最大Reduce任务数。
hive.vectorized.groupby.checkinterval	100000	Vectorized Group By操作的检查间隔。
hive.vectorized.groupby.flush.percent	0.1	Vectorized Group By操作的Flush比例。
hive.compute.query.using.stats	true	是否使用统计信息来优化查询计划。
hive.vectorized.execution.enabled	false	是否启用向量化执行引擎。
hive.vectorized.execution.reduce.enabled	false	是否在Reduce阶段启用向量化执行。
hive.vectorized.use.vectorized.input.format	false	是否使用向量化输入格式。
hive.vectorized.use.checked.expressions	false	是否使用检查表达式的向量化执行。
hive.vectorized.use.vector.serde.deserialize	false	是否使用向量化序列化和反序列化。
hive.vectorized.adaptor.usage.mode	off	向量化适配器的使用模式。
hive.vectorized.input.format.excludes	-	排除的向量化输入格式列表。
hive.merge.mapfiles	true	是否合并Map输出的小文件。
hive.merge.mapredfiles	false	是否合并MapReduce输出的小文件。
hive.cbo.enable	false	是否启用CBO优化。
hive.fetch.task.conversion	none	Fetch任务转换级别。
hive.fetch.task.conversion.threshold	-1	触发Fetch任务转换的数据量阈值。
hive.limit.pushdown.memory.usage	0.1	Limit操作的内存使用百分比。
hive.merge.sparkfiles	false	是否合并Spark任务输出的小文件。
hive.merge.smallfiles.avgsize	-1	合并小文件时的平均大小。
hive.merge.size.per.task	-1	每个任务合并的数据量。
hive.optimize.reducededuplication	true	是否启用重复消除优化。
hive.optimize.reducededuplication.min.reducer	4	最小Reduce任务数以启用重复消除优化。
hive.map.aggr	false	是否启用Map端聚合。
hive.map.aggr.hash.percentmemory	0.5	Map端聚合的哈希表内存比例。
hive.optimize.sort.dynamic.partition	false	是否优化动态分区排序。
hive.execution.engine	mr	Hive执行引擎类型。
spark.executor.memory	1g	Spark Executor的内存大小。
spark.driver.memory	1g	Spark Driver的内存大小。
spark.executor.cores	1	每个Spark Executor的核心数。
spark.yarn.driver.memoryOverhead	384	Spark Driver的内存Overhead。
spark.yarn.executor.memoryOverhead	384	Spark Executor的内存Overhead。
spark.dynamicAllocation.enabled	false	是否启用动态资源分配。
spark.dynamicAllocation.initialExecutors	-1	动态资源分配的初始Executor数量。
spark.dynamicAllocation.minExecutors	-1	动态资源分配的最小Executor数量。
spark.dynamicAllocation.maxExecutors	-1	动态资源分配的最大Executor数量。
hive.metastore.execute.setugi	false	是否在Hive元数据存储中执行setugi操作。
hive.support.concurrency	true	是否支持并发操作。
hive.zookeeper.quorum	-	ZooKeeper服务器列表。
hive.zookeeper.client.port	-	ZooKeeper客户端端口号。
hive.zookeeper.namespace	default	Hive使用的ZooKeeper命名空间。
hive.cluster.delegation.token.store.class	org.apache.hadoop.hive .thrift.MemoryTokenStore	集群委派令牌存储类。
hive.server2.enable.doAs	false	是否启用Hive Server2用户代理模式。
hive.metastore.sasl.enabled	false	是否启用Hive元数据存储的SASL认证。
hive.server2.authentication	NONE	Hive Server2的认证方式。
hive.metastore.kerberos.principal	-	Hive元数据存储的Kerberos主体名称。
hive.server2.authentication.kerberos.principal	-	Hive Server2的Kerberos主体名称。
spark.shuffle.service.enabled	true	是否启用Spark Shuffle服务。
hive.strict.checks.orderby.no.limit	true	是否在没有Limit操作的OrderBy语句中执行严格检查。
hive.strict.checks.no.partition.filter	true	是否在没有分区过滤条件的查询中执行严格检查。
hive.strict.checks.type.safety	true	是否执行严格的类型安全性检查。
hive.strict.checks.cartesian.product	false	是否执行严格的笛卡尔积检查。
hive.strict.checks.bucketing	true	是否执行严格的桶排序检查。

Hive配置文件Hive-site.xml参数说明用途

wang2leee

Hive配置文件hive-site.xml中参数说明和用法

文章目录

参数说明

参数示例

具体用途：

所有评论(0)

wang2leee