Hive基本查询操作（一）

第1关：where操作第2关：group by操作第3关：join操作

半爿天穹

3291人浏览 · 2023-05-27 23:43:45

半爿天穹 · 2023-05-27 23:43:45 发布

第1关：where操作

任务描述

第1关：where操作

任务描述

本关任务：使用where和like求出编程要求中所给需求。

name	age
`bob`	`22`
`cindy`	`27`
`herry`	`26`

编程要求

在右侧编辑器中补充SQL，查询出工作职责涉及hive的并且工资大于8000的公司名称以及工作经验。（其中库名：db1，表名：table1）

student表结构：

INFO TYPE

eduLevel_name String

company_name String

jobName String

salary int

city_code int

responsibility String

workingExp String

本地部分文件内容：
本科,北京联通支付有限公司,大数据开发工程师,10000,530,熟练使用hive等,1-3年
专科,北京联科数创科技有限公司,大数据分析师,8000,530,熟练使用MySQL等数据库,1-3年`
本科,湖南智湘赢播网络技术有限公司,大数据开发工程师,16000,749,熟练使用spark等,3-5年

INFO	TYPE
`eduLevel_name`	`String`
`company_name`	`String`
`jobName`	`String`
`salary`	`int`
`city_code`	`int`
`responsibility`	`String`
`workingExp`	`String`

测试说明

平台会对你编写的代码进行测试：

预期输出：
1-3年    北京联通支付有限公司
1-3年    深圳市德科信息技术有限公司广州分公司

代码：

----------禁止修改----------
create database if not exists db1;
use db1;

create table if not exists table1(
eduLevel_name string comment '学历',
company_name string comment '公司名',
jobName string comment '职位名称',
salary int comment '薪资',
city_code int comment '城市编码',
responsibility string comment '岗位职责',
workingExp string comment '工作经验'
)
row format delimited fields terminated by ','
lines terminated by '\n'
stored as textfile;
truncate table table1;
load data local inpath '/root/aaa.txt' into table table1;
----------禁止修改----------

----------Begin----------
select workingExp,company_name from table1 where responsibility like '%hive%' and salary>8000;
----------End----------

第2关：group by操作

任务描述

本关任务：实现不同工作年限的平均工资需求。

city	salary	job
长沙	`7000`	大数据开发
北京	`10000`	大数据开发
广州	`11000`	大数据开发
长沙	`7000`	大数据开发

编程要求

在右侧编辑器中补充SQL，计算不同工作年限以及其平均工资并且过滤出平均工资大于10000的。（其中库名：db1，表名：table1）

table1表结构：

INFO TYPE

eduLevel_name String

company_name String

jobName String

salary int

city_code int

responsibility String

workingExp String

本地部分文件内容：
本科,北京联通支付有限公司,大数据开发工程师,10000,530,熟练使用hive等,1-3年
专科,北京联科数创科技有限公司,大数据分析师,8000,530,熟练使用MySQL等数据库,1-3年
本科,湖南智湘赢播网络技术有限公司,大数据开发工程师,16000,749,熟练使用spark等,3-5年

INFO	TYPE
`eduLevel_name`	`String`
`company_name`	`String`
`jobName`	`String`
`salary`	`int`
`city_code`	`int`
`responsibility`	`String`
`workingExp`	`String`

测试说明

平台会对你编写的代码进行测试：

预期输出：
17000.0    3-5年
20000.0    5-10年

代码：

----------禁止修改----------
create database if not exists db1;
use db1;


create table if not exists table1(
eduLevel_name string comment '学历',
company_name string comment '公司名',
jobName string comment '职位名称',
salary int comment '薪资',
city_code int comment '城市编码',
responsibility string comment '岗位职责',
workingExp string comment '工作经验'
)
row format delimited fields terminated by ','
lines terminated by '\n'
stored as textfile;
truncate table table1;
load data local inpath '/root/t1.txt' into table table1;
----------禁止修改----------

----------Begin----------
select avg(salary),workingExp from table1 group by workingExp having avg(salary)>10000;
----------End----------

第3关：join操作

任务描述

本关任务：通过关联求出每个城市名的平均工资。

id	name
`1`	`bob`
`2`	`lily`
`3`	`herry`

cid	score
`1`	`80`
`2`	`90`
`5`	`60`

编程要求

在右侧编辑器中补充SQL，求出表table2中所有城市名的平均工资。（其中库名：db1，表名：table1，表名：table2）

表table1结构：

INFO TYPE

eduLevel_name String

company_name String

jobName String

salary int

city_code int

responsibility String

workingExp String

table1本地部分文件内容：
本科,北京联通支付有限公司,大数据开发工程师,10000,530,熟练使用hive等,1-3年
专科,北京联科数创科技有限公司,大数据分析师,8000,530,熟练使用MySQL等数据库,1-3年
本科,湖南智湘赢播网络技术有限公司,大数据开发工程师,16000,749,熟练使用spark等,3-5年
表table2结构：

INFO TYPE

city_code int

city_name String

table2本地部分文件内容：
538,上海
653,杭州
749,长沙
763,广州

INFO	TYPE
`eduLevel_name`	`String`
`company_name`	`String`
`jobName`	`String`
`salary`	`int`
`city_code`	`int`
`responsibility`	`String`
`workingExp`	`String`

INFO	TYPE
`city_code`	`int`
`city_name`	`String`

测试说明

平台会对你编写的代码进行测试：

预期输出：
8000.0     上海
9000.0     北京
NULL       天津
12000.0    广州
7500.0     杭州
10000.0    深圳
12000.0    长沙

代码：

----------禁止修改----------
create database if not exists db1;
use db1;

create table if not exists table1(
eduLevel_name string comment '学历',
company_name string comment '公司名',
jobName string comment '职位名称',
salary int comment '薪资',
city_code int comment '城市编码',
responsibility string comment '岗位职责',
workingExp string comment '工作经验'
)
row format delimited fields terminated by ','
lines terminated by '\n'
stored as textfile;
truncate table table1;
load data local inpath '/root/t2.txt' into table table1;

create table if not exists table2(
city_code int comment '城市编码',
city_name string comment '城市名'
)
row format delimited fields terminated by ','
lines terminated by '\n'
stored as textfile;
truncate table table2;
load data local inpath '/root/t22.txt' into table table2;
----------禁止修改----------

----------Begin----------
select avg(table1.salary),table2.city_name from table1 right outer join table2 on table1.city_code=table2.city_code group by table2.city_name;
----------End----------

2048 AI社区

有“AI”的1024 = 2048，欢迎大家加入2048 AI社区

更多推荐

一个懂业务、能上手的AI，到底在哪里？大模型产业应用城市纵深行有解！

2048 AI社区

宏智树AI：2025论文降重+去AIGC终极避坑手册，学术写作不再踩雷

2048 AI社区

【无标题】2026年数字化转型新范式：Agent AI 落地趋势与企业级“可信智能体”应用分析

2026 年的企业数字化转型，不再需要更多只会聊天的 AI。企业迫切需要的是像 DeepMiner 这样：能连接全球 80+ 数据源、看得懂 200+ 业务指标、能在 30万+ 动作空间中决策，并以 98.9% 的准确率执行操作的“超级员工”。DeepMiner 真正实现了从“通用对话”到“可信生产力”的跨越——数据洞察，尽在掌握。这不仅是工具的升级，更是企业决策模式的进化。