1 PREHOOK: query: explain select * from src a join src b on a.key=b.key limit 20
3 POSTHOOK: query: explain select * from src a join src b on a.key=b.key limit 20
6 Stage-1 is a root stage
7 Stage-0 depends on stages: Stage-1
15 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
17 predicate: key is not null (type: boolean)
18 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
20 expressions: key (type: string), value (type: string)
21 outputColumnNames: _col0, _col1
22 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
23 Reduce Output Operator
24 key expressions: _col0 (type: string)
26 Map-reduce partition columns: _col0 (type: string)
27 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
28 value expressions: _col1 (type: string)
31 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
33 predicate: key is not null (type: boolean)
34 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
36 expressions: key (type: string), value (type: string)
37 outputColumnNames: _col0, _col1
38 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
39 Reduce Output Operator
40 key expressions: _col0 (type: string)
42 Map-reduce partition columns: _col0 (type: string)
43 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
44 value expressions: _col1 (type: string)
50 0 _col0 (type: string)
51 1 _col0 (type: string)
52 outputColumnNames: _col0, _col1, _col2, _col3
53 Statistics: Num rows: 550 Data size: 5843 Basic stats: COMPLETE Column stats: NONE
56 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
59 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
61 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
62 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
63 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
71 PREHOOK: query: explain select value, sum(key) as sum from src group by value having sum > 100 limit 20
73 POSTHOOK: query: explain select value, sum(key) as sum from src group by value having sum > 100 limit 20
76 Stage-1 is a root stage
77 Stage-0 depends on stages: Stage-1
85 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
87 expressions: key (type: string), value (type: string)
88 outputColumnNames: key, value
89 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
91 aggregations: sum(key)
92 keys: value (type: string)
94 outputColumnNames: _col0, _col1
95 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
96 Reduce Output Operator
97 key expressions: _col0 (type: string)
99 Map-reduce partition columns: _col0 (type: string)
100 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
101 value expressions: _col1 (type: double)
102 Reduce Operator Tree:
104 aggregations: sum(VALUE._col0)
105 keys: KEY._col0 (type: string)
107 outputColumnNames: _col0, _col1
108 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
110 predicate: (_col1 > 100.0D) (type: boolean)
111 Statistics: Num rows: 83 Data size: 881 Basic stats: COMPLETE Column stats: NONE
114 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
117 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
119 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
120 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
121 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
129 PREHOOK: query: explain select key, L.* from (select * from src order by key) a lateral view explode(array(value, value)) L as v limit 10
131 POSTHOOK: query: explain select key, L.* from (select * from src order by key) a lateral view explode(array(value, value)) L as v limit 10
132 POSTHOOK: type: QUERY
134 Stage-1 is a root stage
135 Stage-0 depends on stages: Stage-1
143 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
145 expressions: key (type: string), value (type: string)
146 outputColumnNames: _col0, _col1
147 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
149 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
151 expressions: _col0 (type: string)
152 outputColumnNames: _col0
153 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
154 Lateral View Join Operator
155 outputColumnNames: _col0, _col2
156 Statistics: Num rows: 1000 Data size: 10624 Basic stats: COMPLETE Column stats: NONE
158 expressions: _col0 (type: string), _col2 (type: string)
159 outputColumnNames: _col0, _col1
160 Statistics: Num rows: 1000 Data size: 10624 Basic stats: COMPLETE Column stats: NONE
163 Statistics: Num rows: 10 Data size: 100 Basic stats: COMPLETE Column stats: NONE
166 Statistics: Num rows: 10 Data size: 100 Basic stats: COMPLETE Column stats: NONE
168 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
169 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
170 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
172 expressions: array(_col1,_col1) (type: array<string>)
173 outputColumnNames: _col0
174 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
176 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
177 function name: explode
178 Lateral View Join Operator
179 outputColumnNames: _col0, _col2
180 Statistics: Num rows: 1000 Data size: 10624 Basic stats: COMPLETE Column stats: NONE
182 expressions: _col0 (type: string), _col2 (type: string)
183 outputColumnNames: _col0, _col1
184 Statistics: Num rows: 1000 Data size: 10624 Basic stats: COMPLETE Column stats: NONE
187 Statistics: Num rows: 10 Data size: 100 Basic stats: COMPLETE Column stats: NONE
190 Statistics: Num rows: 10 Data size: 100 Basic stats: COMPLETE Column stats: NONE
192 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
193 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
194 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
202 PREHOOK: query: CREATE TABLE dest_2(key STRING, c1 INT)
203 PREHOOK: type: CREATETABLE
204 PREHOOK: Output: database:default
205 PREHOOK: Output: default@dest_2
206 POSTHOOK: query: CREATE TABLE dest_2(key STRING, c1 INT)
207 POSTHOOK: type: CREATETABLE
208 POSTHOOK: Output: database:default
209 POSTHOOK: Output: default@dest_2
210 PREHOOK: query: CREATE TABLE dest_3(key STRING, c1 INT)
211 PREHOOK: type: CREATETABLE
212 PREHOOK: Output: database:default
213 PREHOOK: Output: default@dest_3
214 POSTHOOK: query: CREATE TABLE dest_3(key STRING, c1 INT)
215 POSTHOOK: type: CREATETABLE
216 POSTHOOK: Output: database:default
217 POSTHOOK: Output: default@dest_3
218 PREHOOK: query: EXPLAIN FROM src
219 INSERT OVERWRITE TABLE dest_2 SELECT value, sum(key) GROUP BY value
220 INSERT OVERWRITE TABLE dest_3 SELECT value, sum(key) GROUP BY value limit 20
222 POSTHOOK: query: EXPLAIN FROM src
223 INSERT OVERWRITE TABLE dest_2 SELECT value, sum(key) GROUP BY value
224 INSERT OVERWRITE TABLE dest_3 SELECT value, sum(key) GROUP BY value limit 20
225 POSTHOOK: type: QUERY
227 Stage-2 is a root stage
228 Stage-0 depends on stages: Stage-2
229 Stage-3 depends on stages: Stage-0, Stage-4, Stage-7
230 Stage-4 depends on stages: Stage-2
231 Stage-6 depends on stages: Stage-1, Stage-4, Stage-7
232 Stage-5 depends on stages: Stage-2
233 Stage-1 depends on stages: Stage-5
234 Stage-7 depends on stages: Stage-5
242 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
244 expressions: key (type: string), value (type: string)
245 outputColumnNames: key, value
246 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
247 Reduce Output Operator
248 key expressions: value (type: string)
250 Map-reduce partition columns: value (type: string)
251 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
252 value expressions: key (type: string)
253 Reduce Operator Tree:
255 Statistics: Num rows: 500 Data size: 5312 Basic stats: COMPLETE Column stats: NONE
257 aggregations: sum(VALUE._col0)
258 keys: KEY._col0 (type: string)
260 outputColumnNames: _col0, _col1
261 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
263 expressions: _col0 (type: string), UDFToInteger(_col1) (type: int)
264 outputColumnNames: _col0, _col1
265 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
268 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
270 input format: org.apache.hadoop.mapred.TextInputFormat
271 output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
272 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
275 expressions: _col0 (type: string), _col1 (type: int)
276 outputColumnNames: key, c1
277 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
279 aggregations: compute_stats(key, 'hll'), compute_stats(c1, 'hll')
281 outputColumnNames: _col0, _col1
282 Statistics: Num rows: 1 Data size: 864 Basic stats: COMPLETE Column stats: NONE
286 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
287 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
288 serde: org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe
290 aggregations: sum(VALUE._col0)
291 keys: KEY._col0 (type: string)
293 outputColumnNames: _col0, _col1
294 Statistics: Num rows: 250 Data size: 2656 Basic stats: COMPLETE Column stats: NONE
297 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
301 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
302 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
303 serde: org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe
310 input format: org.apache.hadoop.mapred.TextInputFormat
311 output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
312 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
320 Column Types: string, int
321 Table: default.dest_2
327 Reduce Output Operator
329 Statistics: Num rows: 1 Data size: 864 Basic stats: COMPLETE Column stats: NONE
330 value expressions: _col0 (type: struct<columntype:string,maxlength:bigint,sumlength:bigint,count:bigint,countnulls:bigint,bitvector:binary>), _col1 (type: struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,bitvector:binary>)
331 Reduce Operator Tree:
333 aggregations: compute_stats(VALUE._col0), compute_stats(VALUE._col1)
335 outputColumnNames: _col0, _col1
336 Statistics: Num rows: 1 Data size: 880 Basic stats: COMPLETE Column stats: NONE
339 Statistics: Num rows: 1 Data size: 880 Basic stats: COMPLETE Column stats: NONE
341 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
342 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
343 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
350 Column Types: string, int
351 Table: default.dest_3
357 Reduce Output Operator
359 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
360 value expressions: _col0 (type: string), _col1 (type: double)
361 Reduce Operator Tree:
363 expressions: VALUE._col0 (type: string), VALUE._col1 (type: double)
364 outputColumnNames: _col0, _col1
365 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
368 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
370 expressions: _col0 (type: string), UDFToInteger(_col1) (type: int)
371 outputColumnNames: _col0, _col1
372 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
375 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
377 input format: org.apache.hadoop.mapred.TextInputFormat
378 output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
379 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
382 expressions: _col0 (type: string), _col1 (type: int)
383 outputColumnNames: key, c1
384 Statistics: Num rows: 20 Data size: 200 Basic stats: COMPLETE Column stats: NONE
386 aggregations: compute_stats(key, 'hll'), compute_stats(c1, 'hll')
388 outputColumnNames: _col0, _col1
389 Statistics: Num rows: 1 Data size: 864 Basic stats: COMPLETE Column stats: NONE
393 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
394 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
395 serde: org.apache.hadoop.hive.serde2.lazybinary.LazyBinarySerDe
402 input format: org.apache.hadoop.mapred.TextInputFormat
403 output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
404 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
411 Reduce Output Operator
413 Statistics: Num rows: 1 Data size: 864 Basic stats: COMPLETE Column stats: NONE
414 value expressions: _col0 (type: struct<columntype:string,maxlength:bigint,sumlength:bigint,count:bigint,countnulls:bigint,bitvector:binary>), _col1 (type: struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,bitvector:binary>)
415 Reduce Operator Tree:
417 aggregations: compute_stats(VALUE._col0), compute_stats(VALUE._col1)
419 outputColumnNames: _col0, _col1
420 Statistics: Num rows: 1 Data size: 880 Basic stats: COMPLETE Column stats: NONE
423 Statistics: Num rows: 1 Data size: 880 Basic stats: COMPLETE Column stats: NONE
425 input format: org.apache.hadoop.mapred.SequenceFileInputFormat
426 output format: org.apache.hadoop.hive.ql.io.HiveSequenceFileOutputFormat
427 serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe