Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

hive tez intermediate output files

Hi,

For example, I have an Hive query which implies a Map phase and a Reduce phase.

Is there a way to get the output file from the Map phase, before it is processed by the Reduce phase ?

That will allow me to understand who does what (and then, optimize the query)...

Thanks.

1 REPLY 1

Explorer

I'm not aware of anything like that. However, you might have more luck using explain to understand the individual vertices. With "hive.tez.exec.print.summary=true" you can see a summary of the number of records that flow between vertices. The Tez View has some visualizations of this data.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.