Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

What are the semantics of Tez logs?

Hi, I am trying to get query information like the query text, Username, session Id after running Hive queries with Tez. I know that logs get generated in /ats/done directory.

Running on Azure:

To be specific: When I run a query like "select * from table1" which has fewer data results in a single "entitylog" file with "daggroup" keyword and a single "summarylog" as shown in the below image.


109935-screen-shot-2019-07-16-at-121532-pm.png

Whereas complex queries with joins or functions result in multiple "entitylog" files with keyword "daggroup" in them. And multiple "summarylog" files.

109904-screen-shot-2019-07-16-at-121525-pm.png


Running on AWS:

For the same queries I see multiple and single "entitylog" files with keyword "dag" in them and the same for "summarylog" files.
109924-screen-shot-2019-07-16-at-121546-pm.png


Is there any rhyme or reason for the log file names? When would we get a single or multiple "entitylog" or "summarylogs"?Are there any docs from where we could refer to or would be helpful if you could point to the exact piece of code where these get generated.

1 REPLY 1

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.