Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Java ORC tool, buckets and Amazon Athena

Highlighted

Java ORC tool, buckets and Amazon Athena

New Contributor

Dear friends,


I am using Java ORC tool (https://orc.apache.org/docs/java-tools.html) to convert JSON records into ORC files. I am querying the ORC files on Amazon Athena. It is working like a charm.

My question: can I use the same Java ORC tool to convert JSON records into a bucketed ORC table?

If not, should I use Apache Hive to perform the conversion from JSON records into a bucketed ORC table?

I want to see if bucketed ORC files increase the performance of my queries on Amazon Athena.

I look forward to hearing from you.


All the best,

Alex.

Don't have an account?
Coming from Hortonworks? Activate your account here