Support Questions

Find answers, ask questions, and share your expertise
Announcements
Check out our newest addition to the community, the Cloudera Data Analytics (CDA) group hub.

PutHiveQL Processor function in NIFI

Hi,

I noticed that the only configurable part in PutHiveQL processor is Hive Database Connection Pooling Service Name. And this needs to be setup as a controller service. So i'm wondering where to mention the actual insert statement or query which will do the job. Does this imply that there has to be other processor preceding to it with a means to generate insert queries that are specific to where and how we want the data in Hive and is fed to this processor as flowfiles?

1 ACCEPTED SOLUTION

Sam, the statements should be in the flowfile content/payload. Please see https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveQL/index.ht...

Also, it's not limited to INSERT statements only, any Hive DDL will be executed by this processor as well. This becomes very useful for creating new partitions in a table, etc.

View solution in original post

2 REPLIES 2

Sam, the statements should be in the flowfile content/payload. Please see https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.hive.PutHiveQL/index.ht...

Also, it's not limited to INSERT statements only, any Hive DDL will be executed by this processor as well. This becomes very useful for creating new partitions in a table, etc.

@Andrew Grande

Thanks for the clarification.

Take a Tour of the Community
Don't have an account?
Your experience may be limited. Sign in to explore more.