Member since
12-01-2015
13
Posts
2
Kudos Received
0
Solutions
02-09-2021
05:52 AM
@medloh That is the correct solution here, the filename is always stored in a FlowFile attribute named "filename". Using the updateAttribute processor is the easiest way to manipulate the FlowFile attribute. You can use other attributes, static text, and even subjectless functions like "now()" or "nextInt()" to create dynamic filenames for each FlowFile. https://nifi.apache.org/docs/nifi-docs/html/expression-language-guide.html Hope this helps, Matt
... View more
05-02-2018
02:16 PM
Thanks Romainr, You pointed us in the right direction. I was setting it in several incorrect places in CM Impala and Hue. The place that finally worked was: Hue-->Configuration Hue Server Advanced Configuration Snippet (Safety Valve) for hue_safety_valve_server.ini [impala] query_timeout_s=86400
... View more
02-05-2018
04:18 PM
Gzip decompression will definitely use more CPU than snappy decompression, so I'd usually expect Gzip to give you worse performance, unless your query is limited by disk I/O (in which case smaller is better) or if your query isn't limited by scan performance.
... View more
11-28-2017
05:00 PM
1 Kudo
It's generally safe to use name-based resolution by default. Performance should be about the same. I agree name-based resolution may be a better choice because it's more intuitive. Index vs. name based resolution have different tradeoffs in terms of what schema-evolution operations are allowed. For example with index-based resolution you can safely rename a column in your table schema. With name based resolution you can safely add/drop columns in the middle of your table schema, whereas with index-based resolution you can generally only add new columns at the end. So it's really all about tradeoffs.
... View more