Member since
09-23-2015
88
Posts
109
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
6792 | 08-24-2016 09:13 PM |
10-19-2015
10:29 PM
1 Kudo
We've been using the Tez Persistent Queues (Interactive Query Tuning) to optimize Tez Queue performance. Separately we now have Capacity Scheduler "Fair Sharing" policies, which allows separate jobs on the same queue to execute with evenly shared resources. How should we reconcile the two for optimal Hive configuration? E.g. should we configure Persistent queues when queries are submitted as one user "Hive" and use Fair Sharing policies if the job is submitted by a variety of users? Any guidance on whether or not we should use the two in combination will help.
... View more
Labels:
- Labels:
-
Apache Hive
10-15-2015
01:56 AM
Is there a way I can configure nifi, not to pull same files over? It appears to pull the same files more than once if some files in the directory were modified. I.e. could you explain the full algorithm behavior on how GetSFTP tracks which files it has downloaded? And how does this behave in case NiFi process or the server restarts?
... View more
Labels:
- Labels:
-
Apache NiFi
-
Cloudera DataFlow (CDF)
10-11-2015
12:15 PM
1 Kudo
In your opinion, what is the best consolidated resource for performance tuning an HBase instance? Are the HBase guides the best reference we have for performance tuning or are there others which are more comprehensive?
http://hbase.apache.org/book.html#performance http://hbase.apache.org/book.html#important_configurations
... View more
Labels:
- Labels:
-
Apache HBase
10-08-2015
04:34 PM
Good point. In particular, we should add the user to sudoers list in case their chosen processors access privilege ports or system resources.
... View more
10-08-2015
03:29 PM
1 Kudo
NiFi can be deployed to run as any non-root user. By default NiFi runs as user "nifi", but this can be modified. For instructions, https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html then go to 'Admin Guide' then go to 'Bootstrap Properties’. In there it talks about 'run.as'
... View more
Labels:
10-08-2015
03:26 PM
Which tools do you see as most popular for customers who wish to manage large number of NiFi worker/slave instances across many separate servers? E.g. do you see people using Puppet, pdcp, or other tools?
... View more
Labels:
- Labels:
-
Apache NiFi
10-07-2015
06:55 PM
Customer would like to know if they are able to switch the keys which are stored in the KMS without re-encrypting HDFS data? I believe this may also be referred to as the EEK (Encrypted Encryption Key)? Documentation here
... View more
Labels:
- Labels:
-
Apache Ranger
10-07-2015
01:13 PM
3 Kudos
Atlas UI allows us to search for entities with SQL like syntax, for example 'table where name = "sales_fact"'. Is this SQL framework based on any existing libraries within Gremlin? Or was this custom built for Atlas? Trying to understand if there is documentation on the proper SQL syntax for these searches? Doc Reference: http://docs.hortonworks.com/HDPDocuments/HDP2/HDP-2.3.2/bk_data_governance/content/section_metadata_store_mgmt_with_atlas_ui.html
... View more
Labels:
- Labels:
-
Apache Atlas
09-30-2015
08:07 PM
What is the reason for increasing the heap? Are you seeing OOM errors due to too many users?
... View more
09-30-2015
03:58 PM
2 Kudos
Storm platform assumes that all messages will be processed successfully. This implies that any messages marked as "failed" were also replayed and successfully processed. I don't believe that failed and replayed messages are identified in a unique way by the platform. Documentation reference "This means the (failed) message is not actually taken off the queue yet, but instead placed in a "pending" state waiting for acknowledgement that the message is completed. While in the pending state, a message will not be sent to other consumers of the queue."
... View more
- « Previous
- Next »