About dkaiser

dkaiser · ‎08-14-2017

@Muji, is this single setting "use.hive.interactive.mode=true" a new feature of the Hive View 2.0 as of HDP-2.6? Do you know when this became available? Your answer worked immediately for me on 2.6.1. I think the Hive View 2.0 version is "2.0.0".

dkaiser · ‎04-26-2016

I would look at the alternate answers to this question, which are simpler and allow the package manager to gracefully downgrade the snappy library without breaking dependencies or unnecessarily removing other packages.

dkaiser · ‎04-26-2016

I would never remove and reinstall a package (which is the more complex answer listed as the 'Best Answer' above) when a simple downgrade or upgrade will work, as the RPM package manager will retain the dependencies and prevent unnecessary uninstallation of related packages. Kirk's answer here looks legitimately easier and better.

dkaiser · ‎12-09-2015

Vinod, this is a great FAQ article. Is this the "latest" certification? Will this be limited to HDP 2.2 and HDP 2.3? How do we maintain this info in this post so it stays current over the years as multiple certifications are done over many versions?

dkaiser · ‎12-07-2015

I am working with a lot of geospatial data and big data sets that need analysis and interpretation. Can you point me to best practices on doing this.

dkaiser · ‎09-29-2015

The best-practice is to avoid the use of active Anti-Virus (AV) systems that monitor access to the underlying disk systems being used for metadata storage by the following processes: Apache Hadoop HDFS Namenode HDFS Datanode YARN Resource Manager YARN Node Manager Apache Accumulo Apache Flume Apache HBase Apache Kafka Apache ZooKeeper These processes store data structures only, and there is nothing stored by these processes that is executable by the underlying OS. As these processes can be very active, potentially performing continuous writes against large files, the best performance requires direct, unimpeded access to the underlying filesystem, and any AV system that traps filesystem calls will have a negative impact on Hadoop system performance. Some sites choose to implement AV "scans" that run periodically (like a weekly scan) on clients, gateway and "edge node" systems where users & developers connect and run local processes. These scans do not interfere with cluster performance, but are important to safeguard the edge-connected systems that are the main clients of the cluster.

dkaiser · ‎09-25-2015

Some helpful ideas to keep content organized in the community forums. Topic Tags : Pay attention to misspelled tags and avoid conjoined tags. The ability to group forum posts together by a topic is diminished when there are multiple, differently-spelled tags for the same topic. Having all similar posts tagged in common helps with grouping and ranking of content. Tags will add value for integration with the Hortonworks Gallery and other public sites, so having clean consistent usage of topic tags, especially for standard HDP components, will really help things look good. As an example: If some posts are tagged as 'NiFi' and some as 'dataflow' but they are referring to the same thing, then it will be hard for the AnswerHub system to rank the one overall topic by popularity, or for users to click on a tag and see all relevant questions. Best-practices: When posting a question, idea, or article, scan the topics page and search for the proper tag as it may exist before you create a new one. If you see multiple tags with the same meaning, consolidate to the correct tag. If you see a topic that is misspelled, you can edit and correct that tag. Use each topic in an individual tag. Having a conjoined tag such as [ambari with kerberos] or [nifi dataflow] is not useful compared to the standard of using separate individual tags, like [Ambari], [kerberos] or [nifi], [dataflow]. Comments vs. Replies : Pay attention to the difference between a comment and a reply. A comment to a question is used if requesting clarification or validation of the question. Comments cannot be accepted as a valid answer by the requester. Comments cannot be liked or shared by anyone, and do not contribute to the overall 'Number of replies' count for the post. A reply is used to provide an answer. Replies can be accepted as a valid answer, liked, rewarded and shared, so there is much more value to the community if your answer is placed into a proper reply. If you reply and it is accepted, it helps to increase your reputation as well as the value score of the question. Best-practices: If you are providing any intrinsic value such as a technical answer, solution architecture, suggested design, alternative idea, etc. you should use the reply function so your information provides context. If you see that someone has provided 'comment-level' text in a reply, click on the gear-menu and convert the reply into a comment by selecting the menu option 'Convert Answer to Comment' (limited to track moderators)

dkaiser · ‎09-25-2015

@sraghavan@hortonworks.com I think the point of tags should be to separate the various components/keywords. Your tag 'nifi for rabitmq and couchbase' isn't really a single tag. I have re-tagged with 3 separate tags as separate words: Nifi rabbitmq and couchbase. Thanks.

dkaiser · ‎09-25-2015

Is it possible use a single Zookeeper quorum for managing the HA state of more than one HDFS-HA? If so: Would there be any scaling limits with this approach? A practical upper limit on number of Namenodes that can work across a single Zookeeper quorum? What steps are required to configure the multiple HDFS Namenodes to use the single Zookeeper quorum? What impact would this have on upgrades and maintenance? Does Hortonworks support provide support for Namenodes or Zookeeper implemented in this configuration? In general, are there reasons why this would be not recommended?

Online	Offline
Last Visited	‎10-02-2020 02:06 PM

Member Since	‎09-23-2015 08:20 PM
Last Visited	‎10-02-2020 02:06 PM
Posts	14
Kudos received	21

Cloudera Community

Re: Are there any recommendations or best practice...

Re: Hive LLAP in Hive View Ambari

Re: HDP-2.3.4.0-3485 upgrade failed due to HDP-UTI...

Re: HDP-2.3.4.0-3485 upgrade failed due to HDP-UTI...

Re: HDP with Isilon: Certified and ready for any H...

What is the best way to perform geospatial analysi...

Re: Are there any recommendations or best practice...

Best Practices for using HCC

Re: Nifi for RabbitMQ and Couchbase

Can one Zookeeper quorum support multiple HDFS ins...