Member since
08-16-2016
642
Posts
131
Kudos Received
68
Solutions
My Accepted Solutions
| Title | Views | Posted |
|---|---|---|
| 3973 | 10-13-2017 09:42 PM | |
| 7463 | 09-14-2017 11:15 AM | |
| 3792 | 09-13-2017 10:35 PM | |
| 6027 | 09-13-2017 10:25 PM | |
| 6595 | 09-13-2017 10:05 PM |
06-29-2017
06:43 PM
This usually means that another adaptor is being picked up by the test. For me it was the loopback and it doesn't have a speed or mode, so the health test fails. Use ethtool to examine your adaptors and find the one that doesn't have a speed. Add a regex to exclude it in Network Interface Collection Exclusion Regex under the Host configuration screen. My regex for the loopback adaptor (lo) is ^lo$
... View more
06-29-2017
06:03 PM
You would need to add the copied directory in as a DFS directory. Even then, I don't know if the NN will pick them up as the same blocks since a different DN will have them on their report. Typically, if a DN reports a block that doesn't match the NN, the NN tells it to delete it. The safe approach is to recommission the old node, change the replication factor, and then decommission it again.
... View more
06-28-2017
07:53 PM
@csguna No, the YARN gateway, HDFS gateway, Hive gateway, etc. Each of these will install the binaries, libraries, set env vars, and client configuration files for its service.
... View more
06-27-2017
07:43 AM
They should be under /etc/<service>/conf, where service can be hadoop, hive, yarn, etc.
... View more
06-27-2017
07:19 AM
I don't know of any hard limits. There are limitations as a table with 10k+ partitions will likely fail on operations against all partitions like 'drop table'. That is generally the soft cap on partitions per table. For the full cluster, the backend RDBMS hosting the metastore will dictate this somewhat. Again there is no hard limit. I have seen some near 10 million partitions across all tables. Granted HMS, HS2, and CatalogD were not stable due to the large partitions count. A single or set of large queries or full table scans would bring them down each time. Your HMS heap will also be large. Hive does have settings now to prevent full partitions grabs or limiting the partition count per query. The Hive community is moving HMS to be backed by HBase to address the scalability of partitions, tables, and databases.
... View more
06-26-2017
09:56 PM
1 Kudo
Did you include all existing nodes in the new racks? That is the only thing can think of. If you missed one, then it would be considered decommissioned and those blocks would be reported and missing or under replicated until they are replicated to other nodes. You are correct, it would report as being mis-replicated after the topology change if two replicas were in the same rack. I have seen the replication issue pop up before. I don't know what the resolution ended up being but it is critical to remember that it is a client side setting, so if a client is still using 3 as the repl factor then that data will have 3 replicas for each block.
... View more
06-26-2017
06:28 PM
I just copied it over from a cluster node.
... View more
06-26-2017
05:31 AM
Yes you do.
... View more
06-26-2017
03:24 AM
Please share the HW and SW specs and the results. I am quite interested. As pointed out, both could sway the results as even Impala's defaults are anemic. Also, I want to point out that Kudu is a filesystem, Impala is an in-memory query engine. Parquet is a file format. So what you are really comparing is Impala+Kudu v Impala+HDFS. You should be using the same file format for both to make it a direct comparison. Also, I don't view Kudu as the inherently faster option. Yes it is written in C which can be faster than Java and it, I believe, is less of an abstraction. Anyway, my point is that Kudu is great for somethings and HDFS is great for others. It isn't an this or that based on performance, at least in my opinion.
... View more
06-26-2017
02:22 AM
For what it is worth, I just did this and it worked. 1. set up the cdh 5 repo 2. installed hadoop-client with my package manager 3. updated the configs manually (scp or cm api) 4. ??? 5. profit
... View more