About pcoates

pcoates · ‎01-11-2016

I have a requirement to periodically restart all cluster nodes at the machine level. Assume I've done an FSCK before starting to confirm that all blocks are fully replicated. Question is, as I restart each node in turn, will the NameNode notice that any block on that node is under-replicated and put those blocks on the replication queue? If this does happen, will it automatically remove those blocks when the data node comes back online and reports it's blocks to the NN? Note, this is a hardware restart, so the Ambari rolling restart doesn't do the job.

pcoates · ‎01-06-2016

Within a cluster we have no trouble executing commands agains an HA NameNode using the NameServiceID. But it doesn't work when doing discp from one cluster to another because the clusters are unaware of each other's mapping of nodes to NameServiceID. How does one do this?

pcoates · ‎01-04-2016

We have two use cases--one is the normal slight imbalance that can creep up gradually and the other is when we add new nodes. Ten new nodes can be 100TB+ to move around--it can take a very long time with normal dfs.network.bandwidth.persecond setting. What's a good strategy? Is it reasonable to use chron to reset the value during off hours? What's the best practice? Also, does rebalancing defer to normal processing dynamically?

pcoates · ‎12-09-2015

The documentation seems to suggest that the normal mode of use would be to have one reconstituted replica sitting around and that reconstituting an encoded block would be done only if this isn't the case. Keeping a block by default would eliminate most of the space savings because the data would expand from 1.6 to 2.6 times the raw file size. Why not have a policy that for leaves a single size copy for a limited time after a block is used? A "working set" as it were, so if you've used a block in the last X hours the decoded block won't be deleted.

pcoates · ‎12-08-2015

The admins want to know why every service has its own account ID, and is there any harm is using the same account for all? The cluster will be tightly secured. What is the best practice?

pcoates · ‎12-06-2015

Hadoop has long stressed moving the code to the data, both because it's faster to move the code than to move the data, and more importantly because the network is a limited shared resource that can easily be swamped. Erasure coding would seem to require that a large proportion of the data must move across the network because the contents of a single block will reside on multiple nodes. This would presumably apply not just the ToR switch, but the shared network as well, if the ability to tolerate the loss of a rack is preserved. Is this true and how are these principles reconciled?

pcoates · ‎11-23-2015

Your inode article is a great addition to David's answer. I'm puzzled though that any machine would run out of inodes before running out of disk space---it would require a strange configuration of the file system, wouldn't it? Was someone trying to save on inode allocation by assuming the average file would be larger? I can't think of any other reason to stray from the defaults. Any idea why?

pcoates · ‎11-21-2015

Thanks Neeraj and Deepesh---that's what I needed to know.

pcoates · ‎11-20-2015

Thanks Ancil. I'm still curious about what can be done from inside Hadoop. The federation of queries is particularly interesting becauese you don't always want to import the data into HDFS.

pcoates · ‎11-20-2015

Thanks for the reply. Yes, I read that page--the problem is trying to confirm whether the version of the connector in that tarball that this leads to, which seems to be for HDP 2.3 works with 2.2.4. Can't seem to locate one specifically for 2.2.4.

Online	Offline
Last Visited	‎03-30-2016 02:57 PM

Member Since	‎10-06-2015 02:10 PM
Last Visited	‎03-30-2016 02:57 PM
Posts	45
Kudos received	48

Cloudera Community

How does restarting a data node affect block repli...

How to use Name Service ID between to Clusters

What are the best practices for HDFS rebalancing?

Re: How will Erasure Coding affect the principle o...

Why are there 21 separate service accounts?

How will Erasure Coding affect the principle of da...

Re: How many files is too many on a modern HDP clu...

Re: Connecting Teradata to HDP 2.2.4

Re: Access modes for teradata beyond Sqoop ingesti...

Re: Connecting Teradata to HDP 2.2.4