Member since
10-18-2017
13
Posts
2
Kudos Received
0
Solutions
04-15-2019
08:32 AM
Thanks @Lester Martin I keep in mind the balancer admin command. I solve the issue simply by removing a very huge file created by a data scientist executing a very huge request on hive. The temporary files located at /tmp/hive/[user] seems to be not replicated (i'am not sure of that).
... View more
04-12-2019
08:34 AM
Hi all, I have 6 datanodes on my hortonworks cluster (HDP 2.6.3) and one of them is 91% full. The others are "only" 65% full. Il don't understand why the replication is not homogeneous and how i can fix it ? I check the file system and the same difference is observed : On a safe node : # pwd
/grid1/hadoop/hdfs/data/current/BP-332877091-10.136.82.11-1500650625087/current/finalized
# du -h . --summarize
1.9T . On the unsafe node : # pwd
/grid1/hadoop/hdfs/data/current/BP-332877091-10.136.82.11-1500650625087/current/finalized
# du -h . --summarize
2.7T . Same things on each DataNode directories. Thanks for your help. Mathieu
... View more
Labels:
- Labels:
-
Apache Hadoop
11-08-2017
07:44 AM
Thank's a lot for your response @kgautam. That's what I thought but now I have arguments to convince.
... View more
10-18-2017
08:57 AM
Hello, My company and I are new users of hortonworks. We plan to use oozie as scheduler but for most of us, we found strange to put scripts and librairies into the HDFS. The main role of the HDFS isn't to store data and only data ? As oozie's clients can be installed on several nodes of the cluster, I think it's pertinent to get these librairies somewhere accessible by all nodes. So the HDFS is in fact the best place (and files are replicated). Can someone tell me why oozie made this choice ? Greathfully Mathieu
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Oozie