Member since
06-20-2016
12
Posts
4
Kudos Received
1
Solution
My Accepted Solutions
Title | Views | Posted |
---|---|---|
722 | 07-06-2016 03:08 PM |
02-20-2017
11:19 PM
1 Kudo
I am writing a java map reduce job that reads through a variety of files and directories. I have no idea of knowing ahead of time if the data storage type is plain text, orc, avro, what have you. Is there some way to determine the storage format of a file programmatically? I haven't found anything in any of the file system apis.
... View more
Labels:
- Labels:
-
Apache Hadoop
12-23-2016
11:39 PM
1 Kudo
Great suggestion! I may try that first.
... View more
12-23-2016
01:52 PM
If you check out the Jira tickets the issue revolves around needing to execute from the secure cluster and read from the insecure cluster. https://issues.apache.org/jira/browse/HDFS-7037
... View more
12-22-2016
08:14 PM
1 Kudo
I have an old CDH (unsecured) cluster (5.4.3) that I am trying to migrate a directory of data from to a new HDP (2.4.x) cluster. Webhdfs has been disabled on the source (unsecured) cluster.
So far I have tried distcp, however due to the mismatched versions, and the secure/unsecure issues, hdfs:// > hdfs:// does not work, hftp > hdfs does not work and webhdfs:// > webhdfs:// cannot work. (HDFS-7037 HDFS-6776)
I also tried sqooping the files from Hive(unsecured cluster) to HDFS (secured cluster) but version differences again prevent that avenue. (HIVE-6050) Outside of writing a script to execute on the unsecured cluster, and pushing data through the rest api on the secured cluster's version of webhdfs I can't think of a faster way.
Before I do that I figured I'd ask if anyone else had a better idea?
... View more
Labels:
- Labels:
-
Apache Hadoop
07-06-2016
03:08 PM
Check the /etc/hosts file. All the nodes in the cluster should have ip addresses to every other node in the cluster.
... View more
07-05-2016
10:14 PM
1 Kudo
You can do a table sample. Select * from bucketed_user tablesample(bucket 1 out of 2 on state) where country = AU;
... View more
06-20-2016
12:56 PM
Oh no. I meant demographic information. Things like industries that hold certifications, or the gender percentages. I wouldn't expect that granular level of detail.
... View more
06-20-2016
12:39 PM
Hello, I'm curious if there is any data available on those who hold HDPCD certifications.
... View more
Labels:
- Labels:
-
Certification