About mqureshi

mqureshi · ‎02-01-2017

@samuel sayag what is this script element in your hbase-site.xml and hive-site.xml. Can you please remove that and try it again?

mqureshi · ‎02-01-2017

In production, you would have "edge nodes" where you have client programs install and they are talking to the cluster. But even if you put data in local file system on data node and then copy into HDFS, it will not prevent data distribution. The client file is in local file system (XFS, ext4) which is unrelated to HDFS (well not exactly, but as far as your question is concerned). Standard practice is to use Edge node and not name node.

mqureshi · ‎01-31-2017

fair enough. see the new answer by @bpreachuk. I was assuming you are loking for free tools but if you can get syncsort or if you already have it, that's the easiest way to do this.

mqureshi · ‎01-31-2017

@samuel sayag The error you are getting is this Unable to set watcher on znode (/hbase/hbaseid) Is your zookeeper running? If yes, please share your hbase-site.xml.

mqureshi · ‎01-31-2017

@Joby Johny You can use Cloudbreak to setup HDP cluster on AWS if HDC does not have everything you need. http://sequenceiq.com/cloudbreak-docs/release-1.6.1/aws/

mqureshi · ‎01-31-2017

@Prasanna G your putty is an ssh client and not an hdfs client. once you ssh to your sandbox, then you are able to run hdfs command because sandbox is where HDFS is installed including on your shell commands. This is simialr to the fact that you cannot run "ls /some directory" from putty before you ssh into the box.

mqureshi · ‎01-31-2017

@Saurabh What is the value of the following property? dfs.namenode.accesstime.precision check docs here --> https://hadoop.apache.org/docs/r2.6.1/hadoop-project-dist/hadoop-hdfs/hdfs-default.xml Use FileStatus.getAccessTime() to get the last access time. It will depend on the precision set above. If it's currently set to zero then you don't have access time. If it's set to default of one hour then you can get access time up to the precision of one hour. If you have set your own precision, then you get whatever you have set. https://hadoop.apache.org/docs/r2.7.1/api/org/apache/hadoop/fs/FileStatus.html

mqureshi · ‎01-31-2017

@karthick baskaran You can use following project. It uses JRecord to do the conversion. https://github.com/tmalaska/CopybookInputFormat You can use Spark to read your EBCDIC files from hadoop and convert them to ASCII using above library.

mqureshi · ‎01-31-2017

@Prasanna G All documentation for sandbox is right here which I think you are aware of. As for which file system, I have never verified but I would think it is standard linux file system like ext4. Just type "mount" command without any parameters and it will show you the mounted file systems and their types. http://hortonworks.com/hadoop-tutorial/learning-the-ropes-of-the-hortonworks-sandbox/

mqureshi · ‎01-31-2017

@Prasanna G I think you are copying in your local file system and looking in your HDFS. Check your local tmp folder. Also, your full command is invisible but I am assuming its something like below: pscp -P 2222 C:\Users\prgovind\Downloads\f.txt root@localhost:///tmp Is that right? I think you first need to copy to local tmp folder on your sand box and then push it into HDFS. Try following: pscp -P 2222 C:\Users\prgovind\Downloads\f.txt root@sandbox:/tmp ssh root@sandbox hdfs dfs -put /tmp/f.txt /user/praskutti

Online	Offline
Last Visited	‎10-31-2017 03:17 AM

Member Since	‎06-07-2016 09:05 AM
Last Visited	‎10-31-2017 03:17 AM
Posts	923
Kudos received	310

Cloudera Community

Re: YARN recommended configuration

Re: How to resolve for NULL values when they are c...

Re: Why is spark has better speed than Hadoop

Re: Is it possible to assign Hadoop queues to Hado...

Re: Kafka NiFi HDF Installation

Re: Spark HBase Connector (SHC) job fails to conne...

Re: Co-located client

Re: How can I read Mainframe file which is in EBCD...

Re: Spark HBase Connector (SHC) job fails to conne...

Re: Options for using HDP2.5 services on AWS

Re: Cannot find files copied to Sandbox from Windo...

Re: Is there anyway to get last access time of hdf...

Re: How can I read Mainframe file which is in EBCD...

Re: Cannot find files copied to Sandbox from Windo...

Re: Cannot find files copied to Sandbox from Windo...