Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Lad Source Files to HDFS from My local Machine

avatar
Rising Star

I've download the cloudera-quickstart-vm-5.7.0-0-virtualbox Virtual Machine to do my Big Data Project. In my PC i've two Zips files (2GB each on) that contains my source data (there are a lot of txt files). I need to upload this files to HDFS in Virtual Machine, however I getting some troubles When I try to copy/drag the txt files to Virtual Machine.

 

I was thinking loading the files directly on HDFS (no use Sqoop, for example), my question is:
Is there a way that can I load the Source Data from my Local PC to HDFS? Java? Sqoop? There a lot of Txt Files...

Thanks!

1 ACCEPTED SOLUTION

avatar
Guru
The VirtualBox Guest additions are installed in the VM which should enable
drag & drop of files, but perhaps it's having issues with the size of the
files? SSH should also be running so scp is another option, as is a Shared
Folder. You'll need to get the file to be visible from the VM's filesystem,
perhaps unzip them at that point, and then you can use 'hadoop fs
-copyFromLocal' to put them in HDFS.

View solution in original post

1 REPLY 1

avatar
Guru
The VirtualBox Guest additions are installed in the VM which should enable
drag & drop of files, but perhaps it's having issues with the size of the
files? SSH should also be running so scp is another option, as is a Shared
Folder. You'll need to get the file to be visible from the VM's filesystem,
perhaps unzip them at that point, and then you can use 'hadoop fs
-copyFromLocal' to put them in HDFS.