Reply
Highlighted
Explorer
Posts: 12
Registered: ‎06-18-2016

Types for a better Performance in cloudera-quickstart-vm-5.7.0-0-virtualbox

Hi,

 

I'm doing a small Big Data project using Hadoop in cloudera-quickstart-vm-5.7.0-0-virtualbox.

I've a file in HDFS that have 22GB of size. When I try to do some job in Pig, like:

A = LOAD "/user/cloudera/file.csv";
DUMP A;

It stays with staus:Running like a long of time. There exists any configuration that I need to do to proces all this data?

Thanks