I'm trying to load a dataset (280GB) using the Phoenix csv bulk load tool on a HDInsight Hbase cluster. The job fails with the following error:
18/02/23 06:09:10 INFO mapreduce.Job: Task Id : attempt_1519326441231_0004_m_000067_0, Status : FAILEDError: Java heap spaceContainer killed by the ApplicationMaster.Container killed on request. Exit code is 143Container exited with a non-zero exit code 143
Here's my cluster configuration:
8 cores, 56 GB RAM, 1.5TB HDD
4 cores, 28GB, 1.5TB HDD
Can anyone please help me troubleshoot this issue?