Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Lab 3 - Pig Risk Factor Analysis: Application remains in running status

avatar
Rising Star

Hi,

Could you help me to run a pig script with successful status.

As mentioned in the Lab 3 exercise, geolocation table was created in hive and the pig script was created with only load transformation.

a = LOAD 'geolocation' using org.apache.hive.hcatalog.pig.HCatLoader();

DUMP a LIMIT 1;

Tried executing the script(not on Tez) arguments with -useHCatalog.

In the ResourceManagerUI the status show succeeded with Cantainers allocated 2. But, in the PIGUI the status is still RUNNING. It neither errors or return a result tab.

What might be the issue? How to check if the metadata exist in HCatalog? WebHCat server is in started status. Where should I look for Heap memory?

Thanks In advance for the support.

hive.png

service.png

pig-script.png

pig-status.png

resourcemanagerui.png

1 ACCEPTED SOLUTION

avatar
Master Mentor

Package is incorrect its USING org.apache.hive.hcatalog.pig.HCatLoader();

It was fixed in future release of sandbox use tez and make sure to use the HcatLoader I specified.

View solution in original post

9 REPLIES 9

avatar
Master Mentor
@Revathy Mourouguessane

1) Check MapReduce --> what are those alerts? Make sure that HDFS, Yarn and MapReduce services are up

2) It looks like your pig job did finish but check the pig view log

3)hive.png - You can see geolocation table , thats your hive table

avatar
Master Mentor

@Revathy Mourouguessane heap memory

You can click HDFS and in filter type heap

2287-screen-shot-2016-02-20-at-75259-am.png

avatar
Rising Star

Thank you.

I have tried with MapReduce2 and Tez2. The MapReduce2 is set to green. The heap size is same as in the attachement.

job-1456016544519-0006-logs.txt

job-1456016544519-0007-logs.txt

It errors with Could not resolve org.apache.hcatalog.pig.HCatLoader. In the arguments -useHCatalog is set.

ERROR org.apache.pig.PigServer - exception during parsing: Error during parsing. Could not resolve org.apache.hcatalog.pig.HCatLoader using imports: [, java.lang., org.apache.pig.builtin., org.apache.pig.impl.builtin.] Failed to parse: Pig script failed to parse: <file script.pig, line 1, column 29> pig script failed to validate: org.apache.pig.backend.executionengine.ExecException: ERROR 1070: Could not resolve org.apache.hcatalog.pig.HCatLoader using imports: [, java.lang., org.apache.pig.builtin., org.apache.pig.impl.builtin.] at org.apache.pig.parser.QueryParser

avatar
Master Mentor

@Revathy Mourouguessane Please see this

http://stackoverflow.com/questions/29990236/flag-usehcatalog-not-working

See this

I was facing the issue in HDP 2.3 and Pig 0.15 .

Package name for HCatLoader() class is different in Hortonworks distribution.

The following worked for me

USING org.apache.hive.hcatalog.pig.HCatLoader()

instead of USING org.apache.hcatalog.pig.HCatLoader();

avatar
Master Mentor

Make sure mapreduce is up if youre not executing using Tez. It is preferred if you use tez though as it is faster.

avatar
Rising Star

I tried both and get org.apache.hcatalog.pig.HCatLoader. The arguments of script is set as -useHCatalog. Thanks!!!

avatar
Master Mentor

Package is incorrect its USING org.apache.hive.hcatalog.pig.HCatLoader();

It was fixed in future release of sandbox use tez and make sure to use the HcatLoader I specified.

avatar
Rising Star

Thank you.

It is working fine now.

avatar
Rising Star

Thanks. Its working fine now.