<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118042#M16919</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; the code work-around works, I was running in tez mode by the way.&lt;/P&gt;&lt;PRE&gt;outputSet = foreach outputSet generate $0 as (name:chararray) , $1 as (customerId:chararray), $2 as (VIN:chararray) , $3 as (Birthdate:chararray), $4 as (Mileage:chararray) ,$5 as (Fuel_Consumption:chararray);

store outputSet into 'avroout2' using AvroStorage();&lt;/PRE&gt;&lt;P&gt;Input(s):
Successfully read 100 records (15099 bytes) from: "/user/root/Test-20160129-1401822-lake.avro"
Successfully read 100 records (12703 bytes) from: "/user/root/Test-20160129-1401822-ttp.avro"
Output(s):
Successfully stored 100 records (7703 bytes) in: "hdfs://sandbox.hortonworks.com:8020/user/root/avroout2"
grunt&amp;gt; 2016-01-29 18:04:19,978 [main] INFO  org.apache.pig.Main - Pig script completed in 1 minute, 52 seconds and 249 milliseconds (112249 ms)
2016-01-29 18:04:19,978 [main] INFO  org.apache.pig.backend.hadoop.executionengine.tez.TezLauncher - Shutting down thread pool
2016-01-29 18:04:20,008 [Thread-1] ERROR org.apache.pig.impl.io.FileLocalizer - java.io.IOException: Filesystem closed
2016-01-29 18:04:20,025 [Thread-23] INFO  org.apache.pig.backend.hadoop.executionengine.tez.TezSessionManager - Shutting down Tez session org.apache.tez.client.TezClient@2c8b16b6
2016-01-29 18:04:20,025 [Thread-23] INFO  org.apache.tez.client.TezClient - Shutting down Tez Session, sessionName=PigLatin:DefaultJobName, applicationId=application_1454090472993_0001
[root@sandbox pig-upload]# hdfs dfs -ls avroout2
Found 2 items
-rw-r--r--   3 root hdfs          0 2016-01-29 18:03 avroout2/_SUCCESS
-rw-r--r--   3 root hdfs       7703 2016-01-29 18:03 avroout2/part-v003-o000-r-00000.avro
[root@sandbox pig-upload]# hdfs dfs -cat avroout2/part-v003-o000-r-00000.avro | less&lt;/P&gt;</description>
    <pubDate>Sat, 30 Jan 2016 02:06:17 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2016-01-30T02:06:17Z</dc:date>
    <item>
      <title>AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118004#M16881</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I have simple Pig script. Im trying to load avro file or directory that contains avro file using AvroStorage in Mapreduce mode. I tried almost all the combinations (hdfs://, / , hdfs://ip:port/file ... ) but nothing works.&lt;/P&gt;&lt;P&gt;Using command below&lt;/P&gt;&lt;PRE&gt;set = load '/spool-dir/CustomerData-20160128-1501807/' USING org.apache.pig.piggybank.storage.avro.AvroStorage ();&lt;/PRE&gt;&lt;P&gt;I got error:&lt;/P&gt;&lt;P&gt;2016-01-29 00:10:08,439 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias sensitiveSet. Backend error : java.lang.RuntimeException: could not instantiate 'org.apache.pig.piggybank.storage.avro.AvroStorage' with arguments 'null'
2016-01-29 00:10:08,439 [main] WARN  org.apache.pig.tools.grunt.Grunt - There is no log file to write to.
2016-01-29 00:10:08,439 [main] ERROR org.apache.pig.tools.grunt.Grunt - org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias sensitiveSet. Backend error : java.lang.RuntimeException: could not instantiate 'org.apache.pig.piggybank.storage.avro.AvroStorage' with arguments 'null'
  at org.apache.pig.PigServer.openIterator(PigServer.java:925)
  at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:754)&lt;/P&gt;&lt;P&gt;or using command with argument&lt;/P&gt;&lt;PRE&gt;set = load '/spool-dir/CustomerData-20160128-1501807/' USING org.apache.pig.piggybank.storage.avro.AvroStorage('no_schema_check');&lt;/PRE&gt;&lt;P&gt;2016-01-29 00:25:02,767 [main] ERROR org.apache.pig.tools.grunt.Grunt - org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias sensitiveSet. Backend error :&lt;/P&gt;&lt;P&gt;java.lang.RuntimeException: could not instantiate 'org.apache.pig.piggybank.storage.avro.AvroStorage' with arguments&lt;/P&gt;&lt;P&gt;'[no_schema_check]' at org.apache.pig.PigServer.openIterator(PigServer.java:925&lt;/P&gt;&lt;P&gt;My samples are almost identical with the ones on the &lt;A href="https://cwiki.apache.org/confluence/display/PIG/AvroStorage"&gt;avrostorage documentation&lt;/A&gt;, but i really cant see where the problem is.&lt;/P&gt;&lt;P&gt;The problem is partially described on &lt;A href="https://stackoverflow.com/questions/35072885/avrostorage-with-mapreduce-and-java-lang-runtimeexception-could-not-instantiate"&gt;stackexchange &lt;/A&gt; also.&lt;/P&gt;&lt;P&gt;Thank you&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 11:32:11 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118004#M16881</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T11:32:11Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118005#M16882</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;P&gt;firstly set is a reserved word, change set to another alias, you can also refer to avro simply by AvroStorage no need to write out full package name. If all else fails, add register piggybank.jar command.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 12:01:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118005#M16882</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T12:01:48Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118006#M16883</link>
      <description>&lt;P&gt;Hi, sorry the set was my mistypo here:&lt;/P&gt;&lt;P&gt;outSet = load 'hdfs:///CustomerData-20160128-1501807.avro' USING AvroStorage();&lt;/P&gt;&lt;P&gt;This command works, which is ODD, because whats the different when you call it as AvroStorage() or using full package path &lt;/P&gt;&lt;PRE&gt;org.apache.pig.piggybank.storage.avro.AvroStorage()&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 17:50:02 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118006#M16883</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T17:50:02Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118007#M16884</link>
      <description>&lt;P&gt;I have another issue with STORE now ....&lt;/P&gt;&lt;PRE&gt;STORE outputSet INTO 'hdfs:///avro-dest/-CustomerData-20160128-1501807'&amp;gt;&amp;gt; USING AvroStorage('no_schema_check', 'schema', '{"type":"record","name":"xxx","fields":[{"name":"name","type":"string","title":"Customer name","description":"non Surrogate Key for joining files on the BDP"}, ....]}');&lt;/PRE&gt;&lt;P&gt;error below:&lt;/P&gt;&lt;PRE&gt;2016-01-29 09:48:42,211 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1200: Pig script failed to parse:&lt;/PRE&gt;&lt;PRE&gt;&amp;lt;line 20, column 0&amp;gt; pig script failed to validate: java.lang.RuntimeException: could not instantiate 'AvroStorage' with arguments '[no_schema_check, schema, {"type":"record",&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 17:52:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118007#M16884</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T17:52:08Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118008#M16885</link>
      <description>&lt;P&gt;ok so STORE works only with &lt;/P&gt;&lt;PRE&gt;org.apache.pig.piggybank.storage.avro.AvroStorage(.... )&lt;/PRE&gt;&lt;P&gt;But there are still issues while trying to write output file &lt;/P&gt;&lt;PRE&gt;2016-01-29 10:09:28,406 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - HadoopJobId: job_1454023575813_0018
2016-01-29 10:09:28,406 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Processing aliases outputSet
2016-01-29 10:09:28,406 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - detailed locations: M:  C:  R: outputSet[19,12]
2016-01-29 10:10:03,931 [main] WARN  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Ooops! Some job has failed! Specify -stop_on_failure if you want Pig to stop immediately on failure.
2016-01-29 10:10:03,931 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - job job_1454023575813_0018 has failed! Stop running all dependent jobs
2016-01-29 10:10:03,931 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete
2016-01-29 10:10:06,256 [main] INFO  org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl - Timeline service address: &lt;A href="http://sandbox.hortonworks.com:8188/ws/v1/timeline/" target="_blank"&gt;http://sandbox.hortonworks.com:8188/ws/v1/timeline/&lt;/A&gt;
2016-01-29 10:10:06,257 [main] INFO  org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at sandbox.hortonworks.com/10.0.1.47:8050
2016-01-29 10:10:07,417 [main] INFO  org.apache.hadoop.yarn.client.api.impl.TimelineClientImpl - Timeline service address: &lt;A href="http://sandbox.hortonworks.com:8188/ws/v1/timeline/" target="_blank"&gt;http://sandbox.hortonworks.com:8188/ws/v1/timeline/&lt;/A&gt;
2016-01-29 10:10:07,417 [main] INFO  org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at sandbox.hortonworks.com/10.0.1.47:8050
2016-01-29 10:10:07,577 [main] ERROR org.apache.pig.tools.pigstats.mapreduce.MRPigStatsUtil - 1 map reduce job(s) failed!
2016-01-29 10:10:07,585 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.SimplePigStats - Script Statistics:

&lt;/PRE&gt;&lt;PRE&gt;Failed Jobs:
JobId  Alias  Feature Message Outputs
job_1454023575813_0018  outputSet  DISTINCT  Message: Job failed!  hdfs:///avro-dest/CustomerData-20160128-1501807,

Output(s) Failed to produce result in "hdfs:///avro-dest/CustomerData-20160128-1501807"
&lt;/PRE&gt;&lt;P&gt;Well i really dont understand whats going on here ... no proper documentation, for me random behavior its really hard to use the tool like that.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 18:16:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118008#M16885</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T18:16:12Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118009#M16886</link>
      <description>&lt;P&gt;And now it says ... that i cant read data.. both files are there ... even previous run was Successful with reading the source data... Well im so desperate, this is like working with random turing machine. ;-(&lt;/P&gt;&lt;P&gt;How it can fail to read data .... i can easily dump both relations that read data from those input files.&lt;/P&gt;&lt;PRE&gt;
Input(s): &lt;/PRE&gt;&lt;PRE&gt;Failed to read data from "hdfs:///CustomerData-20160128-1501807.avro" &lt;/PRE&gt;&lt;PRE&gt;Failed to read data from "hdfs:///CustomerData-20160128-1501807.avro"&lt;/PRE&gt;&lt;PRE&gt;
&lt;/PRE&gt;&lt;PRE&gt;Output(s):
Failed to produce result in "hdfs:///CustomerData-20160128-1501807"&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 18:30:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118009#M16886</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T18:30:47Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118010#M16887</link>
      <description>&lt;P&gt;Still failing ;-(&lt;/P&gt;&lt;P&gt;Failed Jobs:&lt;/P&gt;&lt;P&gt;JobId  Alias  Feature Message Outputs &lt;/P&gt;&lt;P&gt;job_1454023575813_0027  outputSet  DISTINCT  Message: Job failed!  /CustomerData-20160128-1501807,&lt;/P&gt;&lt;P&gt;
Input(s): &lt;/P&gt;&lt;P&gt;Successfully read 100 records from: "/CustomerData-20160128-1501807-l.avro" &lt;/P&gt;&lt;P&gt;Successfully read 100 records from: "/CustomerData-20160128-1501807-t.avro"&lt;/P&gt;&lt;P&gt;Output(s):
Failed to produce result in "/avro-dest/CustomerData-20160128-1501807"&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 18:54:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118010#M16887</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T18:54:47Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118011#M16888</link>
      <description>&lt;P&gt;here is the full log: &lt;A href="http://paste.debian.net/377423/"&gt;log&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 19:04:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118011#M16888</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T19:04:14Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118012#M16889</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; AvroStorage may have different package now though I confirmed with javadoc and I was the same as yours, it may be packaged differently in HDP, classpath may differ, don't know for sure. Please accept this answer.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 20:26:03 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118012#M16889</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T20:26:03Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118013#M16890</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; I'll review and let you know. &lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 20:28:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118013#M16890</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T20:28:48Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118014#M16891</link>
      <description>&lt;P&gt;Ok, waiting on your results! Thank you&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 20:42:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118014#M16891</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T20:42:14Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118015#M16892</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits&lt;/A&gt; here you can find one more output - &amp;gt; sources were read successfully but output failed, &lt;/P&gt;&lt;P&gt;&lt;A href="http://paste.debian.net/377433/" target="_blank"&gt;http://paste.debian.net/377433/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 20:45:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118015#M16892</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T20:45:24Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118016#M16893</link>
      <description>&lt;P&gt;it is failing to write output avro file but log says:&lt;/P&gt;&lt;P&gt;web-log says&lt;/P&gt;&lt;P&gt;Application Overview&lt;/P&gt;&lt;P&gt;
User:    hdfs&lt;/P&gt;&lt;P&gt;
Name:    PigLatin:pigMerger.pig&lt;/P&gt;&lt;P&gt;
Application Type:    MAPREDUCE&lt;/P&gt;&lt;P&gt;
Application Tags: &lt;/P&gt;&lt;P&gt;
YarnApplicationState:    FINISHED&lt;/P&gt;&lt;P&gt;
Queue:    default&lt;/P&gt;&lt;P&gt;
FinalStatus Reported by AM:    SUCCEEDED&lt;/P&gt;&lt;P&gt;
Started:    Fri Jan 29 12:59:25 +0000 2016&lt;/P&gt;&lt;P&gt;
Elapsed:    4mins, 29sec&lt;/P&gt;&lt;P&gt;
Tracking URL:    History&lt;/P&gt;&lt;P&gt;
Log Aggregation Status    SUCCEEDED&lt;/P&gt;&lt;P&gt;
Diagnostics:   &lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 21:52:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118016#M16893</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T21:52:25Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118017#M16894</link>
      <description>&lt;P&gt;one more log: &lt;/P&gt;&lt;P&gt;&lt;A href="http://paste.debian.net/377443/"&gt;log&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 21:55:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118017#M16894</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T21:55:31Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118018#M16895</link>
      <description>&lt;P&gt;can you paste sample dataset and pig script, I'll try to reproduce sometime today on my machine. It's hard to see the issue from the logs. &lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 21:56:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118018#M16895</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T21:56:47Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118019#M16896</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/393/aervits.html" nodeid="393"&gt;@Artem Ervits &lt;/A&gt;source files and pig script are included &lt;A href="https://drive.google.com/file/d/0B6RZ_9vVuTEcd01aWm9zczNUUWc/view?usp=sharing"&gt;Data&lt;/A&gt; , thanks&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 22:21:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118019#M16896</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T22:21:39Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118020#M16897</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; I highly recommend you develop your scripts in Pig Grunt shell. This is what happened with your script as I was trying to execute it one by one.&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sensitiveSet = load '/user/root/Test-20160129-1401822-ttp.avro' using AvroStorage();
grunt&amp;gt; outputSet = join sensitiveSet by Row_ID, nonSensSet by Row_ID;
grunt&amp;gt; outputSet = distinct outputSet;
grunt&amp;gt; outputSet = foreach outputSet generate nonSensSet::name,nonSensSet::customerId,sensitiveSet::VIN,sensitiveSet::Birthdate,nonSensSet::Mileage,nonSensSet::Fuel_Consumption;
2016-01-29 14:41:59,228 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1025:
&amp;lt;line 5, column 79&amp;gt; Invalid field projection. Projected field [sensitiveSet::VIN] does not exist in schema: sensitiveSet::Row_ID:long,sensitiveSet::name:chararray,sensitiveSet::customerId:chararray,sensitiveSet::Mileage:chararray,sensitiveSet::Fuel_Consumption:chararray,nonSensSet::Row_ID:long,nonSensSet::name:chararray,nonSensSet::customerId:chararray,nonSensSet::Mileage:chararray,nonSensSet::Fuel_Consumption:chararray.
Details at logfile: /root/pig-upload/pig_1454078371113.log


grunt&amp;gt; describe sensitiveSet;
sensitiveSet: {Row_ID: long,name: chararray,customerId: chararray,Mileage: chararray,Fuel_Consumption: chararray}
&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 22:44:05 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118020#M16897</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T22:44:05Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118021#M16898</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; birthdate also doesn't exist&lt;/P&gt;&lt;PRE&gt;&amp;lt;line 5, column 79&amp;gt; Invalid field projection. Projected field [sensitiveSet::Birthdate] does not exist in schema: sensitiveSet::Row_ID:long,sensitiveSet::name:chararray,sensitiveSet::customerId:chararray,sensitiveSet::Mileage:chararray,sensitiveSet::Fuel_Consumption:chararray,nonSensSet::Row_ID:long,nonSensSet::name:chararray,nonSensSet::customerId:chararray,nonSensSet::Mileage:chararray,nonSensSet::Fuel_Consumption:chararray.
Details at logfile: /root/pig-upload/pig_1454078371113.log

-- this works

grunt&amp;gt; outputSet = foreach outputSet generate nonSensSet::name,nonSensSet::customerId,nonSensSet::Mileage,nonSensSet::Fuel_Consumption;&lt;/PRE&gt;</description>
      <pubDate>Fri, 29 Jan 2016 22:45:48 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118021#M16898</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T22:45:48Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118022#M16899</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;P&gt;ok it was a problem with me, I copied the same file twice. nevermind that issue, I'm still looking. &lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 23:06:16 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118022#M16899</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-01-29T23:06:16Z</dc:date>
    </item>
    <item>
      <title>Re: AvroStorage with mapreduce and java.lang.RuntimeException: could not instantiate</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118023#M16900</link>
      <description>&lt;P&gt;thats strange... it works for me.&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sensitiveSet = load '/t-spool-dir/Test-20160129-1401822-ttp.avro' USING AvroStorage();

grunt&amp;gt; nonSensSet = load '/d-spool-dir/Test-20160129-1401822-lake.avro' USING AvroStorage();

grunt&amp;gt; outputSet = join sensitiveSet by Row_ID, nonSensSet by Row_ID;grunt&amp;gt; outputSet = distinct outputSet;

grunt&amp;gt; outputSet = foreach outputSet generate nonSensSet::name,nonSensSet::customerId,sensitiveSet::VIN,sensitiveSet::Birthdate,nonSensSet::Mileage,nonSensSet::Fuel_Consumption;grunt&amp;gt; 

dump outputSet;&lt;/PRE&gt;&lt;P&gt;
("Kina Buttars",12452346,"WBA32649710927373","1968-08-14",68,10.551)&lt;/P&gt;&lt;P&gt;
("Caren Rodman",18853438,"WBA56064572124841","1987-01-24",96,6.779)&lt;/P&gt;&lt;P&gt;
("Tierra Bork",89673290,"WBA69315467645466","1958-11-22",52,10.109)&lt;/P&gt;&lt;P&gt;
("Thelma Steve",97170856,"WBA73739033913927","1985-12-03",98,5.081)&lt;/P&gt;&lt;P&gt;.....&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jan 2016 23:09:55 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/AvroStorage-with-mapreduce-and-java-lang-RuntimeException/m-p/118023#M16900</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-01-29T23:09:55Z</dc:date>
    </item>
  </channel>
</rss>

