<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: PigStorage in mapreduce mode in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127383#M90098</link>
    <description>&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;P&gt;something is wrong with your environment, I was able to execute your statements in mapred and tez modes&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sourceData = load 'CustomerData.csv' using PigStorage(';') as (nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray);
grunt&amp;gt; describe sourceData;
sourceData: {nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray}
grunt&amp;gt; b = limit sourceData 5;
grunt&amp;gt; dump b;
2016-02-05 02:43:32,930 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: LIMIT
2016-02-05 02:43:33,105 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:33,106 [main] INFO  org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer - {RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator, GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter, MergeFilter, MergeForEach, PartitionFilterOptimizer, PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter, SplitFilter, StreamTypeCastInserter]}
2016-02-05 02:43:33,179 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - File Output Committer Algorithm version is 1
2016-02-05 02:43:33,179 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - FileOutputCommitter skip cleanup _temporary folders under output directory:false, ignore cleanup failures: false
2016-02-05 02:43:33,209 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:43:33,256 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:43:33,257 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2016-02-05 02:43:33,305 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of task 'attempt__0001_m_000001_1' to hdfs://c6401.ambari.apache.org:8020/tmp/temp2063345867/tmp1865027526/_temporary/0/task__0001_m_000001
2016-02-05 02:43:33,333 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:33,336 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:43:33,336 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
("Ronni Engelmann",93117643,"WBA68251082969954","1971-11-15",41,10.26)
("Kina Buttars",12452346,"WBA32649710927373","1968-08-14",68,10.551)
("Caren Rodman",18853438,"WBA56064572124841","1987-01-24",96,6.779)
("Tierra Bork",89673290,"WBA69315467645466","1958-11-22",52,10.109)
("Thelma Steve",97170856,"WBA73739033913927","1985-12-03",98,5.081)
&lt;/PRE&gt;&lt;P&gt;and output of illustrate in mapred mode&lt;/P&gt;&lt;PRE&gt;("Leonel Bullen",50258523,"WBA23530058599244","1986-08-26",27,8.673)
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,394 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,401 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,452 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,459 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,459 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,470 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,470 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,473 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,473 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,520 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,536 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,538 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,542 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,542 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,545 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,545 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,589 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,606 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,606 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,612 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,612 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,613 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,613 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,665 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,668 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| sourceData     | nullname:chararray     | customerId:chararray     | VIN:chararray       | Birthdate:chararray     | Mileage:chararray     | Fuel_Consumption:chararray     |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|                | "Leonel Bullen"        | 50258523                 | "WBA23530058599244" | "1986-08-26"            | 27                    | 8.673                          |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
&lt;/PRE&gt;&lt;P&gt;and in tez mode&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sourceData = load 'CustomerData.csv' using PigStorage(';') as (nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray);
grunt&amp;gt; describe sourceData;
sourceData: {nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray}
grunt&amp;gt; b = limit sourceData 5;
grunt&amp;gt; dump b;
2016-02-05 02:46:17,619 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: LIMIT
2016-02-05 02:46:17,698 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:46:17,749 [main] INFO  org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer - {RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator, GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter, MergeFilter, MergeForEach, PartitionFilterOptimizer, PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter, SplitFilter, StreamTypeCastInserter]}
2016-02-05 02:46:18,039 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - File Output Committer Algorithm version is 1
2016-02-05 02:46:18,039 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - FileOutputCommitter skip cleanup _temporary folders under output directory:false, ignore cleanup failures: false
2016-02-05 02:46:18,143 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:46:18,274 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:46:18,288 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2016-02-05 02:46:18,711 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of task 'attempt__0001_m_000001_1' to hdfs://c6401.ambari.apache.org:8020/tmp/temp-785652652/tmp136925164/_temporary/0/task__0001_m_000001
2016-02-05 02:46:18,782 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:46:18,811 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:46:18,811 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
("Ronni Engelmann",93117643,"WBA68251082969954","1971-11-15",41,10.26)
("Kina Buttars",12452346,"WBA32649710927373","1968-08-14",68,10.551)
("Caren Rodman",18853438,"WBA56064572124841","1987-01-24",96,6.779)
("Tierra Bork",89673290,"WBA69315467645466","1958-11-22",52,10.109)
("Thelma Steve",97170856,"WBA73739033913927","1985-12-03",98,5.081)
&lt;/PRE&gt;&lt;P&gt;there was a bug where illustrate command doesn't work in tez mode yet.&lt;/P&gt;&lt;P&gt;bottom line, I tested it with 'CustomerData.csv' also with '/user/root/CustomerData.csv' and 'hdfs://fqdn:8020/user/root/CustomerData.csv'. &lt;/P&gt;</description>
    <pubDate>Fri, 05 Feb 2016 10:49:43 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2016-02-05T10:49:43Z</dc:date>
    <item>
      <title>PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127377#M90092</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to execute pig script in mapreduce mode, script is simple:&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sourceData = load 'hdfs://sandbox.hortonworks.com:8020/src/CustomerData.csv' using PigStorage(';') as (nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray);&lt;/PRE&gt;&lt;P&gt;File is stored in HDFS:&lt;/P&gt;&lt;PRE&gt;hadoop fs -ls hdfs://sandbox.hortonworks.com:8020/src/CustomerData.csv

-rw-r--r--  3 hdfs hdfs  6828 2016-02-04 23:55 hdfs://sandbox.hortonworks.com:8020/src/CustomerData.csv&lt;/PRE&gt;&lt;P&gt;Error that i got:&lt;/P&gt;&lt;P&gt;Failed Jobs:
JobId  Alias  Feature Message Outputs
job_1454609613558_0003  sourceData  MAP_ONLY  Message: Job failed!  hdfs://sandbox.hortonworks.com:8020/tmp/temp-710368608/tmp-1611282262,&lt;/P&gt;&lt;P&gt;Input(s):
Failed to read data from "hdfs://sandbox.hortonworks.com:8020/src/CustomerData.csv"&lt;/P&gt;&lt;P&gt;Output(s):
Failed to produce result in "hdfs://sandbox.hortonworks.com:8020/tmp/temp-710368608/tmp-1611282262"&lt;/P&gt;&lt;PRE&gt;Pig Stack Trace---------------ERROR 1066: Unable to open iterator for alias sourceDataorg.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias sourceData  at org.apache.pig.PigServer.openIterator(PigServer.java:935)  at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:754)  at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:376)  at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:230)  at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205)  at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:66)  at org.apache.pig.Main.run(Main.java:565)  at org.apache.pig.Main.main(Main.java:177)  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)  at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)  at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)  at java.lang.reflect.Method.invoke(Method.java:606)  at org.apache.hadoop.util.RunJar.run(RunJar.java:221)  at org.apache.hadoop.util.RunJar.main(RunJar.java:136)Caused by: java.io.IOException: Job terminated with anomalous status FAILED  at org.apache.pig.PigServer.openIterator(PigServer.java:927)  ... 13 more&lt;/PRE&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:13:18 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127377#M90092</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T08:13:18Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127378#M90093</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; &lt;/P&gt;&lt;P&gt;How did you launch Pig Grunt?&lt;/P&gt;&lt;P&gt;On sandbox you can use tez&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;pig -x tez&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;You can refer to your dataset like so: &lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;'/src/filename.csv' &lt;/STRONG&gt;you don't need to explicitly set hdfs scheme. Also, make sure the src directory has permissions for the user you are executing with. Also, last time I looked at your dataset, I thought the delimeter was comma and not semicolon.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:23:06 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127378#M90093</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T08:23:06Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127379#M90094</link>
      <description>&lt;P&gt;needles to say, this is insane.&lt;/P&gt;&lt;P&gt;Yes, grunt by -x mapreduce, i tried -x tez but:&lt;/P&gt;&lt;PRE&gt;2016-02-05 00:37:42,172 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias sourceDataDetails at logfile: /home/hdfs/pig_1454632554431.log&lt;/PRE&gt;&lt;PRE&gt;privileges are correct:

drwxr-xr-x   - hdfs   hdfs            0 2016-02-04 23:55 /src

&lt;/PRE&gt;&lt;P&gt;delimiter is is ;&lt;/P&gt;&lt;P&gt;any idea?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:39:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127379#M90094</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T08:39:20Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127380#M90095</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; upload your dataset I will try it out in a bit.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:48:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127380#M90095</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T08:48:24Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127381#M90096</link>
      <description>&lt;P&gt;you can find dataset here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://drive.google.com/file/d/0B6RZ_9vVuTEcTHllU1dIR2VBY1E/view?usp=sharing"&gt;https://drive.google.com/file/d/0B6RZ_9vVuTEcTHllU1dIR2VBY1E/view?usp=sharing&lt;/A&gt;&lt;/P&gt;&lt;P&gt;\\thank you&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 08:59:15 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127381#M90096</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T08:59:15Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127382#M90097</link>
      <description>&lt;P&gt;I run successfully with your load statement follow by a dump on 2.3 sandbox. What's your complete script?&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 09:46:01 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127382#M90097</guid>
      <dc:creator>daijy</dc:creator>
      <dc:date>2016-02-05T09:46:01Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127383#M90098</link>
      <description>&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;P&gt;something is wrong with your environment, I was able to execute your statements in mapred and tez modes&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sourceData = load 'CustomerData.csv' using PigStorage(';') as (nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray);
grunt&amp;gt; describe sourceData;
sourceData: {nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray}
grunt&amp;gt; b = limit sourceData 5;
grunt&amp;gt; dump b;
2016-02-05 02:43:32,930 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: LIMIT
2016-02-05 02:43:33,105 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:33,106 [main] INFO  org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer - {RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator, GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter, MergeFilter, MergeForEach, PartitionFilterOptimizer, PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter, SplitFilter, StreamTypeCastInserter]}
2016-02-05 02:43:33,179 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - File Output Committer Algorithm version is 1
2016-02-05 02:43:33,179 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - FileOutputCommitter skip cleanup _temporary folders under output directory:false, ignore cleanup failures: false
2016-02-05 02:43:33,209 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:43:33,256 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:43:33,257 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2016-02-05 02:43:33,305 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of task 'attempt__0001_m_000001_1' to hdfs://c6401.ambari.apache.org:8020/tmp/temp2063345867/tmp1865027526/_temporary/0/task__0001_m_000001
2016-02-05 02:43:33,333 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:33,336 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:43:33,336 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
("Ronni Engelmann",93117643,"WBA68251082969954","1971-11-15",41,10.26)
("Kina Buttars",12452346,"WBA32649710927373","1968-08-14",68,10.551)
("Caren Rodman",18853438,"WBA56064572124841","1987-01-24",96,6.779)
("Tierra Bork",89673290,"WBA69315467645466","1958-11-22",52,10.109)
("Thelma Steve",97170856,"WBA73739033913927","1985-12-03",98,5.081)
&lt;/PRE&gt;&lt;P&gt;and output of illustrate in mapred mode&lt;/P&gt;&lt;PRE&gt;("Leonel Bullen",50258523,"WBA23530058599244","1986-08-26",27,8.673)
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,393 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,394 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,401 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,452 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,459 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,459 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,470 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,470 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,473 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,473 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,520 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,536 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,538 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,542 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,542 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,545 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,545 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,589 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,606 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
2016-02-05 02:43:47,606 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler - File concatenation threshold: 100 optimistic? false
2016-02-05 02:43:47,612 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size before optimization: 1
2016-02-05 02:43:47,612 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer - MR plan size after optimization: 1
2016-02-05 02:43:47,613 [main] INFO  org.apache.pig.tools.pigstats.mapreduce.MRScriptState - Pig script settings are added to the job
2016-02-05 02:43:47,613 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.JobControlCompiler - mapred.job.reduce.markreset.buffer.percent is not set, set to default 0.3
2016-02-05 02:43:47,665 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:43:47,668 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map - Aliases being processed per job phase (AliasName[line,offset]): M: sourceData[4,13] C:  R:
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| sourceData     | nullname:chararray     | customerId:chararray     | VIN:chararray       | Birthdate:chararray     | Mileage:chararray     | Fuel_Consumption:chararray     |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|                | "Leonel Bullen"        | 50258523                 | "WBA23530058599244" | "1986-08-26"            | 27                    | 8.673                          |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
&lt;/PRE&gt;&lt;P&gt;and in tez mode&lt;/P&gt;&lt;PRE&gt;grunt&amp;gt; sourceData = load 'CustomerData.csv' using PigStorage(';') as (nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray);
grunt&amp;gt; describe sourceData;
sourceData: {nullname: chararray,customerId: chararray,VIN: chararray,Birthdate: chararray,Mileage: chararray,Fuel_Consumption: chararray}
grunt&amp;gt; b = limit sourceData 5;
grunt&amp;gt; dump b;
2016-02-05 02:46:17,619 [main] INFO  org.apache.pig.tools.pigstats.ScriptState - Pig features used in the script: LIMIT
2016-02-05 02:46:17,698 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:46:17,749 [main] INFO  org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer - {RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator, GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter, MergeFilter, MergeForEach, PartitionFilterOptimizer, PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter, SplitFilter, StreamTypeCastInserter]}
2016-02-05 02:46:18,039 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - File Output Committer Algorithm version is 1
2016-02-05 02:46:18,039 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - FileOutputCommitter skip cleanup _temporary folders under output directory:false, ignore cleanup failures: false
2016-02-05 02:46:18,143 [main] INFO  org.apache.pig.data.SchemaTupleBackend - Key [pig.schematuple] was not set... will not generate code.
2016-02-05 02:46:18,274 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:46:18,288 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
2016-02-05 02:46:18,711 [main] INFO  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter - Saved output of task 'attempt__0001_m_000001_1' to hdfs://c6401.ambari.apache.org:8020/tmp/temp-785652652/tmp136925164/_temporary/0/task__0001_m_000001
2016-02-05 02:46:18,782 [main] WARN  org.apache.pig.data.SchemaTupleBackend - SchemaTupleBackend has already been initialized
2016-02-05 02:46:18,811 [main] INFO  org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1
2016-02-05 02:46:18,811 [main] INFO  org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1
("Ronni Engelmann",93117643,"WBA68251082969954","1971-11-15",41,10.26)
("Kina Buttars",12452346,"WBA32649710927373","1968-08-14",68,10.551)
("Caren Rodman",18853438,"WBA56064572124841","1987-01-24",96,6.779)
("Tierra Bork",89673290,"WBA69315467645466","1958-11-22",52,10.109)
("Thelma Steve",97170856,"WBA73739033913927","1985-12-03",98,5.081)
&lt;/PRE&gt;&lt;P&gt;there was a bug where illustrate command doesn't work in tez mode yet.&lt;/P&gt;&lt;P&gt;bottom line, I tested it with 'CustomerData.csv' also with '/user/root/CustomerData.csv' and 'hdfs://fqdn:8020/user/root/CustomerData.csv'. &lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 10:49:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127383#M90098</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T10:49:43Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127384#M90099</link>
      <description>&lt;P&gt;First, the error stack does not tell much. You will need to go to MapReduce WebUI, click the job and find the real error message. Second, your input is a csv file, and you use ; as delimit for PigStorage, that sounds wrong unless you are sure that's the case.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 13:15:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127384#M90099</guid>
      <dc:creator>daijy</dc:creator>
      <dc:date>2016-02-05T13:15:20Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127385#M90100</link>
      <description>&lt;P&gt;then what kind of issue with environment it could be?&lt;/P&gt;&lt;P&gt;I only executed menitoned command, nothing else.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 16:43:47 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127385#M90100</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T16:43:47Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127386#M90101</link>
      <description>&lt;P&gt;this is odd:&lt;/P&gt;&lt;P&gt;when i do &lt;/P&gt;&lt;LI&gt;grunt&amp;gt; b = limit sourceData 5;&lt;/LI&gt;&lt;LI&gt;grunt&amp;gt;dump b;&lt;/LI&gt;&lt;P&gt;i works for me also, when i dont limit result set .. .and just executing dump sourceData; im occurring same error.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 17:03:59 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127386#M90101</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T17:03:59Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127387#M90102</link>
      <description>&lt;P&gt;I think it crashed on me when I dumped the whole dataset, there might be a problem with your dataset further down. &lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 19:38:57 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127387#M90102</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T19:38:57Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127388#M90103</link>
      <description>&lt;P&gt;for 100% there is no problem with input dataset, i kept only first 5 records in file and its the same issue.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 22:45:46 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127388#M90103</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-05T22:45:46Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127389#M90104</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; you got me there, as you see my attempt with your file worked. Alternatively take a look at CSVExcelStorage as that has more capability as opposed to PigStorage. &lt;A href="http://help.mortardata.com/technologies/pig/csv"&gt;link&lt;/A&gt;&lt;/P&gt;&lt;P&gt;I am not saying this is the case, I don't know what's wrong but here's a note, not sure how valid it is anymore as this note has been around for a while and they don't mention which version of Pig they were using&lt;/P&gt;&lt;H2&gt;Limitations&lt;/H2&gt;&lt;P&gt;PigStorage is an extremely simple loader that does not handle special cases such as embedded delimiters or escaped control characters; it will split on every instance of the delimiter regardless of context. For this reason, when loading a CSV file it is recommended to use &lt;A href="http://help.mortardata.com/integrations/amazon_s3/csv"&gt;CSVExcelStorage&lt;/A&gt; rather than PigStorage with a comma delimiter.&lt;/P&gt;</description>
      <pubDate>Fri, 05 Feb 2016 23:04:20 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127389#M90104</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-05T23:04:20Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127390#M90105</link>
      <description>&lt;P&gt;well CSVExcelStorage doesnt work also.... &lt;/P&gt;&lt;P&gt;
2016-02-05 16:01:28,917 [main] INFO  org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Failed!
2016-02-05 16:01:29,745 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 1066: Unable to open iterator for alias sourceData
Details at logfile: /home/hdfs/pig_1454687855333.log
grunt&amp;gt;&lt;/P&gt;&lt;P&gt;Im confused... what is it.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Feb 2016 00:01:44 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127390#M90105</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-06T00:01:44Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127391#M90106</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/1997/lenovomi.html" nodeid="1997"&gt;@John Smith&lt;/A&gt; if you identified another bug, I'm going to buy a lottery ticket.&lt;/P&gt;</description>
      <pubDate>Sat, 06 Feb 2016 00:17:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127391#M90106</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-02-06T00:17:12Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127392#M90107</link>
      <description>&lt;P&gt;As I commented above. I cannot reproduce the error. The error you posted is too general. Can you go to Hadoop Web UI and get the detailed message?&lt;/P&gt;</description>
      <pubDate>Sat, 06 Feb 2016 02:53:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127392#M90107</guid>
      <dc:creator>daijy</dc:creator>
      <dc:date>2016-02-06T02:53:54Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127393#M90108</link>
      <description>&lt;P&gt;its strange you cant reproduce error, does it work for you?&lt;/P&gt;&lt;PRE&gt;
&lt;/PRE&gt;&lt;PRE&gt;
                    Application application_1454923438220_0007 failed 2 
times due to AM Container for appattempt_1454923438220_0007_000002 
exited with  exitCode: 1
                                    
                    For more detailed output, check application tracking
 
page:http://sandbox.hortonworks.com:8088/cluster/app/application_1454923438220_0007Then,
 click on links to logs of each attempt.
                                    
                    Diagnostics: Exception from container-launch.
                                    
                    Container id: container_e10_1454923438220_0007_02_000001
                                    
                    Exit code: 1
                                    
                    Stack trace: ExitCodeException exitCode=1: 
                                    
                    	at org.apache.hadoop.util.Shell.runCommand(Shell.java:576)
                                    
                    	at org.apache.hadoop.util.Shell.run(Shell.java:487)
                                    
                    	at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:753)
                                    
                    	at 
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:211)

                                    
                    	at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:302)

                                    
                    	at 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:82)

                                    
                    	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
                                    
                    	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
                                    
                    	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
                                    
                    	at java.lang.Thread.run(Thread.java:745)
                                    
                  
                  
                    Container exited with a non-zero exit code 1
                                    
                    Failing this attempt. Failing the application.
                  &lt;/PRE&gt;</description>
      <pubDate>Tue, 09 Feb 2016 21:37:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127393#M90108</guid>
      <dc:creator>lenovomi</dc:creator>
      <dc:date>2016-02-09T21:37:50Z</dc:date>
    </item>
    <item>
      <title>Re: PigStorage in mapreduce mode</title>
      <link>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127394#M90109</link>
      <description>&lt;P&gt;I got the same issue in hortonworks sandbox environment. Script was correct but was throwing this error &lt;/P&gt;&lt;PRE&gt;Unable to open iterator foralias&lt;/PRE&gt;&lt;P&gt;I found Jobhistory server was not working by default. I could not relate the connection between the two but after starting histoyserver , my pig script worked in both tez and mapreduce mode. Try it if it works for yoou as well.&lt;/P&gt;&lt;PRE&gt;[mapred@sandbox ~]$ cd /usr/hdp/current/hadoop-mapreduce-historyserver/sbin
[mapred@sandbox sbin]$ ls 
mr-jobhistory-daemon.sh
[mapred@sandbox sbin]$ mr-jobhistory-daemon.sh start historyserver&lt;/PRE&gt;</description>
      <pubDate>Sat, 01 Apr 2017 08:40:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/PigStorage-in-mapreduce-mode/m-p/127394#M90109</guid>
      <dc:creator>shalini_goel</dc:creator>
      <dc:date>2017-04-01T08:40:08Z</dc:date>
    </item>
  </channel>
</rss>

