<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Pig Load - ERROR 2118: Input path does not exist in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pig-Load-ERROR-2118-Input-path-does-not-exist/m-p/51857#M56321</link>
    <description>Argggg. Ok I need to find a wall and pound my head against it.&lt;BR /&gt;&lt;BR /&gt;The issue was I was running the first_etl.pig as&lt;BR /&gt;pig -x local first_etl.pig which runs it locally expecting a local file and what I want is to run this on the Hadoop cluster. Running this as pig first_etl.pig fires this off and finds the file.</description>
    <pubDate>Tue, 07 Mar 2017 04:34:07 GMT</pubDate>
    <dc:creator>peterlandis</dc:creator>
    <dc:date>2017-03-07T04:34:07Z</dc:date>
    <item>
      <title>Pig Load - ERROR 2118: Input path does not exist</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pig-Load-ERROR-2118-Input-path-does-not-exist/m-p/51856#M56320</link>
      <description>&lt;P&gt;I'm a newbie at Pig scripting and just walking through some examples (Cloudera on demaind training to be specific). &amp;nbsp;Anyway I load a file&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;hdfs dfs -put $ADIR/data/ad_data1.txt /dualcore/&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Check that the directory has proper permissions via hdfs dfs -l /&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;I can see it's chmod 777 for /dualcore and also check the /dualcore/ad_data1.txt is also set properly in HDFS.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Now when I try to the pig -x local first_etl.pig script I get the following&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;ERROR:&amp;nbsp;org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Input path does not exist: file:/dualcore/ad_data1.txt&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;QUESTION: &amp;nbsp;The file is at the root /dualcore/ad_data1.txt. &amp;nbsp;When I cat the file [hdfs dfs -cat /dualcore/ad_data1.txt] it displays the data.&amp;nbsp;Do I need to specify something other than LOAD '/dualcore/ad_data1.txt' ?&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;SCRIPT:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;data = LOAD '/dualcore/ad_data1.txt' using PigStorage(':') AS (keyword:chararray,&lt;BR /&gt;campaign_id:chararray,&lt;BR /&gt;date:chararray,&lt;BR /&gt;time:chararray,&lt;BR /&gt;display_site:chararray,&lt;BR /&gt;was_clicked:int,&lt;BR /&gt;cpc:int,&lt;BR /&gt;country:chararray,&lt;BR /&gt;placement:chararray);&lt;/P&gt;&lt;P&gt;reordered = FOREACH data GENERATE campaign_id,&lt;BR /&gt;date,&lt;BR /&gt;time,&lt;BR /&gt;UPPER(TRIM(keyword)),&lt;BR /&gt;display_site,&lt;BR /&gt;placement,&lt;BR /&gt;was_clicked,&lt;BR /&gt;cpc;&lt;/P&gt;&lt;P&gt;STORE reordered INTO '/dualcore/ad_data1/';&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 15:43:28 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pig-Load-ERROR-2118-Input-path-does-not-exist/m-p/51856#M56320</guid>
      <dc:creator>peterlandis</dc:creator>
      <dc:date>2022-09-16T15:43:28Z</dc:date>
    </item>
    <item>
      <title>Re: Pig Load - ERROR 2118: Input path does not exist</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pig-Load-ERROR-2118-Input-path-does-not-exist/m-p/51857#M56321</link>
      <description>Argggg. Ok I need to find a wall and pound my head against it.&lt;BR /&gt;&lt;BR /&gt;The issue was I was running the first_etl.pig as&lt;BR /&gt;pig -x local first_etl.pig which runs it locally expecting a local file and what I want is to run this on the Hadoop cluster. Running this as pig first_etl.pig fires this off and finds the file.</description>
      <pubDate>Tue, 07 Mar 2017 04:34:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Pig-Load-ERROR-2118-Input-path-does-not-exist/m-p/51857#M56321</guid>
      <dc:creator>peterlandis</dc:creator>
      <dc:date>2017-03-07T04:34:07Z</dc:date>
    </item>
  </channel>
</rss>

