<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: I am looking to generate sequence no in a file .Used RANK, but it's failing for files &amp;gt; 10GB.  Here is the code ​temp = LOAD 'abc.txt' using PigStorage(';','-tagFile'); test = RANK temp; DUMP test; in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153187#M36162</link>
    <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11546/koteswara-pilla.html" nodeid="11546"&gt;@Koti P&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;I don't see a problem with your code, I'm able to execute your code using HDP 2.4 Sandbox&lt;/P&gt;&lt;PRE&gt;temp = LOAD 'abc.txt' using PigStorage(';','-tagFile');
test = RANK temp;
DUMP test;&lt;/PRE&gt;&lt;P&gt;my abc.txt looks like so&lt;/P&gt;&lt;PRE&gt;David,1,N
Tete,2,N
Ranjit,3,M
Ranjit,3,P
David,4,Q
David,4,Q
Jillian,8,Q
JaePak,7,Q
Michael,8,T
Jillian,8,Q
Jose,10,V
&lt;/PRE&gt;&lt;P&gt;and my output looks like so:&lt;/P&gt;&lt;PRE&gt;(1,abc.txt,David,1,N)
(2,abc.txt,Tete,2,N)
(3,abc.txt,Ranjit,3,M)
(4,abc.txt,Ranjit,3,P)
(5,abc.txt,David,4,Q)
(6,abc.txt,David,4,Q)
(7,abc.txt,Jillian,8,Q)
(8,abc.txt,JaePak,7,Q)
(9,abc.txt,Michael,8,T)
(10,abc.txt,Jillian,8,Q)
(11,abc.txt,Jose,10,V)
&lt;/PRE&gt;&lt;P&gt;I used tez as executing engine&lt;/P&gt;&lt;PRE&gt;pig -x tez&lt;/PRE&gt;</description>
    <pubDate>Tue, 02 Aug 2016 03:18:35 GMT</pubDate>
    <dc:creator>aervits</dc:creator>
    <dc:date>2016-08-02T03:18:35Z</dc:date>
    <item>
      <title>I am looking to generate sequence no in a file .Used RANK, but it's failing for files &gt; 10GB.  Here is the code ​temp = LOAD 'abc.txt' using PigStorage(';','-tagFile'); test = RANK temp; DUMP test;</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153186#M36161</link>
      <description />
      <pubDate>Thu, 28 Jul 2016 03:52:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153186#M36161</guid>
      <dc:creator>koteswara_pilla</dc:creator>
      <dc:date>2016-07-28T03:52:22Z</dc:date>
    </item>
    <item>
      <title>Re: I am looking to generate sequence no in a file .Used RANK, but it's failing for files &gt; 10GB.  Here is the code ​temp = LOAD 'abc.txt' using PigStorage(';','-tagFile'); test = RANK temp; DUMP test;</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153187#M36162</link>
      <description>&lt;P&gt;&lt;A rel="user" href="https://community.cloudera.com/users/11546/koteswara-pilla.html" nodeid="11546"&gt;@Koti P&lt;/A&gt;
&lt;/P&gt;&lt;P&gt;I don't see a problem with your code, I'm able to execute your code using HDP 2.4 Sandbox&lt;/P&gt;&lt;PRE&gt;temp = LOAD 'abc.txt' using PigStorage(';','-tagFile');
test = RANK temp;
DUMP test;&lt;/PRE&gt;&lt;P&gt;my abc.txt looks like so&lt;/P&gt;&lt;PRE&gt;David,1,N
Tete,2,N
Ranjit,3,M
Ranjit,3,P
David,4,Q
David,4,Q
Jillian,8,Q
JaePak,7,Q
Michael,8,T
Jillian,8,Q
Jose,10,V
&lt;/PRE&gt;&lt;P&gt;and my output looks like so:&lt;/P&gt;&lt;PRE&gt;(1,abc.txt,David,1,N)
(2,abc.txt,Tete,2,N)
(3,abc.txt,Ranjit,3,M)
(4,abc.txt,Ranjit,3,P)
(5,abc.txt,David,4,Q)
(6,abc.txt,David,4,Q)
(7,abc.txt,Jillian,8,Q)
(8,abc.txt,JaePak,7,Q)
(9,abc.txt,Michael,8,T)
(10,abc.txt,Jillian,8,Q)
(11,abc.txt,Jose,10,V)
&lt;/PRE&gt;&lt;P&gt;I used tez as executing engine&lt;/P&gt;&lt;PRE&gt;pig -x tez&lt;/PRE&gt;</description>
      <pubDate>Tue, 02 Aug 2016 03:18:35 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153187#M36162</guid>
      <dc:creator>aervits</dc:creator>
      <dc:date>2016-08-02T03:18:35Z</dc:date>
    </item>
    <item>
      <title>Re: I am looking to generate sequence no in a file .Used RANK, but it's failing for files &gt; 10GB.  Here is the code ​temp = LOAD 'abc.txt' using PigStorage(';','-tagFile'); test = RANK temp; DUMP test;</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153188#M36163</link>
      <description>&lt;P&gt;Thanks for the answer. I was out of town and could not get back. I have tested and looked good if we run with 'tez' engine. Is there anyway we can test with oozie flow. The same I am not able to test with oozie as it's taking mapreduce mode&lt;/P&gt;</description>
      <pubDate>Thu, 18 Aug 2016 22:36:49 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/I-am-looking-to-generate-sequence-no-in-a-file-Used-RANK-but/m-p/153188#M36163</guid>
      <dc:creator>koteswara_pilla</dc:creator>
      <dc:date>2016-08-18T22:36:49Z</dc:date>
    </item>
  </channel>
</rss>

