<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: Any option to SKIP header line(s) when using the Phoenix CsvBulkLoadTool? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157128#M20865</link>
    <description>&lt;P&gt;Thanks Predrag, that's what I thought.  psql.py is an option for our smaller files...  &lt;/P&gt;</description>
    <pubDate>Thu, 25 Feb 2016 10:58:19 GMT</pubDate>
    <dc:creator>bpreachuk</dc:creator>
    <dc:date>2016-02-25T10:58:19Z</dc:date>
    <item>
      <title>Any option to SKIP header line(s) when using the Phoenix CsvBulkLoadTool?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157126#M20863</link>
      <description>&lt;P&gt;Running the following code: &lt;/P&gt;&lt;PRE&gt;hadoop jar /usr/hdp/current/phoenix-client/phoenix-4.4.0.2.3.4.0-3485-client.jar org.apache.phoenix.mapreduce.CsvBulkLoadTool 
-z &amp;lt;Zookeeper nodes&amp;gt;:2181:/hbase-unsecure  
-d $'\t' 
--g 
--table &amp;lt;DB&amp;gt;.&amp;lt;TBL&amp;gt; 
--input /data/product/inbound/&amp;lt;FNAME&amp;gt;.TXT&lt;/PRE&gt;&lt;P&gt;is there any way to skip the first line of the input file - is there a parameter on the CsvBulkLoadTool that would allow a skip row?  Specifically like what Hive gives you with 'tblproperties ("skip.header.line.count"="1")'.  &lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Thu, 25 Feb 2016 04:27:04 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157126#M20863</guid>
      <dc:creator>bpreachuk</dc:creator>
      <dc:date>2016-02-25T04:27:04Z</dc:date>
    </item>
    <item>
      <title>Re: Any option to SKIP header line(s) when using the Phoenix CsvBulkLoadTool?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157127#M20864</link>
      <description>&lt;P&gt;Hi &lt;A rel="user" href="https://community.cloudera.com/users/235/bpreachuk.html" nodeid="235"&gt;@bpreachuk&lt;/A&gt;, according to &lt;A href="https://phoenix.apache.org/bulk_dataload.html"&gt;this page&lt;/A&gt; the bulk load tool doesn't have such a feature, but for smaller files, up to "tens of megabytes" you can use a single threaded psql.py tool which can interpret the first line as a list of columns by using the "-h in-line" option. Thinking about the bulk MR tool it's indeed hard to implement this because every mapper gets a chunk of the file, and we'd like only 1 mapper to remove the very first line, so it will have to be marked in a special way. More details about commands &lt;A href="https://github.com/apache/phoenix/tree/master/bin"&gt;here&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Feb 2016 06:52:12 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157127#M20864</guid>
      <dc:creator>pminovic</dc:creator>
      <dc:date>2016-02-25T06:52:12Z</dc:date>
    </item>
    <item>
      <title>Re: Any option to SKIP header line(s) when using the Phoenix CsvBulkLoadTool?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157128#M20865</link>
      <description>&lt;P&gt;Thanks Predrag, that's what I thought.  psql.py is an option for our smaller files...  &lt;/P&gt;</description>
      <pubDate>Thu, 25 Feb 2016 10:58:19 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Any-option-to-SKIP-header-line-s-when-using-the-Phoenix/m-p/157128#M20865</guid>
      <dc:creator>bpreachuk</dc:creator>
      <dc:date>2016-02-25T10:58:19Z</dc:date>
    </item>
  </channel>
</rss>

