<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Extract hbase cell command reference guide in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9388#M1620</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to get more information on extracHbaseCells command.How ever I am unable to find it in the morphline reference guide.Can some one please &amp;nbsp;let me know where I can find the documentation on this.The following is the refernce guide Iam looking at.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://cloudera.github.io/cdk/docs/0.9.1/cdk-morphlines/morphlinesReferenceGuide.html"&gt;http://cloudera.github.io/cdk/docs/0.9.1/cdk-morphlines/morphlinesReferenceGuide.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 16 Sep 2022 08:57:54 GMT</pubDate>
    <dc:creator>Nishan</dc:creator>
    <dc:date>2022-09-16T08:57:54Z</dc:date>
    <item>
      <title>Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9388#M1620</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am trying to get more information on extracHbaseCells command.How ever I am unable to find it in the morphline reference guide.Can some one please &amp;nbsp;let me know where I can find the documentation on this.The following is the refernce guide Iam looking at.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;A target="_blank" href="http://cloudera.github.io/cdk/docs/0.9.1/cdk-morphlines/morphlinesReferenceGuide.html"&gt;http://cloudera.github.io/cdk/docs/0.9.1/cdk-morphlines/morphlinesReferenceGuide.html&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 16 Sep 2022 08:57:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9388#M1620</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2022-09-16T08:57:54Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9390#M1621</link>
      <description>See Sections "Creating a Morphline Configuration File? and "Understanding the extractHBaseCells morphline command? at &lt;A target="_blank" href="http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_hbase_batch_indexer.html"&gt;http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_hbase_batch_indexer.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Apr 2014 21:02:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9390#M1621</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-04-24T21:02:09Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9392#M1622</link>
      <description>&lt;P&gt;Hi ,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I understood that part.But let us say i extract an xml from &amp;nbsp; the hbase cell with following elements(name,city,country) and I want to index the &amp;nbsp;solr .My solr schema also has &amp;nbsp;fields (name,city and country).Now I need to parse &amp;nbsp;the xml ,ge these fileds and &amp;nbsp;index it to solr.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;extractHBaseCells {&lt;BR /&gt;mappings : [&lt;BR /&gt;{&lt;BR /&gt;inputColumn : "messages:name"&lt;BR /&gt;outputField : "name"&lt;BR /&gt;type : String&lt;BR /&gt;source : value&lt;BR /&gt;}&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;{&lt;BR /&gt;inputColumn : "messages:city"&lt;BR /&gt;outputField : "city"&amp;nbsp;&lt;BR /&gt;type : String&lt;BR /&gt;source : value&lt;BR /&gt;}&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;{&lt;BR /&gt;inputColumn : "messages:country"&lt;BR /&gt;outputField : "country"&amp;nbsp;&lt;BR /&gt;type : String&lt;BR /&gt;source : value&lt;BR /&gt;}&lt;BR /&gt;]&lt;BR /&gt;}&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This would have been possible If was able to retrieve data from hbase in this format.But what &amp;nbsp;extractHbaseCell would give me is an xml file.I am loking for a way &amp;nbsp;to parse this using xquery and then assign then assign values to solr field.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Apr 2014 21:29:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9392#M1622</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2014-04-24T21:29:07Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9394#M1623</link>
      <description>You can just specify an extractHBaseCells command followed by an xquery command in the same morphline config file. Each command pipes into the subsequent command, and you can specify as many commands as you like. The links I mentioned contain a (commented out) example for extractHBaseCells followed by readAvroContainer. just uncomment that and replace readAvroContainer with xquery.&lt;BR /&gt;&lt;BR /&gt;Wolfgang.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Thu, 24 Apr 2014 21:33:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9394#M1623</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-04-24T21:33:09Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9448#M1624</link>
      <description>&lt;P&gt;Thank you.If I am not wrong can I can also put my java code to parse the &amp;nbsp;the soap message here.I am a java guy,so it seems to be easy for me.&lt;/P&gt;</description>
      <pubDate>Fri, 25 Apr 2014 20:05:43 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9448#M1624</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2014-04-25T20:05:43Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9454#M1625</link>
      <description>Yes, you can write a custom morphline command in Java [1] and add the corresponding custom jar that to the classpath, e.g via the HBASE_INDEXER_CLASSPATH environment variable in menu ?Service-Wide/Advanced/Safety Valve? in Cloudera Manager (for Near Real Time Indexing) or via the --libjars CLI option on HBaseMapReduceIndexerTool (for Batch Indexing).&lt;BR /&gt;&lt;BR /&gt;Alternatively, you also write a mini script in Java and paste it into the body of the ?java" morphline command [2].&lt;BR /&gt;&lt;BR /&gt;[1] Section "Implementing your own Custom Command? at &lt;A target="_blank" href="http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html"&gt;http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;[2] &lt;A target="_blank" href="http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html#/java"&gt;http://cloudera.github.io/cdk/docs/current/cdk-morphlines/morphlinesReferenceGuide.html#/java&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Fri, 25 Apr 2014 20:43:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/9454#M1625</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-04-25T20:43:09Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11516#M1626</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;I was able to &amp;nbsp;parse the xml from &amp;nbsp; stored in hbase and then put all the values into the record object.How do i set this to the required solr fields now ?.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;My conf file is right npow like this&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;{extract habse}&lt;/P&gt;&lt;P&gt;{java # have all values extracted and set to the record object)&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Now how can I set these extracted values to solr fields?..&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;Nishanth&lt;/P&gt;</description>
      <pubDate>Tue, 29 Apr 2014 18:46:52 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11516#M1626</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2014-04-29T18:46:52Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11522#M1627</link>
      <description>A next step is to configure Solr, in particular schema.xml and solrconfig.xml. For an example see &lt;A target="_blank" href="http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_tutorial.html"&gt;http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_tutorial.html&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Tue, 29 Apr 2014 19:04:09 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11522#M1627</guid>
      <dc:creator>whosch</dc:creator>
      <dc:date>2014-04-29T19:04:09Z</dc:date>
    </item>
    <item>
      <title>Re: Extract hbase cell command reference guide</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11576#M1628</link>
      <description>&lt;P&gt;Thanks a lot.I have created the solr cloud &amp;nbsp;and was able to index a sample data(extract the mesagae and put it into one solr field) just to check that my configuration is correct and it works.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How ever when I try to extract &amp;nbsp;data and assign it to solr schema elements it does not work.Extract hbase cell looks like this.Do I need to have&amp;nbsp;&lt;SPAN&gt;_attachment_body" field or an "_attachment_mimetype" field defined in my schema?&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;extractHBaseCells {&lt;BR /&gt;mappings : [&lt;BR /&gt;{&lt;BR /&gt;inputColumn : "messages:*"&lt;BR /&gt;outputField : "_attachment_body"&lt;BR /&gt;type : byte[]&lt;BR /&gt;source : value&lt;BR /&gt;}&lt;BR /&gt;]&lt;BR /&gt;}&lt;BR /&gt;}&lt;/P&gt;</description>
      <pubDate>Wed, 30 Apr 2014 17:55:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/Extract-hbase-cell-command-reference-guide/m-p/11576#M1628</guid>
      <dc:creator>Nishan</dc:creator>
      <dc:date>2014-04-30T17:55:22Z</dc:date>
    </item>
  </channel>
</rss>

