<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Re: How can you query using JSON against HDP? in Archives of Support Questions (Read Only)</title>
    <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94894#M8164</link>
    <description>&lt;P&gt;One of the prospects recently evaluated Drill and while it worked for the structured / self-describing formats without creating schema, their experience was that the data type resolution aspect slowed the performance down. In any case, HWX does not support Drill officially so the on-us will be on customer to resolve any Drill related issues when using it with HDP. &lt;/P&gt;&lt;P&gt;On the other hand, my comment to customers is that Hive provides a consistent approach and in a way / semantics that is known to the database developers. Additionally, a larger community involvement and maturity of the product has hardened Hive over number of years. &lt;/P&gt;&lt;P&gt;JSONSerde is the easy to use way to handle JSON in HDP. In return of one time table creation, you get better performance as compared to Drill which does not seem like a bad trade off at all. &lt;/P&gt;</description>
    <pubDate>Tue, 06 Oct 2015 02:02:08 GMT</pubDate>
    <dc:creator>bsaini</dc:creator>
    <dc:date>2015-10-06T02:02:08Z</dc:date>
    <item>
      <title>How can you query using JSON against HDP?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94891#M8161</link>
      <description>&lt;P&gt;The customer wants to use something like Apache Drill to query HDP using JSON due to the fact that it's self-describing.&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2015 23:17:22 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94891#M8161</guid>
      <dc:creator>mhendricks</dc:creator>
      <dc:date>2015-10-05T23:17:22Z</dc:date>
    </item>
    <item>
      <title>Re: How can you query using JSON against HDP?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94892#M8162</link>
      <description>&lt;P&gt;Take a look at Spark (and SparkSQL). It can automatically infer the schema of a JSON dataset&lt;/P&gt;&lt;P&gt;&lt;A href="https://spark.apache.org/docs/1.4.1/sql-programming-guide.html#json-datasets" target="_blank"&gt;https://spark.apache.org/docs/1.4.1/sql-programming-guide.html#json-datasets&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2015 23:24:54 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94892#M8162</guid>
      <dc:creator>awatson</dc:creator>
      <dc:date>2015-10-05T23:24:54Z</dc:date>
    </item>
    <item>
      <title>Re: How can you query using JSON against HDP?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94893#M8163</link>
      <description>&lt;P&gt;Apache Drill supports JSON as self describing data format, you can find the usage &lt;A href="https://drill.apache.org/docs/json-data-model/"&gt;here&lt;/A&gt;. In Hive, HCatalog supports JSON as serde format for reading and writing data into tables.&lt;/P&gt;</description>
      <pubDate>Mon, 05 Oct 2015 23:49:45 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94893#M8163</guid>
      <dc:creator>deepesh1</dc:creator>
      <dc:date>2015-10-05T23:49:45Z</dc:date>
    </item>
    <item>
      <title>Re: How can you query using JSON against HDP?</title>
      <link>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94894#M8164</link>
      <description>&lt;P&gt;One of the prospects recently evaluated Drill and while it worked for the structured / self-describing formats without creating schema, their experience was that the data type resolution aspect slowed the performance down. In any case, HWX does not support Drill officially so the on-us will be on customer to resolve any Drill related issues when using it with HDP. &lt;/P&gt;&lt;P&gt;On the other hand, my comment to customers is that Hive provides a consistent approach and in a way / semantics that is known to the database developers. Additionally, a larger community involvement and maturity of the product has hardened Hive over number of years. &lt;/P&gt;&lt;P&gt;JSONSerde is the easy to use way to handle JSON in HDP. In return of one time table creation, you get better performance as compared to Drill which does not seem like a bad trade off at all. &lt;/P&gt;</description>
      <pubDate>Tue, 06 Oct 2015 02:02:08 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Archives-of-Support-Questions/How-can-you-query-using-JSON-against-HDP/m-p/94894#M8164</guid>
      <dc:creator>bsaini</dc:creator>
      <dc:date>2015-10-06T02:02:08Z</dc:date>
    </item>
  </channel>
</rss>

