<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>question Downloading huge results from Hue in Support Questions</title>
    <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24135#M23082</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I run query in Hue that returns huge amount of rows, is it possible to download them through UI? I tried it using Hive query and .csv, download was succesful, but it turned out the file had exactly&amp;nbsp;100000001 rows, while actual result should be bigger. &amp;nbsp;Is 100 milion some kind of limit - if so could it be lifted?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I was also thinking about storing results in HDFS and downloading them through file browser, but the problem is that when you click "save in HDFS", the whole query runs again from scratch, so effectively you need to run it twice to be able to do it (and i haven't checked if result would be stored as one file and if Hue could download it).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In short, is such a use case possible in Hue?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Fri, 01 Nov 2019 12:44:25 GMT</pubDate>
    <dc:creator>Kranach</dc:creator>
    <dc:date>2019-11-01T12:44:25Z</dc:date>
    <item>
      <title>Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24135#M23082</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If I run query in Hue that returns huge amount of rows, is it possible to download them through UI? I tried it using Hive query and .csv, download was succesful, but it turned out the file had exactly&amp;nbsp;100000001 rows, while actual result should be bigger. &amp;nbsp;Is 100 milion some kind of limit - if so could it be lifted?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I was also thinking about storing results in HDFS and downloading them through file browser, but the problem is that when you click "save in HDFS", the whole query runs again from scratch, so effectively you need to run it twice to be able to do it (and i haven't checked if result would be stored as one file and if Hue could download it).&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;In short, is such a use case possible in Hue?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 01 Nov 2019 12:44:25 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24135#M23082</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2019-11-01T12:44:25Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24136#M23083</link>
      <description>&lt;P&gt;Errata, the file had only 1 milion lines, not 100 milions&lt;/P&gt;</description>
      <pubDate>Wed, 28 Jan 2015 13:36:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24136#M23083</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2015-01-28T13:36:23Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24144#M23084</link>
      <description>This is &lt;A target="_blank" href="https://issues.cloudera.org/browse/HUE-2142"&gt;https://issues.cloudera.org/browse/HUE-2142&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;In short right now Hue will not perform well for downloading and streaming&lt;BR /&gt;a lot of data to a browser as it is not designed for that.&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 28 Jan 2015 14:51:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24144#M23084</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2015-01-28T14:51:36Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24148#M23085</link>
      <description>&lt;P&gt;But i dont need to see that data in a browser, i just want to download it on my PC...&lt;/P&gt;</description>
      <pubDate>Wed, 28 Jan 2015 15:05:41 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24148#M23085</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2015-01-28T15:05:41Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24151#M23086</link>
      <description>The webserver is sending it to your browser, a webserver is supposed to&lt;BR /&gt;just send some web pages&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 28 Jan 2015 15:14:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24151#M23086</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2015-01-28T15:14:36Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24153#M23087</link>
      <description>&lt;P&gt;I can download gigs of data from google drive or file hosting websites using my browser, why wouldn't it be possible here?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;This means my only alternative is to tell users to install hive and tell to run something like&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;beeline -u jdbc:hive2://bla:10000 -n user -p password -f yourscript.q &amp;gt; yourresults.txt&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;which is a bit crap... (not to mention until Hive 13 beeline doesnt report any progress on the operation). Or let them log to my server directly and wreak havoc there &lt;span class="lia-unicode-emoji" title=":confused_face:"&gt;😕&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;All that Hue gives you already is awesome, but it needs to do more!&lt;/P&gt;</description>
      <pubDate>Wed, 28 Jan 2015 15:28:24 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24153#M23087</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2015-01-28T15:28:24Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24154#M23088</link>
      <description>Please read the above JIRA for more details. Hue is only one lightweight&lt;BR /&gt;Python server. Google, Dropbox etc... have tens of servers dedicated to&lt;BR /&gt;serving files and not Web pages (the download happens from another machine).&lt;BR /&gt;&lt;BR /&gt;In Hue 4 we will very probably introduce some new types of Hue servers that&lt;BR /&gt;will take care of this part.&lt;BR /&gt;&lt;BR /&gt;Romain&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 28 Jan 2015 15:35:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24154#M23088</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2015-01-28T15:35:36Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24162#M23089</link>
      <description>&lt;P&gt;I see. Maybe then there should be also some option like "execute and save to hdfs", where Hue doesnt dump results to the browser, but puts them in one file in HDFS directly? So user can get it by other means? I recently managed to store results and then download 600 MB csv file in HDFS using Hue and it kinda worked (9 milions lines, new record). Altough few minutes the service went down (not sure if because of it, or because i just started presenting Hue to my boss) so not sure if this would work.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I guess we gonna instructl users to always use LIMIT clause on their quiries, telling that this is to avoid overloading our servers (which is technically true).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for your help!&lt;/P&gt;</description>
      <pubDate>Wed, 28 Jan 2015 19:23:14 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24162#M23089</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2015-01-28T19:23:14Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24174#M23090</link>
      <description>Hue has the option to save the results to HDFS and it is very scalable as&lt;BR /&gt;Hive is doing the writing to HDFS and then downloading from HDFS does not&lt;BR /&gt;require much computation from Hue.&lt;BR /&gt;&lt;BR /&gt;But it indeed re-executes the SQL with the INSERT INTO /... or CREATE TABLE&lt;BR /&gt;AS SELECT ...&lt;BR /&gt;&lt;BR /&gt;Hive or Impala does not offer a way to do both show the data in the Hue&lt;BR /&gt;screen and make it easy to download.&lt;BR /&gt;&lt;BR /&gt;In the next version we should have some optimizations that should make more&lt;BR /&gt;stable to download or bump the limit.&lt;BR /&gt;&lt;BR /&gt;In Hue 4 which is a big version we will tackle this as it would require a&lt;BR /&gt;new twin server.&lt;BR /&gt;&lt;BR /&gt;So for now we recommend downloading directly from HDFS by redoing the query&lt;BR /&gt;for large resultsets and not bumping the 'download_row_limit' limit.&lt;BR /&gt;&lt;BR /&gt;Romain&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Wed, 28 Jan 2015 22:49:36 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24174#M23090</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2015-01-28T22:49:36Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24437#M23091</link>
      <description>&lt;P&gt;Got it. We will go this way, ironically it turned out that due to some regulatory stuff, downloading raw data from our system shouldn't bee too easy, so... we are going for good old 'it's not a bug, it's a feature' &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;FYI, i also tried this :&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;beeline -u jdbc:hive2://hname:10000 -n bla -p bla -f query.q &amp;gt; results.txt&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;but it didn't do much, just hanged. Maybe hive2 (or beeline?) isn't powerful enough as well.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks for all the clarifications!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 06 Feb 2015 21:40:39 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24437#M23091</guid>
      <dc:creator>Kranach</dc:creator>
      <dc:date>2015-02-06T21:40:39Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24445#M23092</link>
      <description>Thanks for the info &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;&lt;BR /&gt;</description>
      <pubDate>Sat, 07 Feb 2015 01:56:07 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/24445#M23092</guid>
      <dc:creator>Romainr</dc:creator>
      <dc:date>2015-02-07T01:56:07Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/42869#M23093</link>
      <description>&lt;P&gt;I think that the best approach to solve this issue in Hue is:&lt;BR /&gt;&lt;BR /&gt;- create an external table which stores the data in TEXT format&lt;BR /&gt;- Load/Insert the data that you want to download there&lt;BR /&gt;- Go to File Browser, and browser to the location where that external table&lt;/P&gt;&lt;P&gt;- Download the files inside that folder&lt;BR /&gt;&lt;BR /&gt;Best, Leandro&lt;/P&gt;</description>
      <pubDate>Thu, 14 Jul 2016 20:58:50 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/42869#M23093</guid>
      <dc:creator>leandroMora</dc:creator>
      <dc:date>2016-07-14T20:58:50Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/281571#M209447</link>
      <description>&lt;P&gt;Example please for this&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;create an external table which stores the data in TEXT format&lt;BR /&gt;- Load/Insert the data that you want to download there&lt;BR /&gt;- Go to File Browser, and browser to the location where that external table&lt;/P&gt;&lt;P&gt;- Download the files inside that folder&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2019 17:08:31 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/281571#M209447</guid>
      <dc:creator>SumanD</dc:creator>
      <dc:date>2019-10-29T17:08:31Z</dc:date>
    </item>
    <item>
      <title>Re: Downloading huge results from Hue</title>
      <link>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/281572#M209448</link>
      <description>&lt;P&gt;&lt;SPAN&gt;Can you please let us know how to do for excel/csv file for the table Select * from test&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;beeline -u jdbc:hive2://bla:10000 -n user -p password -f yourscript.q &amp;gt; yourresults.txt&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Oct 2019 17:10:23 GMT</pubDate>
      <guid>https://community.cloudera.com/t5/Support-Questions/Downloading-huge-results-from-Hue/m-p/281572#M209448</guid>
      <dc:creator>SumanD</dc:creator>
      <dc:date>2019-10-29T17:10:23Z</dc:date>
    </item>
  </channel>
</rss>

