**** Update: This has been fixed in Ambari 2.5 (released April 4, 2017). See below for more details. ****
This has been observed in Ambari 2.4 using HDP 2.5.0/HDP 22.214.171.124-5.
Customer is having the issue and I am able to replicate it.
Using the Ambari Hive view, I'm running a simple select query. By using tpcds data, it is easy to spot the problem since the customer table has an auto-number row_id.
select c_customer_sk, c_customer_id, c_last_name from customer limit 204;
The query runs successfully & the results page in the browser every 50 rows.
The same pattern is displayed when using "Save Results" -> "Download as CSV" - every 101st row is skipped.
In Beeline when I run the same query - no issues. All rows are displayed.
Have attempted to set the following values in Ambari but they don't affect the results:
Thanks in advance for the assistance.
Hi @Ciarán Porter. There is a hotfix required for this issue. In the meantime you can use the beeline cli and send the output to a csv format. A good explanation is found here: https://community.hortonworks.com/questions/25789/how-to-dump-the-output-from-beeline.html
Update: This has been fixed in Ambari 2.5 (released April 4th, 2017). An added bonus is that Ambari 2.5 includes the new Hive View 2.0 - a vastly improved GUI for developing Hive apps. See attached picture: