Member since
10-18-2017
4
Posts
0
Kudos Received
0
Solutions
11-16-2017
03:27 PM
I am creating report on top of Hive jdbc connection. Below is the sql generating in reporting and it throws below error while executing. Can somebody please help me on this. Sales Amt column is double. Error: [Simba][HiveJDBCDriver](500051) ERROR processing query/statement. Error Code: 40000, SQL state: TStatus(statusCode:ERROR_STATUS, infoMessages:[*org.apache.hive.service.cli.HiveSQLException:Error while compiling statement: FAILED: ParseException line 1:189 extraneous input '_SYNTHJOIN_a22915c4___Table__3sales_amt_' expecting ) near '<EOF>':28:27, SQL: SELECT
Table__1.`chain_lvl2_node_nbr`,
Table__2.`sale_tkt_yr`, sum(Table__3.`sales_amt`) FROM
`poc`.`cust_subj_area` Table__1 JOIN `poc`.`pos_partition_yr_pd_full_new` Table__3 ON (Table__3.`dw_cust_id`=Table__1.`dw_cust_id`)
JOIN `poc`.`cao_time_subj_area` Table__2 ON (Table__2.`sale_tkt_dly_dt`=Table__3.`sls_dt`)
WHERE
Table__1.`chain_lvl2_node_nbr` = '1'
GROUP BY
Table__1.`chain_lvl2_node_nbr`,
Table__2.`sale_tkt_yr`
... View more
Labels:
10-20-2017
04:06 PM
Thanks @Sindhu. My goal is to test the query performance QA (4 data nodes) vs PROD (8 data nodes). let's say i have a SELECT HQL which gets result in say 90 secs. in QA (~1 billion records) . Can we assume that in PROD , the same HIVE table with same number of records going to fetch ~40-50% better than QA?
... View more
10-19-2017
09:19 PM
Is it possible to load data to just 2 data nodes instead of distributing across data nodes. Thanks
... View more
Labels:
10-18-2017
07:28 PM
I am fetching data from Hive tables from Teradata SQL assistant using TD query grid. Its taking around 30 to 40 sec. extra compared to direct querying in HIVE. Is there anyways to improve the fetching speed from Hadoop to TD. Sample SQL :
SELECT TOP 20 * FROM poc.POS_TABLE@hadoop_POS WHEREYR='2016';
The above sql is taking 1 second in HIVE but around 30-40 sec from Teradata.
... View more
Labels: