Member since
03-23-2015
1288
Posts
114
Kudos Received
98
Solutions
My Accepted Solutions
Title | Views | Posted |
---|---|---|
3344 | 06-11-2020 02:45 PM | |
5045 | 05-01-2020 12:23 AM | |
2851 | 04-21-2020 03:38 PM | |
3561 | 04-14-2020 12:26 AM | |
2347 | 02-27-2020 05:51 PM |
10-08-2017
03:14 AM
Glad that I am helpful here.
... View more
10-08-2017
02:52 AM
Hi, Full Hive documentation regarding table creation can be found here: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL I think that should be enough for you to start with.
... View more
10-08-2017
02:51 AM
It is CSV, so I assume that it is "," delimited? You will need to tell that to Hive: CREATE TABLE IF NOT EXISTS Auto_Insurance_Claims_US (Customer String,Country String,StateCode String,State String,ClaimAmount Float,Response String,Coverage String,Education String,EffectiveToDate String,EmploymentStatus String,Gender String,Income String,LocationCode String,MaritalStatus String,MonthlyPremiumAuto String,MonthsSinceLastClaim String,MonthsSincePolicyInception String,NumberOfOpenComplaints Int,NumberOfPolicies Int,PolicyType String,Policy String,ClaimReason String,SalesChannel String,TotalClaimAmount Float,VehicleClass String,VehicleSize String) ROW FORMAT DELIMITED FIELDS TERMINATED BY "," STORED AS TEXTFILE; Try and see if it works for you.
... View more
10-04-2017
03:38 AM
1 Kudo
Hi, The first one "Hive Metastore Connection Timeout" is the one you should try. If you look closely, "hive.metastore.client.socket.timeout" is just underneath it. Regarding the JIRA ID, it is an internal JIRA, so you do not have access to it. Another way is to setup a quick script to drop partition in batches and and then drop the table after number of partitions have reduced to a reasonable level.
... View more
10-02-2017
04:01 PM
1 Kudo
Hi, This is a known issue and we have an internal JIRA to track it. Current there is no better way to improve the performance. You can, however, to increase the timeout limit via "hive.metastore.client.socket.timeout", and set it to long time value (in seconds) to allow the query to finish. It should finish eventually, but might take sometime. Our engineers are still discussing internally to see the best fix for the issue, however, it is on going and we do not have solution yet at this stage.
... View more
09-20-2017
04:53 AM
1 Kudo
Impala won't be able to create gzip compression format for text file. Please refer to below documentation: https://www.cloudera.com/documentation/enterprise/latest/topics/impala_file_formats.html It mentioned below: For text format, if LZO compression is used, you must create the table and load data in Hive. If other kinds of compression are used, you must load data through LOAD DATA, Hive, or manually in HDFS. So the short answer is that you can't do it in Impala.
... View more
09-20-2017
04:26 AM
Maybe you are after this JIRA: https://issues.cloudera.org/browse/HUE-1450? I don't think Hue currently supports it through API call, and you might have to import through Hue UI again.
... View more
09-20-2017
04:17 AM
Have you tried to verify that you can telnet to sqlserver host from current host on port 1433? The error message pretty much indicated that your client host is not able to communicate with server host, it could be a network issue, or some firewall settings.
... View more
09-19-2017
12:05 AM
1 Kudo
Currently Hue is the only tool I know that provides full list of access to the Hadoop ecosystem for end users. Regarding tools for Data Scientists, have you tried to explore Cloudera Data Science Workbench (CDSW)?
... View more
09-18-2017
04:06 AM
You can't modify workflow setting outside of Hue, including job.properties and workflow.xml, as Hue will re-generate them when you submit the workflow again. If you want to stick with Hue, I do not think it will work. But let me check for you if Hue has such feature on the road map.
... View more