About EricL

EricL · ‎10-08-2017

Glad that I am helpful here.

EricL · ‎10-08-2017

Hi, Full Hive documentation regarding table creation can be found here: https://cwiki.apache.org/confluence/display/Hive/LanguageManual+DDL I think that should be enough for you to start with.

EricL · ‎10-08-2017

It is CSV, so I assume that it is "," delimited? You will need to tell that to Hive: CREATE TABLE IF NOT EXISTS Auto_Insurance_Claims_US (Customer String,Country String,StateCode String,State String,ClaimAmount Float,Response String,Coverage String,Education String,EffectiveToDate String,EmploymentStatus String,Gender String,Income String,LocationCode String,MaritalStatus String,MonthlyPremiumAuto String,MonthsSinceLastClaim String,MonthsSincePolicyInception String,NumberOfOpenComplaints Int,NumberOfPolicies Int,PolicyType String,Policy String,ClaimReason String,SalesChannel String,TotalClaimAmount Float,VehicleClass String,VehicleSize String) ROW FORMAT DELIMITED FIELDS TERMINATED BY "," STORED AS TEXTFILE; Try and see if it works for you.

EricL · ‎10-04-2017

Hi, The first one "Hive Metastore Connection Timeout" is the one you should try. If you look closely, "hive.metastore.client.socket.timeout" is just underneath it. Regarding the JIRA ID, it is an internal JIRA, so you do not have access to it. Another way is to setup a quick script to drop partition in batches and and then drop the table after number of partitions have reduced to a reasonable level.

EricL · ‎10-02-2017

Hi, This is a known issue and we have an internal JIRA to track it. Current there is no better way to improve the performance. You can, however, to increase the timeout limit via "hive.metastore.client.socket.timeout", and set it to long time value (in seconds) to allow the query to finish. It should finish eventually, but might take sometime. Our engineers are still discussing internally to see the best fix for the issue, however, it is on going and we do not have solution yet at this stage.

EricL · ‎09-20-2017

Impala won't be able to create gzip compression format for text file. Please refer to below documentation: https://www.cloudera.com/documentation/enterprise/latest/topics/impala_file_formats.html It mentioned below: For text format, if LZO compression is used, you must create the table and load data in Hive. If other kinds of compression are used, you must load data through LOAD DATA, Hive, or manually in HDFS. So the short answer is that you can't do it in Impala.

EricL · ‎09-20-2017

Maybe you are after this JIRA: https://issues.cloudera.org/browse/HUE-1450? I don't think Hue currently supports it through API call, and you might have to import through Hue UI again.

EricL · ‎09-20-2017

Have you tried to verify that you can telnet to sqlserver host from current host on port 1433? The error message pretty much indicated that your client host is not able to communicate with server host, it could be a network issue, or some firewall settings.

EricL · ‎09-19-2017

Currently Hue is the only tool I know that provides full list of access to the Hadoop ecosystem for end users. Regarding tools for Data Scientists, have you tried to explore Cloudera Data Science Workbench (CDSW)?

EricL · ‎09-18-2017

You can't modify workflow setting outside of Hue, including job.properties and workflow.xml, as Hue will re-generate them when you submit the workflow again. If you want to stick with Hue, I do not think it will work. But let me check for you if Hue has such feature on the road map.

Online	Offline
Last Visited	‎08-12-2020 03:17 AM

Member Since	‎03-23-2015 01:24 PM
Last Visited	‎08-12-2020 03:17 AM
Posts	1,288
Kudos received	113

Cloudera Community

Re: max() function generating an error in sqoop

Re: Add a dynamic variable to a Hive view

Re: Hive Server 2 failing to start CDP ,Cloudera M...

Re: Sqoop export from hive to teradata - > issue ...

Re: Cloudera Hadoop internal workings

Re: Can not ALTER or DROP a big partitionned table...

Re: How to create Hive table from file? Any tutori...

Re: Why all data is loaded to first column of Hive...

Re: Can not ALTER or DROP a big partitionned table...

Re: Can not ALTER or DROP a big partitionned table...

Re: Inserting to text table compressed

Re: How i can import workflow by command line?

Re: Sqoop import fail : TCP/IP Connection refused

Re: Hadoop Web Interface for data scientists or en...

Re: Externalize properties for Oozie workflows