About ekoifman

ekoifman · ‎12-26-2018

You can use the Export Table command https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ImportExport#LanguageManualImportExport-ExportSyntax

ekoifman · ‎10-10-2018

This may work beeline> !run ///C:/ScriptFile

ekoifman · ‎09-12-2018

desc formatted <table> <column> https://cwiki.apache.org/confluence/display/Hive/StatsDev#StatsDev-Examples

ekoifman · ‎08-20-2018

hive.merge.cardinality.check=false is a bad idea. The logic controlled by this property checks if the ON clause of your Merge statement is such that more than 1 row from source side matches the same row from target side (which only happens in WHEN MATCHED clause). Logically what this means is that the query is asking the system to update 1 existing row in target in 2 (or more) different ways. This check is actually part of SQL standard definition of how Merge should work. You either need examine your data or the ON clause but disabling this check, when it throws a cardinality_violation error, may lead to data corruption later.

ekoifman · ‎08-17-2018

When you do SHOW COMPACTIONS, if compaction MR job was submitted, it will show Hadoop Job ID, which can be used to get more info if the problem is with the job in the Resource Manager UI. If it failed even before submitting the job to the cluster, the errors would be in the log of the standalone Hive Metastore running the compactor processes.

ekoifman · ‎08-09-2018

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+SortBy

ekoifman · ‎04-11-2018

hive.support.concurrency property enables locking. When a queries is shutdown its locks should be released immediately. When dies abruptly it may leave locks behind. These will be cleaned up by a background process running from a standalone Hive metastore process. This process will consider locks abandoned if they have not heartbeated for (by default) 5 minutes. Metastore logfile should have entries from AcidHouseKeeperService - that is the clean up process.

ekoifman · ‎01-04-2018

Not generally. The data layout for transactional tables requires special logic to decide which directories to read and how to combine them correctly. Some data files may represent updates of previously written rows, for example. Also, if you are reading while something is writing to this table your read may fail (w/o the special logic) because it will try to read incomplete ORC files. Compaction may (again w/o the special logic) may make it look like your data is duplicated.

ekoifman · ‎01-03-2018

Spark doesn't support reading Hive Acid tables directly. (https://issues.apache.org/jira/browse/SPARK-15348/SPARK-16996) It can be done (WIP) via LLAP - tracked in https://issues.apache.org/jira/browse/HIVE-12991

ekoifman · ‎11-07-2017

That's not possible w/o rewriting data.

Online	Offline
Last Visited	‎01-02-2019 10:23 PM

Member Since	‎12-09-2015 05:12 PM
Last Visited	‎01-02-2019 10:23 PM
Posts	106
Kudos received	40

Cloudera Community

Re: Hive 3 export single ORC file

Re: Hive Compaction error

Re: What is difference between Distributed by,clus...

Re: Hive Transactional Tables are not readable by ...

Re: Updating the bucketted Hive table

Re: Hive 3 export single ORC file

Re: Hive upgrade HDP3 and Beeline

Re: Viewing Hive Column or Table level Statistics

Re: Hive - Merge command throwing error message

Re: Hive Compaction error

Re: What is difference between Distributed by,clus...

Re: What is causing Hive tables to become locked?

Re: Hive Transactional Tables are not readable by ...

Re: Hive Transactional Tables are not readable by ...

Re: Updating the bucketted Hive table