Reply
New Contributor
Posts: 4
Registered: ‎09-13-2017

External Hive table for the oracle .dmp file

Unable to create hive external table for the oracle .dmp files received from oracle team which they had generated in data pump format.

CREATE EXTERNAL TABLE table_name
ROW FORMAT
   SERDE 'oracle.hadoop.hive.datapump.DPSerDe'
STORED AS
   INPUTFORMAT  'oracle.hadoop.hive.datapump.DPInputFormat'
   OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat'
LOCATION 'path/to/HDFS/directory';

Error:

 

FAILED: SemanticException Cannot find class 'oracle.hadoop.hive.datapump.DPInput

 So, basically need an external table in hive for the oracle .dmp files provided which carries all the table related data and metadata along with it.

Posts: 388
Topics: 11
Kudos: 60
Solutions: 34
Registered: ‎09-02-2016

Re: External Hive table for the oracle .dmp file

[ Edited ]

@maduri

 

Oracle provides Big Data Appliance (BDA) softwares, if you install BDA on your hadoop environment, it will support the following Input format. Not sure BDA is available for free

 

INPUTFORMAT  'oracle.hadoop.hive.datapump.DPInputFormat'

 

Note: Unless you have proper knowledge on BDA, i will not recommend you to install it on your hadoop environment.

 

May be you can use Custom format from the below link (or) any other alternate

 

https://analyticsanvil.wordpress.com/2016/03/06/creating-a-custom-hive-input-format-and-record-reade...

 

By default hive supports only the file formats available in the below link (depends upon your hive version)

 

https://cwiki.apache.org/confluence/display/Hive/FileFormats

New Contributor
Posts: 4
Registered: ‎09-13-2017

Re: External Hive table for the oracle .dmp file

Thanks for your reply but as you know the number of columns existed in the data file, you could able to specify them while creating the external table in the first link provided. But in this case, no idea what kind of data is residing in the .dmp file.
Announcements