Created 07-15-2017 10:45 AM
I am a beginner with oracle BDA and Cloudera ,after installing BDA to my cluster all other services like zookeper , hive went to halt , if I disable the BDA all of the services started working. Please help
Created 07-15-2017 08:53 PM
Have you tried to configure Oracle BDA on an existing cluster ? (or) created new cluster using mammoth (bdacli) commands ?
If your answer is yes for my first question, what are the steps you have followed to enable/disable Oracle BDA on an existing cluster?
Created 07-16-2017 02:09 AM
Created 07-16-2017 08:47 AM
From your details, my understanding is that you have Cloudera manager configured already and trying to setup Oracle BDA on top of that? is my understanding true? if so, if CM is already working then what is the point of setting up BDA on top of it?
Created 07-16-2017 10:40 AM
Hi,
Yes its an existing cloudera and its working , but here have a problem we need to move some 20 tb data from oracle RDBMS system to Hadoop and we dont want to use sqoop or any kind of such technology , so we want to read directly from oracle data pump file and there is only one serde available and which comes as part of BDA . so we want the BDA.
Created 07-16-2017 11:33 AM
If your requirement is to access 20 tb data from Oracle to hive/impala. And for only this purpose if you are trying to configure Oracle BDA then I don't think this is a right choice. Because BDA is not meant for that purpose and it will control your entire cluster. So just for one db/table you cannot switch your cluster to BDA, if so, you may get into different issues in future (ex: you may get error when upgrade your environment). So either configure your cluster via BDA (or) without BDA but not both (this is my understanding).
I understand your requirement that you are looking for a serde which is available with BDA. Did you check the serde availability anywhere in github?
Ex: use the below link only for understanding... it shows how to download, install & enable a library to read and write in JSON format from Hive. Similar to that you have to download your library from github or somewhere else to meet your requirement instead of configure BDA