Member since
03-31-2017
8
Posts
1
Kudos Received
0
Solutions
05-02-2017
07:24 AM
For submit Hive, Pig job in Azure HDInsight, I am using .Net SDK. After successfully output of Pig Job ,I used Pig output file to load into Hive table. Hive job executed successfully but Pig output is in BAG format , so its store data in Hive table like "(" , ")" characters. Means First and Last column of Hive table having these characters. I want to remove this characters from Hive table. Should I have to change the Pig output format or Is this anyway to remove above character from Hive Table ? Please give solution. Thank You.
... View more
Labels:
- Labels:
-
Apache Hive
-
Apache Pig
04-26-2017
10:19 AM
Thanks for reply.
It will really help me.
By mistake I wrote MapReduce Job, I should have to use HiveJob, PigJob, SqoopJob.
Thanks again.
I just go through Oozie. I didn't find exact Link for Oozie.
If I write a Pig script and than want to transfer those data in Hive. Then using Sqoop, export this data to SQL Server. How to connect all these process using Oozie.
Can you provide some reference Link?
... View more
04-26-2017
07:39 AM
1 Kudo
I have data in HDFS(Azure HDInsight) in csv format. I am using Pig to process this Data. After processing the Summarise data will be stored in Hive. And then Hive table is exported in RDBMS using Sqoop. Now I need to automate all this process. Is this possible that I will write particular method for all these 3 task in MapReduce, then run this MapReduce job, and all these task execute one by one. For create MapReduce job , I want to use .Net SDK. So my question is this possible, and if YES than suggest some steps and reference link for this Question.
Thank You.
... View more
Labels:
- Labels:
-
Apache Hadoop
-
Apache Hive
-
Apache Pig
04-18-2017
01:20 PM
Thanks for Reply. I was thinking like this, but confused. Now its clear.
Thanks again.
... View more
04-17-2017
02:17 PM
In Capacity Scheduler, How Resource Manage decide which Job runs on which Queue?
... View more
Labels:
- Labels:
-
Apache YARN