Member since
06-11-2016
22
Posts
1
Kudos Received
0
Solutions
07-01-2016
05:27 PM
Hi, I have a job which needs me to pull JSON file from a Hive table. After calling the file, there are business logic(calculations) which needs to be done on the file. Once the process is done the result needs to be captured in a JSON fle and store it back in Hive table. After processing (in the code) for every ID taken in there will 100 -to- 5000 records generated. Which needs to be taken in JSON File and inserted back in Hive.To accomplish the above task will writing a Python script be beneficial or a Hive UDF(Java code)?
Business wants it to be done in Hive. Any help or suggestion is highly appreciated.
... View more
Labels:
- Labels:
-
Apache Hive
06-19-2016
07:05 AM
@Dileep Kumar Chiguruvada Thanks a lot for sharing the article.The same was also suggested by @Sunile Manjee . Hive streaming is not possible in my case. So I am going the standard ways as of now. Thanks for your help.
... View more
06-19-2016
07:03 AM
@Sunile Manjee
No doubt the article was helpful in expanding the knowledge base but in my case its not feasible to use it.
As of now, I am getting the things done via standard ways not streaming. Thanks for your help.
... View more
06-16-2016
05:47 AM
@Rajkumar Singh
I have already tried that and getting NULL as the output.
Is there something I am doing wrong or is there other way round for getting the result?
... View more
06-16-2016
03:03 AM
@Paul Boal
This is what I was planning to do but after brainstorming. It was realized that there will be performance issue(s) seeing the future flow and volume of data. How about using Spark Dataframe for this purpose? It would be really helpful if I can get some insight about it too!
... View more
06-16-2016
02:57 AM
Thanks Dileep. The document is really helpful in increasing the knowledge base.
... View more
06-15-2016
03:41 AM
Thanks Sunile for guiding me on this.
Is there any case study available in this regard or something that can be helpful?
I have just started and this is my first time with Hive and related technologies/ ecosystems. .
Would really appreciate if you can guide further or point me towards right channel in this perspective.
... View more
06-15-2016
01:01 AM
I have a problem scenario :-
1. Need to capture ID and corresponding URL from a table in Teradata
2. Access the URL ---> this will open a JSON file and need to capture certain fields from the file
3. From the existing file need to access another URL ---> this will open another JSON file and capture some more fields from it
4. Finally need to load the captured fields/ entities in a Hive table
I was guessing whether this could be achieved plainly with the help of HiveQL or do I need to write a UDF for this?
Any suggestion or guidance is appreciated or if there are some case studies available then please let me know.
... View more
Labels:
- Labels:
-
Apache Hive
06-14-2016
04:34 PM
I have updated my question with the sample JSON data.
... View more
06-14-2016
06:20 AM
Thanks Rajkumar that was silly mistake.
Now when I am executing the below query :-
SELECT GET_JSON_OBJECT(Id.json,'$.Object.resource.Object.entries.Array.Object.payload.Object.Id') FROM Id;
OK
NULL
Time taken: 0.341 seconds, Fetched: 1 row(s)
hive>
Why there is Null and no values? Any suggestion please?
... View more
- « Previous
-
- 1
- 2
- Next »