Support Questions
Find answers, ask questions, and share your expertise
Announcements
Alert: Welcome to the Unified Cloudera Community. Former HCC members be sure to read and learn how to activate your account here.

Merge Points data with GEOJSON in Hive

Solved Go to solution

Merge Points data with GEOJSON in Hive

New Contributor

I have Points and shapefile converted to GEOJSON data uploaded in Hive. I'm doing visualization in Tableau with these files, but to do that I need to join both of the data.

The problem is the Points data don't have any common key if I want to do merging with the shapefile data.

I've done this with sample data in my desktop. The only think I know in order to merge the data, I need to assign the attribute of the shapefile to each of the points, and then do the merging based on the attributes.

I did it in QGIS, using Point Sampling Tools to get the attributes from the shapefile and then join the shapefile and the Points + attribute data in Tableau with common key (i.e. Region).

But since we are talking about big data here, around 19mio rows of data, I'm wondering if we can do the join in Hive?

Anyone that can help giving me ideas will be highly appreciated. Many thanks!

1 ACCEPTED SOLUTION

Accepted Solutions

Re: Merge Points data with GEOJSON in Hive

Hey @Melody S!
I'm not a specialist in GEO data, but, this link may serve to you:
https://community.hortonworks.com/articles/5129/geospatial-data-analysis-in-hadoop.html

Hope this helps! :)

2 REPLIES 2

Re: Merge Points data with GEOJSON in Hive

Hey @Melody S!
I'm not a specialist in GEO data, but, this link may serve to you:
https://community.hortonworks.com/articles/5129/geospatial-data-analysis-in-hadoop.html

Hope this helps! :)

Highlighted

Re: Merge Points data with GEOJSON in Hive

New Contributor

Tthank you! I'll take a look at it.