Support Questions

Find answers, ask questions, and share your expertise
Announcements
Celebrating as our community reaches 100,000 members! Thank you!

Merge Points data with GEOJSON in Hive

avatar
New Contributor

I have Points and shapefile converted to GEOJSON data uploaded in Hive. I'm doing visualization in Tableau with these files, but to do that I need to join both of the data.

The problem is the Points data don't have any common key if I want to do merging with the shapefile data.

I've done this with sample data in my desktop. The only think I know in order to merge the data, I need to assign the attribute of the shapefile to each of the points, and then do the merging based on the attributes.

I did it in QGIS, using Point Sampling Tools to get the attributes from the shapefile and then join the shapefile and the Points + attribute data in Tableau with common key (i.e. Region).

But since we are talking about big data here, around 19mio rows of data, I'm wondering if we can do the join in Hive?

Anyone that can help giving me ideas will be highly appreciated. Many thanks!

1 ACCEPTED SOLUTION

avatar

Hey @Melody S!
I'm not a specialist in GEO data, but, this link may serve to you:
https://community.hortonworks.com/articles/5129/geospatial-data-analysis-in-hadoop.html

Hope this helps! 🙂

View solution in original post

2 REPLIES 2

avatar

Hey @Melody S!
I'm not a specialist in GEO data, but, this link may serve to you:
https://community.hortonworks.com/articles/5129/geospatial-data-analysis-in-hadoop.html

Hope this helps! 🙂

avatar
New Contributor

Tthank you! I'll take a look at it.