Created on 03-17-2017 05:31 AM - edited 09-16-2022 04:16 AM
how to implement fuzzy matching without spark. We want to match fields like Address etc.
Created 03-21-2017 02:35 PM
If you are ruling Spark out, then:
- Look at Hive UDFs for Fuzzy Matching.
- Look at using Solr, and Solr search for Fuzzy Matching
There are a number of articles on those subjects.
Created 03-22-2017 05:01 AM
Thanks @Graham Martin. If possible can you please provide any document or link to achieve the same.
Created 04-21-2017 11:45 AM
Hi All, Is there any reusable UDF's for implement fuzzy address match in HIVE.
Created 11-02-2018 10:59 AM
Here are a couple publicly available Git Repos for fuzzy matching Hive UDFs: