how to implement fuzzy matching without spark. We want to match fields like Address etc.
If you are ruling Spark out, then:
- Look at Hive UDFs for Fuzzy Matching.
- Look at using Solr, and Solr search for Fuzzy Matching
There are a number of articles on those subjects.
Thanks @Graham Martin. If possible can you please provide any document or link to achieve the same.
Hi All, Is there any reusable UDF's for implement fuzzy address match in HIVE.
Here are a couple publicly available Git Repos for fuzzy matching Hive UDFs: