Support Questions

Find answers, ask questions, and share your expertise

how to implement fuzzy matching without spark

how to implement fuzzy matching without spark. We want to match fields like Address etc.

4 REPLIES 4

Expert Contributor

@Gaurav Jain

If you are ruling Spark out, then:

- Look at Hive UDFs for Fuzzy Matching.

- Look at using Solr, and Solr search for Fuzzy Matching

There are a number of articles on those subjects.

Thanks @Graham Martin. If possible can you please provide any document or link to achieve the same.

Hi All, Is there any reusable UDF's for implement fuzzy address match in HIVE.

Cloudera Employee

Here are a couple publicly available Git Repos for fuzzy matching Hive UDFs:

https://github.com/ychantit/fuzzymatch_hiveUDF

https://github.com/rueedlinger/hive-udf