I have an email list of customers and a list of companies we want to exclude. The email list has almost 8,000 entries so it's an overwhelming project to do manually. I run fuzzy lookup and it is skipping some obvious (to me) matches but matching others that have a similar structure/complexity (Threshold 85). I have double-checked spelling, spaces, etc. Below is a rough example of my results. If I decrease the Similarity Threshold (70), it matches the missing ones but also matches thousands of additional customers.
Table 1 | Table 2 | Fuzzy Lookup Similarity | |
Bayshore Health | Bayshore Health | 1.0000 | |
Bayshore Health- Main campus | Ashland Hospital | 0.0000 | |
Bayshore Health- Boca Raton 94th Street | | 0.0000 | Why No match!?!? |
Bayshore Health- SW (FKA Atlantic Imaging) | | 0.0000 | Why No match!?!? |
Bayshore Health- NW (FKA Radiology NW) | | 0.0000 | Why No match!?!? |
Bayshore Health- West | | 0.0000 | Why No match!?!? |
Bayshore Health- East | | 0.0000 | Why No match!?!? |
Bayshore Health- South | | 0.0000 | Why No match!?!? |
Bayshore Health- North | | 0.0000 | Why No match!?!? |
Bayshore HealthCare | | 0.0000 | Why No match!?!? |
Ashland Hospital East | | 0.9800 | |
Oregon Ashland Hospital Emergency Room | | 0.9200 | |
George Welch Family Ashland Hosp | | 0.9100 | |