FuzzyLookup Errors

ExcelMac720

New Member
Joined
Sep 10, 2021
Messages
2
Office Version
  1. 365
  2. 2019
Platform
  1. Windows
Hey Everyone,

I have two list, each of which contain close to 10,000 companies and various information about both. There are duplicates in both List A and List B, however, only one match is needed from List B. I matched the companies from List A (the more important list) to List B, using FuzzyLookup with the parameters of '1 match' and a similarity threshold of '.85'. The connections it made are fairly accurate. However, upon further review there are many obvious matches it did not make. I've tried other similarity thresholds with the remaining companies from List A to the original List B. The results become much less accurate and the obvious matches were still not made. Does anyone have any suggestions on how to fix this? Or have I exasperated all of my options?

Thanks
 

Excel Facts

Select all contiguous cells
Pressing Ctrl+* (asterisk) will select the "current region" - all contiguous cells in all directions.
Hey Everyone,

I have two list, each of which contain close to 10,000 companies and various information about both. There are duplicates in both List A and List B, however, only one match is needed from List B. I matched the companies from List A (the more important list) to List B, using FuzzyLookup with the parameters of '1 match' and a similarity threshold of '.85'. The connections it made are fairly accurate. However, upon further review there are many obvious matches it did not make. I've tried other similarity thresholds with the remaining companies from List A to the original List B. The results become much less accurate and the obvious matches were still not made. Does anyone have any suggestions on how to fix this? Or have I exasperated all of my options?

Thanks
Can you share a sample data
 
Upvote 0
Are you using the Add-In or Power Query ?
Unfortunately the process is still a best first pass and it will need additional validation or scrutiny.
The Add-In has the advantage of being able to output the probability match as a column, so you can just set a lower probability and then eyeball which is the best match.

Power Query, I think they suggest setting a lower match figure and then selecting that you only want one result and it will return the match with the higher result.
It also lets you build a helper table to include in the Fuzzy Lookup so you can put your identified matches in that helper table, to help whittle down the unmatched.

If both lists have something like and ABN (australian business registration no) or country equivalent or address or email address you can see if that works better in a fuzzy match
 
Upvote 0

Forum statistics

Threads
1,214,848
Messages
6,121,914
Members
449,054
Latest member
luca142

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top