Hi,
I'm trying to find a formula or UDF code that will allow me to compare name cells from one year (~30,000) records to another year. I am aware of vlookup and fuzzylookup add in, etc., and they all work well to an extent. My data is really ugly with some names being entered as firstname lastname and others as lastname firstname with no way for me to actually know which is which in any particular record with confidence. I'd like to find a way for a formula or code that would recognize John Davis and Davis John as the same person (and return a true or 1 or something like that) while it would not recognize Jane Davis as the same person (and would return false or 0). I found a Similarity UDF that works for many cases, but it does not perform well for simple name reversals (Davis John vs. John Davis). It will score that around a .5 while it will score John M. Davis vs. John F. Davis as .9 or so yet those are two different people. Having a way to filter out simple reversals will help reduce the number I have to manually review. Any insight would be appreciated.
Thanks!
John Davis
I'm trying to find a formula or UDF code that will allow me to compare name cells from one year (~30,000) records to another year. I am aware of vlookup and fuzzylookup add in, etc., and they all work well to an extent. My data is really ugly with some names being entered as firstname lastname and others as lastname firstname with no way for me to actually know which is which in any particular record with confidence. I'd like to find a way for a formula or code that would recognize John Davis and Davis John as the same person (and return a true or 1 or something like that) while it would not recognize Jane Davis as the same person (and would return false or 0). I found a Similarity UDF that works for many cases, but it does not perform well for simple name reversals (Davis John vs. John Davis). It will score that around a .5 while it will score John M. Davis vs. John F. Davis as .9 or so yet those are two different people. Having a way to filter out simple reversals will help reduce the number I have to manually review. Any insight would be appreciated.
Thanks!
John Davis