I'm looking to remove duplicates across several Excel spreadsheets. Each spreadsheet has 10-20 unique headers (categories) and many rows (entries). Taking one of the sheets as an example we can see that row 1 and row 200 are duplicate entries but they are only partially duplicate, row 200 may have some blank cells and there are headers such as date of entry, time of entry and unique identifier that would make this entry appear unique across those headers.
Is there a way to take into consideration all of the headers and identify which entries are duplicates based on the similarity? Say row 1 and 200 have entries that are duplicate for 15/20 of the headers but unique for 5/20 but they are still duplicate entries and therefore need to be flagged/removed.
Is there a way to take into consideration all of the headers and identify which entries are duplicates based on the similarity? Say row 1 and 200 have entries that are duplicate for 15/20 of the headers but unique for 5/20 but they are still duplicate entries and therefore need to be flagged/removed.