First off this is only in the planning stages so I don't need any definitive answers yet. I'm sure this is quite a common task to perform so I'm just asking for advise at the moment.
I have about 150,000 stock records and I need to somehow remove duplicates.
This would be easy but they will not be exact duplicates. For instance
Circuit Breaker CZ-123456 should match
CB Main CZ-123456 and
Circuit Breaker 220V 12A 123456
Anyone come across this sort of thing before? Any advise welcome
I have about 150,000 stock records and I need to somehow remove duplicates.
This would be easy but they will not be exact duplicates. For instance
Circuit Breaker CZ-123456 should match
CB Main CZ-123456 and
Circuit Breaker 220V 12A 123456
Anyone come across this sort of thing before? Any advise welcome