Churchy LaFemme
Board Regular
- Joined
- Sep 22, 2010
- Messages
- 135
Not sure how to phrase this. Have reviewed prior questions but most seem to be about matching known text.
I have a product column with almost 100,000 entries. Some are straight up duplicates (e.g., "Home Movies") but many included duplicpated text as part of the product name (e.g., "Home Movies Superstars").
I need a formula (although it's not advisable to use a formula on that many rows) that will say TRUE for any line for which the text in column A includes a text string that matches any other cell or cells in column A.
<tbody>
</tbody><colgroup><col><col></colgroup>
I have a product column with almost 100,000 entries. Some are straight up duplicates (e.g., "Home Movies") but many included duplicpated text as part of the product name (e.g., "Home Movies Superstars").
I need a formula (although it's not advisable to use a formula on that many rows) that will say TRUE for any line for which the text in column A includes a text string that matches any other cell or cells in column A.
Shows | Formula Results |
The Oddball Comedy & Curiosity Festival featuring Flight of the Conchords | TRUE |
A Very Conshords Christmas | TRUE |
Behind the Conchords | TRUE |
Dr. Katz Bikini Beach Party | FALSE |
Flight of the Conchords, Pilot | FALSE |
Flight of the Conchords, Season 2 | TRUE |
Flight of the Conchords, Season 3 | TRUE |
Flight of the Conchords: Censored Tracks | TRUE |
Home Movies: Brendon Gets Rabies | TRUE |
Home Movies: Brendon's Hat | TRUE |
Home Movies: Get Away From My Mom | TRUE |
Home Movies: Yoko | TRUE |
Rhys Darby - Conchords Exposed! | TRUE |
Science Court Outakes and Bloopers | FALSE |
Short Poppies | FALSE |
The Black Seeds (Debut) The Mighty Boosh | FALSE |
In this example, both the extact phrase "Conchords" and "Home Movies" have a "TRUE" result. This is fine. My goal at this point is to upload into Access and remove everything that is "FALSE." Trimming won't help because the duplications may not be the first x characters in the cell. (X is probably 5, but I can play with this if that's too short.) I have done this in the past by trimming both from left and from right and looking for duplicates, but this was a pain for a few thousand lines. For tens of thousands, a manual process would kill me. Any ideas? | FALSE |
<tbody>
</tbody><colgroup><col><col></colgroup>