Remove duplicates (including original) based on a conditional?

scootsy · Mar 23, 2018

I need to find and remove all instances of duplicate values including the original, if any of the duplicates are status 1. I can't just find and delete all status 1 entries, because any names repeated that are at status 2 would need to be deleted as well. Would it be easier to search all status 1's, take the relative reference name and delete all instances of that? or is there a simpler way?

For example:

Column A Column B

Status	Name
2	A
2	B
1	B
2	C
2	C
1	D
2	D

Should become:

Status	Name
2	A
2	C
2	C

Eric W · Mar 23, 2018

Welcome to the Board!

You can do this with a couple of array formulas:

	A	B	D	E
1	Status	Name	Status	Name
2	2	A	2	A
3	2	B	2	C
4	1	B	2	C
5	2	C
6	2	C
7	1	D
8	2	D

Sheet10

Array Formulas

Cell	Formula
D2	{=IFERROR(INDEX($A$2:$A$8,SMALL(IF(COUNTIFS($A$2:$A$8,1,$B$2:$B$8,$B$2:$B$8)=0,ROW($B$2:$B$8)-ROW($B$2)+1),ROWS($D$2:$D2))),"")}
E2	{=IFERROR(INDEX($B$2:$B$8,SMALL(IF(COUNTIFS($A$2:$A$8,1,$B$2:$B$8,$B$2:$B$8)=0,ROW($B$2:$B$8)-ROW($B$2)+1),ROWS($D$2:$D2))),"")}

Entered with Ctrl+Shift+Enter. If entered correctly, Excel will surround with curly braces {}.
Note: Do not try and enter the {} manually yourself

Depending on your layout and size of data, a macro might be worth considering too.

Peter_SSs · Mar 23, 2018

Welcome to the MrExcel board!

Your requirement is not clear to me. If the very first status value in your sample data was 1 instead of 2, would that first row appear in the required result or not?
I ask because Eric's formula would remove that row but to me it isn't a duplicate so shouldn't be removed.

If my interpretation above is correct, then perhaps you could consider this manual method.
1. Formula in C2 is copied down.
2. Use AutoFilter to filter column C for TRUE values.
3. When filtered, delete all the rows below the header.
4. Delete column C entirely

Excel Workbook

A

B

C

1

Status

Name

Check

2

A

FALSE

3

2

B

TRUE

4

1

B

TRUE

5

2

C

FALSE

6

2

C

FALSE

7

1

D

TRUE

8

2

D

TRUE

Delete Dupes

Aladin Akyurek · Mar 24, 2018

I think the point Peter makes is sensible...

Row\Col	A	B	C	D	E	F
1	Status	Name		3
2	1	A		Idx	Status	Name
3	2	B		1	1	A
4	1	B		4	2	C
5	2	C		5	2	C
6	2	C
7	1	D
8	2	D

In D1 control+shift+enter, not just enter:

=SUM(IF(ISNA(MATCH(B2:B8,IF(COUNTIFS(B2:B8,IF(A2:A8=1,B2:B8))>1,B2:B8),0)),1))

In D3 control+shift+enter, not just enter, and copy down:

=IF(ROWS($D$3:D3)>$D$1,"",SMALL(IF(ISNA(MATCH($B$2:$B$8,IF(COUNTIFS($B$2:$B$8,IF($A$2:$A$8=1,$B$2:$B$8))>1,$B$2:$B$8),0)),ROW($A$2:$A$8)-ROW($A$2)+1),ROWS($D$3:D3)))

In E3 just enter, copy across to F3, and down:

=IF($D3="","",INDEX(A$2:A$8,$D3))

Remove duplicates (including original) based on a conditional?

scootsy

New Member

Excel Facts

Eric W

MrExcel MVP

Peter_SSs

MrExcel MVP, Moderator

Aladin Akyurek

MrExcel MVP

Similar threads

Forum statistics

Share this page

Remove duplicates (including original) based on a conditional?

scootsy

New Member

Excel Facts

Eric W

MrExcel MVP

Peter_SSs

MrExcel MVP, Moderator

Aladin Akyurek

MrExcel MVP

Similar threads

Forum statistics

Share this page

We've detected that you are using an adblocker.

Which adblocker are you using?

Disable AdBlock

Disable AdBlock Plus

Disable uBlock Origin

Disable uBlock