Filtering for duplicates in large data set, having crashing issues

Excelnoob97

New Member
Joined
May 25, 2020
Messages
3
Office Version
  1. 365
Platform
  1. Windows
I’m trying to remove the unique values from a large data set so im just left with the duplicates. I’ve tried filtering by color but it crashes excel every time. I’ve tried several things but none of them have worked. If anyone could offer some help I’d greatly appreciate it. The file has over 300000 rows that have to be checked for duplicates and filtered.
 

Excel Facts

Best way to learn Power Query?
Read M is for (Data) Monkey book by Ken Puls and Miguel Escobar. It is the complete guide to Power Query.
maybe try Power Query
Rich (BB code):
let
    Source = Excel.CurrentWorkbook(){[Name="Table1"]}[Content],
    Duplicates = let columnNames = {"raw"}, addCount = Table.Group(Source, columnNames, {{"Count", Table.RowCount, type number}}), selectDuplicates = Table.SelectRows(addCount, each [Count] > 1), removeCount = Table.RemoveColumns(selectDuplicates, "Count") in Table.Join(Source, columnNames, removeCount, columnNames, JoinKind.Inner)
in
    Duplicates
it will remove unique and you will stay with duplicates only

post an example of your data with real structure
 
Upvote 0
maybe try Power Query
Rich (BB code):
let
    Source = Excel.CurrentWorkbook(){[Name="Table1"]}[Content],
    Duplicates = let columnNames = {"raw"}, addCount = Table.Group(Source, columnNames, {{"Count", Table.RowCount, type number}}), selectDuplicates = Table.SelectRows(addCount, each [Count] > 1), removeCount = Table.RemoveColumns(selectDuplicates, "Count") in Table.Join(Source, columnNames, removeCount, columnNames, JoinKind.Inner)
in
    Duplicates
it will remove unique and you will stay with duplicates only

post an example of your data with real structure
First NameMiddle NameLast NameCompany NameStreet 1 Street 2
7092 Hwy 6 N
7092 Hwy 6 N
50 Town and Country Blvd #710
2000 Willowbrook D
50 Town and Country Blvd #710
 
Upvote 0
Its something like that but i only need to find duplicates within the Street 1/Address column.
 
Upvote 0
First NameMiddle NameLast NameCompany NameStreet 1Street 2
7092 Hwy 6 N
7092 Hwy 6 N
50 Town and Country Blvd #710
2000 Willowbrook D
50 Town and Country Blvd #710
Street 1
7092 Hwy 6 N
7092 Hwy 6 N
50 Town and Country Blvd #710
50 Town and Country Blvd #710


Rich (BB code):
let
    Source = Excel.CurrentWorkbook(){[Name="Table5"]}[Content],
    Duplicates = let columnNames = {"Street 1"}, addCount = Table.Group(Source, columnNames, {{"Count", Table.RowCount, type number}}), selectDuplicates = Table.SelectRows(addCount, each [Count] > 1), removeCount = Table.RemoveColumns(selectDuplicates, "Count") in Table.Join(Source, columnNames, removeCount, columnNames, JoinKind.Inner),
    TSC = Table.SelectColumns(Duplicates,{"Street 1"})
in
    TSC
 
Last edited:
Upvote 0

Forum statistics

Threads
1,215,054
Messages
6,122,895
Members
449,097
Latest member
dbomb1414

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top