Big table with long texts (>500chars/cell) eliminate duplicates create matrix

CsJHUN

Active Member
Joined
Jan 13, 2015
Messages
360
Office Version
  1. 365
  2. 2021
  3. 2019
Platform
  1. Windows
  2. Mobile
Hi guys,
I have a 26 column, 30k rows base table
30k rows belong to 106 doc
There is duplication in 30k rows (concatenate each column for each row) Removing duplication not work, always miss ~100 rows, and there is duplication within that 100.
I tried the built-in remove duplicates, used macro and used arrays to remove duplicated items. Looks like not working (11925 unique row) or the next step missing something
The concatenate cell lenght (as value) is over 500, some even over 1k character long
I want to create a matrix, with unique rows and docs on column.
Here is a sample from start
start
Itemdoc
a1
b1
c1
b2
c2
d2
a3
d3

(30k rows, 26 column)
And the result
Result
Item / Doc123
axx
bxx
cxx
dxx

(~11k row, 106 column)

step i tried:
- create the 30k rows from the 106 doc (each separate xlsx file)
- concatenate the data for each row (its a must, imho, thats makes it unique)
- create unique list of docs (106 pcs)
- create unique list of the 30k row. (~11k pcs)
- find the 30k row if its fit inside the 11k, which is not! some of the 30k has 0 result on 11k
- mark the 1st occurence of the 11k in each column of the 106 docs

Any advice is welcome.
Cheers
J
 

Excel Facts

Ambidextrous Undo
Undo last command with Ctrl+Z or Alt+Backspace. If you use the Undo icon in the QAT, open the drop-down arrow to undo up to 100 steps.

Forum statistics

Threads
1,214,975
Messages
6,122,538
Members
449,088
Latest member
RandomExceller01

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top