Compare two large lists of data

Peter_Alan

New Member
Joined
Aug 23, 2011
Messages
7
Every week I have a report generated and handed to me with more than 60,000 entries. I need to isolate which items are different than the week before. At the moment I have a macro that opens the previous weeks report, deletes all but four columns then copies that sheet in the book the macro is running from. The same is repeated for the most current report. The sheets are named "old" and "new" respectively.

I thought the simplest way to determine which items were new would be to have a formula in (for example) E2 =A2 & B2 & C2 & D2 down the whole column on both sheets. The only way to tell an item has been changed is to review all four column entries.

Up to this point excel does great, it isn't really slow.

My problem arises when I try to run a =match() with E2 against column E from the "old" sheet. Excel doesn't handle it so well. Any suggestions on how to do a quick column comparison?

The idea, after the column had ran the match function, is I could do a filter and delete out the items that had a numerical value returned. Returning unique items only.
 

Excel Facts

Formula for Yesterday
Name Manager, New Name. Yesterday =TODAY()-1. OK. Then, use =YESTERDAY in any cell. Tomorrow could be =TODAY()+1.
Your method is sound, I don't see the problem.

In sheet2, in F2, this formula:

=MATCH(E2, Sheet1!$E:$E, 0)

...then copied down will give you numbers and errors.

Highlight column F, press F5, select Special > Formulas > Numbers (uncheck the rest) > OK and you've selected all the matches. Delete those rows.
 
Upvote 0
jbeaucaire, for the match function, excel is basically reviewing 60,000 items, in 60,000 items, which is 3,600,000,000 checks and then it does it again for another page. Excel hasn't made it through them all yet.
 
Upvote 0
jbeaucaire, for the match function, excel is basically reviewing 60,000 items, in 60,000 items, which is 3,600,000,000 checks and then it does it again for another page. Excel hasn't made it through them all yet.

OH SNAP!!! :eeek: Good call....I didn't even think about that one....hmmmm.....
 
Upvote 0
Yeah, on my computer it took about 3 minutes. You do have a large dataset, you do want the comparison to be done, so...
 
Upvote 0

Forum statistics

Threads
1,224,603
Messages
6,179,855
Members
452,948
Latest member
UsmanAli786

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top