Cross-referencing a large set of categories pair by pair

crackinthekraken

New Member
Joined
Mar 12, 2019
Messages
1
Hey guys, let's say you had to compare a very large group of categories, each with a set of defined traits. The goal is to compare every category against every other category and identify the pairs that are MOST SIMILAR in terms of their trait set. That is, all those category pairs that are mostly identical, with less than 5 traits don't match in between them. Is there a way to have Excel spit out a list of all the mostly-identical pairs?

I thought of the following steps we could do through if we were to do this manually, but with a large set of categories (67, so 67*67 is a REALLY big number) the number quickly becomes unmanageable.
  1. List out every pair of categories in adjacent columns.
  2. Use conditional formatting to shade unique values (traits) amongst each pair.
  3. Use Excel's Count feature to identify the number of shaded values in each pair.
  4. Identify and plot all the counts, filter by those with a count less than or equal to 5.
  5. Bonus points if we can list out every discrepant trait for each pair.


I've pasted an extract of a sample category list so you can get a better picture. There are 4 categories here, so we would we taking Category 1 (ASC OR TECH_NA) and comparing it against Category 2 (IP VIEW ONLY WITH TREATMENT TEAM) and then Category 3 (IP CPOE VIEW ONLY) and so on.

Category NameASC OR TECH_NAIP VIEW ONLY WITH TREATMENT TEAMIP CPOE VIEW ONLYIP VIEW ONLY
TraitsBuild Custom DocumentsBuild Custom DocumentsBuild Custom DocumentsBuild Custom Documents
Patient SummaryChart ReviewCare PlanCharge Capture - Enter/Edit
Documentation FlowsheetChart SearchChart ReviewCharge Capture - File
Order ReviewCreate Patient Staff RelationshipChart SearchChart Review
View Only DemographicsPatient ListsEducation RecordChart Search
Order EntryPatient SummaryPatient ListsPatient Lists
Event LogResults ReviewPatient SummaryPatient Summary
Edit ResultsView Only DemographicsResults ReviewResults Review
View Only DemographicsView Only Demographics

<tbody>
</tbody>
 

Excel Facts

What is the fastest way to copy a formula?
If A2:A50000 contain data. Enter a formula in B2. Select B2. Double-click the Fill Handle and Excel will shoot the formula down to B50000.

Forum statistics

Threads
1,215,059
Messages
6,122,918
Members
449,093
Latest member
dbomb1414

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top