Count unique values, with a twist

brainiack189

New Member
Joined
Jun 18, 2017
Messages
4
Hi all,

I'm trying to use a formula to count the number of unique "blocks" of values. Below is an example.

Row #Column 1Column 2
11A
22B
32B
42B
52C
62C
73A
83A
93A
103B

<tbody>
</tbody>

In this example there are 3 blocks: Block one is rows 2-4, Block two is rows 5-6, Block 3 is rows 7-9. Basically whenever Columns 1 and 2 match and have more than 1 match counts as a block. So rows 1 and 10 do not count as blocks because there is no other row that matches them.

Is there a way to use a formula to do this? I have to do this for about 300,000 rows and it'd be nice to have the formula refresh when the number of rows changes. Thank you!
 

Excel Facts

Format cells as date
Select range and press Ctrl+Shift+3 to format cells as date. (Shift 3 is the # sign which sort of looks like a small calendar).
I think a formula will be slow; and a query faster. With your table named 'MyTable', use query

Code:
SELECT COUNT(*) AS [My Block Count]
FROM (SELECT  [Column 1],[Column 2], COUNT(*) AS MyCount
FROM MyTable
GROUP BY [Column 1],[Column 2]
HAVING COUNT(*) > 1)

Refresh as required.
 
Upvote 0
Fazza,

I tried your suggestion. It worked but i had to do a small adjustment since table MyTable was not visible in Microsoft Query even with Show System Tables checked (options). I think a named range would be visible but it does not automatically resize with new data.

Assuming:
File named TestQuery.xlsx in C:\Temp
The table in Sheet1
this worked for me

Code:
SELECT COUNT(*) AS [My Block Count]
FROM (SELECT  [Column 1],[Column 2], COUNT(*) AS MyCount
[COLOR=#0000ff]FROM `C:\Temp\TestQuery.xlsx`.`Sheet1$` `Sheet1$`[/COLOR]
GROUP BY [Column 1],[Column 2]
HAVING COUNT(*) > 1)

Query Result (new Workbook)

A
1
My Block Count​
2
3​
3

<tbody>
</tbody>


M.
 
Last edited:
Upvote 0
Yes, Marcelo

using a sheet name is better - as you explain, all rows will be picked up and this doesn't happen when rows vary and the basic named range is used.

note you should be able to use
FROM [Sheet1$]

instead of
FROM `C:\Temp\TestQuery.xlsx`.`Sheet1$` `Sheet1$`

I find it easier to explain in forum posts to use defined names but the worksheet name is better - so long as it doesn't change
 
Upvote 0
some of what I posted is unnecessary. can shorten to

SELECT COUNT(*) AS [My Block Count]
FROM (SELECT [Column 1],[Column 2]
FROM MyTable
GROUP BY [Column 1],[Column 2]
HAVING COUNT(*) > 1)
 
Upvote 0

Forum statistics

Threads
1,215,355
Messages
6,124,468
Members
449,163
Latest member
kshealy

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top