Hi everyone, I'm using a function where the total count is a different result to what I get when manually filtering using the same criteria. This is a sample of the dataset:
The goal is to count the unique number of individuals (using the email address as the unique identifier) who participated in either Startup Arabic OR Startup English. All individuals in this sample dataset have participated (denoted by "Yes" in column C). In the full dataset, there is the possibility of "No" in column C to denote those who didn't attend. The full dataset contains about 25,000 rows. When I use the below formula, I get a result of 1,272:
=IFERROR(ROWS(UNIQUE(FILTER('All Session Data'!$G:$G,IFERROR(FIND("Startup",'All Session Data'!$B:$B)>0,0)*('All Session Data'!$C:$C="Yes")),0)))
However, when I manually filter the data, I get a result of just 1,127. To filter, I first filter column B to include Startup Arabic and Startup English. Then, I filter column C to be "Yes", and finally, I remove the duplicates from the resulting list of email addresses.
I can't figure out why manually filtering versus using the above formula is yielding such a different result. If anyone could give some advice as to what might be wrong with the formula, I'd really appreciate it.
Session 1 | Startup Arabic | Yes | sun@gmail.com | |||
Session 1 | Startup Arabic | Yes | stars@gmail.com | |||
Session 1 | Startup Arabic | Yes | earth@gmail.com | |||
Session 1 | Startup Arabic | Yes | mars@gmail.com | |||
Session 2 | Startup Arabic | Yes | sun@gmail.com | |||
Session 2 | Startup Arabic | Yes | jupiter@gmail.com | |||
Session 2 | Startup Arabic | Yes | pluto@gmail.com | |||
Session 1 | Startup English | Yes | pluto@gmail.com | |||
Session 2 | Startup English | Yes | mercury@gmail.com |
The goal is to count the unique number of individuals (using the email address as the unique identifier) who participated in either Startup Arabic OR Startup English. All individuals in this sample dataset have participated (denoted by "Yes" in column C). In the full dataset, there is the possibility of "No" in column C to denote those who didn't attend. The full dataset contains about 25,000 rows. When I use the below formula, I get a result of 1,272:
=IFERROR(ROWS(UNIQUE(FILTER('All Session Data'!$G:$G,IFERROR(FIND("Startup",'All Session Data'!$B:$B)>0,0)*('All Session Data'!$C:$C="Yes")),0)))
However, when I manually filter the data, I get a result of just 1,127. To filter, I first filter column B to include Startup Arabic and Startup English. Then, I filter column C to be "Yes", and finally, I remove the duplicates from the resulting list of email addresses.
I can't figure out why manually filtering versus using the above formula is yielding such a different result. If anyone could give some advice as to what might be wrong with the formula, I'd really appreciate it.