How to pull certain words from a long list of “messages” and count how many instances of them are mentioned?

uvarg002

New Member
Joined
May 20, 2022
Messages
2
Office Version
  1. 365
Platform
  1. Windows
The column (I) in question has 4 phrases in one cell (I2). I want to be able to grab the information in column (H) using the phrases from column I.

For example. The words say assignment update, required document, assignment needed, document assigned. I want to be able to see how many times each instance is mentioned throughout column H. Just the word, not the entire message.

I tried conditional formatting > highlight cell rules > text that contains. Then added a filter for the highlighted ones and copied them to a pivot table. However, the pivot tables display the full text and not an actual number of how many instances it has been mentioned.

Excel version: 365, 2021.
 

Excel Facts

Test for Multiple Conditions in IF?
Use AND(test, test, test, test) or OR(test, test, test, ...) as the logical_test argument of IF.
You didnt specify if you are interested in a specific list of words or all the possible words in the text; I'll assume this second option.

Your Office 365 should include the functions SPLIT and LET, thus:
-the combined phrases are in I2
-insert in K2 the formula =SPLIT(SUBSTITUTE(I2,",","")," ")
Note that I AM NOT sure that the name of the function is SPLIT, in english
This will list in K2 and down all the words available in cell I2, using "comma" or "space" as separator. If there are not enough free cells in K3 and down then you will have the #SPILL! error

Now insert in L2 the formula =LET(iTxt,SUBSTITUTE($I$2,","," ")&" ",cWd,K2&" ",(LEN(iTxt)-LEN(SUBSTITUTE(iTxt,cWd,"")))/LEN(cWd))
This will return how many times the word in K2 is available in the initial text; copy down as needed

In case that you are interested only in a certain list of words, then insert the words in K2 and down, rather then using the formula in K2

Try...
 
Upvote 0
You didnt specify if you are interested in a specific list of words or all the possible words in the text; I'll assume this second option.

Your Office 365 should include the functions SPLIT and LET, thus:
-the combined phrases are in I2
-insert in K2 the formula =SPLIT(SUBSTITUTE(I2,",","")," ")
Note that I AM NOT sure that the name of the function is SPLIT, in english
This will list in K2 and down all the words available in cell I2, using "comma" or "space" as separator. If there are not enough free cells in K3 and down then you will have the #SPILL! error

Now insert in L2 the formula =LET(iTxt,SUBSTITUTE($I$2,","," ")&" ",cWd,K2&" ",(LEN(iTxt)-LEN(SUBSTITUTE(iTxt,cWd,"")))/LEN(cWd))
This will return how many times the word in K2 is available in the initial text; copy down as needed

In case that you are interested only in a certain list of words, then insert the words in K2 and down, rather then using the formula in K2

Try...
Correct, I want all the possible words in the text. So "assignment update" I wanted to see how many times it was mentioned through column H. I thought a pivot table was needed but it's pulling out the entire text WITH the message.

The split and substitute does not seem to be working. I admit I am not sure what to fill in the the comma after the I2.
 
Upvote 0
I have Office 365, (Version 2204 Build 16.0.15128.20210), no Office Insider option (ie standard user) and SPLIT has been distribuited recently; LET is available since months.
If you have an English version, you gave to use the formula as suggested; if you use a different language then you need to translate the name of the Functions in your local language
 
Upvote 0
Let's forget about SPLIT, that could not yet be available in your Office (it depends on the updating channel)
-the combined phrases are in I2
-insert in K2 the formula
Excel Formula:
'=UNIQUE(LET(txt,SUBSTITUTE(I2,","," "),sep," ",( FILTERXML("<atag><btag>"&SUBSTITUTE(TRIM(RIGHT("00.00.",MAX(0,7-LEN(txt)))&txt),sep,"</btag><btag>")&"</btag></atag>","//btag"))))
This will list in K2 and down all the words available in cell I2, using "comma" or "space" as separator. Again, if there are not enough free cells in K3 and down then you will have the #SPILL! error

-set in L2 the same formula given before:
Excel Formula:
=LET(iTxt,SUBSTITUTE($I$2,","," ")&" ",cWd,K2&" ",(LEN(iTxt)-LEN(SUBSTITUTE(iTxt,cWd,"")))/LEN(cWd))

We use LET, that was released mid of 2021, and thus should be available even to users set on the semiannual update channel.
 
Upvote 0

Forum statistics

Threads
1,215,066
Messages
6,122,948
Members
449,095
Latest member
nmaske

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top