I am attempting to count words extracted from unstructured data. To understand my current method please consider my formulas as indicated below:
="^"&SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(A2,",","^"),".","^"),"?","^"),"!","^"),";","^"),":","^"),"""","^"),"'","^"),"(","^"),")","^"),",","^"),CHAR(13),"^"),CHAR(10),"^"),CHAR(160),"^")," ","^"),"[","^"),"]","^"),"/","^"),"'","^"),"–","^")&"^"
I use the output from above to obtain unique words by using text to data using ^ as a delimiter and sorting them 1 per row in a single column and removing duplicates.
I then use the following formula to obtain a count
=COUNTIF(Master!$C$2:$C$2335,"*^"&A2&"^*")
Please consider this methodology and indicate how I may improve or make it more efficient.
Thank you
="^"&SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(SUBSTITUTE(A2,",","^"),".","^"),"?","^"),"!","^"),";","^"),":","^"),"""","^"),"'","^"),"(","^"),")","^"),",","^"),CHAR(13),"^"),CHAR(10),"^"),CHAR(160),"^")," ","^"),"[","^"),"]","^"),"/","^"),"'","^"),"–","^")&"^"
I use the output from above to obtain unique words by using text to data using ^ as a delimiter and sorting them 1 per row in a single column and removing duplicates.
I then use the following formula to obtain a count
=COUNTIF(Master!$C$2:$C$2335,"*^"&A2&"^*")
Please consider this methodology and indicate how I may improve or make it more efficient.
Thank you