Hi, I'm trying to categorise/cluster a large number of words and phrases in a column (around 20K) but I don't know which cluster or category to use for each of these words/phrases.
I was wondering if anyone knows a way to go through this list and create groups or clusters based on how many times a single word (or ideally 2, 3, and 4 words) appears throughout the list? Ideally, it would be good to see which 1, 2, 3, and 4 words appear the most throughout this list (so to find out which clusters to focus on) and then get a list of words that are in that cluster.
Thanks in advance to anyone who will offer a solution.
I was wondering if anyone knows a way to go through this list and create groups or clusters based on how many times a single word (or ideally 2, 3, and 4 words) appears throughout the list? Ideally, it would be good to see which 1, 2, 3, and 4 words appear the most throughout this list (so to find out which clusters to focus on) and then get a list of words that are in that cluster.
Thanks in advance to anyone who will offer a solution.