Any comments on this Wordle data?

JenniferMurphy

Well-known Member
Joined
Jul 23, 2011
Messages
2,532
Office Version
  1. 365
Platform
  1. Windows
Several other family members (not me) are Wordle nuts. They debate with each other about the best words to start with. It seemed to me that there ought to be a way to analyze the probabiities, so I decided to do a little "research". I downloaded several lists of 5-letter words, merged them, and removed duplicates. I ended up with 5,760 words. I wrote some code to count how often each letter occurs in each position in each word (1-5). I got this chart.

.....
Wordle-letter-frequencies.jpg


I was surprised by the variability from position to position. "S", for example, is the most common letter in 1st and last position, but only #17 in 2nd. "K" is #7 overall and #4 in 4th position. So it seems pretty clear that the position of a letter really matters.

Picking letters from this chart, I suggested 3 possible first words: SAINT, CORES, & COATS. They seemed to contain all letters fairly high of the chart.

Then it occurred to me that I could calculate the average frequency of the letters in each word and then sort that list. Here are the top 150 words. My word CORES came in at #8, but the other two did not fare so well. COATS is #264 and SAINT is #2080, both off the chart below.

.....
image.png


I think the minisheet plus the VBA code is a bit too much to post here, so I've uploaded them both to this OneDrive folder. This was a quick and dirty coding run, so Im sure there are far better ways to do it. But I'd still be interested in any comments and/or suggestions for different or better data.


Cheers...
 

Attachments

  • 1679269530267.png
    1679269530267.png
    225.2 KB · Views: 4

Excel Facts

Select all contiguous cells
Pressing Ctrl+* (asterisk) will select the "current region" - all contiguous cells in all directions.
Bill's take on it (never tested it myself as I have no interest in playing)

 
Upvote 0
To me the important thing is to identify as many vowels as possible
I start with AUDIO followed by PESKY

This covers all vowels as well the Y

Guess 3 then is a little simpler
One of our Wordle players told me that the consonants are more important. She says that there are a lot of words that rhyme, like CAKES, FAKES, LAKES, MAKES, RAKES, TAKES, & WAKES. Having 4 of the 5 letters still leaves you with many choices. And if yiua re playing the "Hard" version of the game, you have to use all known letters in subsequent guesses.

I'll suggest she try your method.
 
Upvote 0
Bill's take on it (never tested it myself as I have no interest in playing)

Interesting
 
Upvote 0
To me the important thing is to identify as many vowels as possible
I start with AUDIO followed by PESKY

This covers all vowels as well the Y

Guess 3 then is a little simpler
I use Shout and Bread. :)
 
Upvote 0
To me the important thing is to identify as many vowels as possible
I start with AUDIO followed by PESKY

This covers all vowels as well the Y

Guess 3 then is a little simpler
On my rating list based on letter frequencies by position, AUDIO is #256, PESKY is # 3598.
 
Upvote 0
Several other family members (not me) are Wordle nuts. They debate with each other about the best words to start with. It seemed to me that there ought to be a way to analyze the probabiities, so I decided to do a little "research". I downloaded several lists of 5-letter words, merged them, and removed duplicates. I ended up with 5,760 words. I wrote some code to count how often each letter occurs in each position in each word (1-5). I got this chart.

.....
Wordle-letter-frequencies.jpg


I was surprised by the variability from position to position. "S", for example, is the most common letter in 1st and last position, but only #17 in 2nd. "K" is #7 overall and #4 in 4th position. So it seems pretty clear that the position of a letter really matters.

Picking letters from this chart, I suggested 3 possible first words: SAINT, CORES, & COATS. They seemed to contain all letters fairly high of the chart.

Then it occurred to me that I could calculate the average frequency of the letters in each word and then sort that list. Here are the top 150 words. My word CORES came in at #8, but the other two did not fare so well. COATS is #264 and SAINT is #2080, both off the chart below.

.....
image.png


I think the minisheet plus the VBA code is a bit too much to post here, so I've uploaded them both to this OneDrive folder. This was a quick and dirty coding run, so Im sure there are far better ways to do it. But I'd still be interested in any comments and/or suggestions for different or better data.


Cheers...

Do your statistics include words where one letter appears more than once - like PAPER, APPLY etc. ? Those ones can really trip you up as having already guessed one correct letter you are then looking for letters other than the ones already used!
 
Upvote 0

Forum statistics

Threads
1,214,986
Messages
6,122,611
Members
449,090
Latest member
vivek chauhan

We've detected that you are using an adblocker.

We have a great community of people providing Excel help here, but the hosting costs are enormous. You can help keep this site running by allowing ads on MrExcel.com.
Allow Ads at MrExcel

Which adblocker are you using?

Disable AdBlock

Follow these easy steps to disable AdBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the icon in the browser’s toolbar.
2)Click on the "Pause on this site" option.
Go back

Disable AdBlock Plus

Follow these easy steps to disable AdBlock Plus

1)Click on the icon in the browser’s toolbar.
2)Click on the toggle to disable it for "mrexcel.com".
Go back

Disable uBlock Origin

Follow these easy steps to disable uBlock Origin

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back

Disable uBlock

Follow these easy steps to disable uBlock

1)Click on the icon in the browser’s toolbar.
2)Click on the "Power" button.
3)Click on the "Refresh" button.
Go back
Back
Top