I updated that big ranked "phraser" word list (and also the even bigger ranked phrase list). It counts words (and phrases) from different sources than it did before.
- The Expanded Crossword Name Database lists people, places, and things associated with "groups/identities/people that are often excluded from crossword grids". This meant adding support to phraser for reading Crossword Compiler word lists; so I also tossed in a couple of other word lists I found.
- I stopped getting movie titles from OMDb. The guy who runs the OMDb (reasonably) doesn't want some goofus pulling down many many movie titles. He set up a payment system. The OMDB data wasn't making a big difference to the word lists—phraser still gets most of its data from Wikipedia and editors apparently love adding movie information to Wikipedia. I'm a cheapskate, so that OMDb data is now gone.
- Google Books updated their "ngrams" data for the first time in several years. They changed the format to be more efficient, yay. The word list I generated from the new Books data is kinda unbalanced. It takes days to download all the Books data via my apartment's DSL internet connection. My plan was to start the download going and then go for a 7–9-day walk around San Francisco Bay. But this heat wave came along, and so I'm back home, out of the sun. I did a Ctrl-C on that big download. Thus I have a lot of ngrams that start with Aardvarks, America, Anderson, … British but nothing after the Bs.
Also, I got an updated copy of Wikipedia (that knows about newfangled things that have come along in the last few years like wandavision, which didn't usedta be a word, you know); and updated snapshots of fandom.com wikis that had new snapshots. And probably some other things I forgot. (I don't do well in heat waves. I just want to drink more ice water. Why am I tinkering with computers?) Anyhow, please enjoy these updated word and phrase lists.