Larry Hosken: New: Tag: vintage-computing

Book Report: The Theory that would Not Die

It's a book on the history of Bayes' Theorem. Bayes' Theorem is, roughly, a handy tool for practical probability problems. Suppose you are an email system's spam filter. You see a new email message that says "Best bargains Vi@gra". You need to put this message in the Spam folder or the Inbox. What do you do? Bayes says you can figure probabilities.

60% of email messages are spam.
In a set of 1000 not-spam messages, 2 mentioned "Best"
In a set of 1000 spam messages, 3 mentioned "Best"

Bayes gives you a nice way to multiply together the relevant numbers: we can ignore all those non-"Best" messages and concentrate on the relative probabilities: .003 * 60% spammy cases versus .002 * 40% non-spammy cases. So if you're looking at a message that contains "Best" and trying to decide if it's spam, considering just that word the odds are 18:8 that this message is spam. And you can get more information from the other words in the message.

At least I think that's roughly how Bayes works. This book traces the struggle of Bayesians versus some other group who use "Frequency Probability". Unfortunately, all I know about statistics is a few techniques. Of the tools in my toolbox... I don't know whether they're Bayesian or Frequentist or Pickle Sandwich or whatever. I have a hard time understanding frequentism. This book only tries to kinda hand-wavily describe it; I guess the author doesn't want to lose non-technical readers. So... I knew just enough to find myself confused nonetheless.

Mmmmaybe the difference is: I carefully computed that "60% of email is spam" statistic. (Where by "carefully computed", I mean "Looked in the email and spam folders on one email account and eyeballed a rough count.") But what if I didn't have that historical data? If I understand this book correctly, the Bayesian answer is "We need a decision. So plug in an estimate. What percentage of mail do you think is spam? Now you can use that to multiply with the other numbers. (But be sure to update that guess when you know more)"; but the Frequentist answer is "Give up! Wait until you have a significant number of emails to count!!" That sounds weird to me. Anyhow.

I still enjoyed this book, even though I didn't understand the struggle that it used as a framing story. Why? Because it presented some interesting statistics problems that have occurred through history. Interesting problems are good for the brain.

Laplace rediscovered Bayes' Rule... actually, this book makes a good case that Bayes' Rule, as stated by Bayes was not so useful. That maybe we should call it Laplace's Rule or something. Anyhow, Laplace applied the rule to many things, including jury trials. There were a lot of guessed factors in there, few known statistics. Still, he made a pretty good case that juries were wrong... not super-often, but not infitessimally-often, either. He used this as an argument against capital punishment: being found guilty wasn't a strong enough indicator of guilt.

A bunch of the stories involve looking for things at sea. Suppose you're a WWII US Navy fleet commander. Many merchant convoys are trying to cross the Atlantic to get supplies from the USA to England. U-Boats prowl the Atlantic, sinking the convoys. You have some destroyers, some search planes... but not enough to patrol all of the Atlantic. How do you organize your search? How long should a destroyer search in one place before moving on to another?

Or what about the Broken Arrow incidents at Palomares and Thule? You want to find a nuclear bomb that's... in the water. Wow, the Earth has a lot of water. Again, how do you figure out where to look? When do you decide to give up on that spot and look somewhere else?

There was election prediction. You might think that Nate Silver has a tough job, but when Tukey ran a group predicting elections for the TV news, one year his bosses sequestered the team because they didn't trust the prediction. It's never a good sign when someone locks up the statisticians in a room. Anyhow.

There's also some love for computing here: statistics, whether Bayesian, Frequentist, or whatever wasn't super-practical until it was easy to work with big piles of data. Maybe the anti-Bayesians had a point: until you had computers, if you didn't have enough data to get a scientifically-significant result, why on earth would you spend weeks of your life wrestling with data to get an answer that's not going to be that much better than the guess you'd make by eyeball? Nowadays, gathering data is still tough; but once you've got it, it's relatively easy to press a button and say "Computer, tell me about any weird correlations in here".

All in all, a good read. I was tempted to put the book down a few pages in when it said "Laplace emerged from Caen a swashbuckling mathematical virtuoso..." and then there was no swashbuckling; it was bad like when Steven Levy is bad. Fortunately, McGrayne isn't Levy-bad nearly as often as Levy is, so I kept reading.

Permalink
& Comments

Book Report: Brain Storm This novel is by Richard Dooling, the same guy who wrote Bet Your Life, one of my favorite books of 2003. This book was pretty good, too. It's a legal thriller—hey, come back! It's a legal th...

Permalink & Comments

Book Report: Hackers It's another Steven Levy book about the history of technology. As with other Levy books, I keep spotting things that I know are wrong, so it makes me not trust Levy to tell me things I don't know. ...

Permalink & Comments

Book Report: The Mythical Man-Month (a Study Guide) If this book report seems a little heavy on the questions? It's because it's the first draft of a study guide? For people reading the book? Oh man it's way too long? But hey give me a break, it's...

Permalink & Comments

Book Report: Pattern Recognition I'd heard that William Gibson had written Pattern Recognition, this book that wasn't science fiction. So I didn't read it. That was years ago. More recently, I read Spook Country that wasn't exactl...

Permalink & Comments

Book Report: Applied Cryptography This is an old textbook about applying cryptography; that is, it's about computer security. It's the textbook by Bruce Schneier, the book he later said wasn't so important--you can get this stuff ri...

Permalink & Comments

Book Report: The Psychology of Computer Programming How to get programmers to get along together. Attempts to use psychology to design easier-to-use computer language features. Discussion of which is better for your organization's culture: batch proc...

Permalink & Comments

Book Report: The Man Who Loved China Back in 2002, I went to the British Museum where an old illustration maybe showed a punch-card controlled loom from ancient China--long before such were invented in the West. Bookish fellow that I a...

Permalink & Comments

Book Report: iWoz It is Steve Wozniak's autobiography, as told to Gina Smith. It's a fun read. Keep your wits about you as you read--they didn't fact-check all of this material. So when Wozniak tells you what was g...

Permalink & Comments

Book Report: Anathem Yesterday, I watched a co-worker give a "practice" thesis defense. My workplace has plenty of grad students who are just, uhm, taking a little break from school. He's one of them. I, on the other h...

Permalink & Comments

Book Report: On the Edge I posted that link to that "Another Bubble" video. Computer nostalgia is easy, you don't have to look back, the past just keeps coming back. That viper Wade Randlett who spread lies about the "New...

Permalink & Comments

Book Report: In Search of Stupidity I'm not working on gPhone the Open Handset Alliance. There were various internal recruiting drives for the project; I slunk away from those, kept my head down. I've worked on some mobile phone plat...

Permalink & Comments

Book Report: Core Memory I like old computers. This is a book of photos from the Computer History Museum. The photographer, Mark Richards, gave a talk at work a while back. When people asked him how he chose which things ...

Permalink & Comments

Book Report: The Man Behind the Microchip Lea W. is in town, visiting from Cincinnati. Several folks gathered at Yancy's Saloon on Irving to kick it with Lea. Michael asked the question: "What do you love to do? There are a bunch of things...

Permalink & Comments

Book Report: Dealers of Lightning Sometimes, it's good to be wrong. For example, I claim to be pretty jaded. But when I saw a little dog, a Yorkshire terrier-style dog, walking along this morning carrying a rubber chicken, I was fil...

Permalink & Comments

Updates:

Tags