I am scheduled for HEAD & NECK SURGERY. It says so, in all-capital letters on the appointment form. Don't worry, mom, HEAD & NECK SURGERY is a scary-sounding category of things, but really someone is just going to cut this bump off of my lip. I guess to make it sound less scary, they could have given a sub-category: HEAD & NECK SURGERY / LIP FIXING. But figuring out categories is hard, figuring out subcategories is harder and it's silly to waste time figuring out if some procedure is LIP FIXING or CUPID'S BOW REARRANGEMENT when you could spend that time cutting bumps off of lips instead.
If you have a bump on your lip, searching for medical information on the internet is frustrating. If you search for [lip], these sites serve up results for herpes. If you look over the description of herpes and say, "nope, I just have this one bump", the sites don't know what to say. I think that's because these sites are organized by condition--and for a lip-bump the doctor doesn't diagnose the exact medical condition. "Is it one of these scary things? Nope. Then let's just cut that bump off, whatever it is." I've had medical self-help books that are organized by initial diagnosis, not by medical condition. You start with "bump on lip" and go through a flow chart. They capture the case of "we don't know exactly what it is, but it's probably not too serious" better.
Categorizing things can be tricky. Figuring out which things are the things you're talking about can be tricky.
The first time I ever heard of Peter Morville, information architecture
pundit, was when
he gave a talk at work a few months ago.
I got the impression that he hated tagsonomies, hated
users annotating web pages/photos/anything.
I thought he only wanted
librarians to have that Mysterious Classification Power.
(Hey, bear with me, Peter Morville fans. I now realize I was wrong.)
It upset me and made me think he was a jerk.
So, what did he say?
This is the free-tagging and the folksonomies of flickr and del.icio.us,
and there's almost a sort of religous revolutionary zeal that's
wrapped up with this notion of free-tagging.
Get rid of the librarians, the information architects,
the taxonomies, the controlled vocabularies, and just
let the users tag stuff with anything they want!
And in that sort of spirit, David Weinberger, who's got a new
book out called Everything is Miscellaneous, said
"The old way creates a tree, the new rakes leaves together."
So the old way was about taxonomies and tree structures, and
the new is about these wonderful self-organizing clusters.
When I saw that, I thought you know that's the perfect metaphor.
Because we know what happens to those lovely piles of leaves we
shuffle through each fall.
They very quickly rot, and they return to the ground
where they become food for trees,
which come in many shapes and sizes and live a very long time.
I actually think that David's book is brilliant; I
think that he's a really smart guy.
I'm not sure he's totally fair to librarians but I'm of course
a little biased. But I actually think that the answer lies in
the genius of the "and", in figuring out
how do we bring these traditional and novel organization approaches
together...
I hadn't read Everything is Miscellaneous. More to the point, I
hadn't seen some of the reactions it drew from a set of idiot
blowhards. Thus, I didn't interpret Morville's words as "tagsonomies and
professionally-put-together taxonomies help each other." I just heard
"this new crap will rot and then my beloved librarians will have the power
back neiner neiner." His talk's ending didn't help much.
He told the story of the three stonecutters. Here, I'll summarize the
story: ask three stonecutters in a quarry what they're doing. First
one answers "I'm making a living." Second one answers, "I am honing my
craft." Third one gets starry-eyed and says, "I am building a cathedral."
Third one has his eye on the big picture. So, what does this story
illustrate...
I've always thought of libraries as more than just warehouses
of information. I've thought of them, to some degree, as
cathedrals of knowledge. Sort of lifting us up and inspiring us,
making us aware of the human potential to create and share knowledge
and work together. And my hope is that we move further into
the internet age
that we take some of those values with us and that we also create
and share with one another new sources of inspiration. That we are
seeing the big picture in what we're doing,
and that we're not just marching forward with this sense of some sort of
pre-ordained techno-utopian future, but that we're actually
taking the time to think about a future that we want to create,
and that we end up
working towards desirable futures.
When you're speaking to computer programmers, don't
present the "cathedral" as a good thing. When you say "cathedral",
programmers sweep the stonecutter story out of their brains,
and instead remember the essay
The Cathedral and the Bazaar. This essay is about the advantages of
very-open-source programming--the model in which developers around the
world can see a program's progress as that program is being written,
can contribute to that program, all very open. That's the "bazaar".
This essay's thesis is that the "bazaar" model works much better than
the old model of programming: a team doesn't show their source code to
the world as they work (or does so only rarely); the team doesn't accept
fixes/code from the outside world; the resulting software is buggier
because not as many people look at the source code. That's the "cathedral",
the team that doesn't listen.
Don't call yourself a Cathedral when you're talking to programmers,
not when you're trying to convince us that a small team of elites
is doing something wonderful that a huge crowd of enthusiasts couldn't do.
I'm sure Morville has read The Cathedral and the Bazaar, but
does he understand how much more it resonates than that
stonecutter story?
Oh, I'm getting worked up again. What was I talking about? Oh right--I
didn't understand
the context of Morville's statements. Morville talks to librarians.
A lot. A lot of them had read this book Everything is
Miscellaneous. And this book says--well, first:
It doesn't say that librarians aren't talented; it doesn't say that
librarians are stupid. It does say that as information moves
of of paper and onto the web, many activities
that librarians have historically spent a lot of time on--those activities
aren't going to be so useful. But librarians still have useful skills.
Many librarians understand this. Some don't; they say that this book
is an attack on their profession. Some idiot blowhards, not librarians
themselves, are tut-tutting the book, wrapping their
toxicity in a cloak of "I'm just trashing this book
because I love librarians."
That's a pretty slick maneuver. Everybody loves librarians, so folks might
not figure out that you're just being an idiot blowhard. But that's
not what Everything is Miscellaneous is about.
Who are these idiot blowhards?
[In an earlier draft of this book report, here I quoted one of these
fight-picking morons and then pointed out how they'd misinterpreted
the book, why their misinterpretation wasn't even internally consistent...
Ahem. But pointing at someone else and calling him a fight-picking
moron... Well, that's not setting a great example of how not
to be a fight-picking moron...]
It is good to remember that these idiot blowhards are out there.
Some of them don't like it when you tell them that the Dewey Decimal
System has not aged well. I bet they've whined at Peter Morville about it
until he stopped wondering "Did I overlook something in that book?" and
started wondering "Why is that David Weinberger being so mean
to librarians?"
I finally read Everything is Miscellaneous. And reading it made
me want to go back and watch the video of Morville's talk. EisM talks
about subject classifications and tagsonomies. That reminded me
that Morville had talked about librarians vs tagsonomies.
I hadn't understood that
Morville's "yay librarians/boo tagsonomy" remark was
in the context of talking about Everything is Miscellaneous and
the reaction.
Now that I look at the talk again, I don't think he was really saying
"boo tagsonomies". Morville likes them. He likes having someone with a
taxonomic attitude make the first organization for information, but
he doesn't mind users annotating.
So what is this book that baited so many idiot blowhards to bloviate?
(Since I'm writing about it, I guess it's lured in one more...)
This book is about how we organize knowledge. It's about categories,
about ontologies, about taxonomies, about indexes, about card catalogs,
about the Dewey Decimal system, about books, about the web, about
Web2.0, about tagsonomies, about user-supplied annotations, about
user-supplied content, about how we perceive the world. And it's written
very understandably. So you can see why some people would have opinions
about it.
He talks about how we organize physical objects so that people can find
them. You can categorize things--in an office supply store, you might
put "printer stuff" in a section. You could try just storing everything
in alphabetical order based upon some name provided by the manufacturer.
But what if someone fails to find the Printer Cable they're looking for
just because the manufacturer called it a "parallel cable"?
Categorization can also help you to identify things. Fun fact I
learned from this book: Linnean classification predates acceptance
of evolution. Similarly-classified things weren't necessarily
supposed to spring from common ancestry. The classification was just
a way of helping to identify the thingies you were talking about.
Which flower? The one with the split stamen, five petals, etc etc.
There wasn't a great reason to canonicalize your classification to
be "first describe the stamen, then number of petals", but you did
need some canonical order because you were writing all of these things
down. And if you wrote down all possible orderings, then the resulting
index would be larger than your botanical garden's library building.
Card catalogs have subject cards. But there was pressure on librarians
not to assign a book to too many subjects--there wouldn't be enough
physical space in the card catalog to hold that many subject cards.
If a book was mostly about the Crimean War and had some interesting
things to say about caring for horses--you'd probably never know about
the horses from the card catalog.
The Dewey Decimal system is an attempt to order books by category.
It was very impressive. It is showing its age. Why is the "Religion"
section so big and why is Christianity so large within that section?
Well, back when the system was devised, that might have made sense.
Why doesn't Chinese get more of a... Uhm, the Dewey Decimal system is
showing its age. (Though Weinberger doesn't talk about it, the Library
of Congress system seems like it's skewing out of balance too. I think
that's what they use at the UC Berkeley library. I find myself
in some "subjects" all the time and others not at all--with less balance
than I'd expect even with my narrow interests. I bet the Cutter System
has similar problems.)
The web is here and it's popular. Now it's easy for people to annotate
things. They can point out links between things. They can comment on
things. And they can categorize things. It's very exciting. It's not
limited by physical space. So an electronic card catalog could have 2000
"subject cards" for each book.
Our categories are fuzzy. Hamlet is a tragedy. What about
Charlotte's Web? Well, it has sad parts. Maybe it's kinda tragic.
The chapter called Smart Leaves nudged me out into new-to-me
mental territory:
The things that we're trying to describe/categorize are themselves fuzzy.
What is Hamlet? Is it the First Folio edition? There were a
couple of editions before that. Is a book that combines all of those,
showing the differences--is that Hamlet, too? How about
Rosencrantz and Guildenstern Are Dead, is that Hamlet?
(I'd run into this problem while using
allconsuming,
a web site that lets you comment on books. It pulls some basic
information about books from Amazon. But Amazon doesn't list "Hamlet";
it lists every edition of Hamlet that is has for sale. So when I want
to comment on a book on allconsuming, I have to choose which edition
to comment on. I usually look for the one that the most other people
chose. But that's hardly scientific.)
It talks about..
It talks about the issues that you hit every day on the internet.
Yet, I learned from it. And even when I was reading things that I
already knew, it was sufficiently well-written such that I didn't
get bored. Check it out.
Labels: book, phylogeny, tagging, words