Earlier today I was in a presentation where my boss had used the Wordle tool to generate a word cloud that described our library. A word cloud visually captures a snapshot of text and presents the words in different font sizes and weights to illustrated the frequency of a word used within that text.
This is a tool I've heard of before, and had played around a little with, but had never seen used in a meaningful or illustrative way. I wondered if I could find an interesting way to generate a genealogy-related word cloud. Particularly, I wondered if I could somehow extract surnames from my genealogy software and create a cloud that would illustrate how frequently a family name appears in my database.
I have to say I'm fairly pleased with the results:
For privacy reasons, I decided to first exclude all living people from my cloud, since there are a handle of names used in recent generations only among the living. To generate the data used (using Reunion):
- Identified all non-living people using one of Reunion's preset searches. It finds all people with a death date, death place, burial date, or who is over 100 years. Imperfect, I know, since it is possible to have relatives living more than 100 years (I have none). It also omits anyone who has no birth or death dates at all -- many of whom in my database are in fact deceased. Nevertheless, I got a decent sized sample of 590 people.
- I marked the resulting people Reunion had identified as non-living, then exported a text file of their surnames.
- Copy and pasted the list of names into Wordle to generate the word cloud. Once in Wordle, you can play with fonts, colors and layouts, though Wordle determines the sizes of the words. (Regarding privacy: By not saving my Wordle to their gallery, the site claims none of my text was saved to their site: http://www.wordle.net/faq#secure)