TYWKIWDBI ("Tai-Wiki-Widbee"): You are invited to decipher Civil War telegrams

02 January 2017

You are invited to decipher Civil War telegrams

As part of a crowdsourced project:

Volunteers are logging on — at decodingthecivilwar.org — to transcribe thousands of long-lost wartime missives using a crowdsourcing platform developed and managed, in part, by the University of Minnesota...

...in 2009, two wooden trunks holding 35 ledger and code books turned up and were sold at auction. The archive had been apparently been taken by Thomas T. Eckert, head of the Civil War telegraph program, when he left government service. With almost 16,000 messages, the telegram cache is so dense, so massive, that Einaudi predicted that it would take a full-time staff as long as a decade to painstakingly decipher the messages one by one.

That’s where the crowdsourcing comes in...

Starting this past June, volunteers from across the country have been able to register on the Decoding the Civil War website, take a quick tutorial and begin scrutinizing the scanned copies of the original telegrams.

Note this is not "codebreaking" per se - it's more a matter of deciphering sometimes-illegible handwriting.

Because the handwriting of the 1860s is filled with flourishes and quirks that have long passed from style, volunteers often debate the meaning of the scribblings on the website’s talk boards. “We have people online 24 hours a day,” Einaudi said. “When you increase the eyeballs, you increase consensus, the wisdom of the crowd.”..

“We can’t build an algorithm that can do this kind of work,” said Lucy Fortson, U Zooniverse director. “Humans have developed this beautiful visual cortex that allows them to see complex patterns, distinguish visual information and type what they see.

“Machines can do data analysis, but they aren’t any good at reading handwriting.”..

“A certain type of person is predisposed to volunteering on a site like this. We see a hankering to be involved with something meaningful, and research translates as meaningful. With crowdsourcing, they have a bit of ownership in what we might find.”

If you're interested, here is the project's website.

11 comments:

Marlys Hesch SebaskyJanuary 2, 2017 at 2:10 PM
I've been de-coding all summer for Zooniverse. It's really interesting, and when I describe it to friends, they get it: this is something that would take an intern 20 years to do, and the lack of feedback would be disconcerting. But here on line, we have other people to check with, and scholars to help. Some of my own genealogical research helps here, like noticing the written double s. In German script, it looks like fs (as in "necefsary"). A few American telegraphers still used it, especially when they were writing fast. I feel pretty cocky just noticing it ☺ BTW, there are dozens of projects to get involved with there--great fun.
ReplyDelete
Replies
Marlys Hesch SebaskyJanuary 2, 2017 at 2:16 PM
https://www.zooniverse.org/
ReplyDelete
Replies
RoseJanuary 2, 2017 at 2:25 PM
Wow, the project is over half-done. The participation has been dropping off since it started, though. They are estimating completion in 180 days.
ReplyDelete
Replies
Miss CellaniaJanuary 2, 2017 at 3:21 PM
There are very few words here that I cannot easily read at a glance. My older daughter would struggle mightily, and my youngest would give up immediately -and they were taught cursive! Too bad I don't have the time to volunteer.
ReplyDelete
Replies
Aritê gunê AkasaJanuary 2, 2017 at 10:16 PM
Similarly corwdsourced projects at the University of Iowa as well. Civil war diaries and letters, medieval manuscripts, pioneer diaries, vaudeville theater reports, natural history museum specimen cards. Stuff like this is a great way for museums and other collections to both make up for how understaffed they are and also have more behind-the-scenes sorts of interactions with the public.
ReplyDelete
Replies
nolanddaJanuary 5, 2017 at 12:13 PM
I would caution against saying machines "aren't any good at reading hadwriting". It is true that humans are quite good at these visual pattern matching tasks, but we must not think we are special. There is simply not the economic incentive to program machines read this type of handwriting.

As a counterexample here is a US Post Office Advanced Facer-Canceler System (AFCS) capable of reading 40,000 handwritten addresses per hour "As the name suggests, this machine 'faces' the mail – detecting the presence of postage and making sure it faces in the right direction for canceling. Then it cancels the postage, reads the address on the letter, compares it to the database of addresses, and sprays a florescent orange barcode on the back of each letter [so simpler & cheaper machines can use the barcode for further sorting all the way down the delivery chain]."
ReplyDelete
Replies

Add comment