|
|
How Accurate is the Ellis Island Database Transcription?
Last April, the estimated 100 million Americans having at least one ancestor who
entered the U.S. through Ellis Island cheered when 22 million records from 1892-1924 were
uploaded to the Internet for free searching at: www.ellisislandrecords.org I decided to conduct a little experiment using my own surname. Smolenyak works well as a case study because it has that foreign sound of many Ellis Island immigrants and can be misspelled in countless ways. If anything, I thought that it would provide a very rigorous testing of the EIDB because it is more prone to misspelling than most names. I also conveniently had paper copies of Smolenyak arrivals obtained through traditional, microfilm research at the National Archives. I began my experiment by taking the paper copies and transcribing the Smolenyak entries as best I could, letter by letter. Then I used the EIDB to pull up these same pages. Since there are actually no entries for "Smolenyak" in the EIDB, I did this with a little reverse- engineering, by locating someone else on the hard copy pages whose name was fairly obvious and searching for them. Once I found the page that contained this individual in the EIDB, it was only a matter of scrolling to find the Smolenyak on the same page. By clicking on the "view text version manifest" option, I could see how the Smolenyak name had been transcribed. In all, I had fifteen of my own transcriptions to compare to the EIDB ones and was very pleased to find that ten of the fifteen matched perfectly. Of the remaining five, three of the mismatches were easily understood. Inspecting the original manifest, for instance, I could see how Smoleniak could be interpreted as Smolenick. The last two both surprised and impressed me: Szuwlyenak and C . . . oleniak! Using my surname as something of a barometer then, I concluded that about two-thirds are flawlessly transcribed and another 20 percent are very close. The last 13 percent were apparently going to be little more challenging to find. My conclusion? I don't want to overstate my interpretation since the findings were based on a single surname, but I share it here as at least one indication. Yes, there are some names in the EIDB that are apt to be elusive unless you get creative. But given that I used a name that practically invites misspelling for my experiment, I would surmise that most names are completely or fairly accurately transcribed in perhaps 90 percent of the cases. And when you factor in that I had the advantage of being familiar with variations of my own name and Eastern European spelling quirks, as well as the luxury of not having to transcribe thousands of other entries under deadline, I can only express my admiration for the volunteers who logged countless hours building this database and still managed to be remarkably precise. But just in case you're worried about that last ten or so percent, Stephen P. Morse's One-Step search form, mentioned previously in a pair of Michael John Neill's articles, is the just the tool you need. In a future article, I'll tell you about another experiment I did to gauge its effectiveness as compared to traditional, microfilm research and using just the EIDB. In the meantime, I'd be delighted to hear from any fellow EIDB experimenters about results they achieved with other surnames.
Megan Smolenyak, author of In Search of Our Ancestors, companion book to the 2000 PBS Ancestors series, and the forthcoming Honoring Our Ancestors: Inspiring Stories of the Quest for Our Roots, can be reached through www.honoringourancestors.com. |
|
![]() © 2001-2002 UGS, Lighthouse Genealogy Service with Iwaniw & Associates B.E.
|