Collaborative Transcription Tools
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

View only
 
 
ABCDEFGHIJKLMNOPQRSTUVW
1
InstitutionHostedLicenceSoftwareText TypeTEI?CMS IntegrationUnique FeaturesURLCode
2
TextLabJohn Bryant,et al, Hoftsta UniversityNo??Free-formYes?Direct annotation of TEI add/del tags onto images.http://mel.hofstra.edu/textlab.htmlMelville Electronic Library
3
ScribeAPIZooniverseCoffeescript / Ruby on RailsStructured
data + Free-form
Nohttps://github.com/zooniverse/scribeAPIhttps://github.com/zooniverse/scribeAPI
4
Wiki::Score (music scores in ABC)?DocuWikiCollaborative music editionhttp://www.wiki-score.org/doku.php
5
National Archives Transcription Pilot ProjectU.S. National Archives1,000+ pages (300+ records)DrupalFree-FormDrupalDifficulty rating (Beginner, Intermediate, Advanced), lock out feature, commenting, links to online cataloghttp://transcribe.archives.gov/
6
Islandora TEI Editor (defunct)UPEI, Robertson LibraryNoGPL 3.0Drupal/FedoraFree-formYesFedoraTEI mark-up of documents hosted in Fedorahttp://wiki.tei-c.org/index.php/IslandoraTEIEditorhttps://github.com/Islandora/islandora_tei_editorPublic Records Office, Victoria http://prov.versi.edu.au/
7
World Archives ProjectAncestry.comYesProprietaryInstalled .exe clientStructured data (Genealogy)??difficulty rating, context based help, multiple achive sourceshttp://community.ancestry.co.uk/awapWorld Archive project and http://www.worldmemoryproject.org/ (essentially another 'way in')
8
FieldDataAtlas of Living Australia/Gaia ResourcesMozilla Public License 1.1JavaNohttp://www.ala.org.au/get-involved/citizen-science/fielddata-software/http://code.google.com/p/ala-citizenscience/http://volunteer.ala.org.au/project/index/42780
9
intranda viewerintranda GmbH (http://www.digiverso.com)optionalProprietaryJava / TomcatFree-Form / Structured datayes (as Export)nonehttp://www.digiverso.com/de/products/viewerhttp://sl07.kobv.de/viewer/
10
Virtual Transcription LaboratoryPoznań Supercomputing and Networking CenteryesJava Enterprise Edition + TesseractStructured dataNoIntegrated OCR tool, trained for Polish historical documents also facility for OCR training is available. Nice and simple transcription editor which allows for text-image linking.http://wlt.synat.pcss.pl/http://wlt.synat.pcss.pl/wlt-web/index.xhtml
11
T-PENSt. Louis U Center for Digital TheologyAs of 11/20/2012: 1,088 projects created by 538 users for a total of 61,162 lines transcribed?EPL 2.0Java/JavascriptLine-based
medieval
Yesusers can create export pipelines that can export transcriptions directly into a CMS database (such as Drupal)Direct linking of transcription to lines of text in imagehttp://digital-editor.blogspot.com/https://github.com/jginther/T-PENhttp://t-pen.org/TPEN/
12
Mirador (transcription branch)IIIFNoApache 2.0JavaScriptFree-formNoIIIFprojectmirador.orghttps://github.com/IIIF/mirador/tree/transcription
13
Alto Editor (kinda working, but obsolete)IMPACT Centre of CompetenceNoApache 2.0Javascript/RubyNonone https://github.com/impactcentre/alto-editor
14
Scribe (Zooniverse)ZooniverseUpon application.MITjQuery/Ruby on RailsStructured
data
NononeBlind triple-keying, data linked to imageshttp://github.com/zooniverse/Scribe What's the Score at the Bodleian (earlier versions at OldWeather.org)
FreeREG/FreeCEN rewrite for FreeUKGen will extend Scribe in 2012.
15
Bentham Transcription DeskUniversity of London Computer Centre; UCL Bentham ProjectAs of 31 August 2012: 4,168 manuscripts transcribed or partially-transcribed (c.2 million words, plus extensive TEI markup), of which 3,949 (93%) are complete.YesGPL2.0MediaWikiYesFull TEI mark-up support; customised toolbar to automatically apply TEI tags to transcripthttp://www.ucl.ac.uk/transcribe-benthamhttps://github.com/onothimagen/cbp-transcription-deskTranscribe Bentham; Public Record Office of Victoria's (Melbourne) Transcription Pilot
16
WikisourceWikimediahttp://toolserver.org/~phe/statistics.phpYesGPL 2.0MediaWikiFree-formNoArchive.orgWorkflow managementhttp://en.wikisource.org/wiki/Main_Pagehttp://www.mediawiki.org/wiki/MediaWikiNARA Citizen Archivist Dashboard
17
DromioFolger LibraryYes?PHPFree-FormYesCustomizable set of elements, multiple trasncriptions allowed, transcriptions can be collateddromio.folger.edu
http://collation.folger.edu/2014/12/a-transcriba-what/
18
ScriptoCenter for History and New Media at George Mason UniversityNoGPL 3.0PHP library, MediaWikiFree-form, wikitextNoOmeka, WordPress, DrupalCan be integrated into potentially any CMS or personal archivehttp://scripto.orghttps://github.com/chnm/Scripto
https://github.com/omeka/plugin-Scripto
https://github.com/chnm/scripto-wordpress-plugin
https://github.com/chnm/scripto-drupal-module
Papers of the War Department, 1784-1800
19
UnbinderyBen Crowder10,000+ book pages transcribed (Project Gutenberg Thailand); 3000+ pages transcribed (Mormon Texts Project)YesMITPHP/JavascriptFree-formNoIs a CMShttp://bencrowder.net/coding/unbindery/https://github.com/bencrowder/unbinderyProject Gutenberg Thailand http://gutenbergthai.org; Mormon Texts Project http://mormontextsproject.org/
20
Biodiversity Volunteer Portal (rebranded as DigiVol,below)Atlas of Living Australia and Museum Australiahttp://volunteer.ala.org.au/about/indexYesMozilla Public License 1.1Postgres/Java/Grails/Apache???https://code.google.com/p/ala-volunteer/source/checkoutAtlas of Living Australia
21
PyBOSSACitizen Cyberscience Centre/OKFN?AGPL 3.0Python/GDocsTabularNoData entry via GDoc spreadsheethttp://pybossa.com/https://github.com/PyBossa/pybossaTranscribe Bleek & Lloyd
22
Scott Transcription@anthonygoddard / zerosixzero.orgYesApache 2RubyFree-formdesigned for Scott / Terra Nova log entrieshttp://scott-transcription.zerosixzero.orgtbdhttp://scott-transcription.zerosixzero.org
23
FromThePageBen Brumfield / Brumfield Labs7500+ pages transcribed,
2000+ indexed with 5000 subjects mentioned 28000 times as of 2017-07-01 (number from fromthepage.com, more pages on other sites.)
YesAGPL 3.0
(inquire for dual license)
Ruby on RailsFree-formYes (as export)Archive.org, Omeka, any IIIF-compliant digital library systemSemantic mark-up for indexing/annotation, OCR correction, collaborative translationhttp://fromthepage.com/http://github.com/benwbrum/fromthepage/wikiLA County Public Libraries, Fordham University, University of Texas, Indianapolis Public Library, Yaquina Head Lighthouses, New York Botanical Garden, University of Virginia Law Library, University College Dublin, San Diego Natural History Museum: Laurence M. Klauber Field Notes
Southwestern University: Zenas Matthews Diary
Rhodes College: Shelby Foote Diaries (private)
Northwestern University Library: private project
Penn State U.: Phillip K. Dick folders
Mosman Council (NSW): WW1 letters, diaries, and inscriptions
U. Delaware: Civil War diary of Joseph Brown
24
Son of Suda On-LineIntegrating Digital PapyrologyAs of 3/28/12: 3,694 submissions approvedNoGPL 3.0Ruby on RailsFree-FormYesnoneGit backendhttps://github.com/sosol/sosolpapyri.info
25
PerseidsTuftsYesGPL 3.0Ruby on RailsFree-FormYeshttp://perseids.org/https://github.com/perseids-project/perseids_docs
26
What's On the Menu? (custom Ruby on Rails app)New York Public Library796,136 dishes from 12,541 menusRuby on RailsStructured dataNohttp://menus.nypl.org/
27
TranscribablePropublicaRuby on Rails (extension to ActiveRecord)StructuredNoDocucloudhttps://projects.propublica.org/free-the-files/https://github.com/propublica/transcribable
28
Boolean TranscriptorUniversity College Cork, rep. of Ireland377 items, 2495 scansNoTo Be DecidedRuby on Rails+plugins, Bootstrap, jQuery+pluginsPlain text at the moment; in time transitioning to MarkdownNot Yet :)NoNice navigation, highlighting, YAML drivenhttp://boole-papers.electropoiesis.org/https://github.com/igravious/-boolean-transcriptor
29
DIYHistoryTranscribeUniversity of Iowa LibrariesYes?Scripto, Omeka, MediawikiFree-form http://diyhistory.lib.uiowa.edu/code.htmlhttps://github.com/ui-librarieshttp://diyhistory.lib.uiowa.edu/
30
Veridian SoftwareDL ConsultingOptionalProprietaryVeridianAllows free-form transcription as well as structured line-by-line transcription (if METS/ALTO is used for underlying data).TEI is supported but METS/ALTO is recommended.No* Supports line-by-line correction of errors in OCRed material (similar to the Trove and British Newspaper Archive sites).

* Supports transcription of unstructured text (with side-by-side image/transcription display).

* Supports structured metadata entry/transcription, in addition to text transcription.

* Transcriptions/corrections instantly become "live" (i.e. are searchable).
http://veridiansoftware.comhttp://cdnc.ucr.edu/
http://cambridge.dlconsulting.com/
http://virginiachronicle.com/
31
Letters of 1916Trinity College Dublin, rep. of IrelandWordpress, Scripto, DIYHistory, Bentham TEI Toolbarhttp://dh.tcd.ie/letters1916/
32
Itineranova-EditorStadsarchief Leuven / CCeH4 000 recordsGPL3XRX/JavascriptLine based medievalYes (basic transcription features)http://www.mom-wiki.uni-koeln.de/https://subversion.rrz.uni-koeln.de/trac/eXist-A/browser/trunk/my/XRX/www/inhttp://www.itineranova.be/
33
VdU-EditorMonasterium.net/HKI CologneGPL3XRX/JavascriptFree-Form/structured datapossible (configurable for any xsd-schema)full XML-editor with hidden XML-syntaxhttps://github.com/icaruseu/mom-ca/wiki/How-to-Use-EditMOM3-Environmenthttps://github.com/icaruseu/mom-cawww.monasterium.net; Virtuelles deutsches Urkundennetzwerk: www.vdu.uni-koeln.de
34
Crowd-Ed (this tool is not precisely for transcription; it's designed for metadata editing which will, among other descriptions, indicate type vs. manuscript dosc to help determine which documents require OCR and which require manual transcription)Martha Berry Digital Archive Projectc. 1500 (edited, but not yet transcribed)yesfree, open sourceZend / PHPstructurednoOmekacrowdsourced metadata editing aligned with Dublin Corehttps://github.com/gsbodine/crowd-ed
https://mbda.berry.edu
35
Bentham TSXUCL and tranScriptoriumYesYeshttp://www.transcribe-bentham.da.ulcc.ac.uk/TSX/
http://transcriptorium.eu/
36
Canadian Census - 1901, 1911, etcAutomated Genealogy>13 million census records transcribed, proofreading underwayYesStructured data (census records)Nohttp://automatedgenealogy.com/census/
37
Velehanden.nlhttp://picturae.com/nl/as of 6/18/12: >240,000 recordsyesClosedStructured dataVolunteers earn points that can be used to purchase scanshttp://www.velehanden.nlmilitieregisters.nl
38
Apiary (defunct)Botanical Research Institute of Texas?StructuredNoOCR integration; Regions of Interesthttp://www.apiaryproject.org/https://github.com/jbest/Apiary
39
Civil War Diaries & Letters Transcription Project (see DIYHistoryTranscribe)The University of Iowa LibrariesAs of 2/24/12: 9,043 pagesFree-Formhttp://papyri.github.com/documentation/
40
DigivolAustralian Museum
41
Family Search IndexingFamily SearchProprietaryStructured data (Genealogy)Nohttps://indexing.familysearch.org/newuser/nuhome.jsf?3.9.6
42
Harold "Doc" Edgerton ProjectMIT?Free-Formhttp://edgerton-digital-collections.org/notebooks
43
North American Bird Phenology ProgramUSGS560,271 cards transcribed; 1,104,494 cards scannedStructured dataNohttp://www.pwrc.usgs.gov/bpp/
44
Smithsonian Digital VolunteerSmithsonian museumMany projects completedNohttps://transcription.si.edu/
45
typewright18c connect (Texas A&M)Line-based OCR correctionNoECCO/EEBOhttp://www.18thconnect.org/typewright/
46
HiveNew York TimesNoApache 2Go, JSON, ElasticsearchFree-form transcription and structured tagshttp://nytlabs.com/blog/2014/12/09/hive-open-source-crowdsourcing-framework/https://github.com/nytlabs/hiveUsed for NYT's Madison project: http://madison.nytimes.com/
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...