Note: The new and improved FLOSS Inventory can be found at:
NameDescriptionDevelopersProject websiteCode repositoryQuality of Documentation Ease of AdaptationCode QualityDocumentation (status/language)Technical support availableLicenseExamplesNotesLast releaseEuropeana Affiliation (if any)Last activityCATEGORY 1CATEGORY 2CATEGORY 3CATEGORY 4CATEGORY 5Available on Labs
Ace ACE (Auditing Control Environment) is a system that incorporates anew methodology to address the integrity of long term archives using rigorous cryptographic techniques. ACE continuously audits the contents of the various objects according to the policy set by the archive, and provides mechanisms for an independent third-party auditor to certify the integrity of any object. ACE consists of two components, the first an Audit Manager(AM) that checks files locally to ensure they have not been compromised. The second part, the Integrity Management Service (IMS), issues tokens that the AM can use to verify that its local store of file digests has not been tampered with.
University of Maryland Institute for Advanced Computer Sciences Preservation
Active_fedoraRubydora and ActiveFedora provide a set of Ruby gems for creating and
managing objects in the Fedora Repository Architecture
( ActiveFedora
is loosely based on “ActiveRecord” in Rails. The 3.x series of
ActiveFedora depends on Rails 3, specifically activemodel and
Project Hydra 2.05/2/2015Collection Management
ActiveTriplesAn ActiveModel-like interface for RDF data. Models graphs as
Resources with property/attribute configuration, accessors, and other methods to support Linked Data in a Ruby/Rails enviornment.

This library was extracted from work on ActiveFedora. It is closely related to (and borrows some syntax from) Spira, but does some important things differently.
DPLA 2.022/8/2014Linked Open Data
AdveneThe Advene project (Annotate Digital Video, Exchange on the NEt) aims at providing a model and a format to share annotations about digital video documents (movies, courses, conferences...), as well as tools to edit and visualize the hypervideos generated from both the annotations and the audiovisual documents. With the Advene software, teachers, moviegoers, etc. can exchange multimedia comments and analyses about video documents. The project also aims at studying the way that communities of users (teachers, moviegoers, students...) will use these self-publishing tools to share their audiovisual "readings", and to envision new editing and viewing interfaces for interactive comment and analysis of audiovisual content.LIRIS Labratory GPL V2.0 Annotation/TaggingMedia Apllications
ALTO-EditorBrowser based post correction tool for Alto XML filesRene van der Ark 2014Media Annotation/Tagging
AmaraAmara gives individuals, communities, and larger organizations the power to overcome accessibility and language barriers for online video. Amara is composed of three main parts: A subtitle creation and viewing tool (aka the widget)A collaborative subtitling websiteAn open protocol for subtitle search/deliveryParticipatory Culture Foundation GPL2/6/4/2014Media Applications
Amplifon: Sounds of Street View Sounds of Street View is a digital explorative sound experiment
by Amplifon which gives users a 3-dimensional sound experience in a
Street View environment. Utilising the Web Audio API platform, sounds
are designated as though they were ordinary Google Maps markers, but
instead of an image and information being assigned to the marker, a
sound is
Amplifon License ManagementGeo-spatial applicationsMedia applications
AnnotatorThe Annotator is an open-source JavaScript library and tool that can be added to any webpage to make it annotatable.Open Knowledge Foundation is a well structured Github with well written documentation, including API docs. Also there is a dedicated website listing information about the tool and providing links to all the important information such as a tutorial, a link to a live demostration page where you can test out the tool, a link to a plugin development page, etc.The extensibility of the tool is promoted on the main website. There is a link to a plugin development page containing a well written comprehensive tutorial.The code is maintained in GitHub. CoffeeScript is used as the main language and all files are packaged in a logical structure. In general, the code includes extensive comments. The code is published for download as a minified JavaScript file. A file with the authors of the code is available, listing 20 contributors to the code.Complete / ENyeseither MIT or GNU GPLv3 Annotation/CurationYes
AnnotoriousAnnotorious is a JavaScript annotation library for images and zoomable images. Link the Annotorious source files (CSS and JavaScript) into an existing HTML page, and images inside this page will be enhanced with an interactive drawing and commenting tool.Rainer Simon
Peter Pilgerstorfer Paul Weichhart
http://annotorious.github.io main website is very informative and it’s main menu includes links to: a demo page, a getting started page, a plugin page an API documentation page and finally an about page.There is a plugin development page available. In combination with a comprehensive GitHub page, it shouldn’t be too hard to develop plugins.The code is maintained in GitHub. The JavaScript (using Google Closure) code seems clearly structured and includes comments. There is a useful page with information on how the code was built and can be adapted.yes (english)YesLGPL /
v0.6 (14/08/2013)actively ongoingMedia Annotation/TaggingYes
Apache PDFBoxThe Apache PDFBox™ library is an open source Java tool for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.The Apache Software Foundation main website clearly lists a ton of useful information about the tool, namely e.g. software dependancies, the architecture and it’s underlying ideas, API docs and information on individually available command line tools.The software does not have a hands on easy to use plugin structure.The code is hosted in an SVN. This page explains how to build the software and how to create new branches for further development. Moreover a page on coding conventions is linked from the main site.complete / ENyes, community basedApache License, Version 2.0Digital Asset Management Collection ManagementYes
Apache StanbolApache Stanbol (currently in incubation) is an open source modular software stack and reusable set of components for semantic content management. main website clearly lists various links with useful information such as: the ideas & architecture of the software, usage scenario’s, a comprehensive ‘getting started’ section.There seems to be no direct plugin architecture, however there is extensive documentation on development practices, which includes an overview how to contribute your (Java) code to the SVN by following the Apache guidelines.When browsing the source code it seems that it is well structured and that in incorporates Maven (pom.xml), which ensures the neat handling of dependancies and versioning. Moreover it includes a benchmark directory, which indicates the system is well tested.complete / ENyesApache License, Version 2.0
5/31/2012have done a demo with LOD from EuropeanaSemantic EnrichmentCollection ManagementDigital Asset ManagementYes
Apache TikaThe Apache Tika toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries. the home page there are links to a dedicated wiki, an issue tracker, mailing lists and for each version of Tika there is documentation on how to get started (you have to download the sources and build them with Maven), API docs and more. Tika has a plugin structure for parsers, which enables developers to extend Tika’s supported formats with their own. Developers are encouraged to use existing libraries to implement these custom parsers.Because it is from Apache and by looking at the structure of Java classes in the API docs, the quality of the code seems perfectly fine.complete / ENyes Apache License, Version 2.07/3/2013actively ongoingMetadata Retrieval ServicesSemantic EnrichmentYes
ArchivematicaArchivematica is a free and open-source digital preservation system that is designed to maintain standards-based, long-term access to collections of digital objects.This project is managed by Artefactual Systems in collaboration with the UNESCO Memory of the World's Subcommittee on Technology, the City of Vancouver Archives, the Museum of Modern Art (MoMA), the University of Alberta Libraries, the University of British Columbia Library, the Rockefeller Archive Center, Simon Fraser University Archives and Records Management, Yale University Library and a number of other collaborators. documentation, arranged by versions, divided between user and administration manual, well structured and informative, illustrated with screenshots that are aligned with the documentation text. Professionally created screencast gives a high level overview. Documentation hosted on Wiki.Microservices allow adapting workflows and easily integrating new services. Scale out is possible by adding new processing nodes. Service oriented, separation between ingest, storage, and access services, all have well defined APIs.Code ist hosted on Github. High code quality (Python 89.7% JavaScript 6.3% Shell 3.2% Other 0.9%), well structured, not many comments, no continuous integration, tests are available.yes (EN)yesAGPL3 (user name:, password: demo)v0.10.1-vm (beta) May 2013actively ongoingDigital PreservationYes
ArchivesSpaceA next-generation archives management application that will incorporate the best features of Archivist’s Toolkit (AT) and Archon. The project team is developing a technical platform, governance structure, and service model that will provide the archival community with a cutting-edge, extensible, and sustainable platform for describing analog and born-digital archival materials. The ArchivesSpace product is being developed using an Agile scrum process, guided by a Product Vision for ArchivesSpace.Hudson Molonglowww.archivesspace.org documentation, well structured, providing many additional documentation ressources. Extensive documentation index.ArchivesSpace has a backand for the the major workflows, and a REST API and many interfaces for adapting the system. Code ist hosted on Github. High code quality ( Ruby 76.6% XSLT 8.9% JavaScript 7.7% CSS 6.0% Shell 0.8), well structured, not many comments, no continuous integration, tests are available.Basic tecnical / ENyesECL 2.0This initiative is led by New York University, University of Illinois (Urbana-Champaign), and University of California, San DiegoArchivesSpace (March 2014) on-goingCollection ManagementYes
Archivists' Toolkit (AT)The AT is an open source archival data management system to provide support for the management of archives. The main goals of the AT are to support archival processing and production of access instruments, promote data standardization, promote efficiency, and lower training costs. Currently, the application supports accessioning and describing archival materials. Future functionality will be built to support repository user/resource use information, appraisal for archival materials, expressing and managing rights information, and interoperability with user authentication systems.Nathan Stevens on Github is sparse without links pointing to other documentation sources. Impression of a quick copy&paste documentation, unstructured text, no headlines, paragraphs, etc. Guidelines for contributing code. Javadoc for different versions. No instructions how to build the code on Github. Structure of documentation not easily comprehensible.Developer guide available as PDF. Some interfaces documented in short PDF Guide for developers. Medium code quality containing many trial and error sections that could be removed. Swing GUI codeGood / ENYesEducational Community License, v. 1.025/01/2013actively ongoingMetadata Mapping/Conversion/normalisation
Art4europeArt4Europe is an application which makes easier for the user to gain their knowledge about historical monuments. During our visit at the museum we can take a picture of a painting and the application will download data for us referring to selected painting by using very sophisticated methods of comparing pictures straight from Europeana API.iTraff Technology: Tomasz Grzywalski,
Jakub Jurkiewicz,
Jakub Porzuczek, Marcin Szajek documentation available.No documentation available.No code available. NonenoUnreleased prototype, but will be open source; many times have you been in an art museum without knowing anything about the paintings you were looking at? Imagine you could take a photo of any painting and get the description of the painting is a matter of seconds! And what if you could translate the description to any of the EU languages? And even have it read loud for you? You can do this all with the Europeana dataset and the Art4Europe project!Unreleased prototypeHackathon winner6/8/2011Metadata Retrieval ServicesInteractive User Interface
Artoo.jsartoo.js is a piece of JavaScript code meant to be run in your browser's console to provide you with some scraping utilities.

This nice droid is loaded into the JavaScript context of any webpage
through a handy bookmarklet you can instantly install by dropping the
above icon onto your bookmark bar.
Guillaume Plique LicenseMetadata Retrieval Services
ASSETS 3D search demo EKETA-CERT:Michalis Lazaridis documentation available.No documentation available.Java code, well structured. Maven-based. Tests available. Single-person development. Complete /ENyesOpen Source / ASSETS6/13/2012Image SearchImage Similarity
ASSETS Annotation PropagationEKETA-CERT:Michalis Lazaridis website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. The link to the code repository is a bare SVN repositry without documentaiton.Since there is no documentation and just a bare SVN with a Java (Maven) project, it would be tedious to adapt this toolThe code looks like a standard Java / Maven project, including tests. There is no documentation whatsoever, so there is no urge to inspect this code any further.NonenoASSETSEnriching metadata
ASSETS Audio Search BMAT: Oscar Paytuvi website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. There is no link to a code repository.There is no link to a code repository.There is no link to a code repository.NonenoASSETSInformation retrieval servicesSimilarity Search
ASSETS ClassificationCNR: Fabrizio Sebastiani, Andrea Esuli website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. The link to the code repository is a bare SVN repositry without documentaiton.Since there is no documentation and just a bare SVN with a Java (Maven) project, it would be tedious to adapt this toolThe code looks like a standard Java / Maven project, including tests. There is no documentation whatsoever, so there is no urge to inspect this code any further.NonenoASSETSData processing
ASSETS image search demoCNR: Giuseppe Amato website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. The link to the code repository is a bare SVN repositry without documentaiton.Since there is no documentation and just a bare SVN with a Java (Maven) project, it would be tedious to adapt this toolThe code looks like a standard Java / Maven project, including tests. There is no documentation whatsoever, so there is no urge to inspect this code any further.none /ENyesOpen Source / ASSETS6/12/2012Image similarityContent retrival servies Image search
ASSETS Knowledge ExtrationCNR: Fabrizio Sebastiani, Andrea Esuli website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. The link to the code repository is a bare SVN repositry without documentaiton.Since there is no documentation and just a bare SVN with a Java (Maven) project, it would be tedious to adapt this toolThe code looks like a standard Java / Maven project, including tests. There is no documentation whatsoever, so there is no urge to inspect this code any further.NonenoASSETSData processing
ASSETS Manual Annotation ToolINA: Patrick Couronet website is a blog titled ‘European Education Technology Terms’. There is no search bar, so it would be very tedious to find out whether there is mention of this tool. There is no link to a code repository.There is no link to a code repository.There is no link to a code repository.NonenoASSETSEnriching metadata
ASSETS Metadata CleaningCNR: Fabrizio Sebastiani, Andrea Esuli processing
ASSETS Metadata-based RankingCNR: Diego Ceccarelli, Claudio Lucchese SourceASSETSInformation retrieval services
ASSETS Preservation-NormalizationAIT: Sergiu Gordea, Andrew Lindley, Roman Graf SourceASSETSDigital Preservation
ASSETS Preservation-NotificationENG: Luigi Briguglio, Massimiliano Nigrelli, Marco Guarini SourceASSETSDigital Preservation
ASSETS Preservation-Risk ManagementAIT: Sergiu Gordea, Andrew Lindley, Roman Graf SourceASSETSDigital Preservation
ASSETS Query PersonalizationUPS: Tsuyoshi Sugibuchi, Nicolas Spyratos retrieval servicesUser engagement
ASSETS Query SuggestionCNR: Diego Ceccarelli, Claudio Lucchese SourceASSETSInformation retrieval services
ASSETS Relevance FeedbackEKETA-CERT:Michalis Lazaridis retrieval services
ASSETS Semantic Cross-LinkingEKETA-CERT:Michalis Lazaridis retrieval services
ASSETS Taxonomy-based NotificationUPS: Tsuyoshi Sugibuchi, Nicolas Spyratos processing
ASSETS User Generated ContentsCNR: Carlo Meghini, Nicola Aloia, Cesare Concordia SourceASSETSUser engagement
ASSETS Video SummarisationUAM: Fernando Lopez retrieval servicesSimilarity search
ASV ToolboxASV Toolbox is a modular collection of tools for the exploration of written language data. They work either on word lists or text and solve several linguistic classification and clustering tasks. The topics covered contain language detection, POS-tagging, base form reduction, named entity recognition, and terminology extraction. On a more abstract level, the algorithms deal with various kinds of word similarity, using pattern based and statistical approaches. The collection can be used to work on large real world data sets as well as for studying the underlying algorithms. The ASV Toolbox can work on plain text files and connect to a MySQL database. While it is especially designed to work with corpora of the Leipzig Corpora Collection, it can easily be adapted to other sources.University of Leipzig MITCollection ManagementOntology/Vocabulary Building and Management
Avalon Media SystemThe Avalon Media System is an open source system for managing large collections of digital audio and video filesIndiana University and Northwestern Universityhttp://www.avalonmediasystem.org / ENyes, community basedApache 2.0 Management
AVI MetaEditSimilar to the BWF MetaEdit tool that AVPS also developed in collaboration with FADGI, AVI MetaEdit supports embedding and validating metadata in RIFF-based AudioVisual Interleave format (AVI) video files. AVI is currently the target format for creation of Preservation Masters within the Digitization Services Branch at NARA.FADGI / NARA / ENyesNARA OPEN SOURCE AGREEMENT VERSION 1.32/17/20122/17/2012Metadata Mapping/Conversion/normalisationMetadata Checker/ValidationDigital Asset Management

Binarization and Conversion Tool

The BCT tool can be used to produce JPEG2000 or JPEG files of newspaper master images for presentation on the web. This tool calls two other tools, a binarization method from Basilis Gatos which is optimised for OCR, and Kakadu, a software development kit for creating JPEG2000 images. Therefore, in order to fully use the features of BCT, both tools must be installed and licensed. However, feel free to call your own tools from BCT. University of Innsbruck NEwspapersPublicationImage SearchMedia Checker/Validation
BitCurator AccessBitCurator Access software tools will assist collecting institutions
(libraries, archives, and museums) in providing web-based and local
access to born-digital materials held on disk images. BitCurator Access
will focus on software that simplifies access to raw and
forensically-packaged disk images, allowing collecting institutions to
incorporate these objects into access environments in a manner that
reflects the original order and relevant environmental context. The use
of open source digital forensics software will allow for detailed
analysis of file and file system provenance, quality and accessibility
of files, metadata in files and the file system, and residual (non-file
system) data contained within disk images.
University of North Carolina at Chapel Hill v 330/12/2014Collection Management
BHL Scan ListThis catalogue allows users to browse and search titles held by BHL-Europe and BHL member institutions using advanced filtering. It will be further developed and extended towards the Global References Index to Biodiversity (GRIB).Wolfgang Koller seem to be no mention of any downloadable source code. The only thing that might get you started is to use one of the listed contact email addresses.As there seems to be only an email address of the creators available, nothing can be said about the adaptability of the system.As there seems to be only an email address of the creators available, nothing can be said about the code quality.Minimal / ENyesBSD Retrieval Services
BIBFRAMETwo tools designed to help you evaluate MARC Bibliographic data in the BIBFRAME model: a comparison service and a transformation serviceLibrary of Congress Network
Development and MARC Standards Office on the concepts behind BIBFRAME are quite extensive, however documentation for developers seems to be limited by a not very descriptive README in GitHub.There is no plugin structure mentioned, but the tool’s functionalities (“convert MARC/XML Bibliographic records to BIBFRAME resources”) are fairly straightforward (there is an XQuery file for each format), so extending the tool for other types of bibliographic records should be not too hard.The most important bits of code are written in XQuery. Next to this there is an exhibit app in HTML. Both parts seem clear and are not hard to comprehend. However there is not much in the sense of comments.Complete / ENno Checker/ValidationMetadata Mapping/Conversation/normalisation
BibServerBibServer is a tool for quickly and easily sharing collections of bibliographic metadata. By making it simple for people to use this information, we make scholarship better.Open Knowledge Foundation the GitHub page, the functionalities of the tool are quite clearly stated (also a link to a live demo is provided). The link towards the detailed documentation is very useful in providing detailed information about the setup, configuration and usage of the tool. A mailinglist is present as well.Other than a dedicated parser directory in the source code, there seems to be no direct explanation or means for extending the tool conveniently.The code structure seems organized. Besides overall documenation, there are not many comments in the code itself. The (Python) code is well readable.Minimal / ENyesGNU Affero GPL 2012Metadata Retrieval ServicesCollection ManagementYes
BlacklightRoR gem discovery interface for any Solr index home page quickly and clearly gives an introduction to Blacklight and shows all the necessary links, e.g. towards a quickstart guide and the GitHub, to get started as a developer. The overall documentation is available in a wiki and seems mostly well written and clear.There is a dedicated section on how to contribute to Blacklight as well as a page that lists the (un)official add-ons that are available. With several examples and clear instructions, as well as a developer mailing list, it seems fairly straightforward to create custom extensions.The Ruby code contains useful comments. The main app is nicely structured according to MVC and includes YAML config files. The main JavaScript function clearly explains the structure and rationale of the code; the aim is to use or extend JQuery plugins as much as possible. Overall the code seems to be of good quality.Good / ENyesCreative Commons Attribution-Share Alike 3.0 United States License. and Browsingyes
BoltBolt is a tool for Content Management, which strives to be as simple and straightforward as possible. With this general user manual you will be able to use Bolt to produce and edit content for your website.Bob den Otter Management
Brat rapid annotation toolThis online tool can be used for a wide variety of annotation tasks, including visualization and collaboration.
brat is designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be
automatically processed and "interpreted" by a computer. brat also supports the annotation of n-ary associations that can link together any number of other annotations participating in specific roles. brat also implements a number of features relying on natural language processing techniques to support human annotation efforts. Other applications include entity mention detection, event extraction, coreference resolution, chunking, dependency syntax, meta-knowledge, and corpora annotation.
Sampo Pyysalo, Pontus Stenetorp, Goran Topić, Tomoko Ohta Annotation/CurationSemantic Enrichment
BWF MetaEdit ToolThis tool permits embedding, editing, and exporting of metadata in Broadcast WAVE Format (BWF) files. This tool can also enforce metadata guidelines developed by the Federal Agencies Audio-Visual Working Group, as well as recommendations and specifications from the European Broadcasting Union (EBU), Microsoft, and IBM.FADGI documentation is quite technical and not very attractive, but just enough to make use of the tool. There does not seem to be a way to download the source code. The tool itself does work well (used by the reviewer).There does not seem to be any way of getting or contributing to the source code.Nothing can be said of the code quality due to the inavailability of the code.Minimal / ENyes3/1/2012Metadata Mapping/Conversion/normalisationMetadata Checker/ValidationDigital Asset Management
Casual CuratorDepricated prototype of the CasualCurator web application developed for the Hack4Europe hackathonDecipher team:
Mike Stapleton, Systems Simulation;
Mike Selway, Systems Simulation;
Eoin Kilfeather, Dublin Institute of Technology, GitHub page clearly states that the tool is no longer under development and is not being supported.There is no documentation, do the ease to adapt the system is completely up to the quality of the code and the ease to get the system running.The Ruby code seems fairly straightforward, but hardly contains comments. Some tests are included. Minimal / ENnoThe Casual Curator prototype is no longer under development or 6/9/2011Hackathon winner6/9/2011Semantic ExtractionInteractive User InterfaceContent Retrival Services
CatmanduCatmandu provides a suite of Perl modules to ease the import, storage, retrieval, export and transformation of metadata records. Nicolas Steenlant, Patrick Hochstenbach main site offers a brief introduction of the capabilities of Catmandu and offers an extensive tutorial. The GitHub code repository also contains developer documentation, but needs to be generated (or you can find it by browsing the code repository) after downloading. The developer documentation seems well written and quite extensive.The code repository in GitHub also includes a page directed to contributers, but does not seem to desribe any plugin architecture. Since the code seems professionally maintained and fairly well documented, adding customizations to the code seems worth considering.The Perl code looks well structured, well written, and contains comments in the form of perldoc. There is a long list of tests available. Also the code is included in CPAN (a repository of Perl libraries).Good / ENnoGPL-2, or later11/2013ongoingMetadata Retrieval ServicesMetadata Mapping/Conversation/normalisationCollection ManagementYes
CesiumCesium is a JavaScript library for creating 3D globes and 2D maps in a web browser without a plugin. It uses WebGL for hardware-accelerated graphics, and is cross-platform, cross-browser, and tuned for dynamic-data visualization.AGI 2.0 Apllications
CHContextCHContext is a JavaScript-based widget that is able to provide a list of links to cultural heritage materials based on predefined item of a website on which it is embedded (via given JQuery HTML selector).Digital Libraries Team of Poznań Supercomputing and Networking Center GitHub page neatly explains how to install, configure and use the widget and also describes in what ways it can be extended.The widget can be easily extended with new search providers. Adding new languages and styling the widget is also conveniently adaptable. Other modifications should also be fairly easy because of the minimal amount of code.There is not a lot of code (most of it is in a single file). The code is not hard to understand.Good / ENyesEUPL V. 1.1 AwarenessContent Retrieval ServicesYes
ChicagoCodeXAn authoring and publishing environment for online catalogues with full scholarly apparatusThe Art Institute of Chicago and the Indianapolis Museum of Art. GPL ManagementCollection Management
CKANCKAN is an open-source DMS (data management system) for powering data hubs and data portals. CKAN makes it easy to publish, share and use data.Open Knowledge Foundation quality of the website and the available documentation is very good. The purpose and features, including the API, of CKAN are all documented very well.There is dedicated documentation on the writing of extensions. Moreover there are 60 external extensions available, which can serve as an example. The code base is large, but the documentation is good, so writing these extensions shouldn’t be too hard.The Pyton code looks good, there are not many comments, but for the important objects (within the comments) references to the API docs are given. The API docs are very detailed.Complete / ENyesAffero GNU GPL v3.0 ManagementYes
CMUSphinxCMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.Carnegie Mellon University / ENyesBSD licenseEnriching metadataSpeech Recognition
CollectionSpaceCollectionSpace is an open-source collections management application that meets the needs of museums, historical societies, and other collection-holding organizations. CollectionSpace is designed to be configurable to each organization’s needs, serving as a gateway to digital and physical assets across an institution. The software is freely distributed via open-source licensing, and an active developer community ensures that CollectionSpace is continually improving. January 2014: Lyrasis is now the organizational home of CollectionSpace. (The project was initiated and led by Museum of the Moving Image. Transition team includes developers based at University of California, Berkeley and Fluid Project at Ontario College of Art and Design; and Jesse Martinez, Freelance developer and Service Provider.) Project Partners 2008-2013 included: University of California Berkeley, Fluid Project at Ontario College of Art and Design, and Centre for Applied Research in Educational Technologies (CARET) University of Cambridge. www.collectionspace.orgCurrent release 4.0: home page clearly provides a link to the documentation, which is very thorough and covers the most relevant topics such as: system requirements, how to configure & install it, how to use and maintain it and also how to develop customizations. The overall quality of this documentation is fairly good.CollectionSpace describes having a so-called hook system in place (which is used in e.g. Drupal or Wordpress), a design pattern to conveniently extend a system. Hook systems in general take some time to learn and in this case specific documentation on the hooks are yet to be added (some limited examples are available though).The different parts, namely the UI, services, application, tools, etc, are in different sections in GitHub, making the distinction clear. Each of these sections is sparsely documented, however the code looks good with JavaDoc and maven pom.xml files. Various guides / ENyesECL 2.0www.demo.collectionspace.orgCollection Managment10/2013on-goingCollection ManagementStaging
Collective AccessWhirl-i-Gig documentation available on Wiki, well structured. Demo, Installation instructions, Upgrade instructionsPHP Web Application, easily extensible. No standards. Documentation for adaption, API documentation available.Mainly PHP-based (PHP 86.7% JavaScript 10.8% CSS 1.9% Other 0.6%). Code hosted on Github. Highly active, last commit 2 days old. Code is well written and documented. Separation into modules, clean code layout. Bugtracker available.Complete /ENyesGNU GPL v.2 act as both OAI-PMH client and server1/2012None (though used by some Europeana data providers)Collection ManagementContent Retrival ServicesExhibition Management
collection-wallcollection-wall is an early labs experiment for displaying collection objects as a waterfall-like wall of images.Cooper Hewitt VisualizationMedia Applications
COMETPerl based Marc21 analysis tool and Perl based Marc21 to RDF publishing toolEd Chamberlain, Cambridge University Library / ENnoGPL2011Metadata Retrieval ServicesLinked Open Data
ConjectureConjecture is a modular, extensible, open-source C++ framework for Optical Character Recognition (OCR). Conjecture is not a single OCR, but rather is an extensible collection of OCRs that can be explored, analyzed, compared, extended, modified, and merged within a unified environment. unknown Asset Management
ConservationSpaceThe goal of ConservationSpace is to develop an open-source software application that will address a core need of the conservation community for a shared solution to the problem of documentation management. The conservation community has long recognized that a digital approach to managing its documentation would improve continuity in procedures, increase access, expand research opportunities, and better ensure the preservation of its documents.www.conservationspace.orgStill no documentation about software available.Still no documentation about software available.No code available
The project is being led by the National Gallery of Art (Washington, D.C., USA with participation from Yale University Special Collections Library, Courtauld Institute, British Museum, Indianapolis Museum of Art, Metropolitan Museum of Art, Statens Museum for Kunst, and the Denver Art Museum). Sirma ITT was awarded the development contract in 2013 ( Version 1.0 is expected to be released in 2014. Collection Management
Constraint Analysis Tool Data and digital analysis tools.Kathleen Kenney documentation available, not sufficiently structured, rather notes than software documentation.Set of PHP pages can be easily integrated.Pure PHP code, MySQL database (two tables), no separation between code logic and layout (HTML interweaved with PHP). Last commit two years ago.Minimal / ENyes11/2011Content Retrieval ServicesCrawl analysis
CoralCORAL is an Electronic Resources Management System consisting
of interoperable modules designed around the core components of
managing electronic resources.
University of Notre Dame's Hesburgh Libraries presentation, well-structured, demo available. Documentation is extensive, available on Github.Set of PHP pages, MySQL database.PHP application ( PHP 58.5% JavaScript 38.2% CSS 3.4%), medium activity, no separation between code logic and layout (HTML interweaved with PHP), last commit 7 months ago. Complete / ENyesGeneral Public License Retrieval ServicesCollection Management
CubicWebCubicWeb is a semantic web application
framework, licensed under the LGPL, that empowers developers to efficiently
build web applications by reusing components (called cubes)
and following the well known object-oriented design principles.
Its main features are:
an engine driven by the explicit data model
of the application,a query language named RQL similar to
W3C’s SPARQL,a selection+view mechanism for
semi-automatic XHTML/XML/JSON/text generation,a library of reusable components
(data model and views) that fulfill common needs,the power and flexibility of the Python programming language,the reliability of SQL databases, LDAP directories, Subversion and
Mercurial for storage backends.
LogiLab ManagementSemantic ExtractionDigital Asset Management
Cross-Platform Authentication - Authorization ProviderHybrid media devices, which can deliver audio, video and interactive content over both broadcast and broadband, create new opportunities and
challenges for broadcasters. Augmenting the broadcast experience with interactive content delivered over the Internet changes the classic one-to-many paradigm bringing it closer to a one-to-one relationship.
Cross-Platform Authentication (CPA) offers an open standard for associating any media device with an online identity, which facilitates delivery of personalized services to these devices.
31/7/2014Media Applications
Cultural Enrichment Mashificator Collaboration tools.Jeremy Ottevanger presentation of the idea together with a demo. There does not seem to be the intention to disseminate the code. No documentation available. Broken links.PHP integration documented.No code available NonenoUnreleasedEuropeanaHackathonsContent Retrieval ServicesMetadata Retrival Services
D3-cartogramContinuous area cartograms with d3 and TopoJSONShawn Allen main site briefly describes what the tool is and what dependencies the tool has. A prominent, but somehow easy to overlook, link to GitHub is available. On GitHub there is a short readme, which mainly points to an example index.html.The example HTML file includes quite a number of JavaScript functions that need to be examined in order to be able to understand how it all works. Besides this, the code is small and it shouldn’t be too difficult to adapt the code to use different (geographical) input.There is not much code and it does look readable, but there are not many comments to help you. Because the code uses d3.js and uses geographical algorithms, it takes a while to get in to.NonenoMIT licenseGeo-spatial ApllicationsData VisulaisationYes
D3D3.js is a JavaScript library for manipulating documents based on data. D3 helps you bring data to life using HTML, SVG and CSS. D3’s emphasis on web standards gives you the full capabilities of modern browsers without tying yourself to a proprietary framework, combining powerful visualization components and a data-driven approach to DOM manipulation. Dustin Ewers License Visualization
DAITSSDAITSS is a digital preservation software applicationFlorida Center for Library Automation (FCLA) is a detailed wiki on how to install the system. The layout of the system architecture is available in well written, and extensive, PDF documents. The code base is huge as it is a complete digital preservation system. There does not seem to be a straightforward plugin structure available, but with the available documentation the system could be adapted (if necessary).The code is very extensive and uses both Ruby and Java and several other different technologies. There are tests available. The code is occasionally commented.Complete / ENnoGNU General Public License, Version 3 PreservationYes
datdat is an open source tool that enables the sharing of large datasets, allowing for a decentralized collaboration flowMax Ogden main page embeds a very informative YouTube video on a talk the creator gives about the tool. The documentation on GitHub is extensive, well written and even illustrated. It gets you started and enthuses you to contribute.The code base is well documented and quite small. Adapting the system, e.g. to support new formats. Should not be too hard.The code is well documented, concise and straightforward to understand. In the code however, it appears there are no comments.Good / ENnoBSD Licensestill pre-alphaMetadata Mapping/Conversion/normalisationCollection ManagementYes
Data ExplorerData import, graphing and sophisticated data transformation. It builds on ReclineJS + oauth / github connectors
borrowed from Prose. It's part of an ongoing plan to create a "Data Orchestra" of
lightweight data services that can play nicely together with each
other and connect to things like the DataHub or GitHub.

Rufus Pollock, Michael
Aufreiter, Chris Herwig, Mike Morris and others main site basically shows a link towards an example (and a way to try out the tool with your own data) and a link towards GitHub. The GitHub documentation is minimal. Also when finding the developers documentation in a separate folder in GitHub, this documentation is not very extensive, although this part does include a picture of the system architecture.There is no particular documentation on how to extend the tool. Also there does not seem to be a straightforwad plugin structure. The code base however is not too large, so adaptation could be done with some effort.There do not seem to be descriptive comments in the code. The JavaScript code uses Backbone.js so it has a logical MVC structure, making it easier to decompose and understand.Good/ENnoMIT license Retrieval ServicesData Visulaisation
Data PipesData Pipes is a service to provide streaming, "pipe-like" data transformations on the web – things like deleting rows or columns, find and replace, head, grep etc.Open Knowledge Foundation site starts of with a useful example of what the tool can do. Following the example the API is detailed. At the bottom you can find a link to GitHub, which contains basic practical documentation on how to install & run the tool.The documentation mentions that contributing is “super easy” and points you to a list of issues. Other than that there does not seem a specific plugin architecture or obvious way to extend the tool.The code base is small. The code looks well written. Occasionally there are comments.Good/ENnoMIT License Mapping/Conversion/normalisationYes
DBpedia SpotlightDBpedia Spotlight is a tool for automatically annotating mentions
of DBpedia resources in text, providing a solution for linking
unstructured information sources to the Linked Open Data cloud through
Pablo Mendes (Freie Universität Berlin), Jun 2010-present.
Jo Daiber (Charles University in Prague), Mar 2011-present.
Prof. Dr. Chris Bizer (Freie Universität Berlin), supervisor, Jun 2010-present. documented, short documentation on Github, further documentation on Wiki, well structured and written.REST Webservices allow integration and adaption.Java/Scala based project (Java 48.7% Scala 48.1% Shell 1.7% Python 1.5%), actively maintained, last commit days ago, continous integration, well-structured, maven-based build, tests available. Many branches and contributors.complete / ENyesApache License, 2.0, LingPipe10/2012Linked Open DataData Annotation/CurationYes
Dédalo: Intangible Heritage management and Oral History Semantic RDF data sourcesJuan Franciso Onielfa, Alejandro Peñahttp://www.fmomo.org only available in Spanish, PDF documents.No documentation available.Web application (HTML, CSS, Javascript, PHP, MySQL). Download only possible after registration, code not available.Complete /ESYesGNU GPL v3 2012Collection ManagementContent Retrival Services
DedupeA python library for accurate and scaleable data deduplication and entity-resolution.Gregg, Forest, and Derek Eder documented and structured documentation. Extensible.Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.good/enMIT License2013Collection Management
Delving Platform: CultureHub and SIP-CreatorDelving has developed and refined an open source platform with tools specifically tailored to the needs of the cultural heritage domain.

Manuel Bernhardt, Gerald de Jong, Eric van der Meulen, Sjoerd Siebinga, Thomas Wikman, Juliane Stillerhttp://delving.eu documented and structured documentation. Extensible.Java/Scala based (Java 79.7% Scala 11.7% XSLT 5.1% JavaScript 3.1% Groovy 0.5%), hosted on Github,actively maintained, tests available, code documented, well written and structured.Good / ENYesEUPL, Apache 2.0 contract: Delving CreditsMonthly cycleSjoerd, Eric, Gerald were original members of the Europeana development teamActive nowMetadata Mapping/Conversion/normalisationMetadata Checker/ValidationMetadata Retrival ServicesYes
Digital Preservation Software Platform (DPSP)The DPSP is a collection of software applications which support the goal of digital preservation.National Archives of Australia presentation, documentation short and concise, documentation hosted on sourceforge. UnknownBuilds hosted on sourceforge, no source code available.Complete / ENnoGPLv37/31/2013Digital Preservation
DigitalLabelsDjango app to generate an interactive gallery label interface, optimised for touch screens and tablets.V&A Digital Media team presentation, Basic documentation, well written and structured, documentation hosted on github pages. No APIs or documentation about interfaces available.Python/JavaScript based ( Python 65.2% JavaScript 34.8%), hosted on Github, not being actively maintained, not many comments in code, but readable.Good / ENnoBSD LicenseExhibition Management
DigitateDigitate is a free application designed for use on the iOS platform, specifically on iPad devices. The application allows scholars and enthusiasts with an interest in the visual and material elements of a cultural artefact to make notes and annotations directly on an image of such an artefact.For example, a literary scholar might use it to annotate the material or bibliographic elements of a rare text or first edition, while an art historian might do the same on an image of a painting. OpenDHAT BY-SAMedia Annotation/Tagging
digilibDigilib is a web based client/server technology for images. The image content is processed on-the-fly by a Java Servlet on the server side so that only the visible portion of the image is sent to the web browser on the client side. It supports a wide range of image formats and viewing options on the server side while only requiring an internet browser with javascript and a low bandwidth internet connection on the client side. digilib enables very detailed work on an image as required by scientists with elaborate zoom features like an option to show images on the screen in their original size. digilib facilitates cooperation of scientists over the internet and novel uses of source material by image annotations and stable references that can be embedded in URLs. digilib is used in the ALCATRAZ framework of image, text and annotation tools and many other projects. - See more at: for the History of Sciencethe University of Bern GPLMedia Annotation/Tagging
Diva.jsDiva.js (Document Image Viewer with AJAX) is a
Javascript frontend for viewing documents, designed to work with digital
libraries to present multi-page documents as a single, continuous item.
Only the pages that are being viewed at any given time are actually
present in the document, with the rest appended as necessary, ensuring
efficient memory usage and high loading speeds. Written as a jQuery plugin, diva.js requires the jQuery Javascript library. Diva's back end is constituted by the IIPImage server.

Distributed Digital Music Archives and Libraries w/ attribution9/12/2014PublicationExhibition Management
djatokadjatoka is open source Java software that builds upon a rich set of APIs and libraries to provide a service framework for the dynamic dissemination of JPEG 2000 image files. available on wiki, many broken links. Last wiki update 2009.No plugin mechanism. Adaption requires changing Java-Code.Ant-based build, java project. Code is well structured and documented. Complete / ENyes, community basedLGPLv2 Retrieval ServicesCollection Management
Doc ViewerHTML5 document viewerThe New York Times documentation, well written and structured, easily comprehensible.Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.Mainly JavaScript, some Ruby project (JavaScript 73.0% CSS 26.9% Ruby 0.1%). Clean code, many comments support readability of code. Actively maintained, many contributers, code hosted on github.Minimal / ENnoApache License, v2.0Media ApplicationsYes
DocSplitDocsplit is a command-line utility and Ruby library for splitting apart documents into their component parts: searchable UTF-8 plain text, page images or thumbnails in any format, PDFs, single pages, and document metadata (title, author, number of pages...)Jeremy Ashkenas, DocumentCloud and concise documentation, well written and structured, easily comprehensible.Project fulfills specific purpose well, adaption is easily possible by adapting ruby scripts.Pure Ruby project. Clean code, tests available, many comments support readability of code. Actively maintained, many contributers, code hosted on github.Complete / ENnoLGPLv 0.7.2 February 21, 2013Collection ManagementYes
DSpaceDSpace open source software is a turnkey repository application DSpace Foundation high quality software documentation, commercial support.Complex product ecosystem.Maven-based, highly modularized, tests available, clean code, mainly java, many comments explaining concepts and supporting readability. Good object oriented design.Complete/ENyesOpen Source: ManagementMetadata Retrival ServicesRepository SoftwareYes
EADitor Data and digital management tools.Ethan Gruber, American Numismatic Society documentation available on Github seems a bit like a copy& paste text blog that is not structured. But there is a lot of information available and blog posts explain the concepts of the software and give further examples.REST or SOAP interfaces allow easy integration. No plugin mechanism.CSS/XSLT/Java-Script-based project hosted on Github (CSS 62.3% XSLT 17.3% JavaScript 15.0% XProc 5.4%). One contributor (highly active Github committer), the code is being actively maintained. A lot of XSLT is not commented but readable.Good / ENyesApache License 2.0 dissemination through Atom and OAI-PMH.1112 beta (December 2011)June 2012Collection ManagementMetadata Retrival Services
easyLODSimple framework for exposing Linked Open Data from arbitrary sources.Mark Jordan available on Github page. Documentation and instructions for applying data source plugins, resource URIs and mapping namespaces in resource URIs to data source plugins. Also provides info on installing and testing Easy LOD. Instructions also available within the code repository. Easy to access, read, and follow.The GitHub explains having a plugin structure for converting different datasources (to LOD), so creating a plugin for custom content sources should be fairly straightforward. Moreover, the codebase is small an clear, so further adapting the tool should not be too difficult.The Slim PHP micro framework is used, making the code and all configurations not hard to grasp. The code itself contains several useful comments.Minimal / ENnoMIT LicenseLinked Open DataMetadata Retrival ServicesYes
EasyRDFA PHP library to quickly parse RDF. It supports URL redirect and simple "queries" via API. 3-Clause 31/21/2013Linked Open DataMetadata Retrival Services
Echoprint Internet design and authoring tools.The Echonest in partnership with MusicBrainz available that is easy to navigate however it’s possible to get linked back to the massive Echonest webpage. Plenty of resources and tutorials to help users get started and along the way. Wrong linked listed for code repository at Github, the current link takes you to all the Echonest apps. This is the more specific link: offers several useful services which can be used to build your own applications. Extending these services for your own purposes is most likely not needed, except for maybe solving bugs.The code of the most tools/services seem to be fairly well organized and commented.Good / ENyesThe code generator is MIT and the server is Apache 222/4/2013
Content Retrieval ServicesMetadata Retrival ServicesYes
EFG Authority File Manager Web Publication Paolo ManghiNo links or content available. Demo link does not work.No links or content available. No links or content available. Noneno metadata
EFG Metadata Editor Franca DeboleNo available or working links.No available or working links.No available or working links.Noneno metadata
ElasticSearchElasticSearch is a distributed RESTful search engine built for the cloud. user friendly website with extremely thorough documentation. Training, development support, and production support are all available. Github page with notes and directions. Blog is updated regularly and is up-to-date. There are also very interesting and helpful case studies available to see how groups have made use of ElasticSearch.
One downside of the documentation is that it’s sometimes hard to find concrete examples of every day use cases.
ElasticSearch has many configuration options for building the search engine needed for your use case. Extending the tool itself is possible by contributing to GitHub, but isn’t probably something you would need to do.The (mainly Java) code seems well documented and commented. Complete / ENyesApache License, Version 2.0Search and BrowsingContent Retrival ServicesMetadata Retrival ServicesCollection Management
ERMesElectronic Resource Management (ERM) SoftwareWilliam Doering, Galadriel Chilton not very clear and requires a lot of downloading. There is a semi-active forum for discussion and help and some user generated instructional videos have been provided. No readily visible documentation.The code is only available as a Access 2007 file, which makes adapting the code not very easy for most contemporary developers.Without Access 2007 the code could not be inspected by the reviewer.Good / ENyesAttribution Assurance License6/1/2010Metadata Retrieval Services
Europeana 4DOur europeana 4D interface – e4D – enables comparative visualisation of multiple queries and supports data annotated with time span data.Ralf Stockmann, Gerik Scheuermann, Stefan Jänicke, Christian Mahnke / ENyesUnreleased prototype, but will be open source prototypeThoughtLab / EuropeanaConnect2011-07-29Metadata Retrieval ServicesData VisulaisationGeo-spatial ApllicationsYes
Europeana API geo-wrapperAdrià Mercader, Independent developer / ENNo thin wrapper around the Europeana API that transforms the results feed in a GeoRSS. It also includes a demo application that uses the geo wrapper to perform spatial queries. winner4/5/2011Metadata Retrieval ServicesGeo-spatial Apllications
Europeana Geo Search app for AndroidGeorg Petz, Austrian National Library Android App shows you records from Europeana with geo coordinates nearby your location. It uses to get GeoRSS results. 4/2/2011Hackathon winner4/2/2011Content Retrieval ServicesMetadata Retrival ServicesGeo-spatial Apllications
Europeana Language Resource RepositoryThe Europeana Language Resource Repository collects resources which are necessary for query translation and vocabulary mapping in Europeana.Aaron Kaplan - EuropeanaConnect (EN)NoUnknownEuropeanaConnectMultilingualityOntology/Vocabulary Building and Management
Europeana Search API component for Joomla 1.6Europeana Search API component for Joomla 1.6.Pictura: Mark Lindeman / ENNoGPL2011Hackathon winnerCollection ManagementMetadata Retrival Services