Enhancing Biodiversity Database Curation through Automated Extraction of Information from Unstructured Data
Roselyn S. Gabud
Solution
We developed information extraction methodologies to automatically extract information relevant to the distribution, reproduction, and habitat of species, that will aid human curators in updating biodiversity databases.
Problem Statement
Since much of our knowledge and investments about the natural world still remain in and are disseminated through scientific literature, manual creation and updating of biodiversity databases is becoming increasingly laborious.
Features
Textual Documents
Database
(e.g., graph database)
Unstructured Data
Structured Data
Information Extraction Tools
human curator
DB
“The main observation site was conserved forest at Mariveles, Bataan.”
Habitat
Geographic Location
NER
RE
[occur in]
Research Interests: Text Mining, Natural Language Processing, Information Extraction, Biodiversity Informatics