ABCDEFGHIJKLMNOPQRSTUVWXYZAAABAC
1
TitleDescriptionOrganizationpart of datasetWebsiteLicenseLicense DocumentPublicationCommentSource Data Update FrequencyBio2RDF script up to dateBio2RDF Example Record URIBio2RDF SPARQL endpointBio2RDF RDF Download URLCKAN
2
affymetrixAffymetrix ProbesetsAffymetrix probesetsAffymetrixhttp://affymetrix.comnon-commercial redistributionhttp://media.affymetrix.com/support/developer/SDK_and_NetAffx_Data_Non_Commercial_Use.pdfdata no longer availableyearlyyeshttp://bio2rdf.org/affymetrix:246206_athttp://affymetrix.bio2rdf.org/sparqlhttp://download.bio2rdf.org/release/2/http://thedatahub.org/dataset/bio2rdf-affymetrix
3
archivalNLM archivesReferences from toxicology literatureNLMtoxkbresearch non distributable to third party without license yeshttp://bio2rdf.org/archival:archival_e8613685f300bfc7d7b9741c3418f430http://toxkb.bio2rdf.org/sparql
4
atlashttp://bio2rdf.org/omim:604640, http://bio2rdf.org/uniprot:Q92781
5
bindBiomolecular Interaction Network DatabaseThe BIND collection of interactions, molecular complexes and pathways.irefindexpublic domainpubmed:15608229noneN/Ahttp://bio2rdf.org/bind:2375http://bind.bio2rdf.org/sparql
6
bio2rdf.atlasBio2RDF AtlasA warehouse of Bio2RDF datasets - omim, pubmed, mgi, hgnc, chebi, proodom, mesh, obo, pdb, geneid, ligand, pfam, genbank, uniprot, unirefBio2RDFpublic domainpubmed:18472304one-time data warehousenoneN/Ahttp://atlas.bio2rdf.org/sparql
7
biocartaBioCartaThe BioCarta collection of pathways.pathwaycommonshttp://www.biocarta.com/genes/index.aspby attribution; no resellhttp://www.biocarta.com/legal/terms.aspN/Ahttp://bio2rdf.org/biocarta:h_HivnefPathwayhttp://biocarta.bio2rdf.org/sparql
8
biocycBioCycThe BioCyc collection of Pathway/Genome Databases (PGDBs)SRIpathwaycommonshttp://biocyc.org/research and eduction; fee for commerical usehttp://biocyc.org/download.shtmlN/Ahttp://bio2rdf.org/biocyc:catalysis1399454http://biocyc.bio2rdf.org/sparql
9
biomodelsBioModelsBioModels Database is a repository of peer-reviewed, published, computational modelsEBIhttp://www.ebi.ac.uk/biomodels-main/public domainyeshttp://bio2rdf.org/biomodels:BIOMD0000000214http://biomodels.bio2rdf.org/sparql
10
biopaxBioPaxContains cpath, reactome, biocycpublic domainN/Ahttp://bio2rdf.org/cpath:CPATH-9978CPATH-9978http://biopax.bio2rdf.org/sparql
11
bioportalNCBO bioportalThe National Center for Biomedical Ontology provides 300+ terminologies and ontologies through BioPortal NCBOhttp://bioportal.bioontology.org/variouscontinuousyeshttp://bio2rdf.org/fma:37633, http://bio2rdf.org/cl:0000825http://download.bio2rdf.org/release/2/ncbo/
12
ccrisChemical Carcinogenisis Research Information SystemCarcinogenicity and mutagenicity test results for over 8,000 chemicalsNLMhttp://toxnet.nlm.nih.gov/cgi-bin/sis/htmlgen?CCRISresearch non distributable to third party without license yeshttp://bio2rdf.org/ccris:ccris_5d53ce200cde223b5bc3d72eb509ddd0http://toxkb.bio2rdf.org/sparql
13
cebsChemical Effects in Biological SystemsThe CEBS database houses data of interest to environmental health scientists. CEBS is a public resource, and has received depositions of data from academic, industrial and governmental laboratories. CEBS is designed to display data in the context of biology and study design.NIEHShttp://www.niehs.nih.gov/research/resources/databases/cebs/index.cfmpublicSIFT file format; may have changednohttp://bio2rdf.org/cebs:7700e18d1d6fcf41dc18ed212578ff49http://toxkb.bio2rdf.org/sparql
14
chebiChemical Entities of Biological InterestChemical Entities of Biological Interest (ChEBI) is a freely available dictionary of molecular entities focused on ‘small’ chemical compounds.EBIbioportalhttp://www.ebi.ac.uk/chebi/public domainmonthlyN/Ahttp://bio2rdf.org/chebi:37889http://chebi.bio2rdf.org/sparqlhttp://thedatahub.org/dataset/bio2rdf-chebi
15
chemblChEMBLChEMBL is a database of bioactive drug-like small molecules, it contains 2-D structures, calculated properties (e.g. logP, Molecular Weight, Lipinski Parameters, etc.) and abstracted bioactivities (e.g. binding constants, pharmacology and ADMET data).EBIhttps://www.ebi.ac.uk/chembl/CC-SAhttps://www.ebi.ac.uk/chembl/yes
16
ctdComparative Toxicogenomics DatabaseCurated data describing cross-species chemical–gene/protein interactions and chemical– and gene–disease associations to illuminate molecular mechanisms underlying variable susceptibility and environmentally influenced diseases. http://ctdbase.orgresearch and education; cite & linkhttp://ctdbase.org/about/legal.jspyeshttp://bio2rdf.org/mesh:D001554, http://bio2rdf.org/ctd_vocabulary:Chemical-Disease-Associationhttp://ctd.bio2rdf.org/sparqlhttp://download.bio2rdf.org/release/2/ctd
17
dbpediaDBPediaThe structured data of Wikipediahttp://dbpedia.orgpublic domainyeshttp://dbpedia.bio2rdf.org/sparql
18
drugbankDrugBankResource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information.http://www.drugbank.caresearch and education; citehttp://www.drugbank.ca/aboutyeshttp://bio2rdf.org/drugbank:DB01381http://drugbank.bio2rdf.org/sparqlhttp://download.bio2rdf.org/release/2/drugbank
19
dsstoxDistributed Structure Searchable Toxicity DatabaseResource that houses and publishes downloadable, structure-searchable, standardized chemical structure files assoicated with chemical inventories or toxicity data sets of environmental relevance.U.S EPAhttp://www.epa.gov/ncct/dsstox/public yes
20
ecEnzyme ClassificationThe IUBMB Enzyme Classificationnohttp://bio2rdf.org/ec:1.1.1.1http://ec.bio2rdf.org/sparql
21
emicEnvironmental Mutagen Information CenterReferences from toxicology literatureNLMresearch non distributable to third party without license yeshttp://bio2rdf.org/emic:emic_2ba37517b06cd6e01aaff1b68de65264
22
genbankGenbankNIH genetic sequence database, an annotated collection of all publicly available DNA sequencesNCBIhttp://www.ncbi.nlm.nih.gov/genbank/public domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlnohttp://bio2rdf.org/genbank:AK096541http://genbank.bio2rdf.org/sparqlhttp://download.bio2rdf.org/release/2/genbank
23
geneidNCBI GeneNCBI Gene supplies gene-specific connections in the nexus of map, sequence, expression, structure, function, citation, and homology data. Unique identifiers are assigned to genes with defining sequences, genes with known map positions, and genes inferred from phenotypic informationNCBIhttp://www.ncbi.nlm.nih.gov/genepublic domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlyeshttp://bio2rdf.org/geneid:4348855http://geneid.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/geneid
24
genetoxNLM Genetic Toxicology DatabankGenetox is a datafile containing legacy experiments testing for potential genotoxicityNLMhttp://toxnet.nlm.nih.gov/cgi-bin/sis/htmlgen?GENETOXresearch non distributable to third party without license quarterlyyes
25
goaGene Ontology Annotationshigh-quality Gene Ontology (GO) annotations to proteins in UniProthttp://www.ebi.ac.uk/GOA/copyrightyeshttp://bio2rdf.org/goa_resource:101M_A_1http://goa.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/goa
26
hgncHUGO Gene Nombenclature Committeeunique gene symbols and names to over 33,000 human locihttp://www.genenames.org/copyrightyeshttp://bio2rdf.org/hgnc:25990http://hgnc.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/hgnc
27
hivdbhuman immunodeficiency virus type 1 (HIV-1) Human protein interaction databaseA database of HIV-1 human protein interactionshttp://gnode1.mib.man.ac.uk/HIV1-text-miningpublic domaindata no longer availablenonenohttp://bio2rdf.org/hhpid:155945-23291-1http://hhpid.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/hivdb
28
homologeneHomologeneautomated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.NCBIhttp://www.ncbi.nlm.nih.gov/homologenepublic domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlyearlyyeshttp://bio2rdf.org/homologene:47http://homologene.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/homologene
29
hsdbHazardous Substances DatabankComprehensive, peer reviewed toxicology data for approximately 5000 chemicalsNLMhttp://toxnet.nlm.nih.gov/cgi-bin/sis/htmlgen?HSDBresearch non distributable to third party without license yeshttp://bio2rdf.org/hsdb:ac3870fcad1cfc367825cda0101eee62http://toxkb.bio2rdf.org/sparql
30
inohIntegrating Network with Object Hierarchies Pathway DatabaseA database of metabolic and signalling pathwayshttp://www.inoh.org/copyrightyearlynohttp://inoh.bio2rdf.org/sparql
31
interproInterProInterPro is an integrated database of predictive protein signatures used for the classification and automatic annotation of proteins and genomes. InterPro classifies sequences at superfamily, family and subfamily levels. InterPro has the following member databases: GENE3D, HAMAP, PANTHER, PIRSF, PRINTS, PROSITE patterns, PROSITE profiles, Pfam, PfamB, ProDom, SMART, SUPERFAMILY, TIGRFAMsEBIhttp://www.ebi.ac.uk/interpro/copyright; free to use and distributehttp://www.ebi.ac.uk/interpro/release_notes.htmlpubmed:220962292-3 monthsyeshttp://bio2rdf.org/interpro:IPR001478http://interpro.bio2rdf.org
32
ipiInternational Protein IndexA database of cross references between the primary data sourcesEBIhttp://www.ebi.ac.uk/IPI/deprecated. final release was produced on the 27th September 2011. replaced by uniprotnoneyeshttp://ipi.bio2rdf.org/sparqlhttp://download.bio2rdf.org/release/2/ipi/
33
iproclassiProClassThe iProClass database provides value-added information reports for UniProtKB and unique UniParc proteins, with links to over 90 biological databases, including databases for protein families, functions and pathways, interactions, structures and structural classifications, genes and genomes, ontologies, literature, and taxonomy.http://pir.georgetown.edu/iproclass/yeshttp://bio2rdf.org/uniprot:Q6GZX0http://iproclass.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/iproclass
34
irefindexiRefIndexRefIndex provides an index of protein interactions available in a number of primary interaction databases including BIND, BioGRID, CORUM, DIP, HPRD, InnateDB, IntAct, MatrixDB, MINT, MPact, MPIDB, MPPI and OPHIDhttp://irefindex.uio.noCC-BAhttp://irefindex.uio.no/wiki/README_MITAB2.6_for_iRefIndex#Licensepubmed:18823568twice a yearyeshttp://bio2rdf.org/irefindex_irogid:16826204http://irefindex.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/irefindex
35
keggKyoto Encyclopedia of Genes and GenomesKEGG is a database resource for understanding high-level functions and utilities of the biological system, such as the cell, the organism and the ecosystem, from genomic and molecular-level information. http://www.genome.jp/kegg/licensing requiredsubscription required as of July 1, 2011every two monthsnohttp://bio2rdf.org/path:mka00350http://kegg.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/kegg
36
meshMedical Subject HeadingsMeSH is the National Library of Medicine's controlled vocabulary thesaurus.http://www.nlm.nih.gov/mesh/licensing requiredyeshttp://bio2rdf.org/mesh:D006948http://mesh.bio2rdf.org/sparql
37
mgiMGI is the international database resource for the laboratory mouse, providing integrated genetic, genomic, and biological data to facilitate the study of human health and disease.http://www.informatics.jax.org/nohttp://bio2rdf.org/mgi:3027896http://mgi.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/mgi
38
ndcNational Drug CodeThe Food and Drug Administration (FDA) current list of all drugs manufactured, prepared, propagated, compounded, or processed by it for commercial distribution.FDAhttp://www.fda.gov/Drugs/InformationOnDrugs/ucm142438.htmpublic domaintwice a monthyeshttp://bio2rdf.org/ndc_resource:9321661b46400c27651a09266462217ehttp://ndc.bio2rdf.org/sparql
39
omimOnline Mendelian Inheritance in ManOMIM is a comprehensive, authoritative, and timely compendium of human genes and genetic phenotypes. NCBIhttp://www.ncbi.nlm.nih.gov/omim/research only. no distribute to third party without licensehttp://omim.org/downloadsmonthlyyeshttp://omim.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/omim
40
pathwaycommonspathway commonsPathway Commons is a central resource for pathways and interactionshttp://webservice.baderlab.org:48080/to be replaced by pathwaycommonsnoneyeshttp://cpath.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/pathwaycommons
41
pdbProtein DataBankThe Protein Data Bank (PDB) archive is the single worldwide repository of information about the 3D structures of large biological molecules, including proteins and nucleic acids. http://www.rcsb.org/pdbpublic domainweeklyyeshttp://download.bio2rdf.org/release/2/pdb/
42
pharmgkbThe Pharmacogenomics KnowledgebaseThe PharmGKB is a pharmacogenomics knowledge resource that encompasses clinical information including dosing guidelines and drug labels, potentially clinically actionable gene-drug associations and genotype-phenotype relationships. PharmGKB collects, curates and disseminates knowledge about the impact of human genetic variation on drug responses.http://www.pharmgkb.orgcopyrightmonthlyyeshttp://pharmgkb.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/pharmgkb
43
pidPathway Interaction DatabaseThe Pathway Interaction Database is a highly-structured, curated collection of information about known biomolecular interactions and key cellular processes assembled into signaling pathwayshttp://pid.nci.nih.gov/contains biocarta, reactome and NCI-curated pathwaysnohttp://pid.bio2rdf.org/sparql
44
pubchemPubChemPubChem provides physical-chemical information on small molecules and their biological activitiesNCBIhttp://pubchem.ncbi.nlm.nih.gov/public domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlcontinuousnohttp://pubchem.bio2rdf.org/sparql
45
pubmedPubMedPubMed comprises more than 21 million citations for biomedical literature from MEDLINE, life science journals, and online books.NCBIhttp://www.ncbi.nlm.nih.gov/pubmedlicensedyeshttp://pubmed.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/pubmed
46
reactomeREACTOMEREACTOME is an open-source, open access, manually curated and peer-reviewed pathway databasehttp://www.reactome.org/ReactomeGWT/entrypoint.htmlCreative Commons Attribution 3.0 Unported License.part of pathwaycommonsevery three monthsN/Ahttp://reactome.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/reactome
47
refseqRefSeqThe Reference Sequence (RefSeq) collection aims to provide a comprehensive, integrated, non-redundant, well-annotated set of sequences, including genomic DNA, transcripts, and proteins.NCBIhttp://www.ncbi.nlm.nih.gov/RefSeq/public domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlevery two monthsnohttp://refseq.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/refseq
48
sgdSaccharomyces Genome DatabaseThe SGD project provides encyclopedic information about the yeast genome and its genes, proteins, and other encoded features. Experimental results on the functions and interactions of yeast genes, as reported in the peer-reviewed literature, are extracted by high-quality manual curation and integrated within a well-developed database.http://www.yeastgenome.org/public domainyeshttp://sgd.bio2rdf.org/sparqlhttp://s4.semanticscience.org/bio2rdf_download/rdf/sgd
49
taxonomyNCBI TaxonomyThe Taxonomy Database is a curated classification and nomenclature for all of the organisms in the public sequence databasesNCBIhttp://www.ncbi.nlm.nih.gov/taxonomypublic domainhttp://www.ncbi.nlm.nih.gov/About/disclaimer.htmlpart of ncboweeklyyeshttp://taxonomy.bio2rdf.org/sparql
50
toxkbToxKBA warehouse of toxicology resources including archival, ccris, cebs, cis, cpdbas, ctd, emic, genetox, hsdb, toxcast.public domaindata warehouseyeshttp://toxkb.bio2rdf.org/sparql
51
uniparcUniParcUniParc is non-redundant archive of protein sequences extracted from public databases UniProtKB/Swiss-Prot, UniProtKB/TrEMBL, PIR-PSD, EMBL, EMBL WGS, Ensembl, IPI, PDB, PIR-PSD, RefSeq, FlyBase, WormBase, H-Invitational Database, TROME database, European Patent Office proteins, United States Patent and Trademark Office proteins (USPTO) and Japan Patent Office proteins. http://www.ebi.ac.uk/uniparc/public domainyeshttp://uniparc.bio2rdf.org/sparql
52
uniprotUniProtThe UniProt Knowledgebase (UniProtKB) is the central hub for the collection of functional information on proteins, with accurate, consistent and rich annotationhttp://www.uniprot.org/cc-attribution-no derivativeshttp://www.uniprot.org/help/licensepubmed:22102590yeshttp://uniprot.bio2rdf.org/sparql
53
unistsUniSTSUniSTS is a comprehensive database of sequence tagged sites (STSs) derived from STS-based maps and other experiments.NCBIhttp://www.ncbi.nlm.nih.gov/unists/public domainweeklyyeshttp://unists.bio2rdf.org/sparql
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100