Possible CKAN/LOD Cloud Candidates
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
$
%
123
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ABCDEFGHIJKLMNOPQRSTUVWXYZAA
1
Datahub/
CKAN URL
URLCommentRDF DumpExample URL (Linked Data?) use "house" if possiblesparql urlgraphdemo queryvisualisationsTypeTriple size (guessed)Status: R = RDF, LOD = in official LOD Cloud, L = some Links existDomainLanguagesPossible links (Wi = Wiktionary, Wo = Wortschatz, D = DBpedia)Licence
2
Multi layer sentiment analysishttp://thedatahub.org/en/dataset/mlsahttp://iggsa.sentimental.li/index.php/contact/CKAN level 2http://ckan.net/storage/f/file/12be3509-4cd1-4fc5-8c95-d0d3a7121766http://mlode-sparql.nlp2rdf.org/sparqlselect distinct * where { ?s ?p ?o. ?s <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://asv.informatik.uni-leipzig.de/mlsa/PositiveWord>.}corpus21000RSentiment analysis on German sentences1WiCC-BY-SA
3
Multext-Easthttp://thedatahub.org/dataset/multext-easthttp://nl.ijs.si/ME/V4/http://mlode-sparql.nlp2rdf.org/sparqldata (corpus)other (TEI), L (scheme in OWL, Chiarcos)parallel corpus of 15 eastern european languages + English and Farsi15academic
4
Wiktionaryhttp://ckan.net/package/wiktionaryhttp://wiktionary.orgno RDFhttp://en.wiktionary.org/wiki/househttp://wiktionary.dbpedia.org/sparqlDict200 Mioin progress, SebastianDictionary170Wortschatz, DBpediacc-by-sa
5
DBpediahttp://ckan.net/package/dbpediahttp://dbpedia.org/Aboutstable, from LODhttp://wiki.dbpedia.org/Downloads38http://dbpedia.org/resource/Househttp://dbpedia.org/snorqlhttp://dbpedia.orgselect distinct ?Concept where {[] a ?Concept} LIMIT 100http://graves.cl/visualRDF/?url=http%3A%2F%2Fdbpedia.org%2Fdata%2FBerlin.rdf1200000000LOD1702000-us-census-rdf, dbtune-musicbrainz, education-data-gov-uk, eunis, flickr-wrappr, freebase, fu-berlin-dailymed, fu-berlin-dblp, fu-berlin-diseasome, fu-berlin-drugbank, fu-berlin-eurostat, fu-berlin-project-gutenberg, fu-berlin-sider, geonames-semantic-web, geospecies, italian-public-schools-linkedopendata-it, linkedgeodata, linkedmdb, nytimes-linked-open-data, opencyc, rdf-book-mashup, reference-data-gov-uk, revyu, tcmgenedit_dataset, transport-data-gov-uk, uk-legislation-api, w3c-wordnet, wikicompany, world-factbook-fu-berlin, yago
6
SFB632, QUIS-corporaNOT FOUNDdata (glosses)other (PAULA)Questionaire for Information Structure10/20/2011OLiAopen/t.b.a
7
PropBankNOT FOUNDcorpusother (Palmer et al, 2005) approximately 113,000 annotated verb tokens. These verb tokens include all those occurring in over one million words of the Wall Street Journal section of the Penn Treebank PennTreebank (WSJ)closed, LDC-licensed
8
Lexvo.orghttp://ckan.net/package/lexvohttp://www.lexvo.org/CKAN level 2http://www.lexvo.org/page/term/eng/househttp://lod.openlinksw.com/sparqlSchemaLODlanguage metadata (orthography, etc.)?CC-BY-SA
9
lingvoj.orghttp://ckan.net/package/lingvojhttp://lingvoj.orgCKAN level 2http://mlode.nlp2rdf.org/downloads/mlsa.nt.gzhttp://www.lingvoj.org/lang/frhttp://graves.cl/visualRDF/?url=http%3A%2F%2Fwww.lingvoj.org%2Flang%2Ffr?LOD??
10
OPUS http://opus.lingfil.uu.se/No RDFdata (corpus)othercollection of parallel open source corpora> 20OLiAopen (partly GPL, LGPL and others)
11
JRC-AcquisNOT FOUNDhttp://langtech.jrc.ec.europa.eu/JRC-Acquis.htmldata (corpus)otherJRC-Acquis (European legislation text)21 EU languages
12
PanLexhttp://thedatahub.org/dataset/panlexhttp://panlex.orghttp://panlex.org/cgi-bin/plxl.cgi?lv=2&ex=438882other (lexical database)450 Mioother (db)database of translations among lexemes6,900 languageslinks to about 3,600 other URLsfreely available
13
AutotypNOT FOUNDOther200 MioproprietaryTypological Data from languagesAllclosed
14
Corpus of Historical American English (1810-2009)NOT FOUNDhttp://corpus.byu.edu/coha/select distinct * where {<http://mlode.nlp2rdf.org/jrc-names/Muammar_Gaddafi> ?p ?o}data (corpus)otherCorpus of Historical American English (1810-2009)American Englishfree
15
PROIELNOT FOUNDhttp://foni.uio.no:3000/data (corpus)other (TIGER XML)historical translations of the New TestamentAncient Greek, Latin, Gothic, Old Church SlavicOLiA, bible translationscc attribution noncommercial sa
16
Arabic CorporaNOT FOUNDhttp://aracorpus.e3rab.com/index.php?content=englishdata (corpus)otherArabic CorporaArabicfree access
17
SUSANNE, CHRISTINE, LUCYNOT FOUNDdata (corpus)othercorpora by Geoffrey SampsonBritish EnglishOLiAno licence specified, downloadable
18
Catalan WordNethttp://ckan.net/package/catalan-wordnethttp://nlp.lsi.upc.edu/web/index.php?option=com_docman&Itemid=135no RDF------------------------LSR----otherCatalan WordNetCatalan----GPL
19
Resnik's Bible corporaNOT FOUNDhttp://www.umiacs.umd.edu/~resnik/parallel/bible.htmldata (corpus)other (CES)several (mostly modern) translations of the BibleCebuano, Chinese, Danish, Early Modern English, Finnish, French, Greek (Koine), Indonesian, Latin, Spanish, Swahili, Swedish, Vietnameseall bible translationsunclear (downloadable)
20
Danish WordnetNOT FOUNDhttp://www.wordnet.dk/LSRotherDanish WordNetDanishopen
21
Cornettohttp://ckan.net/package/cornettohttp://www2.let.vu.nl/oz/cltl/cornetto/no RDF Dump----------------http://graves.cl/visualRDF/?url=http%3A%2F%2Fpurl.org%2Fvocabularies%2Fcornetto%2Fsynset-iets-2-noun.rdfLSRLODDutch WordnetDutch Links to package:vu-wordnet and package:w3c-wordnet.
22
Wordnet (Princeton)http://ckan.net/package/wordnethttp://semanticweb.cs.vu.nl/lod/wn30/http://wordnetweb.princeton.edu/perl/webwn?s=houseLSRLODEngish
23
ConceptNethttp://ckan.net/package/conceptnethttp://csc.media.mit.edu/conceptnet/gethttp://conceptnet5.media.mit.edu/
no RDF
----http://conceptnet5.media.mit.edu/web/c/en/house----------------LSR----other (db)WordNet-like concept databaseEnglishDBpedia, WordNetGPL / CC-by
24
cornell Movie dialog corpusNOT FOUNDhttp://www.cs.cornell.edu/~cristian/Cornell_Movie-Dialogs_Corpus.htmlcorpus304713 utterancesothermetadata-rich collection of fictional conversations extracted from raw movie scriptsEnglishLSRsunclear, downloadable
25
Corpora of misspellingsNOT FOUNDhttp://www.dcs.bbk.ac.uk/~ROGER/corpora.htmlother (orthographic DB)other (DB)word lists of misspelled words, can be used in to learn orthographic rules orfor spellingcorrectionEnglishwith English LSRs or corporaunclear, downloadable
26
LCS DatabaseNOT FOUNDhttp://www.umiacs.umd.edu/~bonnie/LCS_Database_Documentation.htmlLSRother (lexical conceptual structures, LCS)verbal semanticsEnglishWordNetdistributable with attribution
27
Manually Annotated Sub-Corpus (MASC)http://thedatahub.org/en/dataset/masc/ http://www.anc.org/MASC/http://www.anc.org/MASC/download/MASC-1.0.3.zipcorpusotherManually annotated american corpusenglishwiktionary, dbpediaother (open)
28
Name ListNOT FOUNDhttp://nlp.cs.qc.cuny.edu/ngram_genderanimacy.zipLSRothername lists with gender and animacy information discovered from Google n-grams (version II) (Ji and Lin, 2009)EnglishOLiA ?
29
OCASNOT FOUNDhttp://idocument.opendfki.de/wiki/Evaluation/Corpus/OlympicGames2004data (corpus)Rsemantically annotated corpusEnglish
30
SemCor CorpusNOT FOUNDhttp://multisemcor.fbk.eu/semcor.phpCorpusotherSense-Tagged CorporaEnglish?
31
Sentiment-annotated quotation corpusNOT FOUNDhttp://langtech.jrc.ec.europa.eu/JRC_Resources.htmldata (corpus)other (Excel)Sentiment-annotated quotation corpusEnglishOLiA ?free + attribution
32
Verb Semantics OntologyNOT FOUNDhttp://www-csli.stanford.edu/~arunm/LSRother (Prolog, CSV)another ontology of verb semantics, includes Roget's ThesaurusEnglishno license statement, downloadable
33
Wordnet ( W3C )http://ckan.net/package/w3c-wordnethttp://www.w3.org/TR/wordnet-rdf/http://graves.cl/visualRDF/?url=http%3A%2F%2Fwww.w3.org%2F2006%2F03%2Fwn%2Fwn20%2Finstances%2Fwordsense-entity-noun-1http://graves.cl/visualRDF/?url=http%3A%2F%2Fwww.w3.org%2F2006%2F03%2Fwn%2Fwn20%2Finstances%2Fwordsense-entity-noun-1LSRLODEnglish
34
Link Grammar (Parser)none yethttp://www.link.cs.cmu.edu/link/parser for English, includes a dictionary; here, we are only considering the dictionarymorpho-syntactic dictionary60K word formsotherEnglishEnglish corpora/LSRsGPL-compatible
35
WikicorpusNOT FOUNDhttp://www.lsi.upc.edu/~nlp/wikicorpusdata (corpus)otherWikicorpus, v. 1.0English, Catalan, SpanishWordnet, OLiA (POS)cc
36
English-Persian Parallel CorpusNOT FOUNDhttp://ece.ut.ac.ir/NLP/resources.htmdata (corpus)otherMohammad Taher Pilevar, NLP Lab, University of Tehran, Iran.English, Farsi(no annotations yet)free (no licence specified)
37
NunavutHansardNOT FOUNDhttp://www.inuktitutcomputing.ca/NunavutHansard/en/index.htmldata (corpus)otherEnglish-Inuktitut parallel corpus (morphological annotations can be obtained with http://www.inuktitutcomputing.ca/Uqailaut/en/IMA.html)English, Inuktitutdownloadable, academic
38
GeoWordNethttp://ckan.net/package/geowordnethttp://geowordnet.semanticmatching.org/no RDF Dumphttp://geowordnet.semanticmatching.org/synset-dependent_political_entity-noun-1.rdf--------http://graves.cl/visualRDF/?url=http%3A%2F%2Fgeowordnet.semanticmatching.org%2Fsynset-dependent_political_entity-noun-1.rdfdict53390969RGeoWordNet is a semantic resource built from the full integration of WordNet, GeoNames and the Italian part of MultiWordNet.English, Italiangeonames-semantic-web, vu-wordnetCC-BY
http://creativecommons.org/licenses/by/3.0/
39
TEP: Tehran English-Persian Parallel Corpusnot yethttp://ece.ut.ac.ir/NLP/resources.htmEnglish-Persian parallel corpus, subtitlescorpus (unannotated)4 mio tokens per languageotherEnglish, Persian (Farsi)Farsi LSR, English LSRUsage of this package for any research or non-commercial purposes requires the precondition that you cite the following paper: M. T. Pilevar, H. Faili, and A. H. Pilevar, “TEP: Tehran English-Persian
Parallel Corpus”, in proceedings of 12th International Conference on
Intelligent Text Processing and Computational Linguistics
(CICLing-2011).
40
French TimeBankhttp://ckan.net/package/french-timebankhttps://gforge.inria.fr/projects/fr-timebank/no RDF------------------------corpusother (XML)The French TimeBank consists of a set of 109 journalistic articles from 7 different sub-genres annotated according to the ISO-TimeML standard, adapted for the French language. Eventualities (events and states) and temporal expressions (dates, durations, frequencies, quantified intervals) are marked up with in-line annotation.FrenchLGPL (LGPL-LR)
41
Le Petit PrinceNOT FOUNDhttp://www.unlweb.net/unlariumdata (corpus)other (UNL)Le Petit PrinceFrenchcc-by-sa
42
Perceo CorpusNOT FOUNDhttp://cnrtl.fr/corpus/perceo/corpus100,000 tokensothera collection of lemmatized and POS-tagged spoken French transcriptions. The data contains over 100,000 tokens automatically tagged and manually checked.FrenchFrench LSRsfreely downloadable
43
WOLFNOT FOUNDhttp://alpage.inria.fr/~sagot/wolf-en.htmlLSRotherFrench WordNetFrenchCecill-C license (LGPL compatible)
44
Wiktionary RDF dumpNOT FOUNDhttp://kaiko.getalp.org/dbnary/static/LSRRyet another wiktionary rdf dupGerma, Eglish, Finnish, French, Itlian, PolishWias wiktionary(no explicit statement)
45
Deutsches Morphologie-Lexikonnon yethttp://www.danielnaber.de/morphologie/----------------------------LSR (morphology only)otherGerman ophological lexiconGermanwith German LSRsCC-BY
46
SentiWShttp://thedatahub.org/dataset/sentiwshttp://asv.informatik.uni-leipzig.de/download/sentiws.htmlCKAN level 2 (minimal)http://mlode.nlp2rdf.org/downloads/sentiws.ttl.gzhttp://mlode-sparql.nlp2rdf.org/sparql?default-graph-uri=http%3A%2F%2Fmlode.nlp2rdf.org&query=select+distinct+*+where+%7B%3Chttp%3A%2F%2Fmlode.nlp2rdf.org%2Fsentiws%2Fword%2FAbd%C3%A4mpfung%3E+%3Fp+%3Fo%7D+LIMIT+100&format=text%2Fhtml&timeout=0&debug=on
webstore_last_updated
webstore_url
http://mlode-sparql.nlp2rdf.org/sparql----select distinct * where {<http://mlode.nlp2rdf.org/sentiws/word/Abdämpfung> ?p ?o} LIMIT 100http://graves.cl/visualRDF/?url=http%3A%2F%2Fmlode.nlp2rdf.org%2Fsentiws%2Fword%2FAbd%C3%A4mpfungCorpus30339otherSentimentWortschatz, or SentiWS for short, is a publicly available German-language resource for sentiment analysis, opinion mining etc.GermanWiktionary 4233downloadable, cc-by-nc-sa
47
GermaNetNOT FOUNDLSRotherGerman wordnetGermanacademic
48
GrammisNOT FOUNDhttp://hypermedia.ids-mannheim.de/pls/public/ontologie.htmlSchemaother (SQL)grammis ontologie, ontology of linguistic terminology, GermanGermanclosed, browseable
49
NEGRA corpusNOT FOUNDdata (corpus)R (chiarcos)German newpaper corpusGermanOLiAannotations: academic, source text: proprietary
50
Open ThesaurusNOT FOUNDhttp://www.openthesaurus.de/export/OpenThesaurus-Textversion.zipLSRother (txt)German thesaurusGermanLGPL
51
SalsaNOT FOUNDdata (corpus)otherframenet-annotations for TIGERGermanFrameNet, TIGERannotations: academic, source text: proprietary
52
TIGER corpusNOT FOUNDdata (corpus)R (hellmann)German newpaper corpusGermanOLiAannotations: academic, source text: proprietary
53
TüBa-D/Z corpusNOT FOUNDhttp://www.sfs.uni-tuebingen.de/en/de_tuebadz.shtmldata (corpus)otherGerman newpaper corpusGermanOLiA, GermaNetannotations: academic, source text: proprietary
54
Linguee German-English dictionaryNOT FOUNDhttp://www.linguee.com/downloads/completeDict-latin9.txtdictionaryotherGerman-English dictionaryGerman, EnglishGerman and English LSRsGPL
55
SMULTRONNOT FOUNDhttp://www.cl.uzh.ch/kitt/smultron/data (corpus)otherStockholm MULtilingual TReebank) is a parallel treebank which contains around 1000 sentences in English, German and SwedishGerman, English, SwedishOLiAacademic
56
Anatolian word listsNOT FOUNDhttp://ferheng.org/?Daxistindictotherpair-wise word-lists of the languages in column H, partly with parts of speechGerman, Turkish, English, Soranî (Kurdish), Kurmanci (Kurdish), Kurdi (Kurdish), Swedish, CzechGPL
57
Haitian Creole Lang Data, Carnegie MellonNOT FOUNDhttp://www.speech.cs.cmu.edu/haitian/data (corpus)otherHaitian Creole spoken and text dataHaitian(no annotations yet)minimal restrictions
58
Hebrew WordNetNOT FOUNDhttp://cl.haifa.ac.il/projects/mwn/index.shtmlLSRotherHebrew WordNetHebrew"free download"
59
Hindi WordNetNOT FOUNDhttp://www.cfilt.iitb.ac.in/wordnet/webhwn/LSRotherHindi WordNetHindiopen source (GNU FDL)
60
IcePaHCNOT FOUNDhttp://www.linguist.is/icelandic_treebank/DownloadNo RDFdata (corpus)otherIcelandic Parsed Historical CorpusIcelandicOLiAopen (LGPL)
61
Inuktitut - A Multi-dialectal Outline DictionaryNOT FOUNDhttp://www.inuktitutcomputing.ca/Spalding/en/spalding.shtmldictother (HTML)Inuktitut dictionaryInuktitut, Englishdownloadable, no licence statement
62
LSG ("Líonra Séimeantach na Gaeilge")NOT FOUNDhttp://borel.slu.edu/lsg/LSRotherIrish WordNetIrish Gaelic"free download"
63
Japanese WordNetNOT FOUNDhttp://nlpwww.nict.go.jp/wn-ja/index.en.htmlLSRotherJapanese WordNretJapaneseopen
64
Multi-Lingual Semantic Network NOT FOUNDhttp://two.dcook.org/software/mlsn/about/download.htmlLSRotherJapanese/Chinese/German/English WordNetJapanese/Chinese/German/Englishfree download
65
Arabic Online Commentary Dataset v1.1NOT FOUNDhttp://www.cs.jhu.edu/~ozaidan/AOC/AOC_readme.txtcorpus52 mio wordsotherArabic newswire, April-Oct 2010Jordanian, Saudi and Egyptian ArabicLSRs (if any for Arabic available, Wiktionary ?)unclear, downloadable
66
Macedonian WordNetNOT FOUNDhttp://www.time.mk/trajkovski/papers/is2010.pdfLSRotherMacedonian WordNetMacedonianCreative Commons, Attribution-NonCommercial 3.0 Unported license
67
ODINNOT FOUNDhttp://www.csufresno.edu/odin/odin-overview.htmlcorpus (glosses)otherglosses, es gab mal pläne, das mit GOLD zu verbindenmany"open", no licence specified
68
BabelNetNOT FOUNDhttp://lcl.uniroma1.it/babelnet/LSRotheralignment of WordNet with multilingual Wikipedia categoriesmultilingualDBpedia, WordNetdownloadable
69
TMC: Tehran Monolingual Corpusnone yethttp://ece.ut.ac.ir/NLP/resources.htmPersian corpuscorpus (unannotated)250M wordsotherPersian (Farsi)Persian LSR"freely available" (research/NC only ?), To have a copy of this corpus contact us at: t.pilevar {at} ece.ut.ac.ir or nlp {at} ece.ut.ac.ir
70
Bijankhan corpusnone yethttp://ece.ut.ac.ir/DBRG/Bijankhan/morphosyntactically annotated Persian corpus, dependency syntax to be released (http://stp.lingfil.uu.se/~mojgan/persian_dependency_treebank.pdf)corpus2.6M tokensPersian (Farsi)Farsi LSRs All rights of this corpus and the tools that are included in this package are reserved for University of Tehran - Database Research Group. Usage of this package for any research or non-commercial purposes is free with the precondition that you cite the related papers below.
71
Persian Link Grammarnone yethttp://www.ling.ohio-state.edu/~jonsafari/persianlg/see English Link GrammarPersian (Farsi)Persian corpora, LSRsunclear, same as English Link Grammar ?
72
Hamshahri CLEF corpusnone yethttp://ece.ut.ac.ir/dbrg/hamshahri/Hamshahri collection is a standard reliable Persian text collection that was used at Cross Language Evaluation Forum (CLEF) during years 2008 and 2009 for evaluation of Persian information retrieval systems.corpus (not annotated)318K documentsPersian (Farsi)Persian LSRs and dictionaryAll rights of the corpus' news are reserved for Hamshahri newspaper. All rights of the corpus' data and the tools that are included in this website are reserved for University of Tehran - Database Research Group. Usage of this package for any research or non-commercial purposes is free with the precondition that you cite paper number [1] of publications section.
73
Persian Dependency Treebank (PerDT)none yethttp://dadegan.ir/en/perdtcorpus (dep-parsed)30K sentencesPersian (Farsi)Persian LSRs and dictionaryto be checked, research only (?, http://aclweb.org/aclwiki/index.php?title=Resources_for_Persian)
74
Per­si­an Tree­bank (Per­Tree­Bank)none yethttp://hpsg.fu-berlin.de/~ghayoomi/PTB.htmlcorpus (HPSG-parsed)1000 sentencesPersian (Farsi)Persian LSRs and dictionary"freely available" (pers. comm., Ma­so­od Ghayoo­mi, Jan 17, 2013) [= research only?]
75
diverse VOA corporanone yethttp://www.ling.ohio-state.edu/~jonsafari/corpora/harvested from Voice of Americacorpora (unannotated)Persian (Farsi), Urdu, Pashto, DariFarsi morphosyntactic annotations, Farsi LSRpublic domain (see www.voanews.com)
76
plWordNetNOT FOUNDhttp://www.plwordnet.pwr.wroc.pl/main/?lang=enLSRotherPolish WordNetPolishacademic
77
Punjabi Morphology, corpus and lexiconNOT FOUNDhttp://www.lama.univ-savoie.fr/~humayoun/punjabi/index.htmldata (corpus), dictotherPunjabi Morphology, corpus and lexiconPunjabifree
78
Russian WordNetNOT FOUNDhttp://wordnet.ru/LSRotherRussian WordNetRussianfree download
79
The Manually Annotated Sub-Corpus NOT FOUNDhttp://www.anc.org/MASCcorpusother (XCES, GrAF)subcorpus of the ANCs
80
Corpus of Modern Scottish Writing (CMSW)NOT FOUNDhttp://www.scottishcorpus.ac.uk/cmsw/data (corpus)otherCorpus of Modern Scottish Writing (CMSW)Scotsfreely available
81
African BiblesNOT FOUNDhttp://visionneuse.free.fr/index.htm?version=BIBdata (corpus)other (XML)various Bibles, mostly from African languages, represent a parallel corpus (although not announced as such)sentisentiany other Bible corpusunclear, downloadable
82
Apertium project lexiconsNOT FOUNDhttp://sourceforge.net/projects/apertium/dictotherApertium project lexiconsseveralGPL
83
sloWNetNOT FOUNDhttp://lojze.lugos.si/~darja/slownet.htmlLSRotherSlovene WordNetSloveneCreative Commons License (attribution, non-commercial, share-alike)
84
SPLLOCNOT FOUNDwww.splloc.soton.ac.uk, www.talkbank.orgdata (corpus)other (CHILDES)corpus of oral L2 Spanish, universities of Southampton, Newcastle, and York in the UK. SpanishOLiA (POS tags)academic
85
TamilWordNetNOT FOUNDhttp://www.nrcfosshelpline.in/code/wiki/TamilWordnetLSRotherTamil WordNetTamilopen source
86
Asian WordnetNOT FOUNDhttp://www.asianwordnet.org/LSRotherAsian WordNetThai ,Korean, Japanese, Indonesian, Myanmar, Vietnamese, Mongolian, Bengaliopen (BSD)
87
JRC-Nameshttp://thedatahub.org/dataset/jrc-nameshttp://langtech.jrc.it/JRC-Names.htmlCKAN level 2 (minimal)http://mlode.nlp2rdf.org/downloads/jrc-names.ttl.gz
http://mlode.nlp2rdf.org/downloads/jrc-names-links.nt.gz
http://mlode.nlp2rdf.org/jrc-names/Muammar_Gaddafihttp://mlode-sparql.nlp2rdf.org/sparqlhttp://thedatahub.org/dataset/jrc-namesselect distinct * where {<http://mlode.nlp2rdf.org/jrc-names/Muammar_Gaddafi> ?p ?o}http://graves.cl/visualRDF/?url=http%3A%2F%2Fmlode.nlp2rdf.org%2Fjrc-names%2FMuammar_Gaddafi1458828Rhighly multilingual named entity resource for person and organisation namesToo manyDBpediahttp://langtech.jrc.ec.europa.eu/Resources/LICENCE-EULA_JRC-Names_2011.pdf
88
baby namesNOT FOUNDhttp://www.nyc.gov/html/doh/downloads/pdf/public/press09/pr076-09-babynames.pdfword listpdfethnicity- and gender-classified first namesUScorporaunclear, downloadable
89
US Census name listsNOT FOUNDhttp://www.census.gov/genealogy/www/data/1990surnames/index.htmlword list5500 first names, several thousand last namesother (CSV)1990 first names, female and male
1990 last names
2000 last names
2000 last names (Spanish)
can be used for gender detection and NER
UScorporaunclear, downloadable
90
Printed Book Auction Catalogueshttp://thedatahub.org/en/dataset/printed-book-auction-catalogueshttp://keithalexander.co.uk/pbac
??http://keithalexander.co.uk/pbac/identified/agents/405.rdf??? CKAN url not working, but might have one------
91
Intercontinental Dictionary Series http://thedatahub.org/dataset/idshttp://lingweb.eva.mpg.de/ids/CKAN level 2http://mlode.nlp2rdf.org/downloads/ids.nt.gzhttp://mlode-sparql.nlp2rdf.org/sparql
92
Lemon Wiktionaryhttp://thedatahub.org/en/dataset/lemonwiktionaryhttp://monnetproject.deri.ie/lemonsource/wiktionary__en (down)CKAN level 2http://monnetproject.deri.ie/lemonsource/Special:Dump/wiktionary.tar.bz2
93
Lemon Wordnethttp://thedatahub.org/dataset/lemonwordnethttp://monnetproject.deri.ie/lemonsource/wordnetCKAN level 2http://monnetproject.deri.ie/lemonsource/Special:Dump/wordnet.ziphttp://monnetproject.deri.ie/lemonsource/wordnet/house-nounhttp://monnetproject.deri.ie/lemonsource_query/http://graves.cl/visualRDF/?url=http%3A%2F%2Fmonnetproject.deri.ie%2Flemonsource%2Fwordnet%2Fcat-noun.rdfdictRWordNet
94
Open Data Thesaurushttp://thedatahub.org/dataset/open-data-thesaurushttp://vocabulary.semantic-web.at/PoolParty/wiki/OpenDatahttp://vocabulary.semantic-web.at/PoolParty/sparql/OpenDataLSRRDFThesaurus
95
VU WordNethttp://thedatahub.org/en/dataset/vu-wordnethttp://semanticweb.cs.vu.nl/lod/wn30/CKAN level 2http://eculture.cs.vu.nl/git/public/?p=vocs/wordnet.git;a=tree;f=rdf;hb=HEADhttp://semanticweb.cs.vu.nl/europeana/lod/purl/vocabularies/princeton/wn30/synset-house-noun-1.rdf??
96
WALShttp://thedatahub.org/dataset/walswww.wals.infoCKAN level 2http://mlode.nlp2rdf.org/downloads/wals.nt.gzhttp://wals.info/languoid/lect/wals_code_hauhttp://mlode-sparql.nlp2rdf.org/sparqlOthertypological database
97
Zhishi-mehttp://thedatahub.org/en/dataset/zhishi-meCKAN Level 2n/ahttp://zhishi.me/data/zhwiki/resource/Shanghaihttp://zhishi.me/sparql
98
RKB Explorer Wordnethttp://thedatahub.org/en/dataset/rkb-explorer-wordnethttp://wordnet.rkbexplorer.com/CKAN level 3http://wordnet.rkbexplorer.com/models/dump.tgzhttp://wordnet.rkbexplorer.com/id/synset-odd-toed_ungulate-noun-1http://wordnet.rkbexplorer.com/sparql/--
99
Sanskrit English Lexiconhttp://thedatahub.org/en/dataset/sanskrit-english-lexiconhttp://blog.kasabi.com/about/Data available, but not online, huge project, unsure, what to do with this
100
SIMPLE Ontology, Lexicon http://thedatahub.org/en/dataset/simple-ontology-lexiconIn RDF, Not hostedhttp://www.languagelibrary.eu/owl/simple/simpleindividuals.owl
Loading...
Main menu