Apache UIMA HMM Tagger FR Models

JavaScript isn't enabled in your browser, so this file can't be opened. Enable and reload.

Models for the Apache UIMA Hidden Markov Model Tagger Annotator [1] (from the sandbox UIMA Addons)
* Part of speech (POS)
* Grammatical subcategorization (Subcat)
* Morphological inflection (Mph)
* Lemma (canonical form)
* Ee (POS + Subcat + Mph)

Models have been built with the addon's version 2.4.
Training has been performed using the French Treebank corpus [2] (version 2010).
Its licence does not prevent to distribute its analysis results under whatever licence but it mentions that the ftb should be used only for research purpose.
Consequently we restrict the use of these models only for research purposes.

To get the '.dat', unzip and have a look to the '/HMMTrainerTagger/french/' dir

[1] http://uima.apache.org/sandbox.html#tagger.annotator
[2]For more on the French Treebank, see Abeille, A., L. Clement, and F. Toussenel. 2003. `Building a treebank for French', in A. Abeille (ed) Treebanks , Kluwer, Dordrecht. http://www.llf.cnrs.fr/Gens/Abeille/French-Treebank-fr.php

Name *

Email *

Affiliation

Country

Purpose *

A brief description of your project. Do not hesitate to give a link.

Submit

Clear form

Never submit passwords through Google Forms.

This content is neither created nor endorsed by Google. - Terms of Service - Privacy Policy

Does this form look suspicious? Report

Forms