Apache UIMA HMM Tagger FR Models
Models for the Apache UIMA Hidden Markov Model Tagger Annotator [1] (from the sandbox UIMA Addons)
  * Part of speech (POS)
  * Grammatical subcategorization (Subcat)
  * Morphological inflection  (Mph)
  * Lemma (canonical form)
  * Ee (POS + Subcat + Mph)

Models have been built with the addon's version 2.4.
Training has been performed using the French Treebank corpus [2] (version 2010).
Its licence does not prevent to distribute its analysis results under whatever licence but it mentions that the ftb should be used only for research purpose.
Consequently we restrict the use of these models only for research purposes.

To get the '.dat', unzip and have a look to the '/HMMTrainerTagger/french/' dir

[1] http://uima.apache.org/sandbox.html#tagger.annotator
[2]For more on the French Treebank, see Abeille, A., L. Clement, and F. Toussenel. 2003. `Building a treebank for French', in A. Abeille (ed) Treebanks , Kluwer, Dordrecht. http://www.llf.cnrs.fr/Gens/Abeille/French-Treebank-fr.php
Sign in to Google to save your progress. Learn more
Name *
Email *
Affiliation
Country
Purpose *
A brief description of your project. Do not hesitate to give a link.
Submit
Clear form
Never submit passwords through Google Forms.
This content is neither created nor endorsed by Google. - Terms of Service - Privacy Policy

Does this form look suspicious? Report