Part-of-speech tagging
A simple but useful form of linguistic analysis
Christopher Manning
Christopher Manning
Parts of Speech
Christopher Manning
Open class (lexical) words
Closed class (functional)
Nouns
Verbs
Proper
Common
Modals
Main
Adjectives
Adverbs
Prepositions
Particles
Determiners
Conjunctions
Pronouns
… more
… more
IBM
Italy
cat / cats
snow
see
registered
can
had
old older oldest
slowly
to with
off up
the some
and or
he its
Numbers
122,312
one
Interjections
Ow Eh
Christopher Manning
Open vs. Closed classes
Christopher Manning
POS Tagging
Christopher Manning
POS Tagging
Penn Treebank POS tags
Christopher Manning
POS tagging performance
Christopher Manning
Deciding on the correct part of speech can be difficult even for people
Christopher Manning
How difficult is POS tagging?
Christopher Manning
Part-of-speech tagging
A simple but useful form of linguistic analysis
Christopher Manning
Christopher Manning
Part-of-speech tagging revisited
A simple but useful form of linguistic analysis
Christopher Manning
Christopher Manning
Sources of information
Christopher Manning
More and Better Features 🡺 Feature-based tagger
Christopher Manning
Overview: POS Tagging Accuracies
Most errors on unknown words
Christopher Manning
How to improve supervised results?
PRP VBD IN RB IN PRP VBD .
They left as soon as he arrived .
NNP NNS VBD VBN .
Intrinsic flaws remained undetected .
RB
JJ
Christopher Manning
Tagging Without Sequence Information
t0
w0
Baseline
t0
w0
w-1
w1
Three Words
Model | Features | Token | Unknown | Sentence |
Baseline | 56,805 | 93.69% | 82.61% | 26.74% |
3Words | 239,767 | 96.57% | 86.78% | 48.27% |
Using words only in a straight classifier works as well as a basic (HMM or discriminative) sequence model!!
Christopher Manning
Summary of POS Tagging
For tagging, the change from generative to discriminative model does not by itself result in great improvement
One profits from models for specifying dependence on overlapping features of the observation such as spelling, suffix analysis, etc.
An MEMM allows integration of rich features of the observations, but can suffer strongly from assuming independence from following observations; this effect can be relieved by adding dependence on following words
This additional power (of the MEMM ,CRF, Perceptron models) has been shown to result in improvements in accuracy
The higher accuracy of discriminative models comes at the price of much slower training
Christopher Manning
Part-of-speech tagging revisited
A simple but useful form of linguistic analysis
Christopher Manning
Christopher Manning