Model 2
Word Level Analysis & Syntactic Analysis
Content:
Word Level Analysis: Regular Expressions, Finite-State Automata, Morphological Parsing, Spelling Error Detection and Correction, Words and Word Classes, Part-of Speech Tagging.
Syntactic Analysis: Context-Free Grammar, Constituency, Top-down and Bottom-up Parsing, CYK Parsing.
Textbook 1: Ch. 3, Ch. 4
Chapter 3- Word Level Analysis
Regular expressions (RegEx) are sequences of characters used to find or replace patterns within text. They are essential tools in Natural Language Processing (NLP) for tasks such as data pre-processing, pattern matching, text feature engineering, web scraping, and data extraction.
3.3 FINET AUTOMATA
3.4 MORPHOLOGICAL PARSING
What is Morphological Parsing?
Morphological parsing is the process of breaking down a word into its morphemes, which are the smallest units of meaning. This process helps in understanding the word's structure and its role in a sentence.
Why is it Important?
Morphological parsing is crucial for various NLP tasks, such as:
3.5 Spelling Error Detection and Correction
What is Spelling and Error Detection?
Spelling and error detection involves identifying and correcting mistakes in written text. These errors can include:
Why is it Important?
Accurate text is essential for effective communication. Spelling and error detection is crucial for:
3.6 WORDS AND WORD CLASSES
3.7 Part ofSpeech Tagging
Part-of-Speech (POS) tagging is a fundamental NLP task that involves assigning a part of speech to each word in a sentence. Here’s an overview:
What is POS Tagging?
POS tagging is the process of identifying the grammatical category of each word in a given text, such as nouns, verbs, adjectives, etc.
Why is it Important?
How Does it Work?
Example
Chapter 4- Syntactic Analysis
4.2 CONTEXT FREE GRAMMAR(CFG)
PARSING