Contents
�What is a machine translation?
�What Is A Machine Translation?
Machine Translation is the process of converting the text in a source language to a required target language.
�What Is A Machine Translation?
Given a sequence of text in a source language, there is no one single best translation of that text to another language.
This is because of the natural ambiguity and flexibility of human language, especially with most of the Indian Languages that are rarely spoken in other countries.
�What Is A Machine Translation?
Traditionally, Natural Language Processing of both spoken and written language has been regarded as consisting of the following stages:
�What Is A Machine Translation?
Here is the pictorial representation of Bernard Vauquois' pyramid, showing comparative depths of intermediary representation, interlingual machine translation at the peak, followed by transfer-based, then direct translation.
�Types Of Machine Translation
There are different types of machine translation…..
�Statistical Machine Translation or SMT
�Statistical Machine Translation or SMT
�Statistical Machine Translation or SMT
The most abstract view of Statistical Machine Translation can be understood from the image below:
�Statistical Machine Translation or SMT
�Rule-based Machine Translation or RBMT
�Rule-based Machine Translation or RBMT
�Hybrid Machine Translation or HMT
�Hybrid Machine Translation or HMT
�Hybrid Machine Translation or HMT
�Hybrid Machine Translation or HMT
�Neural Machine Translation or NMT�
�What Are The Benefits Of Machine Translation?
�Applications of machine translation
�Machine Translation vs Human translation
On the other hand, human translation is incredible for those undertakings that require additional consideration and subtlety. Talented translators work on your image's substance to catch the first importance and pass on that feeling or message basically in another assortment of work.
�Machine Translation vs Human translation
�Machine Translation vs Human translation
�Machine Translation vs Human translation
Speech recognition
Building a Speech Recognizer
Difficulties in developing a speech recognition system
Difficulties in developing a speech recognition system
Developing a high quality speech recognition system is really a difficult problem.
The difficulty of speech recognition technology can be broadly characterized along a number of dimensions as discussed below −
Difficulties in developing a speech recognition system
Size of the vocabulary
− Size of the vocabulary impacts the ease of developing an ASR.
Consider the following sizes of vocabulary for a better understanding.
Note that, the larger the size of vocabulary, the harder it is to perform recognition.
Difficulties in developing a speech recognition system
Channel characteristics −
Channel quality is also an important dimension.
For example, human speech contains high bandwidth with full frequency range, while a telephone speech consists of low bandwidth with limited frequency range. Note that it is harder in the latter.
Difficulties in developing a speech recognition system
Speaking mode
− Ease of developing an ASR also depends on the speaking mode, that is whether the speech is in isolated word mode, or connected word mode, or in a continuous speech mode.
Note that a continuous speech is harder to recognize.
Difficulties in developing a speech recognition system
Speaking style
− A read speech may be in a formal style,
or spontaneous and conversational with casual style.
The latter is harder to recognize.
Difficulties in developing a speech recognition system
Speaker dependency
− Speech can be speaker dependent, speaker adaptive, or speaker independent.
A speaker independent is the hardest to build.
Difficulties in developing a speech recognition system
Type of noise
− Noise is another factor to consider while developing an ASR.
Signal to noise ratio may be in various ranges, depending on the acoustic environment that observes less versus more background noise −
For example, the type of background noise such as stationary, non-human noise, background speech and crosstalk by other speakers also contributes to the difficulty of the problem.
Difficulties in developing a speech recognition system
Type of noise
− Noise is another factor to consider while developing an ASR.
Signal to noise ratio may be in various ranges, depending on the acoustic environment that observes less versus more background noise −
For example, the type of background noise such as stationary, non-human noise, background speech and crosstalk by other speakers also contributes to the difficulty of the problem.
Difficulties in developing a speech recognition system
Microphone characteristics
− The quality of microphone may be good, average, or below average.
Also, the distance between mouth and micro-phone can vary.
These factors also should be considered for recognition systems.
Difficulties in developing a speech recognition system
Microphone characteristics
− The quality of microphone may be good, average, or below average.
Also, the distance between mouth and micro-phone can vary.
These factors also should be considered for recognition systems.
Types of Models in Speech Recognition
Models in speech recognition can conceptually be divided into an acoustic model and a language model.
The acoustic model solves the problems of turning sound signals into some kind of phonetic representation.
The language model houses the domain knowledge of words, grammar, and sentence structure for the language.
Phonetics
Phonetics
Phonetics