1 of 68

Back to the noisy channel

Taro Watanabe

taro at is.naist.jp

YANS 2021: Back to the noisy channel

2 of 68

Early MT research

2

When I look at an article in Russian, I say; “This is really written in English, but it has been coded in some strange symbols. I will now proceed to decode.”

... I frankly am afraid the boundaries of words in different languages are too vague .... to make any quasimechanical translation scheme very hopeful.

(Warren Weaver to Norbert Wiener, 1947)

YANS 2021: Back to the noisy channel

3 of 68

History of MT

3

2016

Google NMT

2013

Neural MT

2000

Statistical MT

Phrase-based MT

Syntax-based MT

1990

Example-based MT

IBM Model

1960

ALPAC report

Rule-based MT

Systran

1950

Code breaking

YANS 2021: Back to the noisy channel

4 of 68

MT as transfer

4

昨日麒麟を散歩した。

I walked a giraffe yesterday.

昨日麒麟を散歩した。

I walked a giraffe yesterday.

散歩

arg0: ?

arg1: 麒麟

temp: 昨日

walk

arg0: I

arg1: giraffe

temp: yesterday

Event

walk(?, giraffe)

date(yesterday)

YANS 2021: Back to the noisy channel

5 of 68

MT as transfer: The Vauquois triangle

5

昨日麒麟を散歩した。

I walked a giraffe yesterday.

昨日麒麟を散歩した。

I walked a giraffe yesterday.

散歩

arg0: ?

arg1: 麒麟

temp: 昨日

walk

arg0: I

arg1: giraffe

temp: yesterday

Event

walk(?, giraffe)

date(yesterday)

Interlingua

Semantic

Syntax

Words

YANS 2021: Back to the noisy channel

6 of 68

Example-based MT

6

Lookup similar examples and edits

(examples from 機械翻訳)

YANS 2021: Back to the noisy channel

7 of 68

Bilingual Data

7

上海浦东开发与法制建设同步

新华社上海二月十日电（记者谢金虎、张持坚）

上海浦东近年来颁布实行了涉及经济、贸易、建设、规划、科技、文教等领域的七十一件法规性文件，确保了浦东开发的有序进行。

浦东开发开放是一项振兴上海，建设现代化经济、贸易、金融中心的跨世纪工程，因此大量出现的是以前不曾遇到过的新情况、新问题。

对此，浦东不是简单的采取“干一段时间，等积累了经验以后再制定法规条例”的做法，而是借鉴发达国家和深圳等特区的经验教训，聘请国内外有关专家学者，积极、及时地制定和推出法规性文件，使这些经济活动一出现就被纳入法制轨道。

去年初浦东新区诞生的中国第一家医疗机构药品采购服务中心，正因为一开始就比较规范，运转至今，成交药品一亿多元，没有发现一例回扣。

The development of Shanghai's Pudong is in step with the establishment of its legal system

Xinhua News Agency, Shanghai, February 10, by wire (reporters Jinhu Xie and Chijian Zhang)

In recent years Shanghai's Pudong has promulgated and implemented 71 regulatory documents relating to areas such as economics, trade, construction, planning, science and technology, culture and education, etc., ensuring the orderly advancement of Pudong's development.

Pudong's development and opening up is a century-spanning undertaking for vigorously promoting Shanghai and constructing a modern economic, trade, and financial center. Because of this, new situations and new questions that have not been encountered before are emerging in great numbers.

In response to this, Pudong is not simply adopting an approach of "work for a short time and then draw up laws and regulations only after waiting until experience has been accumulated." Instead, Pudong is taking advantage of the lessons from experience of developed countries and special regions such as Shenzhen by hiring appropriate domestic and foreign specialists and scholars, by actively and promptly formulating and issuing regulatory documents, and by ensuring that these economic activities are incorporated into the sphere of influence of the legal system as soon as they appear.

Precisely because as soon as it opened it was relatively standardized, China's first drug purchase service center for medical treatment institutions, which came into being at the beginning of last year in the Pudong new region, in operating up to now, has concluded transactions for drugs of over 100 million yuan and hasn't had one case of kickback.

(LDC2007T02)

YANS 2021: Back to the noisy channel

8 of 68

Guess Translation

8