Project Aristotle
Almost AGI
Q&A chatbot
Fix universe
No agency
The philosopher... as a goal!
Project Aristotle
Factoid question answering
User: What is the date of birth of Mozart?
Aristotle: The famous music composer?
User: Yes.
Aristotle: Wolfgang Amadeus Mozart is born in January 27, 1756
Complex questions
User: How many different president have been in office in the 20th century?
Aristotle: You are referring to US presidents?
User: Yes
Aristotle: Are you referring to full term of do I include partial term?
User: Include partial term.
Aristotle: There we 17 distinct president that we in office for at least some time in the 20th century.
General intelligence
User: Which 21st US president implemented the most efficient policy in term of economic growth.
Aristotle: Do you equate economic growth to GDP?
User: …
Dialectic of representation
The Assayer
Philosophy is written in this grand book, which stands continually open before our eyes (I say the “Universe”), but can not be understood without first learning to comprehend the language and know the characters as it is written. It is written in mathematical language, and its characters are triangles, circles and other geometric figures, without which it is impossible to humanely understand a word; without these one is wandering in a dark labyrinth.
Galileo Galilei, The Assayer (1623)
Concept
Instances
Logical positivism
Law: x ∈ Man → x ∈ Mortal
Fact: Socrates ∈ Man
Conclusion: Socrates ∈ Mortal
Logical positivism
Ludwig Wittgenstein
Tractatus
Investigation
Meaning by usage
Can Quantum Mechanical Description of Physical Reality Be Considered Complete?
Einstein, Podolsky and Rosen
1935
A mix of natural language and mathematics
Dialectic of representation
Logic | Language | World |
Discrete | Discrete (almost) | Continuous |
Unambiguous | Ambiguous | Raw |
Multiple | One | One |
Constant | Drifting | Changing |
Formal theories | Text | Video |
Dense | Dense | Complete |
Dialectic of representation
Discrete ↔ Continuous
Idealism ↔ Empiricism
Logic ↔ Senses
Tractatus ↔ Investigations
Word token ↔ Word embedding
Thinking slow ↔ Thinking fast
System 1 ↔ System 2
Modus ponen ↔ Bayes rule
Transparent ↔ Opaque
Old school AI ↔ Deep learning
Daniel Kahneman
Daniel Kahneman (1934 – ) is an Israeli-American psychologist and economist notable for his work on the psychology of judgment and decision-making, as well as behavioral economics, for which he was awarded the 2002 Nobel Memorial Prize in Economic Sciences. His empirical findings challenge the assumption of human rationality prevailing in modern economic theory.
Thinking, fast and slow
System 2
System 1
Thinking, fast and slow
System 2
System 1
22
Morphemes and Sememes
Sememes hypothesis
I believe that that language has to be treated using discrete unit similar to words. Unfortunately tokenization in word is problematic for several reason and the use of morphemes is not necessarily more practical.
In this project we aim to learn a latent semantic vocabulary (sememes) in which every language can be faithfully and efficiently translated.
We hope that this latent sememe language will be useful in the field of linguistic and natural language processing.
Words and Morphemes
A word is the smallest element that can be uttered in isolation with objective or practical meaning.
A morpheme is the smallest unit of meaning but will not necessarily stand on its own.
A word can concatenate concept
Prefix
Sufix
antifreeze
defrost
disagree
encode
embrace
forecast
injustice
impossible
interact
midway
misfire
nonsense
overlook
return
semicircle
submarine
superstar
transport
personal
hopped
wooden
higher
worker
biggest
careful
linguistic
running
attraction
infinity
plaintive
fearless
quickly
enjoyment
kindness
joyous
comfortable
Compoundwords
candlestick campfire candytuft cannot cardboard carefree careless caretaker carport cartwheel catfish catnap | checkmate checkroom checkup chestnut chickpea childbirth childcare childfreeh childlike childproof childrearing chopstick | clotheshorse coastline cobweb copycat coldframe coldhearted coldsore commonsense cookbook cookout cooktop cookwear | cornbread corncob corndog cornmeal cornstalk cottonmouth countdown counterattack counterbalance counterweight countryside courthouse | cowslip crabgrass craftsman crawfish crossbow crossroad crosswalk crossword crowbar cubbyhole cupboard cupcake |
Some groups of words act as a single word
French | English |
fin de semaine | weekend |
en réalité | as a matter of fact |
avant | prior to |
de temps en temps | occasionally |
trotteuse | second hand |
Words as a basic unit
English German Modern mandarin
taxi driver Taxifahrer 出租车司机
taxi driver taxi driver go out-rent-car-control-machine�
Words and sememes
Example of sememes tokenization
cats cat▪s�Canadian Canada▪ian�Italian Italy▪ian�French France▪ian�countdown count▪down �autotomy casting▪off▪limb
Embeddings
Embeddings hypothesis
One a procedure exists to transform document into tokens, for example using sememes, I believe that an independent machinery should produce embeddings based on large corpus of document without a specific task in mind.
Embeddings
Recursive AutoEncoder (RAE)
The black dog bites the white cat
The black dog bites the white cat
Towards Lossless Encoding of Sentences
Prato, Chandar, Tapp�ACL 2019 Submission
Accuracy for exact and complete phrase reconstruction.
Embedding size: 300, 512 and 1024
Towards Lossless Encoding of Sentences
Sentiment Analysis
Stanford Sentiment Treebank
RAE is our approach
SST-2: complete sentence
SST-5: all sub phrases
Linguistic structure
This pusillanimus Canadian works at the White House.
This pusillanimus Canadian works at the White House.�
▪↑ this▪show lack courage▪Canada ian▪work s▪at▪the▪↑ white▪↑ house▪
▪↑ this▪show lack▪courage▪Canada ian▪work s at the▪↑ white ↑ house▪
Linguistic structure
This pusillanimus Canadian works at the White House.
▪↑ this▪show lack▪courage▪Canada ian▪work s at the▪↑ white ↑ house▪
Knowledge representation and reasoning
Formal reasoning does not capture common sense
The airplane is heavy, otherwise I could carry it.�The airplane is not heavy, otherwise it would not fly.
Because cheap horses are rare�and rare horses are expensive I claim that �cheap horses are expensive.
This is why old school AI did not succeed.
Independently consistent collection of theories
Quantum Mechanics
General
Relativity
Thermodynamics
Newton mechanics
Biology
Independently consistent collection of models
Common human
Literary human
Transhuman human
Ideal human
Biology
human
Independently consistent collection of models
Christian
Consumer
Green
NRA
UFO
Knowledge representation and reasoning
Logic
First Order Logic (FOL)
Ex 1: All member of MILA has a key and a code.
Ex 2: ∀𝑥, ∃𝑦, (𝑃(𝑥)∨𝑃(𝑦))∧(𝑓(𝑥)=𝑓(𝑦))
Von Neumann–Bernays–Gödel set theory
NBG is Von Neumann–Bernays–Gödel set theory.
NBG can express all sciences!
Knowledge representation and reasoning
System 2 = FOL
Metamath
Metamath is a language for developing strictly formalized mathematical definitions and proofs accompanied by a proof checker for this language and a growing database of thousands of proved theorems covering conventional results in logic, set theory, number theory, group theory, algebra, analysis, and topology, as well as topics in Hilbert spaces and quantum logic.
Aristotle architecture
Project Aristotle
Memory
Linguistic
Learning
Logic
Project Aristotle
Aristotle
Knowledge
Knowledge graph and ontology
Linguistic