History of AI
1.Inception of AI (1943 – 1955)
They proposed a model of artificial neurons in which each neuron is characterized by a sufficient number of neighbouring neurons.
HEBBIAN LEARNING: McCulloch and Pitts also suggested that suitably defined networks could learn. Donald Hebb(1949) demonstrated a simple updating rule for modifying the connection strengths between neurons.
1.Inception of AI (1943 – 1955)
2.The Birth of AI (1956)
2. 1956: Logic Theorist-Allen Newell and Herbert Simon
3.Early enthusiasm, great expectations(1952-1969)
Computers where designed for arithmetic operations and nothing else but AI researchers intellectual establishment, by large, preferred to believe that
“a machine can never do X”.
They focused on the tasks including games, puzzles, mathematics and IQ.
General Problem Solver, or GPS, Newell and Simon’s early success was followed up the GPS, unlike Logic Theorist, this program was designed from the start to imitate human problem-solving protocols. Within the limited class of puzzles it could handle, it turned out that the order in which the program considered subgoals and possible actions was similar to that in which humans approached the same problems.
Physical symbol system: hypothesis, which states that “a physical symbol system has the necessary and SYSTEM sufficient means for general intelligent action.” What they meant is that any system (human or machine) exhibiting intelligence must operate by manipulating data structures composed of symbols.
3.Early enthusiasm, great expectations(1952-1969)
At IBM, Nathaniel Rochester and his colleagues produced some of the first AI programs.
1952, Arthur Samuel wrote a series of programs for checkers(draughts) that eventually learned to play at a strong amateur level.
1958, In MIT AI lab, Mc-Carthy defined the high-level language Lisp, that has dominated the next 30 years programming language.
1958, Mc-Carthy published a paper entitled programs with common sense, in which he described the Advice Taker, a hypothetical program that can be seen as the first complete AI system.
1959: Herbert Gelernter constructed the Geometry Theorem Prover, which was able to prove theorems of mathematics that students would find quite tricky.
3.Early enthusiasm, great expectations(1952-1969)
1963 Microworlds.
James Slagle’s SAINT program(1963) was able to solve closed-form calculus integration problem typical of first-year college courses.
Tom Evans’s ANALOGY program(1968) solved geometric analogy problems that appear in IQ tests.
Daniel Bobrow’s STUDENT program(1967) solved algebra story problems, such as the following:
if the number of customers Tom gets it twice the square of 20% of the number of advertisements he runs, and the number of advertisements he runs 45, what is the number of customers Tom gets?
3.Early enthusiasm, great expectations(1952-1969)
3.Early enthusiasm, great expectations(1952-1969)
Hebb’s learning methods were enhanced by Bernie Widrow (widrow and Hoff, 1960; Widrow, 1962), who called his networks adalines, and by Frank Rosenblatt(1962) with his perceptions.
The perception convergence theorem(Block et al.., 1962) says that the learning algorithm can adjust the connection strengths of a perceptron to match any input data, provided such a match exists.
4.A dose of reality (1966-1973)
Reasons for failure?
5.Knowledge-based systems: The key to power?(1969-1979)
Early AI systems had adopted the general purpose search mechanism to solve the problem which proved to be weak as it did not scale up on large data.
The first kind of difficulty arose because most early programs knew nothing of their subject matter, they succeeded by means of simple syntactic manipulations.
The alternative to weak methods is to use more powerful, domain-specific knowledge that allows larger reasoning steps and can more easily handle typically occurring cases in narrow areas of expertise. To solve the more hard problem, you must know the result. DENDRAL program was an example of this type.
Expert systems are more Knowledge-intensive systems, Stanford began Heuristic programming project(HPP) to understand how new method of expert system can be applied to other areas.
MYCIN was system to diagnose the blood infection based on450 rules and was better than junior doctors which employed calculus of uncertainty called certainty factors to fit well how doctors diagnose on impact of evidence on the diagnosis.
6.AI becomes an industry(1980-present)
7. The return of neural networks(1986-present)
In mid 1980 four groups reinvented the back propagation learning algorithms and were applied on the many learning problems in CSE and psychology.
The widespread dissemination of these results in collection parallel distributed processing caused great excitements. They have capability to learn from the examples.
These connectionist models were competitors to the symbolic model proposed by newel and simons and logistic approach od Mc-Carthy. As human manipulate symbols.
These connectionists can compare the predicted true value to the expected output and can modify their parameters to decrease the in differences.
8.AI adopts the scientific method(1987-present)
Brittleness to expert system lead to incorporation of new more scientific approach.
Probability in place of Boolean logic.
Machine learning rather than hand coding.
Experimental results rather than philosophical claims.
Standards to note the progress
Soon UC Irvine was used as a standard repository for ML datasets,
The international planning competition for planning algorithms
LibreSpeech corpus for speech recognition,
The MNIST data set for handwritten digit recognition
ImageNet and COCO for image object recognition
SQUAD for NLP answering
Hidden Markov model dominated the area of speech recognition during 1980 which is based on the real, and large corpus of speech dataset to ensure the performance is robust.
Note that there was no scientific claim that human use HMM but HMM provided mathematical framework for understanding and solving problems.
1988 was important year for connection between the AI and other fields like statistics, OR, decision theory, and control systems.
Pearl’s probabilistic reasoning in intelligent systems lead to new acceptance of probability and decision theory in AI.
Pearl’s Baysian networks yielded the rigorous and efficient formalism for representing uncertain knowledge as well practise algorithms for probabilistic reasoning.
Rich sutans reinforcement learning was the major contribution in 1988.
9. The emergence of intelligent agents(1995-present)
10. The availability of very large data sets(2001-present)