1 of 48

Generative Models, �LLMs & more

2 of 48

Generative models

3 of 48

Generative models

4 of 48

Generative models

5 of 48

Generative models

6 of 48

Generative models

7 of 48

Generative models

8 of 48

Generative models

9 of 48

Generative models

10 of 48

Generative models

Teaching a computer to read: Classification

Teaching a computer to write: Generative

11 of 48

LLMs

Generative models for language

12 of 48

LLMs: A simplified setting

  •  

13 of 48

LLMs: A simplified setting

  •  

14 of 48

Challenge 1: Words aren’t enough!

  • Typos: What if a paragraph contains “ardvarrk”?�
  • Different versions of words: �Listen, listened, listening, listener, …�Biology, biologist, biological, biologically,

Remedy: Use tokens = short character subsequences instead of words

15 of 48

Challenge 1: Words aren’t enough!

  • Typos: What if a paragraph containsardvarrk”?�
  • Different versions of words: �Listen, listened, listening, listener, …�Biology, biologist, biological, biologically,

Remedy: Use tokens = short character subsequences instead of words

16 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

17 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

single-char tokens

18 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

two-char tokens

19 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

two-char tokens

20 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

two-char tokens

21 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

two-char tokens

22 of 48

Tokenization

For tokenizing, we start with tokens for all characters, then continue finding most common next-shortest subsequences.

3+ char tokens

23 of 48

Challenge 2: Language is not fixed length

  •  

24 of 48

Challenge 2: Language is not fixed length

  •  

25 of 48

Challenge 2: Language is not fixed length

  •  

26 of 48

Challenge 2: Language is not fixed length

  •  

27 of 48

Challenge 2: Language is not fixed length

  •  

28 of 48

LLMs

  • Tokenize language
  • Predict the next token based off the previous tokens
  • Use encoding / embedding of tokens as vectors
  • Large neural network (“transformer” architecture) �as generative model�
  • Extremely successful! (clearly!)

But: still largely based on patterns of language

29 of 48

30 of 48

31 of 48

Why is this wrong?

32 of 48

Why is this wrong?

“Given that a ball is red, is it more likely in the left jar or in the right jar?”

33 of 48

Why is this wrong?

“Given the person is republican, are they more likely from California or Louisiana?”

34 of 48

Consequences of Language Models

The association between tokens “Lousiana” and “Republican”

is very strong!!

And “California” to “Democrat” is very strong!

  • LLM answers with Republican = Louisiana

But: This is also a counterintuitive question that people get wrong ☺�And: “Reasoning” / “Thinking” models get the question right

35 of 48

Consequences of Language Models

Be aware of these when using LLMs!

36 of 48

Life Lessons from Data Analysis…

37 of 48

Life Lessons from Data Analysis…

38 of 48

Life Lessons from Data Analysis…

  • Eats healthy
  • Exercises
  • Non smoker
  • Lives to be 100 years old

39 of 48

Life Lessons from Data Analysis…

  • Goes to class
  • Does the homework
  • Studies
  • Gets an A in the class

40 of 48

Life Lessons from Data Analysis…

  • Goes to class
  • Does the homework
  • Studies
  • Gets an A in the class

41 of 48

Life Lessons from Data Analysis…

  • Goes to class
  • Does the homework
  • Studies
  • Gets an A in the class

42 of 48

Life Lessons from Data Analysis…

  • Goes to class
  • Does the homework
  • Studies
  • Gets an A in the class

43 of 48

Life Lessons from Data Analysis…

  • Goes to class
  • Does the homework
  • Studies
  • Gets an A in the class

44 of 48

Recent Research: Generative Models + Quantum

  • Quantum Annealing (very cool) ��
  • + generative model fit for the hardware ��

45 of 48

Quantum Annealing

  • Looks really cool �
  • Solves a certain type of optimization problem����
  • In a nutshell: �Freeze some really small thingies to absolute zero with some magnets to solve Sudokus

46 of 48

Generative Model: Image Denoising

“Noise”: Pixels are damaged in an image

Generative Model:

  • Learn what numbers look like
  • Fix pixels that look wrong

47 of 48

Generative Model: Image Denoising

Determining which pixels are “wrong”:

  • Optimization problem for a quantum computer �(with the right generative model)

Catch:

Largest image a quantum computer can handle is…

12 x 12 pixels black and white!! ☹

48 of 48

Generative Model: Image Denoising

Lots of interesting things still being developed!