1 of 37

ChatGPT, eará AI, ja sámegiella

Professor Lars Ailo Bongo�Informatihka instituhtta, �UiT Norgga árktalaš universitehta

Netsam. 14.09.23. Áltá

DALL-E: “a bright future in sapmi science fiction style”

2 of 37

Eará AI?

3 of 37

4 of 37

5 of 37

Ipmirda go ChatGPT �sámegiela?

6 of 37

Stable Diffusion: “sami reindeer herders in traditional clothing”

DALL-E: “sami reindeer herders in traditional clothing”

7 of 37

Nugo láve?

Stable Diffusion: “same old story depressive style”

8 of 37

Outline

  1. Technical background
  2. Issues with state-of-the-art
  3. 💡Sami-AI💡

DALL-E: “a bright future in sapmi science fiction style”

9 of 37

Rule-based coding (Python)

From: https://github.com/uit-hdl/HEImmune

10 of 37

…with the help of ChatGPT

11 of 37

Rule-based visual coding (Scratch)

Source: https://medium.com/scratchteam-blog/3-things-to-know-about-scratch-3-0-18ee2f564278

12 of 37

AI buzzwords

https://medium.com/swlh/artificial-intelligence-machine-learning-and-deep-learning-whats-the-real-difference-94fe7e528097

Source: https://www.spiceworks.com/tech/big-data/articles/what-is-support-vector-machine/

Source: https://www.javatpoint.com/deep-learning

13 of 37

Big data

  • big computer

Source: The Economist

(ChatGPT = 100,000,000,000,000,000,000,000,000 FLOPS)

14 of 37

Training a model

Label = Zebra

Label = antelope

Label = zebra

Repeat a few million times

15 of 37

Using a model (inference)

Label = ?

Label = zebra

16 of 37

Fine tuning a model (transfer learning)

label = gabba

label = muzet

Repeat a few hundred times

17 of 37

💡Boazu �identification💡

18 of 37

From cats and dogs to abnormal lung sounds

19 of 37

From master’s thesis table to medical device

Conflict of interest: LAB own shares in Medsensio.

20 of 37

Gos leat sámi govat?

21 of 37

💡DataverseSME💡

label = gabba

Mas leat govvat!

Ja chat árpput!

22 of 37

23 of 37

Diffusion models

Source: https://stable-diffusion-art.com/how-stable-diffusion-work/

24 of 37

💡Fine-tuned sámi image generator model💡

25 of 37

💡Ii leat vattis!💡

26 of 37

27 of 37

Reinforcement learning

28 of 37

ChatGPT

Source: https://openai.com/blog/chatgpt

29 of 37

LLMs (ChatGPT) is the fastest evolving technology on earth!

30 of 37

💡Small is interesting (and beautiful)💡

31 of 37

$10M vs $100 language models

32 of 37

33 of 37

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜

Unfathomable training data:

  1. Size doesn’t guarantee diversity
  2. Static data/ changing social views
  3. Encoding bias

34 of 37

💡“take inspiration from movements to decolonize education”💡

35 of 37

Ernie Bot (Baidu)

36 of 37

ChatGPT @ UiT

37 of 37

Oktigeassu

  • ChatGPT ja Stable Diffusion
    • Mii ja movt
    • Manin eai doaimma (vuos) sámegilli
  • Sámi AI
    • Ráhkadit govaid: hui álki
    • AI modelat: veháš váttis
    • Giella: hui váttis
  • Lea dárbu leat mielde AI ráhkadeames!