GenAI, quantum computing, and the power of convergence
DECEMBER 2023
1
Topics for today's session
The roadblocks preventing it from scaling
1
3
2
2
The power of generative AI (I/II)
Simulated exams | Score | Percentile |
Bar Exam (MBE+MEE+MPT)1 | 298/400 | 90th |
LSAT | 163/180 | 88th |
SAT Reading & Writing | 710/800 | 93rd |
SAT Math | 700/800 | 89th |
GRE Quant | 163/170 | 80th |
GRE Verbal | 169/170 | 99th |
GRE Writing | 4/6 | 54th |
USABO Semifinal Exam 2020 | 87/150 | 99-100th |
AP Art History | 5/5 | 86th - 100th |
AP Biology | 5/5 | 85th-100th |
AP Calculus BC | 4/5 | 43rd – 59th |
The value of GenAI
Generative AI is the most viral technology of all time…
… & the first to generalize human-level intelligence
0
25
50
75
100
125
150
0
# of days
~75 days
ChatGPT
~5 days
Spotify
~150 days
# Users
GPT-4 Test Results
3
The power of generative AI (II/II)
What mood does the attached work of art convey? Why? What aesthetic choices does the artist make that contribute to the mood?
ChatGPT response
The value of GenAI
4
GenAI today primarily used for enterprise operations
Source: June 2023 AI Adoption survey, N=125; BCG analysis
Value creation with GenAI today
FMs can be used to automate call centers, but we are going one step further, creating predicative call and text functionality that allows us to know ahead of time why customers are calling
We are using FMs in drug discovery to assist with gene sequencing and small molecule simulation to speed up pre-clinical studies and selection of leads for clinical studies
Operational efficiency use cases
(e.g. extracting clauses from legal documents)
Product & CX enhancement
(e.g. financial robo-advisor)
New products and business models
(e.g. GenAI drug discovery)
% of value creation by use case type
The value of GenAI
We are enhancing our chatbots with GPT trained on 30 years of documentation…it's like having our CSO sitting next to you when you're on a phone with a client.
$100B+ market size for tech providers by 2027
5
5 GenAI capabilities that drive value for enterprises
Tech. capabilities
Description
Illustrative use cases
Note: FM = foundation model
1. GenAI transformations can leverage multiple tech. capabilities (e.g., ChatGPT leverages content generation & creativity)
UNDERSTAND
language / image
EXTRACT
knowledge
SUMMARIZE
& transcribe
GENERATE
content
ANALYZE
data & text
The value of GenAI
6
Model performance scales linearly with model size…
Expert systems
175 B
10k
0
10 M
GPT-3
1.7 T
GPT-41
…
Ability to complete multiple tasks on text format
Ability to reason and control other systems
Ability to complete multiple tasks on multiple formats (texts, image, audio, video)
…but what made it possible?
The value of GenAI
7
The rise of GenAI is fueled by three primary factors
1
2
3
The value of GenAI
8
Algorithm advances | Transformer models
Transformer Model Overview |
|
First described in 2017 by Google researchers in a paper titled "Attention is All You Need"
Transformers are a neural network that can apply mathematical techniques called "attention" to detect how data may be related to each other
Transformers are now the dominant models used to "pre-train" foundation models
Source: Attention is all you need, Proceeding of NIPS (2017), pp. 5998-6008; https://blogs.nvidia.com/blog/2022/03/25/what-is-a-transformer-model/
Before transformers, neural networks were trained on data labelled by humans (e.g., this image is a cat), which is expensive and a huge limitation to training data availability.
With "attention" mechanisms, transformers can tease out subtle ways different parts of the data are related through self-learning (e.g., "cats" are related to "pets")
Using both vast amounts of training data, and learning around context, transformers now had generative ability.
The value of GenAI
9
Digital data growth | 10 to 120 exabytes in 10 yrs�80% is unstructured, requiring "attention"
Requires “attention mechanism” to avoid human labeling bottleneck
80%+
of digitized data
Unstructured�no data model
Structured�well-defined, easily-organized database information
The value of GenAI
10
Compute | Exponential increase via Moore's Law
Larger models perform better than smaller, pushing practitioners to build larger models…
Human ability to detect if AI generated news article
Error
Model parameters
100M
10B
100B
1B
50% line of random chance = human cannot distinguish AI generated news articles from human generated ones
Source: Brown Tom B. et al., 2020. Language models are few-shot learners; "In-Datacenter Performance Analysis of a Tensor Processing Unit", Google
Moore's law provides exponential advances in compute power
The value of GenAI
11
Each enabler also a roadblock for scaling GenAI
Roadblock
Evidence
We are running out of training data to feed the models
Roadblocks for GenAI
12
Key roadblocks | Will we run out of data?
…cutting error rates in half
(Tokens from the internet can increase from ~1012 to ~1014 and reduce model error from ~20% to ~10%)
Source | Tokens in�current models | Tokens accessible�in principle |
Internet | ~1012 | 5 x 1014 |
Books | 5 x 1011 | 1013 |
Wikipedia (English) | 6.5 x 109 | 6.5 x 109 |
Wikipedia (All) | 2.5 x 1010 | 3.9 x 1010 |
Scientific papers | 2.7 x 1010 | 1.5 x 1012 |
Few | 2.5 x 1013 | |
Text Messages | 0 | 1012/year |
Youtube | 0 | 4 x 1012 |
80%+ of model improvement due to amount of data vs. model design
Available internet training data can grow another 2 orders of magnitude …
Source: dynomight.net, BCG analysis
Roadblocks for GenAI
13
Key roadblocks | Will we run out of compute?
Meta RSC AI supercluster
6,080 Nvidia's latest A100 GPUs
~1.9 exaflop/s (1018 float-point operations per second)
~15 hours
~60 days
~16 YEARS
(or 600,000 GPUs)
* GPT-4 uses ~100X computing power in training vs. GPT-3
Roadblocks for GenAI
14
Convergence possibilities for Quantum & GenAI
Quantum Computer x GenAI to work side by side
Quantum Computing to solve challenges in scaling GenAI
GenAI accelerating progress in Quantum Computing
Convergence potential
A
C
B
15
Convergence | QC to resolve GenAI roadblocks
Roadblock
Use case
We are running out of training data to feed the models
Example research in the field
A
Zapata researchers explored substituting parts of MolGAN with variational quantum circuits (VQCs), resulting in quantum GANs surpassing classical GANs in performance
Convergence potential
16
Convergence | GenAI to accelerate QC timeline
Potential to impact fundamental QC algo research?
qecGPT Project1: A framework called qecGPT has been proposed for decoding quantum error-correcting codes using generative modeling. This model employs autoregressive neural networks, specifically Transformers, to learn the joint probability of logical operators and syndromes
B
Source: 1. arvix.org H Cao, F Pan, Y Wang, P Zhang 2. GitHub, 3. Electronic Engineering Journal, BCG analysis
GenAI massively impacting classical programming today
Benefit
E.g. Copilot, Codey
E.g. Tabine, PyCharm, Visual Studio Code
E.g. TensorFlow
Convergence potential
17
Registered chemicals
275,000,000
Entire chemical space - # of compounds:
1,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000 (1060)
Approved medicines
Today ~10,000
Ultralarge chemistry databases
100,000,000,000,000,000,000,000,000 (1026)
Illustrative drug discovery workflow
Completely inaccessible today…
Not even theoretically
C
Convergence potential
18
Registered chemicals
275,000,000
Entire chemical space - # of compounds:
1,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000,000 (1060)
Approved medicines
Today ~10,000
Ultralarge chemistry databases
100,000,000,000,000,000,000,000,000 (1026)
GenAI to generate potential chemical formulas that are likely synthesizable with desired properties
Quantum computer to verify exact properties with quantum simulation on atomic scale and find drug candidates
GenAI generated candidates
Tomorrow could be millions times more efficient
C
Convergence potential
Illustrative drug discovery workflow
19
Our goal is to compress the next 250 years of chemistry and materials science progress into the next 25
Convergence potential
C
20
Traditional computers had many applications prior to the development of error correction
1937 | Atanasoff-Berry Computer solves systems of linear equations for astronomy research
1941 | British Bombe deciphers German Enigma codes
1944 | IBM Harvard Mark I simulates atomic reactions for Manhattan Project
1945 | ENIAC calculates artillery firing tables for the US Army
1950 | Hamming "error correction" codes are introduced
Why collaborate now
21
Government funding, and public-private collaboration around core tech critical ingredients
Research
Technology
Investment
Users
Research
Technology
Investment
Users
Why collaborate now
1951
1952
1953
1954
1955
1955
1956
1956
1953
IBM Harvard Mark I (1939-1944)
1941
1943
1943
UNIVAC I (1950-1956)
1952
Source: 1. US Census Bureau
22
We are in early stages today but components in place
Research
Technology
Investment
Users
We must unify these efforts to scale effectively
In April 2023, IBM and Moderna launch partnership to explore the use of quantum computing and GenAI to advance and accelerate mRNA research & science.
Moderna kicks off the build of a quantum- and GenAI-ready workforce, while IBM explores how quantum technology can apply to mRNA research.4
Source: 1. J. Chem. Inf. Model. 2023, 63, 11, 3307–3318, 2. arvix.org H Cao, F Pan, Y Wang, P Zhang, 3. Global Quantum Intelligence, 4. IBM newsroom
May 2023: Fundamental research into GenAI and quantum computing convergence1
July 2023: Researchers proposed a framework called qecGPT for decoding quantum error-correcting codes using generative modeling2
Governments have pledged $55B+ to quantum computing to date and over $200B by 2030.3
Why collaborate now
23
Key takeaways
GenAI and Quantum Computing independently have transformative value creation potential
Convergence opportunities include both (a) roadmap acceleration with QC advancing GenAI and vice versa, and (b) workflow transformation where they create exponential benefits working side-by-side
But the expertise and resources required for technology convergence on this scale overwhelms single-company efforts and even small partnerships
What it requires is concerted, integrated and multi-disciplinary consortium-led efforts to define goals, guide research and deliver results
24
Thank you
25