State of AI Report
October 12, 2021
#stateofai
stateof.ai
Ian Hogarth
Nathan Benaich
About the authors
Nathan is the General Partner of Air Street Capital, a venture capital firm investing in AI-first technology and life science companies. He founded RAAIS and London.AI (AI community for industry and research), the RAAIS Foundation (funding open-source AI projects), and Spinout.fyi (improving university spinout creation). He studied biology at Williams College and earned a PhD from Cambridge in cancer research.
Nathan Benaich
Ian Hogarth
Ian is an angel investor in 100+ start-ups. He is a Visiting Professor at UCL working with Professor Mariana Mazzucato. Ian was co-founder and CEO of Songkick, the concert service. He studied engineering at Cambridge where his Masters project was a computer vision system to classify breast cancer biopsy images. He is the Chair of Phasecraft, a quantum software company.
stateof.ai 2021
Introduction | Research | Talent | Industry | Politics | Predictions
#stateofai | 2
Artificial intelligence (AI) is a multidisciplinary field of science and engineering whose goal is to create intelligent machines.
We believe that AI will be a force multiplier on technological progress in our increasingly digital, data-driven world. This is because everything around us today, ranging from culture to consumer products, is a product of intelligence.
The State of AI Report is now in its fourth year. Consider this Report as a compilation of the most interesting things we’ve seen with a goal of triggering an informed conversation about the state of AI and its implication for the future.
We consider the following key dimensions in our report:
Collaboratively produced by Ian Hogarth (@soundboy) and Nathan Benaich (@nathanbenaich).
stateof.ai 2021
#stateofai | 3
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Reviewers
Markus Anderljung, Ali Eslami,
Rob Ferguson, Yanping Huang,
Chip Huyen, Andrej Karpathy,
Allie Miller, Moritz Mueller-Freitag, Torsten Reil, Sebastian Ruder,
Shubho Sengupta, Jaime Teevan,
Nu (Claire) Wang, and Diane Wu.
Thank you!
Othmane Sebbouh
Research Assistant
Othmane is a PhD student in ML at ENS Paris, CREST-ENSAE and CNRS. He holds an MsC in management from ESSEC Business School and a Master in Applied Mathematics from ENSAE and Ecole Polytechnique.
#stateofai | 4
Introduction | Research | Talent | Industry | Politics | Predictions
Contributors
Artificial intelligence (AI): A broad discipline with the goal of creating intelligent machines, as opposed to the natural intelligence that is demonstrated by humans and animals. It has become a somewhat catch all term that nonetheless captures the long term ambition of the field to build machines that emulate and then exceed the full range of human cognition.
Machine learning (ML): A subset of AI that often uses statistical techniques to give machines the ability to "learn" from data without being explicitly given the instructions for how to do so. This process is known as “training” a “model” using a learning “algorithm” that progressively improves model performance on a specific task.
Reinforcement learning (RL): An area of ML concerned with developing software agents that learn goal-oriented behavior by trial and error in an environment that provides rewards or penalties in response to the agent’s actions (called a “policy”) towards achieving that goal.
Deep learning (DL): An area of ML that attempts to mimic the activity in layers of neurons in the brain to learn how to recognise complex patterns in data. The “deep” in deep learning refers to the large number of layers of neurons in contemporary ML models that help to learn rich representations of data to achieve better performance gains.
Definitions
stateof.ai 2021
#stateofai | 5
Introduction | Research | Talent | Industry | Politics | Predictions
Algorithm: An unambiguous specification of how to solve a particular problem.
Model: Once a ML algorithm has been trained on data, the output of the process is known as the model. This can then be used to make predictions.
Supervised learning: A model attempts to learn to transform one kind of data into another kind of data using labelled examples. This is the most common kind of ML algorithm today.
Unsupervised learning: A model attempts to learn a dataset's structure, often seeking to identify latent groupings in the data without any explicit labels. The output of unsupervised learning often makes for inputs to a supervised learning algorithm at a later point.
Transfer learning: An approach to modelling that uses knowledge gained in one problem to bootstrap a different or related problem, thereby reducing the need for significant additional training data and/or boosting performance.
Natural language processing (NLP): Enabling machines to analyse, understand and manipulate human language.
Computer vision: Enabling machines to analyse, understand and manipulate images and video.
Definitions
stateof.ai 2021
#stateofai | 6
Introduction | Research | Talent | Industry | Politics | Predictions
Research
Talent
Industry
Politics
Executive Summary
stateof.ai 2021
#stateofai | 7
Introduction | Research | Talent | Industry | Politics | Predictions
Scorecard: Reviewing our predictions from 2020
stateof.ai 2021
#stateofai | 8
Introduction | Research | Talent | Industry | Politics | Predictions
Our 2020 Prediction | Grade | Evidence |
The first 10 trillion parameter dense model. | Yes | Microsoft demonstrated that it can train models with up to 32 trillion parameters. But it is unclear if these can learn better representations than existing large models. |
Attention-based neural networks achieve state of the art result in computer vision. | Yes | Vision Transformers are #1 on ImageNet. |
A major corporate AI lab shuts down as its parent company changes strategy. | Sort of | Alibaba’s AI lab fizzles out as part of an internal restructuring. |
Chinese and European defense-focused AI startups collectively raise over $100M in the next 12 months. | No | Funding did not reach this level, yet. |
One of the leading AI-first drug discovery startups either IPOs or is acquired for >$1B. | Yes | NASDAQ IPOs: Recursion on April 16, 2021 and Exscientia on October 1, 2021. |
DeepMind makes a major breakthrough in structural biology and drug discovery beyond AlphaFold. | Yes | DeepMind released AlphaFold 2. |
Facebook makes a major breakthrough in AR/VR with 3D computer vision. | No | Nothing major in 3D computer vision. |
NVIDIA does not end up completing its acquisition of Arm. | Yes | The acquisition has not completed by its deadline and is under active investigation. |
stateof.ai 2021
#stateofai | 9
Introduction | Research | Talent | Industry | Politics | Predictions
Section 1: Research
stateof.ai 2021
#stateofai | 10
Introduction | Research | Talent | Industry | Politics | Predictions
In our 2020 Report, we predicted: “Attention-based neural networks move from NLP to computer vision in
achieving state of the art results.”
2020 Prediction: Vision Transformers
stateof.ai 2021
#stateofai | 11
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Facebook AI introduces SEER, a 1.3B parameter self-supervised model pre-trained on 1B Instagram images that achieves 84.2% top-1 accuracy on ImageNet, comfortably surpassing all existing self-supervised models.
Self-supervision is taking over computer vision
#stateofai | 12
Introduction | Research | Talent | Industry | Politics | Predictions
Researchers compare a self-supervised ViT (SSViT) to fully supervised ViTs and convnets, and find that SSViTs learn more powerful representations.
What do self-supervised Vision Transformers see in an image that other models don’t?
stateof.ai 2021
#stateofai | 13
Introduction | Research | Talent | Industry | Politics | Predictions
Self-attention is the basic building block of SOTA models on speech recognition...
Transformers take over other major AI applications, e.g. audio and 3D point clouds
stateof.ai 2021
… and on 3D point cloud classification.
#stateofai | 14
Introduction | Research | Talent | Industry | Politics | Predictions
DeepMind’s Perceiver is one such architecture. It solves the Transformers’ quadratic dependence on the input
length by computing attention between the input and a low-dimensional learnable vector, rather than between
the input and itself.
Transformers extend into efficient self-attention-based architectures
stateof.ai 2021
#stateofai | 15
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Researchers from UC Berkeley, Facebook AI and Google show that you don’t need to fine-tune the core parameters of a language pre-trained Transformer in order to obtain very strong performance on a different task.
More evidence for the general purpose nature of Transformers
#stateofai | 16
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
While pre-trained transformers have taken the ML world by storm, new research shows that convolutional neural networks (CNNs) and multi-layered perceptrons (MLPs) shouldn’t be an afterthought. When trained properly, they are competitive with transformers on several NLP and computer vision tasks.
Beyond transformers: MLPs and CNNs make a comeback
#stateofai | 17
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Neural Radiance Fields (NeRF) already achieves SOTA results on view synthesis. New applications further highlight how impressive it is.
Remarkable progress in Novel View Synthesis
Rotating the blue object.
Adding more objects.
360º car rotation.
#stateofai | 18
Introduction | Research | Talent | Industry | Politics | Predictions
In our 2020 Report we predicted: “DeepMind makes a major breakthrough in structural biology and drug discovery beyond AlphaFold.”
2020 Prediction: AlphaFold 2
stateof.ai 2021
#stateofai | 19
Introduction | Research | Talent | Industry | Politics | Predictions
Half a year after DeepMind presented their AlphaFold 2 (AF2) method at the CASP14 conference, the Baker lab at the University of Washington created their own protein structure prediction system using related ideas and managed to attain accuracies approaching the original AF2 without detailed access to its methodology.
The ideas behind AlphaFold 2 rapidly diffused into academia and open source
stateof.ai 2021
#stateofai | 20
Introduction | Research | Talent | Industry | Politics | Predictions
Proteins found in nature today are the product of evolution. But what if AI could generate artificial proteins with useful functionality beyond what evolution has designed?
Large language models can generate functional proteins that are unseen in nature
stateof.ai 2021
#stateofai | 21
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Language models trained on viral sequences can predict mutations that preserve infectivity but induce high antigenic change, akin to preserving “grammaticality” but inducing high “semantic change”.
Learning the language of Covid-19 to predict its evolution and escape mutants
#stateofai | 22
Introduction | Research | Talent | Industry | Politics | Predictions
Single-stranded RNAs (e.g mRNAs) fold into well-defined 3D structures to effect their biological function. Unlike proteins, we know little about RNA folding and the number of available RNA structures is 1% of that for proteins.
New state-of-the-art for predicting the 3D structure of RNA molecules
stateof.ai 2021
#stateofai | 23
Introduction | Research | Talent | Industry | Politics | Predictions
Cryogenic electron microscopy (cryo-EM) empirically determines the structure of macromolecules at near atomic-resolution without the need for their crystallisation. Cryo-EM involves shooting electron beams at a flash-frozen sample of protein or molecule of interest. The microscope generates images of these molecules that are then combined to reconstruct its 3D structure. All stages of the cryo-EM workflow are amenable to AI, ranging from specimen preparation and data collection to structure determination and atomic interpretation.
Cryo-EM and AI: the next frontier in structural biology and drug discovery
stateof.ai 2021
#stateofai | 24
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Combination therapy could improve cancer patient outcomes, but empirically testing a large number of them is unfeasible in the lab setting. Here, self-supervision is used to observe cells treated with a finite number of drug combinations and to predict the effect of unseen combinations.
Predicting and prioritising novel drug combinations, dosages, and timing for therapy
#stateofai | 25
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Deep learning models can learn drug-protein binding relationships from a small number of empirical experiments in order to help prioritise which areas of vast chemical spaces to virtually screen.
Accelerating high-throughput virtual drug screening with model-guided search
#stateofai | 26
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The yield of a chemical reaction describes the percentage of reactants that are transformed into the desired product and is a key metric for reaction performance. Predicting reaction yields helps chemists to navigate chemical reaction space and design more sustainable, economical and effective synthesis plans.
Predicting chemical reaction performance using Transformers
#stateofai | 27
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
MuZero is the latest member of DeepMind’s “Zero” family. It matches AlphaZero’s performance on Go, chess and Shogi, and outperforms all existing models on the Atari benchmark while learning solely within a world model. Muzero appeared in Nature in December 2020.
Games continue to drive Reinforcement Learning research
#stateofai | 28
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
DreamerV2 is the first model-based RL agent trained on a single GPU to surpass human level performance on 55 popular tasks of the Atari benchmark. The agent learns behaviors purely within the latent space of a world model trained from pixels, which makes these behaviors more generalisable to solving future tasks more efficiently.
Superhuman world models for Atari, but on a budget
#stateofai | 29
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
RL agents have shown impressive performance on challenging individual tasks. But can they generalize to tasks they never trained on? DeepMind trained RL agents on 3.4M tasks across a diverse set of 700k games in a 3D simulated environment, and show they can generalize to radically different games without additional training.
Zero-shot generalisation in reinforcement learning
Figure: Examples of XLand environments.
Figure: Test metrics progress during training.
#stateofai | 30
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Soon after AlphaGo was published in 2016, a software implementation called Leela was made available. To assess its impact on the performance of Go players, researchers studied 750K Go moves from 1,200+ players between 2015 and 2019. They show that the advent of Leela coincided with a significant improvement in move quality.
Trained by AI: AlphaGo coaches professional Go players
All professional players
Old vs. Young
China vs. Japan vs. Korea
#stateofai | 31
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The increasing complexity of RL benchmarks and the computational power required to solve them have led researchers to evaluate their models using fewer and fewer runs. Yet, most still report only point estimates, like median scores. The result is a very noisy picture of the performance rankings of SOTA RL models.
Researchers call for more rigorous use of statistics in Reinforcement Learning
#stateofai | 32
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
To solve video-and-language (V&L) tasks like video captioning, ClipBERT only uses a few sparsely sampled short clips. It still outperforms existing methods that exploit full-length videos.
Less is more: watching a few clips is enough to learn how to caption a video
#stateofai | 33
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Google researchers tackle the high-resource language degradation problem by increasing model capacity.
For large-scale multilingual speech recognition too, the bigger the better
#stateofai | 34
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Speech generation usually requires training an Automatic Speech Recognition (ASR) system, which is resource-intensive and error-prone. Researchers introduce Generative Spoken Language Modeling (GSLM), the task of learning speech representations directly from raw audio without any labels or text.
Beyond ASR for speech generation: textless NLP
Some examples here: https://speechbot.github.io/pgslm/
#stateofai | 35
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Diffusion models’ training is more stable than GAN’s and outperforms them on several well-established datasets in image generation, audio synthesis, shape generation and music generation.
GANs have a serious new adversary: diffusion models
#stateofai | 36
Introduction | Research | Talent | Industry | Politics | Predictions
The canonical approach to applying deep computer vision to medical images is fine-tuning ImageNet pre-trained models or using rule-based label extraction from medical textual reports. In contrast, the ConVIRT method pre-trains directly on naturally occurring image-text pairs using a contrastive objective, without any supervision. ConVIRT outperforms all ImageNet-initialized models with only 10% as much labeled training data.
stateof.ai 2021
Learning medical image representations from text-image pairings
#stateofai | 37
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
OpenAI’s CLIP uses 400M text-image pairs to learn image and text representations. It exhibits a solid performance across a wide variety of datasets without any fine-tuning.
Multimodal self-supervision plus scale equals a powerful representer
#stateofai | 38
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
OpenAI’s DALL-E treats text-image pairs as a generative task and thus learns to generate believable images for a wide array of natural language prompts.
DALL-E draws what you want, but be sure to instruct it well
#stateofai | 39
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
CLIP is already serving as a base model for downstream tasks: Researchers from Google use its zero-shot capabilities together with Mask R-CNN to create a zero-shot learning model (VLiD) that surpasses supervised models on zero-shot object detection.
Using CLIP’s learned representations for zero-shot object detection
#stateofai | 40
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
OpenAI’s Codex system is a specialised offspring of GPT-3 that is focused on translating natural language into functional computer code in a dozen programming languages.
Codex for coders
User Instructions
Code generated by Codex
Outputs generated by Codex
#stateofai | 41
Introduction | Research | Talent | Industry | Politics | Predictions
Code generation models can generate snippets of code, but they struggle to generate entire programs.
Yet, code generation models still cannot crack the coding interview
stateof.ai 2021
#stateofai | 42
Introduction | Research | Talent | Industry | Politics | Predictions
Models do poorly on competition mathematics problems that test for reasoning and problem solving ability.
And don’t expect language models to help you with your math tests either
stateof.ai 2021
#stateofai | 43
Introduction | Research | Talent | Industry | Politics | Predictions
Researchers tested large language models on TruthfulQA, a new benchmark of questions spanning domains such as health, law, conspiracies and fiction. They showed that the best model was truthful on 58% of the questions, compared to the human baseline of 94%. More surprisingly, models of larger sizes were generally less truthful.
Big fat liars: large language models are less truthful than their smaller peers
stateof.ai 2021
Figure 2: Average truthfulness on control trivia questions
Figure 1 : Average truthfulness on Truthful QA
#stateofai | 44
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Researchers at CMU surveyed more than 60 papers to make sense of the ongoing progress in prompting research in NLP. They thoroughly document the shift from the “pre-train, fine-tune” procedure to the “pre-train, prompt and predict” one, which is especially relevant for zero-shot learning.
Pre-train, prompt, predict: a new paradigm for NLP models
#stateofai | 45
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Prompting has been shown to be one of the critical parts of zero/few-shot learning in NLP. As zero shot methods become more ubiquitous, effective problem framing through prompts becomes more relevant.
Prompting is key to zero-shot learning
From the ML at Berkeley blog: “Unreal Engine is a popular 3D video game engine created by Epic Games. CLIP likely saw lots of images from video games that were tagged with the caption “rendered in Unreal Engine”. So by adding this to our prompt, we’re effectively incentivizing the model to replicate the look of those Unreal Engine images.”
#stateofai | 46
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Choosing a bad prompt can result in massive performance degradations in NLP tasks. Users can avoid this choice altogether via prompt learning, where prompts are formulated as learnable vectors.
But prompting is also challenging and brittle
#stateofai | 47
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
3 different teams from Baidu, Google and Microsoft all surpass human baselines on the SuperGLUE NLP tasks.
One year after General Language Understanding Evaluation (GLUE), SuperGLUE is solved
#stateofai | 48
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
M6 is a 100B parameter model pre-trained on the largest dataset in Chinese for NLP and multimodal tasks.
CLIP, but now in Chinese
#stateofai | 49
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
After the success of the (English pre-trained) GPT-3, large language models in multiple languages are emerging from private and public companies, academic research labs, and independent open-source initiatives.
The “democratization” of large language models
#stateofai | 50
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Researchers show that human evaluators are often in disagreement on Natural Language Generation (NLG) tasks. This calls into question the idea of beating current human baselines as the gold standard for NLP tasks.
New study suggests human evaluation should be re-evaluated
#stateofai | 51
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
56 studies were published between 2015-19 that reported the training of a deep learning algorithm on at least one geographically identifiable patient cohort to perform an image-based diagnostic task vs. a human physician across 6 clinical disciplines. Of these studies, 71% used a patient cohort from one of three states: California, Massachusetts or New York. Thirty four states did not contribute data, point to huge patient underrepresentation.
Data deserts in biomedical AI research are likely to result in model bias in the clinic
#stateofai | 52
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Missing information and biases in demographic information are widespread in biomedical data that form the basis of the drug discovery process. ML solutions trained on these data need to understand and adapt for these biases to avoid perpetuating health inequities.
Measuring bias: a first step towards more inclusive health research outcomes
There is a significant lack of African American / Afro-Caribbean samples in older ages
Most samples are derived from Europeans
Samples from different ethnicities display different sex disparities
#stateofai | 53
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The UK National Screening Committee commissioned an investigation of the accuracy of AI systems for detecting breast cancer during routine screening. It found that studies published in the last ten years were of poor methodological quality and none were prospective studies that measured the accuracy in screening practice.
Beware of overstated claims: 94% of AI systems for breast cancer screening are less accurate than the original radiologist
#stateofai | 54
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
There is a conundrum in medical imaging AI: While computer vision models trained on a patient’s medical imaging data of various modalities can accurately and trivially predict their race, clinicians attempting to do the same cannot. This implies that medical AI systems can potentially cause discriminatory harm and reproduce or exacerbate the racial disparities that already exist in medical practice.
Medical AI racism: models reliably identify the self-reported racial identity of patients
#stateofai | 55
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Last year’s Report drew attention to the lack of openness of AI research as measured by the percentage of arXiv papers that share the code required to reproduce their results. Methodology improvements from the Papers With Code project that make the openness metric more ML specific have resulted in an increase from 15% in last year’s Report to 26% today. However, when analysing the authors of the “hottest papers” in the last 30 days*, we find that only 17% shared a code repository. This might suggest that some authors do not prioritise its timely release.
26% of AI research papers make their code available and 60% make use of PyTorch
*Top socially shared papers on Twitter for 30 days until 8 September 2021
#stateofai | 56
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Google researchers define Data cascades as “compounding events causing negative, downstream effects from data issues”. Supported by a survey of 53 practitioners from the US, India, East and West African countries, they warn that current practices undervalue data quality and result in data cascades.
Data becomes more critical when the stakes are high
#stateofai | 57
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
As Large Language Models (LLMs) become ever-more successful and ubiquitous, better documentation of the large training text corpora becomes critical. Researchers dissected C4, a 305 GB dataset that Google obtained by filtering a snapshot of Common Crawl. They found that the filtering disproportionately removed text about minority individuals.
Large language training datasets need better documentation
Figure: Proposed documentation methodology.
#stateofai | 58
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
As data-hungry deep learning conquers more applications, better domain-specific datasets are needed. Legal NLP and malware exemplify this struggle as new pretraining datasets and benchmarks come to the rescue.
Better datasets for machine learning in production: legal documents and malware
#stateofai | 59
Introduction | Research | Talent | Industry | Politics | Predictions
Working with massive datasets is cumbersome and expensive. Carefully selecting examples mitigates the pain of big data by focusing resources on the most valuable examples, but classical methods often become intractable at-scale. Recent approaches address these computational costs, enabling data selection on modern datasets.
Careful data selection saves time and money by mitigating the pains of big data
stateof.ai 2021
#stateofai | 60
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
With the explosion of papers submitted for consideration at major ML conference venues each year and the limited spots available, the ML community is calling attention to illicit collusion rings amongst reviewers.
Can you trust the quality of papers you read at academic conferences?
Credits: Michael Littman and Sergei Ivanov
#stateofai | 61
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Industry affiliated authors are less likely to provide access to their research code upon initial submission for conference review compared to academic affiliated authors. While industry authors enjoy a higher paper acceptance rate (right figure), academic authors release their code more frequently than industry authors, whether initially or once a paper is camera-ready (left figure).
Providing code alongside a research paper submission isn’t mandatory, but growing
#stateofai | 62
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Academic papers are disseminated via peer-reviewed journals and academic conferences. Today, researchers are creating Twitter threads and highly designed blog posts that resemble startup product launches to share and hype up their work.
The rise of research communications: Twitter threads drive citations 3-fold higher
#stateofai | 63
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
GNN is the 4th most used keyword at ICLR’21 and the one with the largest increase in usage from 2019 to 2020.
Graph Neural Networks: From niche to one of the hottest fields of AI research
Credits: Xavier Bresson
#stateofai | 64
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Modeling physical systems dynamics often requires subdividing complex continuous spaces into simpler discrete cells, a process called mesh-generation. DeepMind researchers used GNNs to accelerate mesh-based simulations by 1 to 2 orders of magnitude compared to classical solvers.
Graph Neural Networks applications: mesh-based simulation
#stateofai | 65
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Accurately predicting the estimated time of arrival (ETA) for a given route requires a complex understanding of the spatiotemporal interactions taking place on the road. GNNs are well suited for this task because roads and their intersections naturally form a graph network. A GNN-based system reduced negative ETA outcomes between 16% and 51% around the world in live production.
Graph Neural Networks applications: improving ETA predictions in Google Maps
#stateofai | 66
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
While very expressive and powerful, GNN model size doesn’t scale well alongside dataset size due to the complexity of modelling millions of nodes and billions of connections. This is problematic for real-world problems when deploying large GNNs for equally large graph datasets without sacrificing model parameters.
Graph Neural Networks: improving the memory and parameter efficiency of large models
Figure: RevGNNs outperform existing models with significantly less memory consumption.
#stateofai | 67
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
By using a graph as the world model, the L3P agent is able to efficiently plan even over long time horizons.
Graphs for model-based reinforcement learning
Figure: Compared to existing methods, L3P has a smoother trajectory and doesn’t get stuck in search for the next goal.
#stateofai | 68
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Chinese industrial and academic labs win all 3 tasks of the Open Graph Benchmark Large Scale Challenge.
Chinese institutions also sweep a major Graph Neural Networks competition
#stateofai | 69
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Predicting rainfall at high-resolution with a short lead time (<2h, i.e. “nowcasting”) is important for businesses and people when making weather-dependent decisions. New deep generative model (DGM)-based methods bring added resolution and prediction accuracy beyond that of physics-based simulations and current ML methods.
Deep generative models offer highly accurate probabilistic predictions of precipitation
#stateofai | 70
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In this work, a digital biomarker is developed for idiopathic pulmonary fibrosis in mice. Diseased and healthy animals are treated with a drug and their behavior is continuously tracked and analysed using computer vision. Behavioral patterns are learned across animal studies and functionalized as digital biomarkers that relate to drug efficacy and adverse reactions as a study progresses. An example digital biomarker is breathing rate, which can map more directly to patient symptoms in a clinical study. This compares to traditional endpoints (e.g. lung histology) that can only be measured after the study.
Computer vision unlocks accurate and fast disease assessment using digital biomarkers for drug discovery
Continuous data capture (including video)
Digital Biomarkers
(including breathing rate)
Healthy
Disease
Disease + Treatment
Healthy + Treatment
Breathing Rate (AUC)
Digital Biomarkers detect disease and show drug efficacy without waiting for histology
#stateofai | 71
Introduction | Research | Talent | Industry | Politics | Predictions
COVID vaccines are shown to be highly effective from large-scale observational data collected with the ZOE COVID Study App and the use of causal methods.
Citizen science with 1.2M participants demonstrates real-world vaccine effectiveness
stateof.ai 2021
age
sex
healthcare worker
comorbidities
background infections
vaccinated
infected
Infection risk reduction
Months since vaccine
Infection risk reduction since end of May (Delta emergence)
AstraZeneca
Pfizer
Causal variables to account for
#stateofai | 72
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Major factors that drive the carbon emissions during model training are the choice of neural network (esp. dense or sparse), the geographic location of a datacenter, and the processors. Optimising these reduces emissions.
Reducing the carbon emissions of large neural network training by 100-1000x
#stateofai | 73
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Introduced by Google in late 2019, JAX is a python package that combines Autograd (a library for automatic differentiation) and XLA (a compiler for linear algebra) to accelerate computations for machine learning research.
Here comes a new framework challenger: JAX
#stateofai | 74
Introduction | Research | Talent | Industry | Politics | Predictions
Section 2: Talent
stateof.ai 2021
#stateofai | 75
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Brazil and India are hiring >3x more AI talent today than they were in 2017, matching or surpassing the hiring growth of Canada and the US. Meanwhile, almost 30% of scientific research papers from India include women authors compared to an average of 15% in the US and UK, and far greater than 4% in China.
India and China see significant growth of AI talent, and India’s AI research is most diverse
#stateofai | 76
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The Chinese Academy of Sciences, the national academy for the natural sciences in China, was founded in 1949. From having no AI publications in 1980, the institution went to the #1 institution publishing top 25% quality* AI research 30 years later. Tsinghua University and Peking University emulated its growth and are now competitive with the oldest and best universities in the world: Oxford, Cambridge, Harvard, Stanford et al.
白手起家: A Chinese institution publishes the largest volume of quality AI research today
*Microsoft Academic Graph measures quality by “using a dynamic eigencentrality measure that ranks a publication highly if that publication impacts highly ranked publications, is authored by highly ranked scholars from reputable institutions, or is published in a highly regarded venue and also considers the competitiveness of the field.”
#stateofai | 77
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
China is projected to reach nearly double the number of STEM PhD students in the US by 2025.
China is outpacing the US in STEM PhD growth...
#stateofai | 78
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
High count numbers can artificially hide a decrease in program quality, for example if driven by a rapid development of mediocre programs. Data shows that this is not the case in China, where 43% of PhD graduates in 2019 came from Double First Class Universities*, a slight drop from 46% in 2015.
…without sacrificing program quality
*Double First Class Universities are “a tertiary education development initiative designed by the People's Republic of China government, in 2015, which aims to comprehensively develop elite Chinese universities and their individual faculty departments into world-class institutions by the end of 2050.” - Wikipedia
#stateofai | 79
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
An analysis of data from 2015-2019 examines the curricula of students from Tsinghua and Peking University. 70% of undergraduates continue to undertake postgraduate studies. Only 16% of all graduates (Bachelors, Masters, PhDs) choose to study abroad after graduation: their preferred destination is the US, followed by the UK. Domestically, Huawei holds firmly as their top employer.
Where do students from leading Chinese universities go?
Credits: Jeffrey Ding
2015-2019 Ranking of Employment Units for Tsinghua and Peking University Graduates | |||||
| Tsinghua University | ||||
Rank | 2015 | 2016 | 2017 | 2018 | 2019 |
1 | State Grid | Huawei | Huawei | Huawei | Huawei |
2 | Huawei | State Grid | State Grid | Tencent | Tencent |
| Peking University | ||||
1 | Huawei | Huawei | Huawei | Huawei | Peking U |
2 | Baidu | ICBC | ICBC | Tencent | Huawei |
Figure: Destinations of Tsinghua University’s 2019 graduates who go abroad for further study.
#stateofai | 80
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Since 2012, large technology companies have increasingly published either on their own or in collaboration primarily with elite universities as opposed to mid-tier and lower-tier universities. Counterfactual analysis suggests a causal divergence between large technology companies and non-elite universities that is driven by access to computing power as a form of de-democratisation. This results in a small set of actors creating a majority of the high-impact research output.
Elites work with elites: a compute divide drives the “de-democratization” of AI research
#stateofai | 81
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Researchers in deep learning with higher average impact papers from elite universities are more likely to transition into technology companies than their non-elite peers (middle chart). Early in their industry tenure, the citations of researchers increases and then steadily declines over the years (right chart). This suggests a depletion of academic impact (left chart).
Academia to industry transitioning is increasingly popular amongst top universities
#stateofai | 82
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In last year’s Report, we noted the significant efflux of Professors from North American universities into large technology companies (top 3 magnets were: Google/DeepMind, Amazon, Microsoft) from 2004-18. In 2019, the trend largely continued with 33 faculty members departing (right graph). It is notable that 85% of Professors that are hired are Tenured, meaning their level of seniority is such that they hold permanent employment at the university. CMU, Georgia Tech, Washington, and Berkeley lost the most faculty between 2004-19 (left graph).
The Great Academic Brain Drain...continued
#stateofai | 83
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In Germany, for example, student enrollment in applied sciences is growing >60% YoY (2018) whereas faculty growth in the same department and year is stagnating around 25% YoY. In absolute numbers, 2018 saw circa 230k students vs 2.5k professors, suggesting that 1 professor advises 90-100 students. This is untenable.
Depletion of academic faculty and the worsening of faculty:student ratios
Total university students
Students at universities of applied sciences
Professors at universities of applied sciences
Total university professors
Annual growth rate of students and professors (%)
#stateofai | 84
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In the Netherlands, for example, student enrollment in STEM programs has grown 68% between 2000 and 2017 but government funding for these resource-intensive programs has dropped 25% in the same time period on a per student basis. Academics fear for the livelihood of their programs. This is in stark contrast to China, where the government introduced AI courses for elementary and secondary school students in 2018 and has expanded its investment into STEM ever since.
Government funding cuts to higher education threatens more expensive STEM students
STEM student enrollment in the Netherlands
Government funding per student in the Netherlands
#stateofai | 85
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A Google researcher polled Twitter on the approximate annual compute budget for academic and industry AI labs. The responses suggested that grant bodies often reject the inclusion of compute budgets in grant applications and that most research groups work with very small numbers of GPUs. In response, some large cloud vendors are moving in to fill the gap.
Research groups struggle to compete given institutionally limited budgets
#stateofai | 86
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Unsurprisingly, therefore, Big Tech companies are a major source of academic research funding. This lets them indirectly craft a desirable public image and influence events, decisions, and research agendas of the universities they fund (particularly top tier institutions).
More money, more influence: 88% of top AI faculty have received funding from Big Tech
NeurIPS 2020 Platinum Sponsors
63% Big Tech // 21% Finance
% of CS faculty members who have at any point received funding or employment from Big Tech
#stateofai | 87
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Carnegie Mellon University partnered with Emerald Cloud Labs, to build the “world’s first cloud lab in an academic setting” as part of the university’s $250M investment into new science facilities. The project, costing $40M, will house 100 different scientific instruments for life science experiments on the CMU campus that are orchestrated via the cloud and executed by automated workflows. Another related academic-tech company relationship is the $240M partnership between IBM and MIT that formed the MIT-IBM Watson AI Lab in 2017.
Universities team up with private companies to fill research resources gap
#stateofai | 88
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The gender and racial diversity data radically differ between technical and non-technical teams. They show a massive lack of gender diversity in technical teams, while a better balance is achieved in product and commercial teams. African Americans and Hispanics constitute a lower share of the AI workforce than their share in the general workforce, with the severest drop coming from technical teams. These teams also have the highest share of Asian workers.
The US AI workforce: gender and racial diversity
#stateofai | 89
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Computer research scientists, software developers, mathematicians, statisticians and data scientists saw an evolution of their employment that is far ahead of the general employed population. To meet the increasing demand for technical talent, computer science and engineering were the fastest growing undergraduate degrees over 2015 to 2018, accounting for 10.2% of all 4-year degrees conferred in 2018. Their numbers increased by 34% and 25% respectively during the period, while the number of other awarded degrees increased 4.5% on average.
Market forces in action: supply of technical US AI talent grows 26.5% to meet demand
| 2019 Employment | 2015-2019 Employment Change |
Computer Research Scientists | 35,230 | 72.9% |
Mathematicians/ Statisticians/ Data scientists | 184,290 | 251.9% |
Software Developers | 1,651,990 | 38.9% |
Total Technical AI | 1,871,510 | 48% |
Total US Employed | 160,034,580 | 5.8% |
| Number of conferred degrees | |
| 2018 | 2015-2018 change |
Computer Science | 79,598 | 34% |
Engineering | 121,956 | 25% |
Mathematics/ Statistics | 25,256 | 15.6% |
Total technical AI degrees | 226,810 | 26.5% |
All degrees | 1,980,644 | 4.5% |
#stateofai | 90
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In the US, the tech sector is where remote work has been the most prevalent despite the loosening of pandemic rules in the spring of 2021. With the pandemic resurgence, Google, Apple, Facebook and Amazon announced that their offices would still be closed until at least January 2022. Twitter made the switch to remote work permanent.
Tech workers are staying home (for now)
#stateofai | 91
Introduction | Research | Talent | Industry | Politics | Predictions
Section 3: Industry
stateof.ai 2021
#stateofai | 92
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
2020 Prediction: An AI-first drug discovery companies IPOs or is acquired for $1B
British AI-first drug discovery company, Exscientia, originated the world’s first 3 AI-designed drugs into Phase 1 human testing and IPO’d on the NASDAQ on 1 October 2021 at a >$3B valuation. Exscientia is now the UK’s largest biotech and the 3rd largest biopharma company in the UK next to GSK and AstraZeneca. The company has a further 4 more drug candidates currently undergoing advanced profiling for submission of investigational new drug applications, in addition to more than 25 active projects in total.
10x fewer synthesized compounds to deliver a candidate
12 months target-to-hit vs. 54 months industry average
#stateofai | 93
Introduction | Research | Talent | Industry | Politics | Predictions
Drug selection for cancer patients is highly inefficient: over 90% of patients do not respond to the therapy that is selected by their oncologist. Why? Selection methods such as mutation sequencing are too reductionist. By contrast, Allcyte’s AI (left figure) finds the most potent drug for a given patient. AI-based microscopy is used to measure how live cancer cells respond to 140 clinically-approved third-party anticancer drugs at the single cell level. In a prospective clinical trial of 56 blood cancer patients (right figure), those patients who received AI-guided therapy achieved a 55% overall response rate and a statistically significant improvement in progression-free survival over their respective prior line of therapy.
Computer vision identifies the most potent drug for each cancer patient to improve survival
stateof.ai 2021
#stateofai | 94
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Recursion Pharmaceuticals, a Utah-based AI-first company that makes use of high-throughput screening and computer vision-powered microscopy to discover drugs, raised $436M in its NASDAQ IPO in April 2021. The business has 37 internally-developed drug programs including 4 clinical-stage assets. By conducted targeted exploration of biological search space with compound and disease cell type combinations, the company is building a “map” of disease biology. With this map, the company is predicting tens of billions of relationships between disease models and therapeutic candidates. This includes relationships that are predictive of candidate mechanism of action, which expands the discovery funnel beyond hypothesized and human-biased targets.
2020 Prediction: An AI-first drug discovery companies IPOs or is acquired for $1B
#stateofai | 95
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
DELs are composed of billions of small molecules with unique DNA barcodes attached. Previous ML applied to DELs coarsely aggregated data to smooth out noise. By adapting graph neural networks to reflect the DEL process, Anagenex lowers noise and designs novel libraries to complete wet lab-guided active learning loops.
Active learning using custom GNNs for improved drug discovery lead-finding with DEL data
DNA tag
Accessible molecules
Model: custom DEL tuned GNN
Synthesizable or purchasable compounds
Evolved library design
DEL library
Active learning loop
#stateofai | 96
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Treatments for inflammatory bowel diseases such as Crohn’s Disease and Ulcerative Colitis need not only inhibit inflammation, but must also survive while travelling through the gut. In order to achieve this, LabGenius simultaneously co-optimised potency and stability in the presence of protease. Their approach resulted in protein designs that had ~400 fold greater potency and a ~100 fold increase in protease stability in comparison to molecules designed by traditional methods.
Convolutional neural networks help design better protein therapeutics
#stateofai | 97
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Intenseye’s computer vision models are trained to detect over 35 types of employee health and safety (EHS) incidents that human EHS inspectors cannot possibly see in real-time. The system is live across over 15 countries and 30 cities, having already detected over 1.8M unsafe acts in 18 months.
Real-time computer vision protects employees from workplace injuries (or worse)
Heatmap of incidents
Employee not wearing PPE
Dangerous driving
#stateofai | 98
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Computer vision unlocks faster recovery from natural disasters
Climate change is increasing the severity of natural disasters, inflicting $190B of damage to homes worldwide in 2020, 4x more than in 1990. The global population exposed to natural disasters will increase 8x by 2080. Tractable's AI-augmented system allows homeowners to take photos of damage to their home after a natural disaster (e.g. hurricanes) to predict repair costs and unlock insurance claim payouts months faster.
a
b
c
a. Climate change causes property damage from natural disasters
b. Typhoon Mindulle about to strike Japan, Oct 2021
c. Tractable’s user-facing application
#stateofai | 99
Introduction | Research | Talent | Industry | Politics | Predictions
UK National Grid ESO halves error of electricity demand forecast using transformers
stateof.ai 2021
Forecast lead time | Reduction in mean absolute error (MAE) |
1 hour | 58% |
4 hours | 25% |
8 hours | 11% |
24 hours | 14 % |
Predicting demand is essential to achieving ESO’s ambition of running the grid on net-zero generation by 2025.
#stateofai | 100
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Improving the sustainability and carbon efficiency of farms using predictive models
Dairy cow farmers monitor their livestock to for health issues and the onset of calving. Using deep learning to analyse accelerometer data from a neck-worn sensor, Connecterra is able to predict health issues 2-3 days prior to human observation. They can also predict the onset of calving, which reduces the number of days that pregnant cows are treated with antibiotics by 50% (left graph). Connecterra can predict milk yield with <1% margin of error up to 200 days in the future (right graph, blue = less error), which could reduce CO2 emissions.
2018
2019
2020
0
1
2
3
4
5
AI-guided
Human-guided
Industry-standard
Average number of days animals are treated with antibiotics
AI milk predictor
#stateofai | 101
Introduction | Research | Talent | Industry | Politics | Predictions
Good and bad gut microbiome bacteria identified
Nutrition: Good and bad gut microbiome bacteria and their connections to food identified from metagenomic sequencing of 1,100 study participants
stateof.ai 2021
15 best and 15 worst bacteria by correlation against a broad range of health markers (personal health scores, fasted blood tests, post-meal blood tests and habitual diet).
Diet can change your gut microbiome
Successful prediction of whether a person drinks coffee based on bacteria present in their gut microbiome (UK-trained model performance on UK & US test sets).
#stateofai | 102
Introduction | Research | Talent | Industry | Politics | Predictions
Expert-level quantification of “dry” age-related macular degeneration (AMD) developed by a UK-based NHS team
Eye disease is a petri dish for medical AI development in the clinic
stateof.ai 2021
#stateofai | 103
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
One of the few real-world deployments of AI that addresses the pandemic is the reinforcement learning (RL) system, Eva, which was developed in Greece. Given a specified fraction of travellers who could be tested, Eva selected which specific passengers to test at the Greek borders. Eva identified 1.5x - 4x more positive infections at a given testing fraction than random selection.
Reinforcement learning for an effective Covid testing strategy
#stateofai | 104
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Viz.ai’s stroke detection software helps 1 patient every 47 seconds in the US today
A stroke occurs when the brain is deprived of its blood supply. Within minutes, brain cells begin to die from a lack of oxygen and nutrients, which results in irrecoverable damage. Rapid detection of brain strokes is crucial, but clinically challenging. In 2021, a real-world multi-center study of 45 stroke patients tested a deep learning system from Viz.ai versus standard of care. It found that the AI-based approach reduced the transfer time for a patient post-imaging at a primary stroke center to a comprehensive stroke center by 39% on average.
#stateofai | 105
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
With the increasing power and availability of ML models, gains from model improvements have become marginal. In this context, the ML community is growing increasingly aware of the importance of better data practices, and more generally better MLOps, to build reliable ML products.
Insights from ML in production nudge researchers from model-centric to data-centric AI
Improve data
Train model
Error analysis
Data-centric
Fixed model, evolving data
Improve model
Train model
Error analysis
Model-centric
Fixed data, evolving model
#stateofai | 106
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Due to the rapid progress in model development, beating benchmarks has become a matter of months. The high-performing models nonetheless often fail in real-world scenarios. Dynamic Benchmarking, where datasets are continuously updated by human users, are a solution to make benchmarks more useful.
Machine Learning in production: active benchmarking
#stateofai | 107
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Two new datasets to deal with distribution shifts: WILDS and Shifts.
Machine Learning in production: distribution shifts
#stateofai | 108
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A more pernicious problem in ML systems is underspecification: Models trained and tested successfully on the same data, but using different random seeds, can behave differently on real-world data.
Machine Learning in production: underspecification
#stateofai | 109
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Despite a loud call to arms and many willing participants, the ML community has had surprisingly little positive impact against Covid-19. One of the most popular problems - diagnosing coronavirus pathology from chest X-ray or chest computed tomography images using computer vision - has been a universal clinical failure.
Machine Learning in production: beware of bad data
#stateofai | 110
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Data-driven AI: training datasets grow with models in the loop
With automated labelling, and plateauing architecture performance, training data quantity and quality becomes the competitive metric for AI-first startups.
Unstructured documents
KYC w/ID
3D CT & MRI
Ultrasound video
Ratio of ground truth annotations made by AI vs. humans across computer vision teams
#stateofai | 111
Introduction | Research | Talent | Industry | Politics | Predictions
Since our 2020 Report, NVIDIA has faced mounting resistance from several angles over its planned $40B acquisition of Arm: industry players who compete with NVIDIA, customers of Arm, regulators and governments. In September, 2020, NVIDIA had laid out an 18 month plan to complete the deal. The company has now stated that the deadline will not be met and needs to be extended into September 2022.
2020 Prediction outcome: NVIDIA does not complete its acquisition of Arm
stateof.ai 2021
?
#stateofai | 112
Introduction | Research | Talent | Industry | Politics | Predictions
Europe and the US want to buy themselves semiconductor sovereignty. Is this realistic?
stateof.ai 2021
Over the last 30 years, the industry has been beneficiary of geographical specialisation across more than 50 different types of sophisticated wafer processing and testing equipment, and 300 different input materials. In a matter of months, the Covid-19 pandemic exposed 50+ points across the semiconductor supply chain where a single region accounts for 65% or more of the total global supply as key vulnerabilities. Despite earmarking $200B between the US and Europe, achieving semiconductor sovereignty could cost >$1T in upfront investment. This is 6x the combined R&D investment and capital expenditure of the total semiconductor value chain in 2019.
#stateofai | 113
Introduction | Research | Talent | Industry | Politics | Predictions
Europe woke up to its largest company, ASML, the linchpin to global semiconductors
stateof.ai 2021
The Netherland’s ASML provides chip makers with essential hardware, software and services to mass produce patterns on silicon using a method called lithography. The company is alone in offering extreme ultraviolet lithography (EUV) machines that unlocks the leading manufacturing nodes (e.g. 3nm and 5nm at TSMC). Each EUV machine, which has over 100,000 parts and costs $150M, ships in 40 freight containers (or 4 jumbo jets).
#stateofai | 114
Introduction | Research | Talent | Industry | Politics | Predictions
Manufacturers suffer from Covid-induced supply chain disruptions for semiconductors
stateof.ai 2021
Almost all electronic goods depend on semiconductors. Due to Covid lockdowns and rising demand for electronics, manufacturers are suffering from never-before-seen wait times of 4 months+ between ordering a chip and receiving it. Anecdotally, wait times today are more like 6-12 months with chip shortages into Q2 2022.
#stateofai | 115
Introduction | Research | Talent | Industry | Politics | Predictions
A semiconductor drought is costing the automotive sector upwards of $110B in lost sales
stateof.ai 2021
Halfway through 2021, global auto companies have produced 4 million cars less than expected, down 15% on average. Toyota signaled that it would cut production by 40% worldwide in September 2021. By contrast, large technology companies have not been complaining about semiconductor supply shortages, which suggests there is a bifurcation in the “haves” and “have nots”.
Global motor vehicle production: Q1 2021 vs Q4 2020
Global motor vehicle production over 20 years
#stateofai | 116
Introduction | Research | Talent | Industry | Politics | Predictions
Major semiconductor fabs commit $400B for new capacity as global market hits $551B
stateof.ai 2021
Intel’s new CEO, Pat Gelsinger, committed the company become a major contract chip maker. One month after his appointment, Gelsinger pledged $20B to build two new plants in Arizona. He followed with another $3.5B expansion into New Mexico, and in September 2021 said he plans to build $95B worth of new chip fabs in Europe. Intel’s stock price is up 21% since 1 Jan 2019.
TSMC’s CEO, C.C. Wei, said the company would invest $100B over the next three years to boost capacity, which is more than double the company’s expenditure in the last years. This includes TSMC’s planned chip fab in Arizona. Stock price is up 256%.
Samsung said it would invest $205B over the next three years across its chip manufacturing (Samsung Electronics) and its contract drug manufacturing businesses (Samsung Biologics). This includes a $17B chip fab based in either Texas, New York or Arizona. Stock price is up 114%.
#stateofai | 117
Introduction | Research | Talent | Industry | Politics | Predictions
$52B CHIPS for America Act gains support from Semiconductors in America Coalition
stateof.ai 2021
In 2019, our Report noted that “China is (slowly) ramping up on its semiconductor trade deficit.” In 2020, China imported $350B worth of chips, an increase of 14.6% vs. 2019, notably from US manufacturers. However, as the US-China trade war has dramatically escalated in the last 12 months, the US has taken the view that its eroding share of global semiconductor manufacturing capacity from 37% in 1990 to 12% today is no longer acceptable.
#stateofai | 118
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In the last 12 months, CrowdStrike has almost doubled its market capitalisation to $60B and reached $1.3B ARR. The company is demonstrating the platform potential of AI-first technology in cybersecurity: 53% of its 13,080 subscription customers purchase more than 5 products and 29% subscribe to more than 6 products. Meanwhile, SentinelOne (124%) and CrowdStrike (120%) are firmly in the high-growth net dollar retention segment of SaaS companies, suggesting that their customers expand their subscription spend year on year.
Public market investors favor AI-first cybersecurity players: CrowdStrike ($60B), Darktrace (£5B), SentinelOne ($18B), Riskified ($6B)
#stateofai | 119
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
UiPath (robotic process automation), Snowflake (cloud data platform), and Confluent (Kafka-based data streaming) represent $138B of newly created public market value in 2021 with revenues growing 50-100% YoY at this scale. All three companies have best-in-class net dollar retention above 130% and 2% of their customer base spending over $1M per year. Snowflake became the largest software IPO of 2020, raising $3.35B.
The enterprise data and automation sector is on fire: Snowflake, UiPath, Confluent IPOs
#stateofai | 120
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Since launching its original data platform built on Apache Spark in 2015, Databricks has grown into a one-stop home for (un)structured data, automated ETL, collaborative data science notebooks, business intelligence using SQL, and full-stack machine learning built on open source MLflow. Interestingly, all three major cloud vendors - Amazon, Google, and Microsoft - invested in Databricks in February 2021.
Databricks: The enterprise data/AI juggernaut reaches $38B valuation and $600M ARR
#stateofai | 121
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
AEye, Quanergy, Ouster, Innoviz, Aeva, Luminar, and Velodyne raised $1.3B in private markets, $2.9B via SPACs, and went public at a cumulative valuation of $12.4B. None of these companies had significant revenue going into their SPAC. Together, they project $2.9B in 2024 revenue even though they sell hardware and software products to overlapping autonomous driving customers and other nascent markets.
All seven major private LiDAR companies have SPAC’d and trade below their IPO price
Note: Gross proceeds = PIPE + cash in trust, stock price data from 30 Aug 2021
#stateofai | 122
Introduction | Research | Talent | Industry | Politics | Predictions
Survival of the fittest: Waymo, Cruise, Aurora rev up their balance sheets and trucks SPAC
+$2.5B
June 2021
+$2.75B
April 2021
Sold for $550M
July 2021
Sold for 26% of Aurora
+ $400M financing
December 2020
+$2B cash
$10.6B SPAC
July 2021
+$1.35B cash
$8.5B SPAC
April 2021
+$614M cash
$5.2B SPAC
July 2021
+$345M cash
$3.3B SPAC
May 2021
stateof.ai 2021
>$5B raised by Waymo and Cruise.
Lyft and Uber offload AV teams.
>$4B raised as trucking and consumer AVs SPAC to become public companies.
+
+
#stateofai | 123
Introduction | Research | Talent | Industry | Politics | Predictions
Modular
Complex sensors, HD Maps + Hand-coded rules
Today’s modularised approach struggles with brittle decision-making in prediction/planning. An alternative approach is one that uses end-to-end deep learning from cameras and GPS as a solution to decision complexity.
Learning to drive with a large network, trained end-to-end with perception
stateof.ai 2021
End-to-end
Deep Learning
#stateofai | 124
Introduction | Research | Talent | Industry | Politics | Predictions
Another approach makes heavy use of offline simulations learned from real-world observations and planners that learn from training datasets that are collected at scale using expensive camera sensors. This system has been successfully tested on self-driving vehicles in downtown San Francisco in 2021.
Learn to simulate, then train an RL driving system in the simulation
stateof.ai 2021
AV deployment
Offline system evaluation: Few hours are needed
Learn a simulator in which to train an RL agent
#stateofai | 125
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Chinese institutions won all first and second places in all tasks of the 2021 AI City Challenge.
Chinese institutions dominate research in Smart Cities
#stateofai | 126
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
China’s SenseTime, a $12B facial recognition software company that powers surveillance on Uighur Muslim detainment camps and was blacklisted by the Trump administration in 2019, filed to list on the Hong Kong stock exchange. The company generated $525M of revenue in 2020.
Facial recognition: upcoming IPOs and fundraising despite controversy and lawsuits
#stateofai | 127
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In the US, Clearview AI has been sued by the American Civil Liberties Union over face scraping in Europe and by immigrant rights groups in California. Even so, the product has been widely trialed by law enforcement and governments in 24 countries and has continued to raise capital from private investors.
Clearview AI: despite lawsuits and bans in Europe and Canada, the company continues
#stateofai | 128
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Google infuses AI capabilities into more of its business and consumer applications
Beyond Gmail’s popular Smart Reply feature, the company’s AI-based grammar checker is now live across Sheets, Docs, and Slides. Sheets now also provides context-aware formula predictions and allows you to ask questions of your data in natural language. Maps is receiving over 100 new AI-first features, including indoor navigation with AR and a new routing option that optimises for lower fuel consumption and CO2 emissions. Google also open sourced MediaPipe, a cross platform toolkit for integrating fast inference computer vision functionality.
#stateofai | 129
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
OpenAI GPT-3 integrations: Microsoft Power Apps, GitHub Copilot, and 300 other apps
Power Apps users can describe a programming goal in natural language and have GPT-3 automatically transform it into Power Fx code. Meanwhile, GitHub users can call on Codex (descendant of GPT-3) to generate whole lines or entire functions within from within their code editor. After surpassing 300 apps build with GPT-3, OpenAI launched a $100M fund to invest in startups that make use of their APIs.
#stateofai | 130
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Startups in the US, Canada, and Europe raise close to $375M in the last 12 months to bring large language model APIs and vertical software solutions to customers who cannot afford to directly compete with Big Tech. This is significant momentum in a single year when cast against the early acquisitions of NLP startups including Maluuba ($140M in 2017), Semantic Machines (rumored $150-250M in 2018) and SwiftKey ($250M in 2016).
Large language models for all: startups raise $375M to translate research into industry
#stateofai | 131
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Google’s Super-Resolution via Repeated Refinement (SR3) model iteratively refines a noisy 64x64 image into a high-quality 1024x1024 image that outperforms generative adversarial networks. Meanwhile, China’s SenseTime showcased its 30x super-resolution zoom that marries computer vision with a custom AI chipset.
Image super-resolution enables super-zoom on consumer grade smartphones
#stateofai | 132
Introduction | Research | Talent | Industry | Politics | Predictions
The rapid growth of consumers selling products online is supported by AI-first photo app
stateof.ai 2021
ClipDrop enables >11k subscribed online sellers to create beautiful product imagery with a single click. Computer vision-based scene understanding and segmentation enables the extraction of objects from real life settings without the need for photo studies or complex post-processing. This is powering a huge surge in secondhand good selling worth an estimated $27B in 2020, which according to market research is growing several times faster than primary retail.
Paying subscriber growth
#stateofai | 133
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Robotic picking and packing is helping retailers meet a growing demand for online deliveries. Leading online grocery technology company, Ocado, uses computer vision and proprietary grasping technology to efficiently pick and pack items for grocery orders. In e-commerce, robotic picking platform SORT will handle 300M+ items by the end of 2021. Reinforcement learning tool (RLScan) is a very early example of RL success in production environments of robotic systems at scale.
Deep reinforcement learning-enhanced picking robots support a surge in online grocery
#stateofai | 134
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A key differentiator for online grocers is breadth of their in-stock product range. This is challenging to achieve: order too little stock and customers won't be able to buy the items they want, but ordering too much would increase waste and hit margin.
Deep learning automates 98% of stock replenishment decisions for online grocers
Forecast lead time
Daily sales
#stateofai | 135
Introduction | Research | Talent | Industry | Politics | Predictions
AI-last: large scale first party data unlocks new AI product opportunities at Shopify
stateof.ai 2021
In April 2016, Shopify launched an ML-driven lending solution called Shopify Capital that preemptively offers working capital advances up to $2M that can be unlocked in 2-5 days by high performing merchants on their platform. Shopify Capital has grown to $2.3B in cumulative capital advanced since its launch and 137% YoY by Q2 2021. Interestingly, 76% of merchants who used this product returned for at least one additional round of funding and merchants averaged 36% higher sales in the first 6 months compared to their non-funded peers.
#stateofai | 136
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
To detect Child Sexual Abuse Material (CSAM) while preserving user privacy, Apple intended to use NeuralHash, a hashing method for images based on neural networks. Apple claimed that this method enabled images to be compared on device with a known CSAM database while only having access to the photos if they contain CSAM. Faced with criticism from privacy advocates and technical experts, Apple delayed the launch of their system.
Apple faces the complex problem of AI-based privacy
#stateofai | 137
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
With the regulation of third party cookies and the increasing public awareness of the importance of data privacy, browsers are compelled to find new privacy-preserving solutions for their advertising business.
Browser-based federated learning thrives in a post-cookie world
#stateofai | 138
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Critics worry that FLoC makes it easier for advertisers to track users across the web. All other browsers (e.g. Firefox, Brave, Edge) refused to integrate FLoC. DuckDuckGo even created a Chrome extension to block it.
But is Google’s Federated Learning of Cohorts a good alternative to third party data?
#stateofai | 139
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In a world-first, South Africa granted a patent to an AI system. The system, called Dabus, invented a method to better interlock food containers. Most countries, however, do not recognize a machine as an inventor.
AIs play Go. AIs paint. AIs make music. Now AIs… invent?
*
*These are not actual images of AIs.
#stateofai | 140
Introduction | Research | Talent | Industry | Politics | Predictions
Investing in AI: 182 active AI unicorns totaling $1.3T of combined enterprise value
The US outperforms other countries in the number of AI unicorns, followed by China, UK & Israel. US unicorns have reached a combined market value of over €800 billion.
stateof.ai 2021
| Number of AI unicorns | Total funding raised | Combined enterprise value | Examples |
United States | | | | |
China | | | | |
United Kingdom | | | | |
Israel | | | | |
Canada | | | | |
Germany | | | | |
Singapore | | | | |
Switzerland | | | | |
Hong Kong | | | | |
France | | | | |
South Korea | | | | |
Japan | | | | |
India | | | | |
Belgium | | | | |
Bermuda | | | | |
Taiwan | | | | |
Sweden | | | | |
103
35
10
8
4
3
3
3
3
2
2
1
1
1
1
1
1
€55B
€26B
€4B
€2B
€1B
€2B
€2B
€1B
€3B
€1B
€100M
€400M
€400M
€300M
€200M
€100M
n/a
€801B
€346B
€69B
€25B
€8B
€14B
€5B
€4B
€9B
€2B
€2B
€2B
€1B
€2B
€2B
€1B
€4B
#stateofai | 141
Introduction | Research | Talent | Industry | Politics | Predictions
Investing in AI: American AI startups attract the most money but EU+UK is growing fast
stateof.ai 2021
2019
2020
2021 EST
2016
2017
2018
2015
2014
2012
2013
2011
2010
▊ China
▊ USA
▊ European Union & UK
€75B
€50B
€100B
€25B
€0B
The US accounts for ⅔ of global AI investments and the EU+UK is on track to double its share by 2021.
€8B
€17B
2.1x
#stateofai | 142
Introduction | Research | Talent | Industry | Politics | Predictions
Investing in AI: mega rounds are now commonplace as AI startups mature globally
stateof.ai 2021
€250M rounds account for 48% of all capital invested in AI startups in 2021, up from 42% in 2020. We see the same trend for €100M-€250M rounds and Series C rounds, both of which are more represented in 2021.
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
€60B
€40B
€80B
€20B
€0B
▊ €0M-€1M (Pre-Seed)
▊ €1M-€4M (Seed)
▊ €4M-€15M (Series A)
▊ €15M-€40M (Series B)
▊ €40M-€100M (Series C)
▊ €100M-€250M
▊ €250M+
‘250M+’ rounds have accounted for 48% of all investment in 2021 so far
#stateofai | 143
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Combined enterprise value of private companies (AI vs SaaS)
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
€1.5T
€1.0T
€2.0T
€0.5T
€0B
▊ AI
▊ SaaS
Combined EV in EUR B: | 2010 | 2011 | 2012 | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | 2021 |
All AI companies | 5.4 | 7.3 | 11.6 | 17.5 | 35.3 | 70.1 | 102.6 | 164.7 | 334.3 | 517.2 | 715.9 | 1200 |
All SaaS companies | 21 | 26.9 | 39.1 | 51.6 | 85.4 | 157 | 201.8 | 279.5 | 487 | 712.3 | 1000 | 1800 |
Investing in AI: the combined enterprise value of private AI startups & scaleups is ⅔ that of private SaaS startups & scaleups
#stateofai | 144
Introduction | Research | Talent | Industry | Politics | Predictions
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
€600B
€400B
€800B
€200B
€0B
▊ €0M-€200M
▊ €200M-€800M
▊ €800M-€8.0B
▊ €8.0B+
stateof.ai 2021
� | �� |
� | � |
| � |
Cloud data platform
($38B valuation)
Process mining
($11B valuation)
Healthcare data analytics
($8.1B valuation)
Revenue intelligence
($7.25B valuation)
Training data platform
($7.3B valuation)
AI cloud platform
($6.3B valuation)
Combined enterprise value of private
AI SaaS companies
Investing in AI: over €600B of combined enterprise value of private AI-first SaaS startups & scaleups and SaaS startups & scaleups actively using AI
#stateofai | 145
Introduction | Research | Talent | Industry | Politics | Predictions
Investing in AI: enterprise software is the most invested category globally, 2010-2021
stateof.ai 2021
The data-rich domains of Health and Fintech are also particularly popular investing categories globally.
Enterprise software
Transportation
Fintech
Health
Food
Robotics
Marketing
Media
Security
Telecom
Education
Semiconductors
Energy
Travel
Real Estate
Home living
Gaming
Jobs recruitment
Legal
Fashion
Sports
Music
Hosting
Wellness beauty
Event tech
Kids
Dating
Amount invested
€105B
€89B
€71B
€46B
€45B
€37B
€32B
€31B
€28B
€16B
€14B
€11B
€10B
€7B
€6B
€6B
€6B
€3B
€3B
€3B
€2B
€1B
€1B
€36M
€3B
€34M
€8M
Number of rounds
Enterprise software
Health
Fintech
Marketing
Security
Media
Robotics
Education
Transportation
Energy
Food
Jobs recruitment
Real Estate
Semiconductors
Telecom
Legal
Travel
Fashion
Home living
Sports
Gaming
Music
Wellness beauty
Kids
Event tech
Hosting
Dating
7.5K
3.0K
2.9K
2.9K
1.7K
1.6K
1.4K
1.4K
1.4K
903
885
664
614
569
545
518
351
342
319
288
286
192
189
130
128
98
28
#stateofai | 146
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Software
Robotics
AI biotech
Defense
€155B
€37B
€22B
€0.6B
Amount invested (2010-21)
Number of rounds (2010-21)
Software
Robotics
AI biotech
Defense
10K
1K
753
77
Investing in AI: software leads while robotics, AI biotech and defense are growing
#stateofai | 147
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
▊ China
▊ USA
▊ European Union & UK
150
100
200
50
0
250
Exits in AI: American AI startups consistently account for ⅔ of exits globally and EU+UK account for roughly ⅓ with the remainder to China
#stateofai | 148
Introduction | Research | Talent | Industry | Politics | Predictions
Exits in AI: almost 3-fold increase in enterprise value creation in the last 12 months
The sum of M&A exit value, secondaries, and the enterprise value of IPOs and SPACs is passed €750B in 2021.
stateof.ai 2021
�
�
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
€600B
€400B
€800B
€200B
€0B
� | �� |
� | � |
| � |
$19.7B acquisition�Apr 2021
$1.2B IPO�($8.9B valuation)�Jun 2021
$1.34B IPO�($35.8B valuation)�Apr 2021
$5.4B IPO
($150B valuation)�Feb 2021
$1.4B IPO�($8.5B valuation)�Apr 2021
$2B IPO�($28.6B valuation)�Apr 2021
▊ China
▊ USA
▊ European Union & UK
#stateofai | 148
Introduction | Research | Talent | Industry | Politics | Predictions
Exits in AI: $2.3T of enterprise value has been created by AI companies since 2010
stateof.ai 2021
(*) Based on the exits with a known exit amount. The deals included the enterprise value of: Acquisitions, Secondaries, IPOs, SPAC IPOs, Buyouts;
Enterprise software, fintech, media, transportation, and food categories account for $2T of value creation.
Combined exit value
Number of exits
Avg. exit amount(*)
Enterprise software
Fintech
Media
Transportation
Food
Security
Health
Semiconductors
Marketing
Gaming
Telecom
Robotics
Travel
Real Estate
Music
Home living
Fashion
Energy
Jobs recruitment
Education
Legal
Sports
Dating
Hosting
Wellness beauty
Kids
Event tech
Enterprise software
Marketing
Fintech
Media
Security
Health
Transportation
Education
Robotics
Telecom
Semiconductors
Energy
Legal
Food
Real Estate
Jobs recruitment
Gaming
Travel
Home living
Music
Fashion
Sports
Event tech
Wellness beauty
Hosting
Kids
Dating
Semiconductors
Music
Travel
Fintech
Dating
Security
Real Estate
Jobs recruitment
Transportation
Telecom
Media
Food
Health
Home living
Enterprise software
Gaming
Robotics
Legal
Fashion
Marketing
Energy
Hosting
Education
Sports
Kids
Wellness beauty
Event tech
€487B
€472B
€417B
€390
€335B
€142B
€137B
€91B
€91B
€77B
€72B
€43B
€43B
€42B
€40B
€37B
€26B
€24B
€19B
€12B
€6B
€4B
€4B
€2B
€9B
€32M
€20M
€4B
€2B
€2B
€2B
€2B
€2B
€2B
€1B
€1B
€1B
€1B
€1B
€989M
€922M
€863M
€785M
€683M
€443M
€346M
€343M
€298M
€199M
€129M
€114M
€323M
€103M
€18M
686
245
209
182
166
145
110
110
72
66
63
55
51
46
41
39
31
21
21
18
17
16
11
9
8
4
2
#stateofai | 150
Introduction | Research | Talent | Industry | Politics | Predictions
Exits in AI: almost 3.5-fold growth in AI-first SaaS enterprise value creation in 12 months
stateof.ai 2021
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
€300B
€200B
€400B
€100B
€0B
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
100
150
50
0
Combined enterprise value
Number of exits
#stateofai | 151
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Exits in AI: corporates show growing interest in companies that are actively using AI
2019
2020
2021 YTD
2016
2017
2018
2015
2014
2012
2013
2011
2010
150
100
200
50
0
250
The number of exits(1) driven by corporates in 2021 exceeds 200, breaking all yearly records.
� | �� |
� | � |
| � |
£1.1B buyout�Sep 2021
$500M acquisition�Jul 2021
$290M acquisition
Jul 2021
$3.8B acquisition�Jul 2021
$575M acquisition�Jul 2021
$19.7B acquisition�Apr 2021
(1) Counted are acquisitions, buyouts and secondaries
#stateofai | 152
Introduction | Research | Talent | Industry | Politics | Predictions
Section 4: Politics
stateof.ai 2021
#stateofai | 153
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Dr Gebru left Google after a substantial disagreement over a research paper which examined the risks of large language models, including bias and the carbon footprint associated with training these models.
AI Ethics: Timnit Gebru’s firing from Google shocks the AI community
#stateofai | 154
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
TAI is defined as “AI that has an impact comparable to that of the industrial revolution.” The model predicts a median of 2052 for the year in which some actor would be willing and able to train a single transformative model.
AI Safety: new quantitative model extrapolates from current research and compute trends to estimate when ‘transformative AI’ (TAI) might be possible
#stateofai | 155
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A team from Cornell, Oxford and UPenn surveyed 524 researchers who published in top ML conferences and compared their views to that of the general public on subjects such as trust in international political and scientific organizations, military applications of AI, and more.
AI Safety: an overwhelming majority (68%) of machine learning researchers surveyed believe that AI Safety research should be prioritised more than at present
#stateofai | 156
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Within AI Safety, AI Alignment is the critical field of research exploring how we can ensure that increasingly powerful AI systems have goals that are aligned with humanity. If transformational AI might happen in the next 30 years, are too few researchers actively focused on making sure it goes well for humanity?
AI Safety: fewer than 100 researchers work on AI Alignment in 7 leading AI organisations
#stateofai | 157
Introduction | Research | Talent | Industry | Politics | Predictions
Source: primary research by State of AI team. Note, these numbers are for long-term AI Alignment research, which does not include broader AI Safety focused on nearer-term issues. Blue = industry labs, red = academic labs.
stateof.ai 2021
As a percentage of total headcount, Anthropic (42%) and HCAI (36%) are investing the most in this area.
AI Safety: if transformative AI might happen in the next 30 years, how many people are working on making sure it goes well for humanity?
#stateofai | 158
Introduction | Research | Talent | Industry | Politics | Predictions
Source: primary research by State of AI team. Note, these numbers are for long-term AI Alignment research, which does not include broader AI Safety focused on nearer-term issues. Blue = industry labs, red = academic labs.
stateof.ai 2021
AI Safety: new initiatives and organisations are cause for some optimism
Responding to the challenge, a number of smaller organisations and academic departments have sprung up led by talented researchers with an explicit focus on AI Alignment.
#stateofai | 159
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
DeepMind had been negotiating with Google to shift its legal structure to that of a non-profit and to establish a clear governance structure that tackles the deep oversight challenges associated with developing AGI.
AI Governance: DeepMind fails to gain independence from Google
#stateofai | 160
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Many of OpenAI’s leading researchers leave to start a major new AI research lab.
AI Governance: enter Anthropic as a potential third pole for AGI research
#stateofai | 161
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The team cites AI Safety and governance as a primary goal.
AI Governance: enter Anthropic as a potential third pole for AGI research
#stateofai | 162
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A team of renegades have accomplished a huge amount since July 2020.
AI Governance: EleutherAI mounts an attempt to decentralise power via open source
#stateofai | 163
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
A notable achievement of the project has been to create The Pile, a free and publicly released 800GB dataset of diverse English text for large language modelling.
AI Governance: EleutherAI mounts attempt to decentralise power via open source
#stateofai | 164
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
BigScience, also known as the Summer of Language Models, is a one-year long workshop (started in May 2021) whose participants will create large multilingual LMs and datasets. Like EleutherAI, all the workshop’s outputs are open source, and the goal is to analyse the LMs and datasets from all scientific and societal aspects.
AI Governance: the BigScience workshop is an attempt at a hybrid alternative
#stateofai | 165
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The EU introduced a proposal for AI regulation (AI Act) in April 2021. The proposal aims to provide the necessary legal certainty to facilitate innovation while ensuring the protection of consumer rights. Like GDPR, the proposed law concerns any person or organization, even foreign, involved with an AI system placed or used in the EU. But the AI Act goes beyond GDPR by aiming to directly regulate the use of AI systems.
The EU continues to be the first (and heavy handed) mover in AI regulation
#stateofai | 166
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
While all AI systems need to satisfy some minimal requirements under the AIA, high-risk AI systems are subject to more scrutiny and accountability.
The AI Act: regulatory requirements in Europe
#stateofai | 167
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Although the AIA is a step in the right direction, many feel the EU is rushing a legislation on technical issues even the scientific community doesn’t understand. As a result, it is not clear whether the EU and member states have the means to enforce it, nor that all companies have the means to comply with the legislation.
Regulating AI systems presents unique technical, economic and legal challenges
#stateofai | 168
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The Personal Information Protection Law (PIPL), China’s GDPR, will go into effect in November 2021. But Chinese regulators are moving fast. They are already proposing a legislation on a major subset of AI systems: recommendation algorithms. Chinese e-commerce giants and social networks, which are at the center of a regulatory crackdown, are heavy users of these systems.
In China, industrial policy and regulation go hand in hand
#stateofai | 169
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Chinese AI actors (government, academia, industry) have long been aware of AI ethics issues. In several papers and initiatives, they outlined principles for building ethical AI systems. But a practical application of these principles is still lacking, and AI ethics remain subordinated to higher political interests.
AI ethics in China: numerous initiatives, but to what end?
#stateofai | 170
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The Chinese Governance Committee for the New Generation Artificial Intelligence published a draft with a set of ethical norms that AI systems should respect. While this is a step in the right direction, the government’s use of AI for censorship and surveillance jumps to mind as a major infringement of the introduced norms.
AI ethics in China: will a new draft on ethics norms change the status quo?
#stateofai | 171
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The Algorithmic Accountability Act was proposed, and ignored, in 2019. Since then, the US hasn’t seen any attempt at a comprehensive national AI regulation or consumer data privacy law.
Meanwhile, in the US, there still isn’t a federal law protecting consumer data privacy
#stateofai | 172
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Virginia signed the Virginia Consumer Data Protection Act (VCDPA) into law in March 2021. But an examination of the law shows it is not as binding as California’s CPRA, and largely means “business as usual” for Big Tech.
At the state level, comprehensive data privacy laws are rare and differ in strength
#stateofai | 173
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
The US Government Accountability Office (GAO), the supreme audit institution of the US federal government, examined the ownership and use of facial recognition technology by federal agencies, what activities it was used for, and whether agencies tracked how their employees used the technology.
20 of 42 US federal agencies own or use facial recognition systems for law enforcement
#stateofai | 174
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
This is thought to be the first time a drone swarm has been used in combat.
Military AI moves into production: Israel uses AI guided drone swarm in Gaza attacks
#stateofai | 175
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
This was the first time a U.S. Military System has been controlled by an AI system.
Military AI moves into production: US Air Force flew an AI copilot on a U-2 Spy Plane
#stateofai | 176
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Military AI moves into production: US Air Force Research Lab tests autonomous Skyborg
The Skyborg Vanguard program is aimed at integrating “full-mission autonomy with low-cost, attritable unmanned air vehicle technology to enable manned-unmanned teaming.”
#stateofai | 177
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Military AI: governments have doubled down on rhetoric and defense spending
#stateofai | 178
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Anduril’s valuation doubles in 12 months to $4.6B after raising a $450M Series D. It has now raised circa $700M.
Military AI: Anduril continues to gain momentum
#stateofai | 179
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
Microsoft’s huge $22B contract for Hololens moves them closer to a defense prime.
Military AI: large tech companies scale up military contracts
#stateofai | 180
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
In the face of slow adoption of AI legislation by the US Senate, legislators included some non-military AI provisions in the National Defense Authorization Act (NDAA), a bill which is all but guaranteed to pass every year.
Military AI: AI provisions are smuggled through military legislation
#stateofai | 181
Introduction | Research | Talent | Industry | Politics | Predictions
Section 5: Predictions
stateof.ai 2021
#stateofai | 182
Introduction | Research | Talent | Industry | Politics | Predictions
stateof.ai 2021
8 predictions for the next 12 months
2. ASML’s market cap reaches $500B.
1. Transformers replace recurrent networks to learn world models with which RL agents surpass human
performance in large and rich game environments .
3. Anthropic publishes on the level of GPT, Dota, AlphaGo to establish itself as a third pole of AGI research.
4. A wave of consolidation in AI semiconductors with at least one of Graphcore, Cerebras, SambaNova, Groq, or
Mythic being acquired by a large technology company or major semiconductor incumbent.
5. Small transformers + CNN hybrid models match current SOTA on ImageNet top-1 accuracy (CoAtNet-7,
90.88%, 2.44B params) with 10x fewer parameters.
6. DeepMind releases a major research breakthrough in the physical sciences.
7. The JAX framework grows from 1% to 5% of monthly repos created as measured by PapersWithCode.
#stateofai | 183
Introduction | Research | Talent | Industry | Politics | Predictions
8. A new AGI-focused research company is formed with significant backing and a roadmap that’s focused on a
sector vertical (e.g. developer tools, life science).
Section 6: Conclusion
stateof.ai 2021
#stateofai | 184
Introduction | Research | Talent | Industry | Politics | Predictions
Thanks!
Congratulations on making it to the end of the State of AI Report 2021! Thanks for reading.
In this report, we set out to capture a snapshot of the exponential progress in the field of artificial intelligence, with a focus on developments since last year’s issue that was published on 1st October 2020. We believe that AI will be a force multiplier on technological progress in our world, and that wider understanding of the field is critical if we are to navigate such a huge transition.
We set out to compile a snapshot of all the things that caught our attention in the last year across the range of AI research, talent, industry and the emerging politics of AI.
We would appreciate any and all feedback on how we could improve this Report further, as well as contribution suggestions for next year’s edition.
Thanks again for reading!
Nathan Benaich (@nathanbenaich) and Ian Hogarth (@soundboy)
stateof.ai 2021
#stateofai | 185
Introduction | Research | Talent | Industry | Politics | Predictions
The authors declare a number of conflicts of interest as a result of being investors and/or advisors, personally or via funds, in a number of private and public companies whose work is cited in this report.
Ian is an investor in: Anthropic, ClipDrop, Faculty AI, LabGenius.
Conflicts of interest
stateof.ai 2021
#stateofai | 186
Introduction | Research | Talent | Industry | Politics | Predictions
About the authors
Nathan is the General Partner of Air Street Capital, a venture capital firm investing in AI-first technology and life science companies. He founded RAAIS and London.AI (AI community for industry and research), the RAAIS Foundation (funding open-source AI projects), and Spinout.fyi (improving university spinout creation). He studied biology at Williams College and earned a PhD from Cambridge in cancer research.
Nathan Benaich
Ian Hogarth
Ian is an angel investor in 100+ startups. He is a Visiting Professor at UCL working with Professor Mariana Mazzucato. Ian was co-founder and CEO of Songkick, the concert service. He studied engineering at Cambridge where his Masters project was a computer vision system to classify breast cancer biopsy images. He is the Chair of Phasecraft, a quantum software company.
stateof.ai 2021
Introduction | Research | Talent | Industry | Politics | Predictions
#stateofai | 2
State of AI Report
October 12, 2021
#stateofai
stateof.ai
Ian Hogarth
Nathan Benaich