Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding
Subba Reddy Oota1, Manish Gupta2,3, Raju S. Bapi2, Mariya Toneva4
1Inria Bordeaux, France; 2IIIT Hyderabad, India; 3Microsoft, India; 4MPI for Software Systems, Germany
subba-reddy.oota@inria.fr, gmanish@microsoft.com, raju.bapi@iiit.ac.in, mtoneva@mpi-sws.org
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
2
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
3
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
4
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
5
Neuroscience
IJCAI 2023: DL for Brain Encoding and Decoding
6
Brain encoding and decoding in cognitive neuroscience
Ivanova, Anna A., Martin Schrimpf, Stefano Anzellotti, Noga Zaslavsky, Evelina Fedorenko, and Leyla Isik. "Is it that simple? Linear mapping models in cognitive neuroscience." bioRxiv (2021).
IJCAI 2023: DL for Brain Encoding and Decoding
7
Brain encoding and decoding
IJCAI 2023: DL for Brain Encoding and Decoding
8
Techniques for studying the brain function
Vogel, Jörn, Sami Haddadin, Beata Jarosiewicz, John D. Simeral, Daniel Bacher, Leigh R. Hochberg, John P. Donoghue, and Patrick van der Smagt. "An assistive decision-and-control architecture for force-sensitive hand–arm systems driven by human–machine interfaces." The International Journal of Robotics Research 34, no. 6 (2015): 763-780.
IJCAI 2023: DL for Brain Encoding and Decoding
9
Single Micro-Electrode (ME), Micro-Electrode array (MEA), Electro-Cortico Graphy (ECoG), Positron emission tomography (PET), functional MRI (fMRI), Magneto-encephalography (MEG), Electro-encephalography (EEG), Near-Infrared Spectroscopy (NIRS)
fMRI
IJCAI 2023: DL for Brain Encoding and Decoding
10
An fMRI image with yellow areas showing increased activity compared with a control condition
Computational Cognitive Science Research goals
Ivanova, Anna A., Martin Schrimpf, Stefano Anzellotti, Noga Zaslavsky, Evelina Fedorenko, and Leyla Isik. "Is it that simple? Linear mapping models in cognitive neuroscience." bioRxiv (2021).
IJCAI 2023: DL for Brain Encoding and Decoding
11
Computational Cognitive Science Research goals
Ivanova, Anna A., Martin Schrimpf, Stefano Anzellotti, Noga Zaslavsky, Evelina Fedorenko, and Leyla Isik. "Is it that simple? Linear mapping models in cognitive neuroscience." bioRxiv (2021).
IJCAI 2023: DL for Brain Encoding and Decoding
12
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
13
Types of stimuli and popular datasets
IJCAI 2023: DL for Brain Encoding and Decoding
14
Forms of stimulus presentation and data collection
IJCAI 2023: DL for Brain Encoding and Decoding
15
Text Stimulus Datasets
IJCAI 2023: DL for Brain Encoding and Decoding
16
Dataset | Type | Language | Stimulus | #Subjects | Paradigm | Size | Task |
Wehbe et al., 2014 | fMRI | English | Chapter 9 of Harry Potter and the Sorcerer's Stone | 9 | Reading stories | 5000 word chapter was presented in 45 minutes. | Story understanding |
Handjaras et al., 2016 | fMRI | Italian | Verbal, pictorial or auditory presentation of 40 concrete nouns | 20 | Reading, viewing or listening | 40 nouns * 4 times. | Property Generation |
Anderson et al., 2017 | fMRI | Italian | 70 concrete and abstract nouns from law/music. | 7 | Reading | 70 nouns * 5 times. | Imagine a situation that they personally associate with the noun |
Zurich Cognitive Language Processing Corpus (ZuCo): Hollenstein et al., 2018 | EEG and eye-tracking | English | Sentences from movie reviews or Wikipedia | 12 | Reading natural sentences | 21,629 words in 1107 sentences and 154,173 fixations | Rate movie quality, answer control questions, check for existence of a relation |
Anderson et al., 2019 | fMRI | English | 240 active voice sentences describing everyday situations | 14 | Reading | 240 sentences seen 12 times (by 10 subjects) and 6 times (by 4 subjects) | Passive reading |
BCCWJ-EEG: Oseki and Asahara, 2020 | EEG | Japanese | 20 newspaper articles | 40 | Reading | 1 time reading for ~30-40 minutes | Passive reading |
Deniz et al., 2019 | fMRI | English | Subset of Moth Radio Hour. 11 stories | 9 | Reading | 11 10- to 15 min stories presented twice word by word | Passive reading and Listening |
Data for concrete nouns from sighted/blind subjects
Handjaras, Giacomo, Emiliano Ricciardi, Andrea Leo, Alessandro Lenci, Luca Cecchetti, Mirco Cosottini, Giovanna Marotta, and Pietro Pietrini. "How concepts are encoded in the human brain: a modality independent, category-based cortical organization of semantic knowledge." Neuroimage 135 (2016): 232-242.
IJCAI 2023: DL for Brain Encoding and Decoding
17
70 - Italian word stimuli fMRI data
Anderson, Andrew J., Douwe Kiela, Stephen Clark, and Massimo Poesio. "Visually grounded and textual semantic models differentially decode brain activity associated with concrete and abstract nouns." Transactions of the Association for Computational Linguistics 5 (2017): 17-30.
IJCAI 2023: DL for Brain Encoding and Decoding
18
Zurich Cognitive Language Processing Corpus (ZuCo)
Hollenstein, Nora, Jonathan Rotsztejn, Marius Troendle, Andreas Pedroni, Ce Zhang, and Nicolas Langer. "ZuCo, a simultaneous EEG and eye-tracking resource for natural sentence reading." Scientific data 5, no. 1 (2018): 1-13.
IJCAI 2023: DL for Brain Encoding and Decoding
19
Visual Stimulus Datasets
IJCAI 2023: DL for Brain Encoding and Decoding
20
Dataset | Type | Stimulus | #S | Paradigm | Size | Task |
Thirion et al., 2006 | fMRI | Rotating wedges, expanding/contracting rings, rotating Gabor filters, grid | 9 | Viewing visual patterns | Wedges/rings for 8 times, 36 Gabor filters for 4 times, grid 36 times | Passive viewing, imagine one of the 6 domino stimuli when prompted to. |
Vim-1: Kay et al., 2008 | fMRI | Sequences of natural photos | 2 | Viewing natural images | Each subject viewed 1750 (Stage 1)+ 120 (Stage 2) novel natural images | Passive viewing |
Horikawa et al., 2017 | fMRI | Object images | 5 | Viewing and Reading | Each subject: (1) Image presentation: 1,200 images from 150 object categories and 50 images from 50 object categories; (2) Imagery: 10 times. | One-back repetition detection task, imagine object images pertaining to the category |
BOLD5000: Chang et al., 2019 | fMRI | 5254 images depicting real-world scenes | 4 | Viewing natural images | ∼20 hours of MRI scans per each of four participants | Passive viewing |
Algonauts: Cichy et al., 2019 | fMRI (EVC and IT)/MEG (early and late in time) | Object images | 15 | Viewing object images | 92 silhouette object images and 118 images of objects on natural background | Passive viewing |
Natural Scenes Dataset: Allen et al., 2022 | fMRI | 73000 natural scenes | 8 | Viewing natural scenes | ~73000 distinct natural scene images from MSCOCO. | Passive viewing |
THINGS: Hebart et al., 2023 | fMRI/EEG | 31188 natural images across 1,854 object concepts. | 8 | Viewing natural images | fMRI: 3 Participants. 8,740 unique images. 720 objects. MEG: 4 Participants. 22,448 unique images. 1,854 objects | oddball detection task (synthetic image). |
Visual Binary Patterns
Thirion, Bertrand, Edouard Duchesnay, Edward Hubbard, Jessica Dubois, Jean-Baptiste Poline, Denis Lebihan, and Stanislas Dehaene. "Inverse retinotopy: inferring the visual content of images from brain activation patterns." Neuroimage 33, no. 4 (2006): 1104-1116.
IJCAI 2023: DL for Brain Encoding and Decoding
21
Seen and imagined objects
Horikawa, Tomoyasu, and Yukiyasu Kamitani. "Generic decoding of seen and imagined objects using hierarchical visual features." Nature communications 8, no. 1 (2017): 1-15.
IJCAI 2023: DL for Brain Encoding and Decoding
22
BOLD5000
Chang, Nadine, John A. Pyles, Austin Marcus, Abhinav Gupta, Michael J. Tarr, and Elissa M. Aminoff. "BOLD5000, a public fMRI dataset while viewing 5000 visual images." Scientific data 6, no. 1 (2019): 1-18.
IJCAI 2023: DL for Brain Encoding and Decoding
23
Algonauts
Cichy, Radoslaw Martin, Gemma Roig, Alex Andonian, Kshitij Dwivedi, Benjamin Lahner, Alex Lascelles, Yalda Mohsenzadeh, Kandan Ramakrishnan, and Aude Oliva. "The algonauts project: A platform for communication between the sciences of biological and artificial intelligence." arXiv preprint arXiv:1905.05675 (2019).
IJCAI 2023: DL for Brain Encoding and Decoding
24
Training and Testing Material.
Audio Stimulus Datasets
IJCAI 2023: DL for Brain Encoding and Decoding
25
Dataset | Type | Language | Stimulus | #S | Paradigm | Size | Task |
Handjaras et al., 2016 | fMRI | Italian | Verbal, pictorial or auditory presentation of 40 concrete nouns | 20 | Reading, viewing or listening | 40 nouns * 4 times. | Property Generation |
Huth et al., 2016 | fMRI | English | Eleven 10-minute stories | 7 | Listening | 2 hours of stories from The Moth Radio Hour | Passive Listening |
Brennan and Hale, 2019 | EEG | English | Chapter one of Alice’s Adventures in Wonderland as read by Kristen McQuillan | 33 | Listening | 2,129 words in 84 sentences. The entire experimental session lasted 1–1.5 h (including QA). | 8 MCQ Question answering concerning the contents of the story |
Anderson et al., 2020 | fMRI | English | One of 20 scenario names | 26 | Listening scenario name | 20 scenario prompts displayed 5 times. | Imagine themselves personally experiencing common scenarios |
Narratives: Nastase et al., 2021 | fMRI | English | 27 diverse naturalistic spoken stories | 345 | Listening | 891 functional scans, totaling ~4.6 hours of unique stimuli (~43,000 words) | Passive Listening |
Natural Stories: Zhang et al., 2020 | fMRI | English | Moth-Radio-Hour naturalistic spoken stories | 19 | Listening | 5 h 33 m (repeated twice). Each story is 6 m 48 s avg or 2492 words. | Passive Listening |
The Little Prince: Li et al., 2021 | fMRI | English, Chinese, French | Audiobook | 112 | Listening | English audiobook is 94 minutes long. Chinese: 99min. French: 97 min. | Passive Listening. 4 quiz questions. |
MEG-MASC: Gwilliams et al., 2022 | MEG | English | 4 English fictional stories: Cable spool boy, LW1, Black willow, Easy money. | 27 | Listening | Two hours of naturalistic stories. 208 MEG sensors. | Passive Listening |
Imagining common scenarios
Anderson, Andrew James, Kelsey McDermott, Brian Rooks, Kathi L. Heffner, David Dodell-Feder, and Feng V. Lin. "Decoding individual identity from brain activity elicited in imagining common experiences." Nature communications 11, no. 1 (2020): 1-14.
IJCAI 2023: DL for Brain Encoding and Decoding
26
Narratives
Nastase, Samuel A., Yun-Fei Liu, Hanna Hillman, Asieh Zadbood, Liat Hasenfratz, Neggin Keshavarzian, Janice Chen et al. "The “Narratives” fMRI dataset for evaluating models of naturalistic language comprehension." Scientific data 8, no. 1 (2021): 1-22.
IJCAI 2023: DL for Brain Encoding and Decoding
27
Video Stimulus Datasets
IJCAI 2023: DL for Brain Encoding and Decoding
28
Dataset | Type | Language | Stimulus | #Subjects | Paradigm | Size | Task |
BBC’s Doctor Who: Seeliger et al., 2019 | fMRI | English | Spatiotemporal visual and auditory naturalistic stimuli (30 episodes of BBC’s Doctor Who) | 1 | Viewing episode videos | 120.830 whole-brain volumes (approx. 23 h) of single-presentation data, and 1.178 volumes (11 min) of repeated narrative short episodes (22 repetitions) | Passive viewing |
Japanese Ads: Nishida et al., 2020 | fMRI | Japanese | 368 web and 2452 TV Japanese ad movies (15-30s) | 40 and 28 for web and TV ads. 16 were overlapped | Viewing Ads | 7200 train and 1200 test fMRIs for web; fMRIs from 420 ads. | Passive viewing |
Pippi Langkous: Berezutskaya et al., 2020 | ECoG | The movie was originally in Swedish but dubbed in Dutch | 30 s excerpts of a feature film (in total, 6.5 min long), edited together for a coherent story | 37 patients | Viewing | 6.5 min movie. | Passive viewing |
Algonauts: Cichy et al., 2021 | fMRI | English | 1000 short video clips | 10 | Viewing video clips | 1000 short video clips (3 sec each) | Passive viewing |
Natural Short Clips: Huth et al., 2022 | fMRI | English | Natural short movie clips | 5 | Watching natural short movie clips | 3870 responses per subject. | Passive viewing |
Japanese Ads
Nishida, Satoshi, Yusuke Nakano, Antoine Blanc, Naoya Maeda, Masataka Kado, and Shinji Nishimoto. "Brain-mediated transfer learning of convolutional neural networks." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 04, pp. 5281-5288. 2020.
IJCAI 2023: DL for Brain Encoding and Decoding
29
Algonauts 2021
Cichy, Radoslaw Martin, Kshitij Dwivedi, Benjamin Lahner, Alex Lascelles, Polina Iamshchinina, M. Graumann, A. Andonian et al. "The Algonauts Project 2021 Challenge: How the Human Brain Makes Sense of a World in Motion." arXiv preprint arXiv:2104.13714 (2021).
IJCAI 2023: DL for Brain Encoding and Decoding
30
Other Multimodal Stimulus Datasets
IJCAI 2023: DL for Brain Encoding and Decoding
31
Dataset | Type | Language | Stimulus | #Subjects | Paradigm | Size | Task |
Mitchell et al., 2008 | fMRI | English | 60 different word-picture pairs from 12 categories. | 9 | Viewing word-picture pairs | 60 different word-picture pairs presented six times each | Passive viewing |
Sudre et al., 2012 | MEG | English | 60 concrete nouns along with line drawings | 9 | Reading | 60 stimuli × 20 questions = 1200 examples | Question answering |
Zinszer et al., 2017 | fNIRS | English | 8 concrete nouns (audiovisual word and picture stimuli): bunny, bear, kitty, dog, mouth, foot, hand, and nose | 24 | Viewing and listening | 12 blocks with the 8 stimuli per subject. | Passive viewing and listening |
Pereira et al., 2018 | fMRI | English | 180 Words with Picture, Sentences, word clouds; 96 text passages; 72 passages | 16 | Viewing WP, sentences or word clouds | 180 WP, S and WC per subject; 96+72 passages shown 3 times | Passive viewing |
Cao et al., 2021 | fNIRS | Chinese | 50 concrete nouns from 10 semantic categories | 7 | Viewing and listening | Each stimulus is presented 7 times. | Passive viewing and listening |
Courtois Neuromod | fMRI | full-length movies and TV show | 6 | Viewing and Listening | ~100 hours of data per participant | Passive viewing | |
Concrete nouns with line drawings
IJCAI 2023: DL for Brain Encoding and Decoding
32
Word+Picture, Sentences, Word Clouds, Passages
Pereira, Francisco, Bin Lou, Brianna Pritchett, Samuel Ritter, Samuel J. Gershman, Nancy Kanwisher, Matthew Botvinick, and Evelina Fedorenko. "Toward a universal decoder of linguistic meaning from brain activation." Nature communications 9, no. 1 (2018): 1-13.
IJCAI 2023: DL for Brain Encoding and Decoding
33
fNIRS with audio-visual stimuli
IJCAI 2023: DL for Brain Encoding and Decoding
34
Text Stimulus Datasets References
IJCAI 2023: DL for Brain Encoding and Decoding
35
Visual Stimulus Datasets References
IJCAI 2023: DL for Brain Encoding and Decoding
36
Audio Stimulus Datasets References
IJCAI 2023: DL for Brain Encoding and Decoding
37
Video Stimulus Datasets References
IJCAI 2023: DL for Brain Encoding and Decoding
38
Multimodal Stimulus Datasets References
IJCAI 2023: DL for Brain Encoding and Decoding
39
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
40
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
41
Stimulus Representations
IJCAI 2023: DL for Brain Encoding and Decoding
42
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
43
Text Stimulus Representations
IJCAI 2023: DL for Brain Encoding and Decoding
44
Basic NLP Representations for Word Stimuli
IJCAI 2023: DL for Brain Encoding and Decoding
45
Basic NLP Representations for Word Stimuli
Wehbe, Leila, Brian Murphy, Partha Talukdar, Alona Fyshe, Aaditya Ramdas, and Tom Mitchell. "Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses." PloS one 9, no. 11 (2014): e112575.
Wang, Jing, Vladimir L. Cherkassky, and Marcel Adam Just. "Predicting the brain activation pattern associated with the propositional content of a sentence: modeling neural representations of events and states." Human brain mapping 38, no. 10 (2017): 4865-4881.
IJCAI 2023: DL for Brain Encoding and Decoding
46
Discourse features (for Harry Potter dataset)
Wehbe, Leila, Brian Murphy, Partha Talukdar, Alona Fyshe, Aaditya Ramdas, and Tom Mitchell. "Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses." PloS one 9, no. 11 (2014): e112575.
Wang, Jing, Vladimir L. Cherkassky, and Marcel Adam Just. "Predicting the brain activation pattern associated with the propositional content of a sentence: modeling neural representations of events and states." Human brain mapping 38, no. 10 (2017): 4865-4881.
IJCAI 2023: DL for Brain Encoding and Decoding
47
DL Representations: Using embeddings for word stimuli
IJCAI 2023: DL for Brain Encoding and Decoding
48
DL Representations: Using longer context for word stimuli
Toneva, Mariya, and Leila Wehbe. "Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)." Advances in Neural Information Processing Systems 32 (2019).
Jain, Shailee, and Alexander Huth. "Incorporating context into language encoding models for fMRI." Advances in neural information processing systems 31 (2018).
Jat, Sharmistha, Hao Tang, Partha Talukdar, and Tom Mitchell. "Relating simple sentence representations in deep neural networks and the brain." arXiv preprint arXiv:1906.11861 (2019).
IJCAI 2023: DL for Brain Encoding and Decoding
49
DL Representations: Using sentence embeddings
Toneva, Mariya, and Leila Wehbe. "Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)." Advances in Neural Information Processing Systems 32 (2019).
Sun, Jingyuan, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. "Towards sentence-level brain decoding with distributed representations." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, pp. 7047-7054. 2019.
IJCAI 2023: DL for Brain Encoding and Decoding
50
DL Representations: Transformer-based methods for text stimuli (Layer #, context length, architecture)
Toneva, Mariya, and Leila Wehbe. "Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)." Advances in Neural Information Processing Systems 32 (2019).
Sun, Jingyuan, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. "Neural encoding and decoding with distributed sentence representations." IEEE Transactions on Neural Networks and Learning Systems 32, no. 2 (2020): 589-603.
IJCAI 2023: DL for Brain Encoding and Decoding
51
Transformer-XL is the only model that continues to increase performance as the context length is increased. In all networks, the middle layers perform the best for contexts longer than 15 words. The deepest layers across all networks show a sharp increase in performance at short-range context (fewer than 10 words), followed by a decrease in performance. [Toneva and Wehbe, 2019]
DL Representations: Transformer-based methods for text stimuli (NLP task finetuning and scrambled LM)
Gauthier, Jon, and Roger Levy. "Linking artificial and human neural representations of language." arXiv preprint arXiv:1910.01244 (2019).
IJCAI 2023: DL for Brain Encoding and Decoding
52
DL Representations: Transformer-based methods for text stimuli (NLP task finetuning)
Oota, Subba Reddy, Jashn Arora, Veeral Agarwal, Mounika Marreddy, Manish Gupta, and Bapi Raju Surampudi. "Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?." arXiv preprint arXiv:2205.01404 (2022).
IJCAI 2023: DL for Brain Encoding and Decoding
53
Tasks
Paraphrase, Summarization, Question Answering, Sentiment Analysis, NER, Word Sense Disambiguation, Natural Language Inference, Semantic Role Labeling, Coreference Resolution, Shallow Syntax Parsing
Pereira dataset: CR, NER, and SS perform the best.
Dendrogram constructed using similarity on representations from task-specific Transformer encoder models with stimuli from the dataset passed as input.
DL Representations: Transformer-based methods for text stimuli (Multi-task setup)
Schwartz, Dan, Mariya Toneva, and Leila Wehbe. "Inducing brain-relevant bias in natural language processing models." Advances in neural information processing systems 32 (2019).
IJCAI 2023: DL for Brain Encoding and Decoding
54
DL Representations: Comparing Transformers and extracting syntax vs semantics
Caucheteux, Charlotte, Alexandre Gramfort, and Jean-Remi King. "Disentangling syntax and semantics in the brain with deep networks." In International Conference on Machine Learning, pp. 1336-1348. PMLR, 2021.
IJCAI 2023: DL for Brain Encoding and Decoding
55
Experiential attributes model for text stimuli
Anderson, Andrew James, Jeffrey R. Binder, Leonardo Fernandino, Colin J. Humphries, Lisa L. Conant, Rajeev DS Raizada, Feng Lin, and Edmund C. Lalor. "An integrated neural decoder of linguistic and experiential meaning." Journal of Neuroscience 39, no. 45 (2019): 8969-8987.
Anderson, Andrew James, Jeffrey R. Binder, Leonardo Fernandino, Colin J. Humphries, Lisa L. Conant, Mario Aguilar, Xixi Wang, Donias Doko, and Rajeev DS Raizada. "Predicting neural activity patterns associated with sentences using a neurobiologically motivated model of semantic representation." Cerebral Cortex 27, no. 9 (2017): 4379-4395.
Anderson, Andrew James, Kelsey McDermott, Brian Rooks, Kathi L. Heffner, David Dodell-Feder, and Feng V. Lin. "Decoding individual identity from brain activity elicited in imagining common experiences." Nature communications 11, no. 1 (2020): 1-14.
IJCAI 2023: DL for Brain Encoding and Decoding
56
Binary attribute representations
Handjaras, Giacomo, Emiliano Ricciardi, Andrea Leo, Alessandro Lenci, Luca Cecchetti, Mirco Cosottini, Giovanna Marotta, and Pietro Pietrini. "How concepts are encoded in the human brain: a modality independent, category-based cortical organization of semantic knowledge." Neuroimage 135 (2016): 232-242.
Wang, Jing, Vladimir L. Cherkassky, and Marcel Adam Just. "Predicting the brain activation pattern associated with the propositional content of a sentence: modeling neural representations of events and states." Human brain mapping 38, no. 10 (2017): 4865-4881.
IJCAI 2023: DL for Brain Encoding and Decoding
57
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
58
Visual Stimuli
IJCAI 2023: DL for Brain Encoding and Decoding
59
Visual Stimuli: Gabor wavelet pyramid
Kay, Kendrick N., Thomas Naselaris, Ryan J. Prenger, and Jack L. Gallant. "Identifying natural images from human brain activity." Nature 452, no. 7185 (2008): 352-355.
IJCAI 2023: DL for Brain Encoding and Decoding
60
a, Spatial frequency and position. Wavelets occur at five spatial frequencies. This panel depicts one wavelet at each of the first five spatial frequencies. At each spatial frequency f cycles/field-of-view (FOV), wavelets are positioned on an f × f grid, as indicated by the translucent lines.
b, Orientation and phase. At each grid position, wavelets occur at eight orientations and two phases. This panel depicts a complete set of wavelets for a single grid position. Dashed lines indicate the bounds of the mask associated with each wavelet.
Gabor wavelet pyramid model. Each image is projected onto the individual Gabor wavelets comprising the Gabor wavelet pyramid. Gabor wavelets differ in size, position, orientation, spatial frequency, and phase. The projections for each quadrature pair of wavelets are squared, summed, and square-rooted, yielding a measure of contrast energy. The contrast energies for different quadrature wavelet pairs are weighted and then summed. Finally, a DC offset is added. The weights are determined by gradient descent with early stopping.
Visual Stimuli: HMAX model
Riesenhuber, Maximilian, and Tomaso Poggio. "Hierarchical models of object recognition in cortex." Nature neuroscience 2, no. 11 (1999): 1019-1025.
Horikawa, Tomoyasu, and Yukiyasu Kamitani. "Generic decoding of seen and imagined objects using hierarchical visual features." Nature communications 8, no. 1 (2017): 1-15.
IJCAI 2023: DL for Brain Encoding and Decoding
61
Visual Stimuli: Convolutional Neural Networks (CNNs)
IJCAI 2023: DL for Brain Encoding and Decoding
62
Visual Stimuli: Object Recognition with Word embeddings
Berezutskaya, Julia, Zachary V. Freudenburg, Luca Ambrogioni, Umut Güçlü, Marcel AJ van Gerven, and Nick F. Ramsey. "Cortical network responses map onto data-driven features that capture visual semantics of movie fragments." Scientific reports 10, no. 1 (2020): 1-21.
IJCAI 2023: DL for Brain Encoding and Decoding
63
Visual Stimuli: Semi-supervised CNNs
Beliy, Roman, Guy Gaziv, Assaf Hoogi, Francesca Strappini, Tal Golan, and Michal Irani. "From voxels to pixels and back: Self-supervision in natural-image reconstruction from fMRI." Advances in Neural Information Processing Systems 32 (2019).
IJCAI 2023: DL for Brain Encoding and Decoding
64
Training phases & Architecture. (a) The first training phase: Supervised training of the Encoder with {Image, fMRI} pairs. (b) Second phase: Training the Decoder simultaneously with 3 types of data: {Image, fMRI} pairs (supervised examples), unlabeled natural images (self-supervision), and unlabeled test-fMRI (self-supervision). Note that the test-images are never used for training. The pretrained Encoder from the first training phase is kept fixed in the second phase. (c) Encoder and Decoder architectures. BN, US, and ReLU stand for batch normalization, up-sampling, and rectified linear unit, respectively.
Visual Stimuli: Convolutional LSTM Autoencoder
StepEncog, a convolutional LSTM autoencoder model trained on fMRI voxels.
Oota, Subba Reddy, Vijay Rowtula, Manish Gupta, and Raju S. Bapi. "StepEncog: A convolutional LSTM autoencoder for near-perfect fMRI encoding." In 2019 International Joint Conference on Neural Networks (IJCNN), pp. 1-8. IEEE, 2019.
IJCAI 2023: DL for Brain Encoding and Decoding
65
Latent Diffusion Models
Takagi, Yu, and Shinji Nishimoto. "High-resolution image reconstruction with latent diffusion models from human brain activity." In CVPR, pp. 14453-14463. 2023.
IJCAI 2023: DL for Brain Encoding and Decoding
66
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
67
Audio Stimuli
Huth, Alexander G., Wendy A. De Heer, Thomas L. Griffiths, Frédéric E. Theunissen, and Jack L. Gallant. "Natural speech reveals the semantic maps that tile human cerebral cortex." Nature 532, no. 7600 (2016): 453-458.
Nishida, Satoshi, Yusuke Nakano, Antoine Blanc, Naoya Maeda, Masataka Kado, and Shinji Nishimoto. "Brain-mediated transfer learning of convolutional neural networks." In Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, no. 04, pp. 5281-5288. 2020.
IJCAI 2023: DL for Brain Encoding and Decoding
68
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
69
Multimodal Stimulus Representations
Wang, Shaonan, Jiajun Zhang, Haiyan Wang, Nan Lin, and Chengqing Zong. "Fine-grained neural decoding with distributed word representations." Information Sciences 507 (2020): 256-272.
IJCAI 2023: DL for Brain Encoding and Decoding
70
Multimodal Stimuli: Visio-linguistic representations
Oota, Subba Reddy, Jashn Arora, Vijay Rowtula, Manish Gupta, and Raju S. Bapi. "Visio-Linguistic Brain Encoding." arXiv preprint arXiv:2204.08261 (2022).
IJCAI 2023: DL for Brain Encoding and Decoding
71
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
72
References
[1] Nicolas Affolter, Beni Egressy, Damian Pascual, and Roger Wattenhofer. Brain2word: Decoding brain activity for language generation. arXiv preprint arXiv:2009.04765, 2020.
[2] Andrew J Anderson, Douwe Kiela, Stephen Clark, and Massimo Poesio. Visually grounded and textual semantic models differentially decode brain activity associated with concrete and abstract nouns. Transactions of the Association for Computational Linguistics, 5:17–30, 2017.
[3] Andrew James Anderson, Jeffrey R Binder, Leonardo Fernandino, Colin J Humphries, Lisa L Conant, Mario Aguilar, Xixi Wang, Donias Doko, and Rajeev DS Raizada. Predicting neural activity patterns associated with sentences using a neurobiologically motivated model of semantic representation. Cerebral Cortex, 27(9):4379–4395, 2017.
[4] Andrew James Anderson, Jeffrey R Binder, Leonardo Fernandino, Colin J Humphries, Lisa L Conant, Rajeev DS Raizada, Feng Lin, and Edmund C Lalor. An integrated neural decoder of linguistic and experiential meaning. Journal of Neuroscience, 39(45):8969–8987, 2019.
[5] Andrew James Anderson, Kelsey McDermott, Brian Rooks, Kathi L Heffner, David Dodell-Feder, and Feng V Lin. Decoding individual identity from brain activity elicited in imagining common experiences. Nature communications, 11(1):1–14, 2020.
[6] Richard Antonello, Javier Turek, Vy Vo, and Alexander Huth. Low-dimensional structure in the space of language representations is reflected in brain responses. arXiv preprint arXiv:2106.05426, 2021.
[7] Roman Beliy, Guy Gaziv, Assaf Hoogi, Francesca Strappini, Tal Golan, and Michal Irani. From voxels to pixels and back: Self-supervision in naturalimage reconstruction from fmri. arXiv preprint arXiv:1907.02431, 2019.
[8] Julia Berezutskaya, Zachary V Freudenburg, Luca Ambrogioni, Umut Güçlü, Marcel AJ van Gerven, and Nick F Ramsey. Cortical network responses map onto data-driven features that capture visual semantics of movie fragments. Scientific reports, 10(1):1–21, 2020.
[9] Charlotte Caucheteux, Alexandre Gramfort, and Jean-Remi King. Disentangling syntax and semantics in the brain with deep networks. In International Conference on Machine Learning, pages 1336–1348. PMLR, 2021.
[10] Charlotte Caucheteux and Jean-Rémi King. Language processing in brains and deep neural networks: computational convergence and its limits. BioRxiv, 2020.
[11] Joshua S Cetron, Andrew C Connolly, Solomon G Diamond, Vicki V May, James V Haxby, and David JM Kraemer. Decoding individual differences in stem learning from functional mri data. Nature communications, 10(1):1–10, 2019.
IJCAI 2023: DL for Brain Encoding and Decoding
73
References
[12] Nadine Chang, John A Pyles, Austin Marcus, Abhinav Gupta, Michael J Tarr, and Elissa M Aminoff. Bold5000, a public fmri dataset while viewing 5000 visual images. Scientific data, 6(1):1–18, 2019.
[13] Radoslaw Martin Cichy, Kshitij Dwivedi, Benjamin Lahner, Alex Lascelles, Polina Iamshchinina, M Graumann, A Andonian, NAR Murty, K Kay, Gemma Roig, et al. The algonauts project 2021 challenge: How the human brain makes sense of a world in motion. arXiv preprint arXiv:2104.13714, 2021.
[14] Radoslaw Martin Cichy, Gemma Roig, Alex Andonian, Kshitij Dwivedi, Benjamin Lahner, Alex Lascelles, Yalda Mohsenzadeh, Kandan Ramakrishnan, and Aude Oliva. The algonauts project: A platform for communication between the sciences of biological and artificial intelligence. arXiv e-prints, pages arXiv–1905, 2019.
[15] Changde Du, Changying Du, Lijie Huang, and Huiguang He. Conditional generative neural decoding with structured cnn feature prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 2629–2636, 2020.
[16] Michael Eickenberg, Alexandre Gramfort, Gaël Varoquaux, and Bertrand Thirion. Seeing it all: Convolutional network layers map the function of the human visual system. NeuroImage, 152:184–194, 2017.
[17] Jack Gallant. Human brain mapping and brain decoding, 2017.
[18] Jon Gauthier and Roger Levy. Linking artificial and human neural representations of language. arXiv preprint arXiv:1910.01244, 2019.
[19] Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep learning. MIT press, 2016.
[20] Giacomo Handjaras, Emiliano Ricciardi, Andrea Leo, Alessandro Lenci, Luca Cecchetti, Mirco Cosottini, Giovanna Marotta, and Pietro Pietrini. How concepts are encoded in the human brain: a modality independent, category-based cortical organization of semantic knowledge. Neuroimage, 135:232–242, 2016.
[21] Nora Hollenstein, Antonio de la Torre, Nicolas Langer, and Ce Zhang. Cognival: A framework for cognitive word embedding evaluation. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pages 538–549, 2019.
[22] Nora Hollenstein, Jonathan Rotsztejn, Marius Troendle, Andreas Pedroni, Ce Zhang, and Nicolas Langer. Zuco, a simultaneous eeg and eye-tracking resource for natural sentence reading. Scientific data, 5(1):1–13, 2018.
IJCAI 2023: DL for Brain Encoding and Decoding
74
References
[23] Alexander G Huth, Wendy A De Heer, Thomas L Griffiths, Frédéric E Theunissen, and Jack L Gallant. Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600):453–458, 2016.
[24] Shailee Jain and Alexander G Huth. Incorporating context into language encoding models for fmri. In Proceedings of the 32nd International Conference on Neural Information Processing Systems, pages 6629–6638, 2018.
[25] S Jat, H Tang, P Talukdar, and T Mitchel. Relating simple sentence representations in deep neural networks and the brain. In ACL 2019-57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, pages 5137–5154. Association for Computational Linguistics (ACL), 2020.
[26] Marcel Adam Just, Vladimir L Cherkassky, Sandesh Aryal, and Tom M Mitchell. A neurosemantic theory of concrete noun representation based on the underlying brain codes. PloS one, 5(1):e8622, 2010.
[27] Kendrick N Kay, Thomas Naselaris, Ryan J Prenger, and Jack L Gallant. Identifying natural images from human brain activity. Nature, 452(7185):352–355, 2008.
[28] Jonas Kubilius, Martin Schrimpf, Kohitij Kar, Rishi Rajalingham, Ha Hong, Najib Majaj, Elias Issa, Pouya Bashivan, Jonathan Prescott-Roy, Kailyn Schmidt, et al. Brain-like object recognition with high-performing shallow recurrent anns. Advances in Neural Information Processing Systems, 32:12805–12816, 2019.
[29] Tom Mitchell. Neural representations of language meaning, 2014.
[30] Tom M Mitchell, Svetlana V Shinkareva, Andrew Carlson, Kai-Min Chang, Vicente L Malave, Robert A Mason, and Marcel Adam Just. Predicting human brain activity associated with the meanings of nouns. science, 320(5880):1191–1195, 2008.
[31] Thomas Naselaris, Ryan J Prenger, Kendrick N Kay, Michael Oliver, and Jack L Gallant. Bayesian reconstruction of natural images from human brain activity. Neuron, 63(6):902–915, 2009.
[32] Samuel A Nastase, Yun-Fei Liu, Hanna Hillman, Asieh Zadbood, Liat Hasenfratz, Neggin Keshavarzian, Janice Chen, Christopher J Honey, Yaara Yeshurun, Mor Regev, et al. Narratives: fmri data for evaluating models of naturalistic language comprehension. bioRxiv, pages 2020–12, 2021.
[33] Satoshi Nishida, Yusuke Nakano, Antoine Blanc, Naoya Maeda, Masataka Kado, and Shinji Nishimoto. Brain-mediated transfer learning of convolutional neural networks. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 5281–5288, 2020.
IJCAI 2023: DL for Brain Encoding and Decoding
75
References
[34] Shinji Nishimoto, An T Vu, Thomas Naselaris, Yuval Benjamini, Bin Yu, and Jack L Gallant. Reconstructing visual experiences from brain activity evoked by natural movies. Current biology, 21(19):1641–1646, 2011.
[35] Subba Reddy Oota, Vijay Rowtula, Manish Gupta, and Raju S Bapi. Stepencog: A convolutional lstm autoencoder for near-perfect fmri encoding. In 2019 International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, 2019.
[36] Francisco Pereira, Matthew Botvinick, and Greg Detre. Using wikipedia to learn semantic feature representations of concrete concepts in neuroimaging experiments. Artificial intelligence, 194:240– 252, 2013.
[37] Francisco Pereira, Bin Lou, Brianna Pritchett, Nancy Kanwisher, Matthew Botvinick, and Evelina Fedorenko. Decoding of generic mental representations from functional mri data using word embeddings. bioRxiv, page 057216, 2016.
[38] Francisco Pereira, Bin Lou, Brianna Pritchett, Samuel Ritter, Samuel J Gershman, Nancy Kanwisher, Matthew Botvinick, and Evelina Fedorenko. Toward a universal decoder of linguistic meaning from brain activation. Nature communications, 9(1):1–13, 2018.
[39] Martin Schrimpf, Idan Blank, Greta Tuckute, Carina Kauf, Eghbal A Hosseini, Nancy Kanwisher, Joshua Tenenbaum, and Evelina Fedorenko. The neural architecture of language: Integrative reverseengineering converges on a model for predictive processing. PNAS, Vol:To appear, 2021.
[40] Martin Schrimpf, Jonas Kubilius, Ha Hong, Najib J Majaj, Rishi Rajalingham, Elias B Issa, Kohitij Kar, Pouya Bashivan, Jonathan Prescott-Roy, Franziska Geiger, et al. Brain-score: Which artificial neural network for object recognition is most brain-like? BioRxiv, page 407007, 2020.
[41] Dan Schwartz, Mariya Toneva, and Leila Wehbe. Inducing brain-relevant bias in natural language processing models. Advances in Neural Information Processing Systems, 32:14123–14133, 2019.
[42] K Seeliger, RP Sommers, Umut Güçlü, Sander E Bosch, and MAJ Van Gerven. A large singleparticipant fmri dataset for probing brain responses to naturalistic stimuli in space and time. bioRxiv, page 687681, 2019.
[43] Vishwajeet Singh, Krishna P. Miyapuram, and Raju S. Bapi. Detection of cognitive states from fmri data using machine learning techniques. In Manuela M. Veloso, editor, IJCAI, pages 587–592, 2007.
[44] Jonathan Smallwood and Jonathan W Schooler. The science of mind wandering: empirically navigating the stream of consciousness. Annual review of psychology, 66:487–518, 2015.
IJCAI 2023: DL for Brain Encoding and Decoding
76
References
[45] Jingyuan Sun, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. Towards sentence-level brain decoding with distributed representations. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 33, pages 7047–7054, 2019.
[46] Jingyuan Sun, Shaonan Wang, Jiajun Zhang, and Chengqing Zong. Neural encoding and decoding with distributed sentence representations. IEEE Transactions on Neural Networks and Learning Systems, 32(2):589–603, 2020.
[47] Bertrand Thirion. Statistical inference in highdimension and application to brain imaging, 2019.
[48] Bertrand Thirion, Edouard Duchesnay, Edward Hubbard, Jessica Dubois, Jean-Baptiste Poline, Denis Lebihan, and Stanislas Dehaene. Inverse retinotopy: inferring the visual content of images from brain activation patterns. Neuroimage, 33(4):1104– 1116, 2006.
[49] Mariya Toneva, Otilia Stretcu, Barnabás Póczos, Leila Wehbe, and Tom M Mitchell. Modeling task effects on meaning representation in the brain via zero-shot meg prediction. Advances in Neural Information Processing Systems, 33, 2020.
[50] Mariya Toneva and Leila Wehbe. Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain). arXiv preprint arXiv:1905.11833, 2019.
[51] Aria Wang, Michael Tarr, and Leila Wehbe. Neural taskonomy: Inferring the similarity of task-derived representations from brain activity. Advances in Neural Information Processing Systems, 32:15501– 15511, 2019.
[52] Jing Wang, Vladimir L Cherkassky, and Marcel Adam Just. Predicting the brain activation pattern associated with the propositional content of a sentence: Modeling neural representations of events and states. Human brain mapping, 38(10):4865– 4881, 2017.
[53] Shaonan Wang, Jiajun Zhang, Haiyan Wang, Nan Lin, and Chengqing Zong. Fine-grained neural decoding with distributed word representations. Information Sciences, 507:256–272, 2020.
[54] Leila Wehbe, Brian Murphy, Partha Talukdar, Alona Fyshe, Aaditya Ramdas, and Tom Mitchell. Simultaneously uncovering the patterns of brain regions involved in different story reading subprocesses. in press, 2014.
[55] Daniel LK Yamins, Ha Hong, Charles F Cadieu, Ethan A Solomon, Darren Seibert, and James J DiCarlo. Performance-optimized hierarchical models predict neural responses in higher visual cortex. Proceedings of the national academy of sciences, 111(23):8619–8624, 2014.
[56] Boyle, Julie A., Basile Pinsard, A. Boukhdhir, S. Belleville, S. Bram-batti, J. Chen, J. Cohen-Adad et al. "The Courtois project on neuronal modelling: 2020 data release." In Presented at the 26th annual meeting of the Organization for Human Brain Mapping. 2020.
IJCAI 2023: DL for Brain Encoding and Decoding
77
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding
Subba Reddy Oota1, Manish Gupta2,3, Raju S. Bapi2, Mariya Toneva4
1Inria Bordeaux, France; 2IIIT Hyderabad, India; 3Microsoft, India; 4MPI for Software Systems, Germany
subba-reddy.oota@inria.fr, gmanish@microsoft.com, raju.bapi@iiit.ac.in, mtoneva@mpi-sws.org
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
79
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
80
Outline
IJCAI 2023: DL for Brain Encoding and Decoding
81
Encoding vs. Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
82
Encoding
Decoding
Stimulus
Representation
Stimulus
Representation
fMRI
fMRI
What is Brain Decoding?
IJCAI 2023: DL for Brain Encoding and Decoding
83
Visual Task
Language Task
Smith et al., 2011, Wang et al. 2019
Linguistic Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
84
input
output
Zou et al., 2022
Outline
IJCAI 2023: DL for Brain Encoding and Decoding
85
Linear Decoder Models
IJCAI 2023: DL for Brain Encoding and Decoding
86
Ridge / Logistic Regression
Stimulus Representation
Stimulus Classification
Horikawa et al. 2018
Non-Linear Decoder
IJCAI 2023: DL for Brain Encoding and Decoding
87
Vu et al. 2018
Deep CNNs
Evaluating Decoding Models: Pairwise Accuracy
IJCAIi-2023: DL for Brain Encoding and Decoding
88
ith Concept Word
jth Concept Word
Periera et al. 2018
Evaluating Decoding Models: Rank Accuracy
IJCAI-2023: DL for Brain Encoding and Decoding
89
Y1
Y2
Yn
Periera et al. 2018
ith Concept Word
Correaltion
rank = rsort(corr_scores).index(correlation)
All the correlation scores in descending order
Representational Similarity Matrix (RSM)
IJCAI-2023: DL for Brain Encoding and Decoding
90
corr(Scene1, Scen2)
Moussa et al. 2012
Representational Dissimilarity Matrix (RDM)
IJCAI-2023: DL for Brain Encoding and Decoding
91
Hamed et al. 2014
Representation Similarity Analysis
IJCAI-2023: DL for Brain Encoding and Decoding
92
Kriegeskorte et al. 2018
DSM = RDM
Outline
IJCAI 2023: DL for Brain Encoding and Decoding
93
Linguistic Brain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
94
Periera et al. 2018, Gauthier et al. 2019, Huth et al. 2023, Oota et al. 2022
Classical Decoders
IJCAI 2023: DL for Brain Encoding and Decoding
95
Mitchell et al. 2008
Toward a universal decoder
IJCAI 2023: DL for Brain Encoding and Decoding
96
Pereira et al. 2018
GloVE
Pennington et al. 2014
Dataset Details (Experiment-1)
IJCAI 2023: DL for Brain Encoding and Decoding
97
Concept + Sentence View
Concept Word
Concept + Picture View
Concept + Wordcloud View
Periera et al. 2018
Dataset Details (Experiment-1)
IJCAI 2023: DL for Brain Encoding and Decoding
98
Periera et al. 2018
Dataset Details (Experiments 2 and 3)
IJCAI 2023: DL for Brain Encoding and Decoding
99
Topic
Concept
Topic
Periera et al. 2018
Informative Voxel Selection
Cogsci-2022: DL for Brain Encoding and Decoding
100
Voxel + 26 neighbors in 3D
Input
Ridge Regression
Output
Stimulus:
Apartment
Present
GloVE
Present
Stimulus:
Apartment
Pearson Correlation (R) = Corr(Y, W(X))
Correlation across feature dimensions
V1 – R1
V2 – R2
….
Vn – R3
Select 5000 voxels based on top-5000 correlation scores
3D Image
X
Y
W
Pairwise and Rankwise Results
IJCAI 2023: DL for Brain Encoding and Decoding
101
Periera et al. 2018
Decoder built from Expt 1 could distinguish sentences at all levels of granularity
Universal Decoder!
Distribution of Informative Voxels
IJCAI 2023: DL for Brain Encoding and Decoding
102
Periera et al. 2018
Brain activation patterns consistent across 16 Ss
5000 informative voxels are roughly evenly distributed among the four networks
Overall, LN contains a relatively higher proportion of informative voxels, compared to its size!
Insights
IJCAI 2023: DL for Brain Encoding and Decoding
103
Periera et al. 2018
Linguistic Brain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
104
Periera et al. 2018, Gauthier et al. 2019, Huth et al. 2023, Oota et al. 2022
Linking artificial and human neural representations of language
IJCAI 2023: DL for Brain Encoding and Decoding
105
Ridge Regression
Gauthier et al. 2019
Cogsci-2022: DL for Brain Encoding and Decoding
106
Devlin et al. 2019
Pretrained vs. Task-specific language models
IJCAI 2023: DL for Brain Encoding and Decoding
107
Natural Language Understaning Tasks
Devlin et al. 2019, Bowon et al. 2020
Pretrained vs. Task-specific language models
Squad-2.0: Question Answering
Custom Tasks
IJCAI 2023: DL for Brain Encoding and Decoding
108
Fingers are used for grasping, writing, grooming and other activities. |
grasping are used for Fingers, grooming, writing and other activities. |
This is Los Angeles. And it's the height of summer. In a small bungalow off of La Cienega, Clara serves homemade chili and chips in red plastic bowls -- wine in blue plastic. |
This is Los Angeles. And the height it's of summer. In a bungalow off small of La Cienega, Clara serves homemade chili and chips in red plastic bowls -- wine in blue plastic. |
Gauthier et al. 2019
Brain decoding performance
IJCAI 2023: DL for Brain Encoding and Decoding
109
Scrambled language models have shown better performance!!
Gauthier et al. 2019
Brain decoding performance trajectories over fine-tuning time
IJCAI 2023: DL for Brain Encoding and Decoding
110
Gauthier et al. 2019
Summary
IJCAI 2023: DL for Brain Encoding and Decoding
111
Gauthier et al. 2019
Linguistic Brain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
112
Periera et al. 2018, Gauthier et al. 2019, Huth et al. 2023, Oota et al. 2022
IJCAI 2023: DL for Brain Encoding and Decoding
113
Continuous Language Decoder
Tang, LaBel, Jain & Huth (2023)
IJCAI 2023: DL for Brain Encoding and Decoding
114
Continuous Language Decoder
Tang, LaBel, Jain & Huth (2023)
IJCAI 2023: DL for Brain Encoding and Decoding
115
Continuous Language Decoder
Tang, LaBel, Jain & Huth (2023)
Summary
IJCAI 2023: DL for Brain Encoding and Decoding
116
Tang, LaBel, Jain & Huth (2023)
Linguistic Brain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
117
Periera et al. 2018, Gauthier et al. 2019, Huth et al. 2023, Oota et al. 2022
Multi-view and Cross-ViewBrain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
118
Oota et al. 2022
Multi-view decoding
IJCAI 2023: DL for Brain Encoding and Decoding
119
Wordcloud View
Train
Sentence View
Picture View
Wordcloud View
Oota et al. 2022
Picture View
Train
Sentence View
Train
Multi-view decoding results
IJCAI 2023: DL for Brain Encoding and Decoding
120
Picture View
Train
BERT Representaions
Shuffled the Target Concepts
Test
Sentence View
Train
WordCloud View
Train
Pictures Best Accuracy
Sentences Best Accuracy
Oota et al. 2022
Distribution of Informative Voxels
IJCAI 2023 : DL for Brain Encoding and Decoding
121
Oota et al. 2022
Cross-view Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
122
Picture View
Train
Caption
Test
Picture View
Train
Visual words
Test
Wordcloud View
Train
Sentence
Test
Sentence View
Train
Keywords
Test
Oota et al. 2022
Cross-view Decoding results
IJCAI 2023: DL for Brain Encoding and Decoding
123
BERT Representaions
Shuffled the Target Concepts
Oota et al. 2022
Summary
IJCAI 2023: DL for Brain Encoding and Decoding
124
Oota et al. 2022
Linguistic Brain Decoding
IJCAI 2023: DL for Brain Encoding and Decoding
125
Periera et al. 2018, Gauthier et al. 2019, Huth et al. 2023, Oota et al. 2022
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
126
References
IJCAI 2023: DL for Brain Encoding and Decoding
127
References
IJCAI 2023: DL for Brain Encoding and Decoding
128
Deep Learning for Brain Encoding and Decoding
Subba Reddy Oota1, Manish Gupta2,3, Raju S. Bapi2, Mariya Toneva4
1Inria Bordeaux, France; 2IIIT Hyderabad, India; 3Microsoft, India; 4MPI for Software Systems, Germany
subba-reddy.oota@inria.fr, gmanish@microsoft.com, raju.bapi@iiit.ac.in, mtoneva@mpi-sws.org
129
Agenda
130
Agenda
131
Mechanistic understanding of information processing in the brain: 4 big questions
132
How
Where
When
What
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding models have a causal interpretation
133
stimulus properties
corr( )
Ytest, Ytest
^
Evaluate:
“The problem is when the capsule moves from an elliptical orbit to a parabolic orbit.”
Reveal which brain areas are affected by stimulus properties [Weichwald et al. 2015]
ytrain
Train:
<0,1,...0>
latent brain-relevant
stimulus properties
= hypothesis for
stim. representation
stimulus representation
<0, 1, … 0>
Part of Speech: Noun
IJCAI 2023: DL for Brain Encoding and Decoding
Classic findings using encoding models
134
IJCAI 2023: DL for Brain Encoding and Decoding
Classic encoding model finding: Language
135
[Barsalou, 1999; Barsalou, 2008; Pecher et al., 2005]
figure from Kemmerer, 2014; adapted from Thompson-Schill et al. 2006
Empirical evidence for distributed organization for attributes related to:
bear
IJCAI 2023: DL for Brain Encoding and Decoding
Classic encoding model finding: Language
136
bear
Accurately predicts fMRI recordings for a novel word
Correspondences
between a semantic property (“push”) and the function of the cortical regions where the fMRI recordings are well predicted
IJCAI 2023: DL for Brain Encoding and Decoding
Classic encoding model finding: Vision
Kay, Kendrick N., Thomas Naselaris, Ryan J. Prenger, and Jack L. Gallant. "Identifying natural images from human brain activity." Nature 452, no. 7185 (2008): 352-355.
137
Encoding models estimated quantitative receptive fields for V1-V3 voxels
Identified which of a set of candidate natural image was viewed by a participant
IJCAI 2023: DL for Brain Encoding and Decoding
Classic encoding model finding: Audio
138
spatial
temporal
posterior/dorsal auditory: coarse spectral info & high temporal precision
anterior/ventral auditory: fine-grained spectral & low temporal precision
IJCAI 2023: DL for Brain Encoding and Decoding
Deep learning models enable data-driven encoding models for naturalistic stimuli
139
more stimulus properties that affect brain activity
more naturalistic stimuli
<0,1,...0>
simple stim. representations explain less variance in brain activity
DeepMind’s New AI Taught Itself to Be the World’s Greatest Go Player
Singularity Hub
Meet GPT-3. It Has Learned to Code (and Blog and Argue)
The New York Times
IJCAI 2023: DL for Brain Encoding and Decoding
Data-driven encoding models evaluate the relationships between brains and deep learning models
140
fMRI
A priori locations in DL system and brain
Deep learning system
how are they related?
Multimodal naturalistic stimulus
Data-driven encoding model
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: training and evaluation
141
function often modeled as linear
[Mitchell et al. 2008, Nishimoto et al., 2011;
Sudre et al., 2012; Wehbe et al., 2014]
Considerations for
Linear vs non-linear
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: training and evaluation
142
function often modeled as linear
[Mitchell et al. 2008, Nishimoto et al., 2011;
Sudre et al., 2012; Wehbe et al., 2014]
Training: cross validation (CV), regularization parameter chosen via nested CV
Evaluation: 1) make predictions for heldout data
2) compare predictions with true brain data
3) stringent statistical testing
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: training setup
143
Test how well predicts unseen brain recordings
Learn function
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: training independent models
144
P1
…
P2
PN
P1, v1
P1, v2
…
P1, vm
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: fMRI specifics
145
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: evaluation setup
146
Test how well predicts unseen brain recordings
Learn function
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: evaluation metrics
147
Pearson correlation
2v2 accuracy
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: statistical significance
148
IJCAI 2023: DL for Brain Encoding and Decoding
Encoding: performance visualization
149
fMRI
MEG/EEG
IJCAI 2023: DL for Brain Encoding and Decoding
Agenda
150
More recent work utilizing progress in DL for encoding
151
IJCAI 2023: DL for Brain Encoding and Decoding
Language: work utilizing DL progress
152
significant word-by-word alignment between MEG & representations of words and context from recurrent NLP systems
IJCAI 2023: DL for Brain Encoding and Decoding
Audio: work utilizing DL progress
153
alignment between fMRI & recurrent NLP representations w/ varying context;
best alignment with middle layer
IJCAI 2023: DL for Brain Encoding and Decoding
Language: work utilizing DL progress
154
across several types of large NLP systems, best alignment with fMRI in middle layers
IJCAI 2023: DL for Brain Encoding and Decoding
Language: work utilizing DL progress
155
best alignment with fMRI & MEG in middle layers
better performance at predicting next word -> better prediction of fMRI & MEg
IJCAI 2023: DL for Brain Encoding and Decoding
Language: work utilizing DL progress
156
some NLP systems can predict fMRI and ECoG up to 100% of estimated noise ceiling
IJCAI 2023: DL for Brain Encoding and Decoding
Language: work utilizing DL progress
157
NLP word representations predict ECoG recordings for upcoming words
IJCAI 2023: DL for Brain Encoding and Decoding
Recent work utilizing progress in DL for encoding
158
IJCAI 2023: DL for Brain Encoding and Decoding
Vision: work utilizing DL progress
Yamins, Daniel LK, Ha Hong, Charles F. Cadieu, Ethan A. Solomon, Darren Seibert, and James J. DiCarlo. "Performance-optimized hierarchical models predict neural responses in higher visual cortex." Proceedings of the national academy of sciences 111, no. 23 (2014): 8619-8624.
159
Highest layer in CNN model most predictive of IT; intermediate layers most predictive of V4
IJCAI 2023: DL for Brain Encoding and Decoding
Vision: work utilizing DL progress
160
A CNN tuned for object classification captures stages of human visual processing in both space and time
IJCAI 2023: DL for Brain Encoding and Decoding
Vision: work utilizing DL progress
161
Self-supervised deep models achieve parity with category-supervised models in predicting fMRI responses along visual hierarchy
IJCAI 2023: DL for Brain Encoding and Decoding
Vision: work utilizing DL progress
162
Self-supervised deep models produce brain-like representations even when trained solely with noisy data from child head-mounted cameras
IJCAI 2023: DL for Brain Encoding and Decoding
Recent work utilizing progress in DL for encoding
163
IJCAI 2023: DL for Brain Encoding and Decoding
Audio: work utilizing DL progress
164
Primary auditory responses predicted best by intermediate layers of task-optimized model;
non-primary responses predicted best by late layers
IJCAI 2023: DL for Brain Encoding and Decoding
Audio: work utilizing DL progress
165
Middle layers of self-supervised speech models predict auditory cortex the best
IJCAI 2023: DL for Brain Encoding and Decoding
Audio: work utilizing DL progress
166
Self-supervised speech models reveal specialization for native sounds in the STS and MTG;
IFG and AG show more general specialization for speech rather than native-language
IJCAI 2023: DL for Brain Encoding and Decoding
Agenda
167
Deep Learning for Brain Encoding and Decoding
Subba Reddy Oota1, Manish Gupta2,3, Raju S. Bapi2, Mariya Toneva4
1Inria Bordeaux, France; 2IIIT Hyderabad, India; 3Microsoft, India; 4MPI for Software Systems, Germany
subba-reddy.oota@inria.fr, gmanish@microsoft.com, raju.bapi@iiit.ac.in, mtoneva@mpi-sws.org
168
Agenda
169
Challenges in using DL for cognitive modeling
170
NLP systems: Designed to predict upcoming words
Harry never thought ???
Harry never thought he ???
Harry never thought he would ???
...
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive modeling
171
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive science
172
part-of-speech
semantic role
dependence on other words
...
+
+
+
?
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive science
173
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive science
174
IJCAI 2023: DL for Brain Encoding and Decoding
Training DL models using brain recordings
175
Brain-optimized NLP model predicts unseen fMRI recordings better, especially in canonical language regions
A priori locations in NLP system and brain
NLP system
Chapter of a book
𝑥 alignment
error propagation
fMRI
IJCAI 2023: DL for Brain Encoding and Decoding
Training DL models using brain recordings
176
Brain-optimized vision model trained entirely on fMRI recordings ~= task-optimized networks for predicting brain recordings in early and high-level ROI
IJCAI 2023: DL for Brain Encoding and Decoding
Training DL models using brain recordings
177
Brain-optimized vision model can predict brain signals corresponding to a category of stimuli that it was never trained on
IJCAI 2023: DL for Brain Encoding and Decoding
Training DL models using brain recordings
178
Brain-optimized vision model can learn representations that do not follow a strict hierarchy
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive modeling
179
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
180
Category-based attention during natural vision alters representation of both attended and unattended categories
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
181
bear
X
veg?
bear
X
tool?
800ms
306 sensors
800ms
306 sensors
Systematic difference due to different question tasks
Attention emphasizes task-relevant information
Mechanism?
Can we model as a function of the task AND stimulus?
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
182
question task effect word effect
significant prediction performance
The end of semantic processing of a word is task-dependent
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
183
Possible to predict whether a person is passively reading or performing a task with the text based on EEG recordings
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
184
Semantic Low-dim. Geometric 2D 3D
Vision tasks with higher transferability make similar predictions for brain responses from different regions
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
Oota, Subba Reddy, Jashn Arora, Veeral Agarwal, Mounika Marreddy, Manish Gupta, and Bapi Raju Surampudi. "Neural Language Taskonomy: Which NLP Tasks are the most Predictive of fMRI Brain Activity?." arXiv preprint arXiv:2205.01404 (2022).
185
Reading fMRI best explained by coref. resolution, NER, shallow syntax parsing
Listening fMRI best explained by paraphrasing, summarization, NLI
IJCAI 2023: DL for Brain Encoding and Decoding
Tasks affect processing
brain alignment (Pearson correlation)
Model trained with�language modeling
Model trained to�summarize narratives
input
input
activations
activations
book�chapter
Training language models to summarize narratives improves brain alignment, especially during important narrative elements (Characters, emotions, etc.)
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive modeling
187
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
188
“Mary finished the apple”
supra-word meaning may contain concept of:
supra-word
meaning
Isolating supra-word meaning is a type of intervention
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
189
full context
supra-word
Bilateral PTL and ATL process supra-word meaning
Word-level information important for prediction of most language regions
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
190
Figures provided by Shailee Jain
Utilizing an NLP model that explicitly represents different timescale of information allows the voxel-wise estimation of the preferred timescales
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
191
Syntactic structure-based features explain additional variance in language regions over complexity metrics
Regions predicted by syntactic and semantic are difficult to distinguish
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
Caucheteux, Charlotte, Alexandre Gramfort, and Jean-Remi King. "Disentangling syntax and semantics in the brain with deep networks." In International Conference on Machine Learning, pp. 1336-1348. PMLR, 2021.
192
Compositional representations recruit a wider cortical network than word-level representations
Syntax and semantics not associated with separate modules
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
193
Decomposing NLP embeddings into attention heads reveals correlations between syntactic computations and prediction of fMRI recordings
IJCAI 2023: DL for Brain Encoding and Decoding
Disentangling contributions of different info sources to brain predictions
fMRI
Naturalistic stimulus
This is Los Angeles. And it's the …
Language model
Linguistic property
Original brain alignment
Significant �difference ⇒Ling. prop. affects alignment
Residual
Residual brain alignment
Syntactic properties contribute the most to the brain alignment trend across layers of language models
IJCAI 2023: DL for Brain Encoding and Decoding
Complex stimulus representations make it difficult to infer the effect of a stimulus on multiple brain areas
195
“The problem is when the capsule moves from an elliptical orbit to a parabolic orbit.”
Variance in Brain area 1
Variance in Brain area 2
Variance in the stimulus
Variance in the stimulus representation
IJCAI 2023: DL for Brain Encoding and Decoding
Framework to determine whether a complex stimulus affects two brain areas in a similar way
196
IJCAI 2023: DL for Brain Encoding and Decoding
Framework reveals differences in processing across language network areas
197
Example of each type of effect in movie fMRI data
Encoding model perf. significant in all language areas
Framework reveals differences in processing across language network areas
IJCAI 2023: DL for Brain Encoding and Decoding
Challenges in using DL for cognitive modeling
198
IJCAI 2023: DL for Brain Encoding and Decoding
Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding
Subba Reddy Oota1, Manish Gupta2,3, Raju S. Bapi2, Mariya Toneva4
1Inria Bordeaux, France; 2IIIT Hyderabad, India; 3Microsoft, India; 4MPI for Software Systems, Germany
subba-reddy.oota@inria.fr, gmanish@microsoft.com, raju.bapi@iiit.ac.in, mtoneva@mpi-sws.org
Agenda
IJCAI 2023: DL for Brain Encoding and Decoding
200
Outline
201
IJCAI 2023: DL for Brain Encoding and Decoding
Summary
202
IJCAI 2023: DL for Brain Encoding and Decoding
Summary
203
IJCAI 2023: DL for Brain Encoding and Decoding
Outline
204
IJCAI 2023: DL for Brain Encoding and Decoding
DNNs & The Brain: Multi-modal, Multi-task
Fei, Lu, Gao et al (2022). Towards artificial general intelligence via a multimodal foundation model. Nature Communications 13:3094
doi.org/10.1038/s41467-022-30761-2
DNNs & Brain Damage
Snowden, Harris, Thompson, Kobylecki, Jones, Richardson, Neary (2018). Semantic dementia and the left and right temporal lobes, Cortex, 107(188-203).
https://doi.org/10.1016/j.cortex.2017.08.024.
Rt Ant Temporal Lobe Damage (Patient 8)
Animal habitat task.
The patient is asked:
Where would you find this?
Do DL Models exhibit such degradation with damage to units?
207
Multilinguality
IJCAI 2023: DL for Brain Encoding and Decoding
DNNs & Brain: Multi-modal, Multi-task
Fei, Lu, Gao et al (2022). Towards artificial general intelligence via a multimodal foundation model. Nature Communications 13:3094 doi.org/10.1038/s41467-022-30761-2
A big thank you!
Tutorial, Code and Material:
Material from IJCAI 2023 Tutorial would be uploaded soon!
(Past): Deep Learning for Brain Encoding and Decoding, Cogsci-2022
(Past): Language and the Brain: Deep Learning for Brain Encoding and Decoding, IJCNN 2023
Thanks!
IJCAI 2023: DL for Brain Encoding and Decoding
210