| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Reference Paper List | https://docs.google.com/spreadsheets/d/1qpPQI9rnHjR-xipPRkojuVgSAnJ1VBcv/edit#gid=1144707069 | |||||||||||||||||||||||||
2 | Date | Presenter(s) | Topic | Paper Title | Conference | Paper URL | Slides | ||||||||||||||||||||
3 | Apr 2 | Jiawei Zhang | Introduction | Course Introduction. | https://drive.google.com/file/d/18UttPNUjyBI4cW4MJRaoWjIazNNcaDUD/view?usp=sharing | ||||||||||||||||||||||
4 | Apr 4 | Xinhao Xiang | Multi-Modal | VideoPoet: A Large Language Model for Zero-Shot Video Generation | arxiv'24 | https://arxiv.org/pdf/2312.14125.pdf | https://drive.google.com/file/d/10riGM7XjWV00GSUnCDIfa-ai8O48q8V1/view?usp=share_link | ||||||||||||||||||||
5 | Apr 9 | Zizhong Li | NLP | Self-Rewarding Language Models | arxiv'24 | https://arxiv.org/pdf/2401.10020.pdf | https://drive.google.com/file/d/1VtPI9jl353H9kwL28JIT8BOLxuirM_nJ/view?usp=share_link | ||||||||||||||||||||
6 | Apr 11 | Zhuoheng Li | CV | EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything | arxiv'23 | https://arxiv.org/pdf/2312.00863.pdf | https://drive.google.com/file/d/1A95K9nSN4UJkxBFU93xnUq_zieOgpLVd/view?usp=share_link | ||||||||||||||||||||
7 | Apr 16 | Rong Ching Chang | NLP | Are Emergent Abilities of Large Language Models a Mirage? | Neurips'23 | https://arxiv.org/pdf/2304.15004.pdf | https://drive.google.com/open?id=1vDmkUFq6DtJw2Phx9oQLky-ryHMoDDdJ&usp=drive_fs | ||||||||||||||||||||
8 | Apr 18 | Terry Tong | NLP | Long-form factuality in large language models | arxiv'24 | https://arxiv.org/pdf/2403.18802.pdf | https://drive.google.com/file/d/1_eZyervQDwfrh-ulfW7iZarJUo8cSTXI/view?usp=share_link | ||||||||||||||||||||
9 | Apr 23 | Yifang Ren | Multi-Modal | Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | arxiv'23 | https://arxiv.org/pdf/2310.05737.pdf | https://drive.google.com/file/d/13GkHz6RCPgfIoalM4efD1lzkw9bu7cI9/view?usp=share_link | ||||||||||||||||||||
10 | Apr 25 | Terry Tong | NLP | RAFT: Adapting Language Model to Domain Specific RAG | arxiv'24 | https://arxiv.org/pdf/2403.10131.pdf | https://drive.google.com/file/d/1S-xzfLNBFWRlqua-7PSgmg97-rYCdvin/view?usp=share_link | ||||||||||||||||||||
11 | Apr 30 | Halil Ozgur Demir | Multi-Modal | Synth2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings | arxiv'24 | https://arxiv.org/pdf/2403.07750.pdf | https://drive.google.com/file/d/1QzB979DirXXbmDNzpT6yfjy1wRadpPOw/view?usp=share_link | ||||||||||||||||||||
12 | May 2 | Zhuoheng Li | CV | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | arxiv'24 | https://arxiv.org/pdf/2403.03206.pdf | https://drive.google.com/file/d/1s86aJ1SDWiPYr62-RrQa-ghOokE-8pnr/view?usp=share_link | ||||||||||||||||||||
13 | May 7 | Zhuosheng Liu | CV | InstantID: Zero-shot Identity-Preserving Generation in Seconds | arxiv'24 | https://arxiv.org/pdf/2401.07519.pdf | https://drive.google.com/file/d/1IOTFaPtA7QzX9caTm-CvWKYsSNsksum4/view?usp=share_link | ||||||||||||||||||||
14 | May 9 | Tong Miao | CV | OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation | CVPR'24 | https://arxiv.org/pdf/2311.17911.pdf | https://drive.google.com/file/d/1JM3t9-m_KNeZM-gh1xg-0Qv-fiNBQ9z-/view?usp=share_link | ||||||||||||||||||||
15 | May 14 | Yifang Ren | CV | Generative Image Dynamics | arxiv'23 | https://arxiv.org/pdf/2309.07906.pdf | https://drive.google.com/file/d/1RRCkKyFAl3cF8uszpsRM0iPGacwf6ekM/view?usp=share_link | ||||||||||||||||||||
16 | May 16 | Anant Vishwakama | NLP | Mistral 7B && Mixtral of Experts | https://arxiv.org/pdf/2310.06825.pdf | https://arxiv.org/pdf/2401.04088.pdf | https://drive.google.com/file/d/1EqFCJdj-YBw-0y0ZKafnJxe1EtAk6NpB/view?usp=share_link | ||||||||||||||||||||
17 | May 21 | Halil Ozgur Demir | DL & Others | Mamba: Linear-Time Sequence Modeling with Selective State Spaces | arxiv'23 | https://arxiv.org/ftp/arxiv/papers/2312/2312.00752.pdf | https://drive.google.com/file/d/1dBwl6j3792ghfbzW45gKS8xKp9mxRSLG/view?usp=share_link | ||||||||||||||||||||
18 | May 23 | Tong Miao | DL & Others | Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips | ICLR'24 | https://openreview.net/pdf?id=1SIBN5Xyw7 | https://drive.google.com/file/d/1Vw-VSAEfLEADWczNzj3OdBVTfJAiZ3Nk/view?usp=share_link | ||||||||||||||||||||
19 | May 28 | Anant Vishwakama | DL & Others | QLORA: Efficient Finetuning of Quantized LLMs | arxiv'23 | https://arxiv.org/pdf/2305.14314.pdf | https://drive.google.com/file/d/1I8QARuVEG-HadjU_tb-XT1KDClm6iH7k/view?usp=share_link | ||||||||||||||||||||
20 | May 30 | Joe Zhu | Robotics | Voyager: An Open-Ended Embodied Agent with Large Language Models | arxiv'23 | https://arxiv.org/pdf/2305.16291.pdf | https://drive.google.com/file/d/17eobhln6a6tfGt6UQxcT_wyi1t0_41i4/view?usp=share_link | ||||||||||||||||||||
21 | Jun 4 | Rong Ching Chang | Robotics | LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models | ICCV'23 | https://openaccess.thecvf.com/content/ICCV2023/papers/Song_LLM-Planner_Few-Shot_Grounded_Planning_for_Embodied_Agents_with_Large_Language_ICCV_2023_paper.pdf | https://drive.google.com/file/d/1lMOJTRqjuy9EiXsK_-MzlfbhyyPN8WTG/view?usp=share_link | ||||||||||||||||||||
22 | Jun 6 | Zhuosheng Liu | DL & Others | Accurate structure prediction of biomolecular interactions with AlphaFold 3 | Nature'24 | https://drive.google.com/file/d/1AKM-hk-SIDg9fgp7TAC0DqchV_48SSbY/view?usp=sharing | https://drive.google.com/file/d/11ZWmKn4gWuX-MOz38-GWfU7Ji2yvscn3/view?usp=share_link | ||||||||||||||||||||
23 | |||||||||||||||||||||||||||
24 | |||||||||||||||||||||||||||
25 | Extra | CV | One-step Diffusion with Distribution Matching Distillation | arxiv'23 | https://arxiv.org/pdf/2311.18828.pdf | ||||||||||||||||||||||
26 | Extra | CV | Learning and Leveraging World Models in Visual Representation Learning | ICLR'24 | https://arxiv.org/pdf/2403.00504.pdf | ||||||||||||||||||||||
27 | Extra | Multi-Modal | Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets | arxiv'24 | https://arxiv.org/pdf/2311.15127.pdf | ||||||||||||||||||||||
28 | Extra | Multi-Modal | EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | arxiv'24 | https://arxiv.org/pdf/2402.17485.pdf | ||||||||||||||||||||||
29 | Extra | CV | DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing | arxiv'23 | https://arxiv.org/pdf/2312.07409.pdf | ||||||||||||||||||||||
30 | Extra | NLP | The Unreasonable Ineffectiveness of the Deeper Layers | arxiv'24 | https://arxiv.org/pdf/2403.17887.pdf | ||||||||||||||||||||||
31 | Extra | Multi-Modal | CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor | arxiv'23 | https://arxiv.org/pdf/2312.07661.pdf | ||||||||||||||||||||||
32 | Extra | CV | FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects | CVPR'24 | https://arxiv.org/pdf/2312.08344.pdf | ||||||||||||||||||||||
33 | Extra | CV | Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | arxiv'24 | https://arxiv.org/pdf/2404.02905.pdf | ||||||||||||||||||||||
34 | Extra | Robotics | Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers | arxiv'24 | https://arxiv.org/pdf/2403.12943.pdf | ||||||||||||||||||||||
35 | Extra | NLP | Direct Preference Optimization: Your Language Model is Secretly a Reward Model | arxiv'23 | https://arxiv.org/pdf/2305.18290.pdf | ||||||||||||||||||||||
36 | Extra | CV | Generative Powers of Ten | arxiv'23 | https://arxiv.org/pdf/2312.02149.pdf | ||||||||||||||||||||||
37 | Robotics | BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation | arxiv'24 | https://arxiv.org/pdf/2403.09227.pdf | |||||||||||||||||||||||
38 | |||||||||||||||||||||||||||
39 | |||||||||||||||||||||||||||
40 | |||||||||||||||||||||||||||
41 | |||||||||||||||||||||||||||
42 | |||||||||||||||||||||||||||
43 | |||||||||||||||||||||||||||
44 | |||||||||||||||||||||||||||
45 | |||||||||||||||||||||||||||
46 | |||||||||||||||||||||||||||
47 | |||||||||||||||||||||||||||
48 | |||||||||||||||||||||||||||
49 | |||||||||||||||||||||||||||
50 | |||||||||||||||||||||||||||
51 | |||||||||||||||||||||||||||
52 | |||||||||||||||||||||||||||
53 | |||||||||||||||||||||||||||
54 | |||||||||||||||||||||||||||
55 | |||||||||||||||||||||||||||
56 | |||||||||||||||||||||||||||
57 | |||||||||||||||||||||||||||
58 | |||||||||||||||||||||||||||
59 | |||||||||||||||||||||||||||
60 | |||||||||||||||||||||||||||
61 | |||||||||||||||||||||||||||
62 | |||||||||||||||||||||||||||
63 | |||||||||||||||||||||||||||
64 | |||||||||||||||||||||||||||
65 | |||||||||||||||||||||||||||
66 | |||||||||||||||||||||||||||
67 | |||||||||||||||||||||||||||
68 | |||||||||||||||||||||||||||
69 | |||||||||||||||||||||||||||
70 | |||||||||||||||||||||||||||
71 | |||||||||||||||||||||||||||
72 | |||||||||||||||||||||||||||
73 | |||||||||||||||||||||||||||
74 | |||||||||||||||||||||||||||
75 | |||||||||||||||||||||||||||
76 | |||||||||||||||||||||||||||
77 | |||||||||||||||||||||||||||
78 | |||||||||||||||||||||||||||
79 | |||||||||||||||||||||||||||
80 | |||||||||||||||||||||||||||
81 | |||||||||||||||||||||||||||
82 | |||||||||||||||||||||||||||
83 | |||||||||||||||||||||||||||
84 | |||||||||||||||||||||||||||
85 | |||||||||||||||||||||||||||
86 | |||||||||||||||||||||||||||
87 | |||||||||||||||||||||||||||
88 | |||||||||||||||||||||||||||
89 | |||||||||||||||||||||||||||
90 | |||||||||||||||||||||||||||
91 | |||||||||||||||||||||||||||
92 | |||||||||||||||||||||||||||
93 | |||||||||||||||||||||||||||
94 | |||||||||||||||||||||||||||
95 | |||||||||||||||||||||||||||
96 | |||||||||||||||||||||||||||
97 | |||||||||||||||||||||||||||
98 | |||||||||||||||||||||||||||
99 | |||||||||||||||||||||||||||
100 |