CV Reading Group

	A	B	C	D	E
1		Title & Authors	Paper/Project/Video/Slides links	short description	Slides
2	8/24/2022
3	Dimitris Samaras	Introduction			https://docs.google.com/presentation/d/1wEmFhARFnilBW9-Ogt9NBqvpX_qnIKBnMiZIatuPXgY/edit?usp=sharing
4	8/31/2022
5	Jingwei	Gigapixel Whole-Slide Images Classification using Locally Supervised Learning	https://arxiv.org/pdf/2207.08267.pdf	MICCAI practice
6	Lei	Efficient Geometry-aware {3D} Generative Adversarial Networks	https://arxiv.org/abs/2112.07945		https://docs.google.com/presentation/d/1vSb7XY_Hpu6YjaIM8UwxL-U2dqkUZY9v6aZiApgzHBE/edit?usp=sharing
7	Shilin	FurryGAN: High Quality Foreground-aware Image Synthesis	https://arxiv.org/pdf/2208.10422.pdf		https://docs.google.com/presentation/d/1AgqFnVH248TtzdDhyH2X54UAHY0NH2dpL2Po7C0xYec/edit?usp=sharing
8	Ruyi	Neural Correspondence Field for Object Pose Estimation	https://arxiv.org/pdf/2208.00113.pdf	6DoF object pose estimation from single RGB image based on 3D-3D correspondence estimation with 3D implicit representation	https://docs.google.com/presentation/d/1pNHsATv4Smgrblsr0cZWdWI9exTFNc2QFnuwl9Ho3BU/edit?usp=sharing
9	9/7/2022
10	Yue	High-resolution Face Swapping via Latent Semantics Disentanglement	https://openaccess.thecvf.com/content/CVPR2022/papers/Xu_High-Resolution_Face_Swapping_via_Latent_Semantics_Disentanglement_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1JW4FI0GrrFmEvfshnlCq4N506hQZGpc4alkL0xgktSg/edit?usp=sharing
11	Hritam	Addressing Class Imbalance in Semi-supervised Image Segmentation: A Study on Cardiac MRI		MICCAI22 paper	https://drive.google.com/file/d/1A4--_03ssu2yoGAQplPJrlLpsKBgQ5S_/view?usp=sharing
12	Haoyu	SparseNeuS: Fast Generalizable Neural Surface Reconstruction from Sparse views	https://arxiv.org/abs/2206.05737	ECCV 2022	https://docs.google.com/presentation/d/1SvayreRc42d0NyjH0P-4JgXoQXYAcpBdjOPl1oJ5nds/edit?usp=sharing
13	Moinak	GazeRadar: A Gaze and Radiomics-guided Disease Localization Framework		MICCAI22 paper	https://docs.google.com/presentation/d/1RayaYc-DDZSBwZ-z2hVdZZlpOzzGUKhvvvtiYtLDVx8/edit?usp=sharing
14	9/14/2022
15	Sounak	On Guiding Visual Attention with Language Specification	https://openaccess.thecvf.com/content/CVPR2022/papers/Petryk_On_Guiding_Visual_Attention_With_Language_Specification_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/19jlZb8ofrrxwLLmOUKz9dvSqtte-hLWm17BhrkOCdNc/edit?usp=sharing
16	Aishik	Learning What Not to Segment: A New Perspective on Few-Shot Segmentation	https://openaccess.thecvf.com/content/CVPR2022/papers/Lang_Learning_What_Not_To_Segment_A_New_Perspective_on_Few-Shot_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1138Z9SorqjU6T6tW27oUG7s7AijmG6HSFh3Fmk23wzU/edit?usp=sharing
17	Alfredo	Neural Feature Fusion Fields: 3D Distillation of Self-Supervised 2D Image Representations	https://arxiv.org/abs/2209.03494	3DV2022	https://docs.google.com/presentation/d/194Gc_01IFO-jyMTq5MCEde7D5SkweDjNYUNfoYKJEmg/edit?usp=sharing
18
19	Peiyao	Frozen CLIP Models are Efficient Video Learners	https://arxiv.org/pdf/2208.03550.pdf	ECCV 2022	https://docs.google.com/presentation/d/1g8fW-CXqCdNi_CaRJjBY1VZa3gfh2UHk/edit?usp=sharing&ouid=103358536235384126888&rtpof=true&sd=true
20	9/21/2022
21	Mithilesh	EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks	https://arxiv.org/pdf/1905.11946	ICML 2019	https://docs.google.com/presentation/d/1QMADlSbdSjKdBFHLUeDQhfvGvGowJ6v1/edit?usp=sharing&ouid=103647986604749576791&rtpof=true&sd=true
22	Qiaomu	ESCNet: Gaze Target Detection with the Understanding of 3D Scenes	https://openaccess.thecvf.com/content/CVPR2022/papers/Bao_ESCNet_Gaze_Target_Detection_With_the_Understanding_of_3D_Scenes_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/19TeREXqgLnsIDmVHasSvpbhH4g01IUMxNSrOKiZB-5c/edit?usp=sharing
23	Dichang	Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry	https://arxiv.org/pdf/2112.08177.pdf	CVPR 2022	https://docs.google.com/presentation/d/1v8U3bL-QqQp-VPXU_krP3II9QVJHe37ayCPweF74_a8/edit?usp=sharing
24	Meilong	Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning	https://openaccess.thecvf.com/content/CVPR2022/papers/Chen_Scaling_Vision_Transformers_to_Gigapixel_Images_via_Hierarchical_Self-Supervised_Learning_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1kWrRZCUO9wxnRVo4EHzuWDvSDcGaazy611zJu4Idzkc/edit?usp=sharing
25	9/28/2022
26	Saumya	Learning Topological Interactions for Multi-Class Medical Image Segmentation	Paper link : https://arxiv.org/abs/2207.09654	ECCV 2022	https://docs.google.com/presentation/d/1s4KmaGUR-H0MgNNAVxbUVJDFcH6vhbdpa5Rc_OQiZgU/edit?usp=sharing
27	Sagnik	Learning Isometric Surface Parameterization for Texture Unwrapping	https://drive.google.com/file/d/1evpXuRwlNCZc5CAYuFqKL5SpaV2kFa8Y/view	ECCV 2022	https://drive.google.com/file/d/1aOr7VKfm73LsI3atolze15JY927h2bqw/view?usp=sharing
28	Moinak	RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification	https://arxiv.org/abs/2202.11781	ECCV 2022	https://drive.google.com/file/d/10v8KhX13v5hybemMS8HMfZDjSFk4_L-v/view?usp=sharing
29	Jinghuan	StARformer: Transformer with State-Action-Reward Representation for Visual Reinforcement Learning	Paper: https://arxiv.org/abs/2110.06206 Video: https://www3.cs.stonybrook.edu/~jishang/starformer/4447.mp4	ECCV 2022	https://docs.google.com/presentation/d/1nUCQyUZ8lmJTYoz-i4aA5xou5JkzXUIN/edit?usp=sharing&ouid=105032877246147260285&rtpof=true&sd=true
30	10/5/2022
31	Mithilesh	EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks	https://arxiv.org/pdf/1905.11946	ICML 2019	https://docs.google.com/presentation/d/1QMADlSbdSjKdBFHLUeDQhfvGvGowJ6v1/edit?usp=sharing&ouid=103647986604749576791&rtpof=true&sd=true
32	Khiem Phi	How Do Vision Transformers Work?	https://arxiv.org/pdf/2202.06709.pdf	ICLR 2022	https://docs.google.com/presentation/d/1TT9oPYmwU3-O7w8_i081i66E77dxKgVH/edit?usp=sharing&ouid=107110547759842100452&rtpof=true&sd=true
33	Shahira Abousamra	GroupViT: Semantic Segmentation Emerges from Text Supervision	https://arxiv.org/abs/2202.11094	CVPR 2022	https://docs.google.com/presentation/d/1s2roFFmX6BwfmCixMR55J3FNlBkVmgOqie5cFM_-lro/edit?usp=sharing
34	Othmane	Deep Reinforcement Learning for Sarcopenia Assessment	https://arxiv.org/abs/2107.12800	MLMI - MICCAI 2021	https://drive.google.com/file/d/1mlxfoh1XKGLVdDzYnDN7Djn6aHTfIbn6/view?usp=sharing
35	10/12/2022
36	Alexandros	Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise	https://arxiv.org/abs/2208.09392	NeurIPS 2022	https://docs.google.com/presentation/d/1skoyssqXW35kx3jtct_z7hEdX9LFydCe3QU3ooBSVY4/edit?usp=sharing
37	Cristina	Learning State-Aware Visual Representations from Audible Interactions	https://arxiv.org/pdf/2209.13583.pdf	Neurips 2022	https://docs.google.com/presentation/d/1dpDY21btn3oPdOYaDSyvh0k6aWS-UvfSUPQ3Y-hVSk8/edit?usp=sharing
38	ShahRukh	DreamFusion: Text-to-3D using 2D Diffusion	https://dreamfusion3d.github.io/		https://docs.google.com/presentation/d/1_G-t-OaM6WCuF7dn8DCxXkSXwB2F8f_zz4WE7FXorTE/edit?usp=sharing
39	Xiang	Temporally Consistent Video Transformer for Long-Term Video Prediction	https://wilson1yan.github.io/teco/	Use transformer to predict the latent trajectory, it is also built on existing fancy methods like VQ-GAN and MaskGIT	https://docs.google.com/presentation/d/1VijoYW95iP7mNTCK2BdAdKd_tMYOQ91cnbJMMx4FxmA/edit?usp=sharing
40	10/19/2022
41	Kanchana	Perceptual Grouping in Vision-Language Models	https://arxiv.org/abs/2210.09996	ICLR 2023 submission	https://drive.google.com/file/d/1oAoO-WzaoifpABquHdHCNfFt-9NjAYzo/preview
42	Kumara	Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language	https://arxiv.org/abs/2204.00598	arxiv	https://docs.google.com/presentation/d/1OCbgM32T7RMWHt-nDH_eRkgUnEN-NZEm60_2CbPncOA/edit?usp=sharing
43	Yunfan	Semantic-Aware Domain Generalized Segmentation	https://openaccess.thecvf.com/content/CVPR2022/papers/Peng_Semantic-Aware_Domain_Generalized_Segmentation_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1KMbTMlPWMKmGilGq-l_SDxkM58lMW-RMDh43GZD46Ak/edit?usp=sharing
44
45	10/26/2022
46	Jongwoo	GRAFTING VISION TRANSFORMERS	https://openreview.net/pdf?id=48EwqCCosOO	ICLR 2023 submission	https://docs.google.com/presentation/d/15O8brbJJGwaiHAs0CTQ6p7r5TPi8UQrip5_j85hCk3Y/edit?usp=sharing
47	Aggelina	FaceFormer: Speech-Driven 3D Facial Animation with Transformers	https://arxiv.org/pdf/2112.05329.pdf	CVPR 2022	https://docs.google.com/presentation/d/1D8ujlvlevm21HU7r6h33d8U2fMt7tVQl_aC1sLofIsI/edit?usp=sharing
48	Saarthak	What do self-supervised vision transformers learn?	https://openreview.net/pdf?id=azCKuYyS74	ICLR 2023 submission	https://docs.google.com/presentation/d/1LVP5WXDTFL-gs-qhmaQpxH3sv0L5ux-rRZWtl6GmGWQ/edit?usp=sharing
49	Prantik	Semi-Supervised Semantic Segmentation via Adaptive Equalizaton Learning	https://arxiv.org/pdf/2110.05474.pdf	NeurIPS 2021	https://docs.google.com/presentation/d/1ZU4Hr0iWYZKlv8jPPo0U3D61Ho3Uqz93zwLslzXC6ec/edit?usp=sharing
50	11/2/2022
51	Sagnik	From In-the-Wild Images of Documents to High Quality Scans
52	11/16/2022
53	Georgi	Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation	https://openaccess.thecvf.com//content/CVPR2022/papers/Rahimi_Reading_To_Listen_at_the_Cocktail_Party_Multi-Modal_Speech_Separation_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/11qjju47QNAXoCgQrlIYKIzvrE6n_eTMX0Uwn74mCfOw/edit?usp=sharing
54	Manal	Boundary IoU: Improving Object-Centric Image Segmentation Evaluation	https://arxiv.org/pdf/2103.16562.pdf	CVPR 2021	https://docs.google.com/presentation/d/14WSZ-XhiCTcnZLuLPEsmEiMPj7hyaW6GyPT7uOdnRSI/edit?usp=sharing
55	Saumya	Slim Scissors: Segmenting Thin Object from Synthetic Background	https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136890375.pdf	ECCV 2022	https://docs.google.com/presentation/d/1GPEZlOL4giiAn1mZKcUCMgruybk-9CVNGqI0BJWSusM/edit?usp=sharing
56	Alexandros	NeurIPS Poster			https://drive.google.com/file/d/1xXgt7wL77YF7E_ay2ZjxmNHJ18YaGfCN/view?usp=share_link
57	11/30/2022
58	Zhilin	Efficient Deep Embedded Subspace Clustering	https://openaccess.thecvf.com/content/CVPR2022/papers/Cai_Efficient_Deep_Embedded_Subspace_Clustering_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1RCXfOPFxxXIYVh2lpaIXPyUJzzJxQTbE-_-9B00Izes/edit?usp=sharing
59	Vu	Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces	https://arxiv.org/pdf/2211.07292.pdf		https://docs.google.com/presentation/d/1PLXrIrmErh3z-Dvm00jxJesVjhYzOWoOGh8tiNtjFis/edit?usp=sharing
60	David	Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast	https://openaccess.thecvf.com/content/CVPR2022/papers/Du_Weakly_Supervised_Semantic_Segmentation_by_Pixel-to-Prototype_Contrast_CVPR_2022_paper.pdf	CVPR 2022	https://docs.google.com/presentation/d/1fVF3smDpfxFGwpQbYZpui6jlXPrXIFE3mQDMhVLKseI/edit?usp=sharing
61	ShahRukh	DreamFusion: Text-to-3D using 2D Diffusion	https://dreamfusion3d.github.io/		https://docs.google.com/presentation/d/1_G-t-OaM6WCuF7dn8DCxXkSXwB2F8f_zz4WE7FXorTE/edit?usp=sharing
62	Yash	Pollux: Co-adaptive Cluster Scheduling for Goodput-Optimized Deep Learning	https://www.usenix.org/system/files/osdi21-qiao.pdf	Optimising batch size and learning rate for a deep learning model in a cluster training setting	https://docs.google.com/presentation/d/1S6a6MOymNGw33K4FvOXOUJRtnnpPY_u-PH_tnrxOBKY/edit?usp=sharing
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100