ABCDEFGHIJKLMNOPQRSTUVWXYZ
1
2
PresenterTitlePaper/Project/Video/Slides linksshort descriptionSlides
3
instructorsIntroduction and scheduling
4
HieuGAN: a tutorial
5
2/5/2020
6
PresenterTitle (Generative models)Paper/Project/Video/Slides linksshort descriptionSlides
7
Cristina, VinhAuto-Encoding Variational Bayes Diederik P. Kingma Max Welling Mahttps://arxiv.org/pdf/1312.6114.pdfhttps://docs.google.com/presentation/d/1JwNBd8H_iwE1O-GPXF-eTN4kz6yWgWznC1av3NDHeJQ/edit?usp=sharing
8
Autoencoding beyond pixels using a learned similarity metric Anders Boesen Lindbo Larsen, Søren Kaae Sønderby, Hugo Larochelle, Ole Winther ICML 2016https://arxiv.org/pdf/1512.09300.pdf
9
Boyu, TaoGenerating Diverse High-Fidelity Images with VQ-VAE-2 Ali Razavi, Aaron van den Oord, Oriol Vinyals arxiv 2019https://arxiv.org/pdf/1906.00446.pdfhttps://docs.google.com/presentation/d/1NCQ5lm8mtprYN12AkEyUXDEHQkPwVrLL/edit
10
Viresh, PratyushCycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks Jun-Yan Zhu∗ Taesung Park∗ Phillip Isola Alexei A. Efros CVPR 2017https://arxiv.org/pdf/1703.10593.pdfhttps://drive.google.com/file/d/1zNYFgy9gWnM1lvOBXukGgkS65yJ82AqN/view?usp=sharing
11
Jingyi, HieuStyleGAN:A Style-Based Generator Architecture for Generative Adversarial Networks Tero Karras, Samuli Laine, Timo Aila CVPR 2019https://arxiv.org/pdf/1812.04948.pdfhttps://1drv.ms/p/s!AjfgrkEf5X5RgRtA93YkTeOBC_ju
12
Huidong, PavaniSinGAN: Learning a Generative Model from a Single Natural Image Tamar Rott Shaham Tali Dekel Tomer Michaeli ICCV 2019http://openaccess.thecvf.com/content_ICCV_2019/papers/Shaham_SinGAN_Learning_a_Generative_Model_From_a_Single_Natural_Image_ICCV_2019_paper.pdf
13
2/12/2020
14
PresenterTitle (Tracking)Paper/Project/Video/Slides linksshort descriptionSlides
15
Haibin LingIntroduction to tracking
16
Heng Fan, SupreethGoutam Bhat, Martin Danelljan, Luc Van Gool, Radu Timofte. Learning Discriminative Model Prediction for Tracking. ICCV 2019http://openaccess.thecvf.com/content_ICCV_2019/papers/Bhat_Learning_Discriminative_Model_Prediction_for_Tracking_ICCV_2019_paper.pdfhttps://docs.google.com/presentation/d/1pWe5xZ-ucxYxIPaedTo4lstzhfI52Iu6R2zv96gl6lo/edit#slide=id.p
17
Jiaxiang, Lei ZhouZiyuan Huang, Changhong Fu, Yiming Li, Fuling Lin, Peng Lu. Learning Aberrance Repressed Correlation Filters for Real-Time UAV Tracking. ICCV 2019https://arxiv.org/pdf/1908.02231.pdfhttps://docs.google.com/presentation/d/1eGi2J27muWixzjpmChcILMjBA5ln-2W4bGX9W99v0cA/edit?usp=sharing
18
Georgi, CristinaYaadhav Raaj, Haroon Idrees, Gines Hidalgo, Yaser Sheikh. Efficient Online Multi-Person 2D Pose Tracking With Recurrent Spatio-Temporal Affinity Fields. CVPR 2019https://arxiv.org/pdf/1811.11975.pdfhttps://docs.google.com/presentation/d/19nLsKMVxYXYPCWAw7l5wHeUVTYat_8ilCQVGGHHudko/edit?usp=sharing
19
Hieu, Ruyihttps://docs.google.com/spreadsheets/d/1Vbxrufq672N-fK6G-ppuu60nhGq57Fr1llhKfpWTAIM/edit?usp=sharinghttp://openaccess.thecvf.com/content_ICCV_2019/papers/Wiyatno_Physical_Adversarial_Textures_That_Fool_Visual_Object_Tracking_ICCV_2019_paper.pdffirst adversarial attack for sequential tracking systemshttps://docs.google.com/presentation/d/1T8hH7iqhE9Zgew4cSqSIIedBT4MBLg5DV6swjpaN3Iw/edit?usp=sharing
20
Vu, David PYaron Meirovitch, Lu Mi, Hayk Saribekyan, Alexander Matveev, David Rolnick, Nir Shavit. Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics. CVPR 2019.https://arxiv.org/pdf/1812.01157.pdfhttps://docs.google.com/presentation/d/1XLgiU-VnR_h0MiV18A-IRkcBRxYtbkjCzDm1WhjFYrs/edit?usp=sharing
21
2/19/2020
22
PresenterTitle (Video Representation)Paper/Project/Video/Slides linksshort descriptionSlides
23
Zhibo, SouradeepCarreira and Zisserman, Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset, 2017https://arxiv.org/pdf/1705.07750.pdfhttps://docs.google.com/presentation/d/1yLgIAnxbqnKd74j6GMIyV9otLj8sIz_sN8NQI9mSQao/edit?usp=sharing
24
Bingyao, VinhYeung et al., End-to-end Learning of Action Detection from Frame Glimpses in Videos, 2016https://arxiv.org/pdf/1511.06984.pdfhttps://docs.google.com/presentation/d/1GbyQ5HWPzNDoFNwZRh_rGymwo_Iy1rnQ28oCzadatjI/edit?usp=sharing
25
Qiaomu, ShahiraWang et al., Non-local Neural Networks, 2017https://arxiv.org/pdf/1711.07971v3.pdfhttps://docs.google.com/presentation/d/1bHnIz2L8Az29J5tgOw3fKos3ejrm8308otz2FXYAZaI/edit?pli=1#slide=id.g52ca06d2f6_0_182
26
David B, KumaraWang and Gupta, Videos as Space-Time Region Graphs, 2018 https://eccv2018.org/openaccess/content_ECCV_2018/papers/Xiaolong_Wang_Videos_as_Space-Time_ECCV_2018_paper.pdfhttps://docs.google.com/presentation/d/18ZplAT-k2K_HqJdeyWzVrxz77V7ZYA_BHUu-abHm6fs/edit?usp=sharing
27
Bo (Bryan), ZekunPiergiovanni and Ryoo, Temporal Gaussian Mixture Layer for Videos, 2019https://arxiv.org/pdf/1803.06316.pdf
A fully differentiable layer with few parameters to learn temporal structure in videos.
https://docs.google.com/presentation/d/1qk4ov4oil78MQfD7g8lCsg_jgDd5jSvZ-y1HOXlELWQ/edit?usp=sharing
28
2/26/2020
29
Presenter Paper/Project/Video/Slides linksshort descriptionSlides
30
Viresh, Bo (Bryan)Crowdcounting: Colab setup
31
Leveraging Unlabeled Data for Crowd Counting by Learning to Rank, CVPR 2018https://arxiv.org/pdf/1803.03095.pdf
32
Bayesian Loss for Crowd Count Estimation with Point Supervision, ICCV 2019https://arxiv.org/pdf/1908.03684.pdf
33
3/11/2020
34
PresenterTitle (Self-supervised learning)Paper/Project/Video/Slides linksshort descriptionSlides
35
Cristina, QiaomuDoersch et al., Unsupervised Visual Representation Learning by Context Prediction, 2015https://www.cv-foundation.org/openaccess/content_iccv_2015/papers/Doersch_Unsupervised_Visual_Representation_ICCV_2015_paper.pdfhttps://docs.google.com/presentation/d/18NTBaOoOQHZdGVI6VRfKOUs5NIivXsKy0RgdAgBD980/edit?ts=5e6678c4#slide=id.g81461fa5ad_0_0
36
Ruyi, KumaraPathak et al., Context Encoders: Feature Learning by Inpainting, 2016https://arxiv.org/pdf/1604.07379.pdf
an unsupervised visual feature learning algorithm based on filling in large missing areas of the image
https://docs.google.com/presentation/d/1vCAIEfK4yV5DK9_nMi8Fc-wpAc05XYbkBjzCjehZ0kI/edit?usp=sharing
37
Supreeth, David PMisra et al., Shuffle and Learn: Unsupervised Learning using Temporal Order Verification, 2016`https://docs.google.com/presentation/d/1rvghp_IRK_afCUJwEimxXHDZKW-mOJdivGD7XUtdm4A/edit#slide=id.g7f036733b3_0_11
38
Prantik, David BWei et al., Learning and Using the Arrow of Time, 2018https://www.robots.ox.ac.uk/~vgg/publications/2018/Wei18/wei18.pdfhttps://docs.google.com/presentation/d/18SWzTCt3WSGtvzV1YRVWOLx5RObZ4vUeGH0aPdSK0CE/edit?usp=drivesdk
39
Bo, XiaolingJang et al., Grasp2Vec: Learning Object Representations from Self-Supervised Grasping, 2018http://proceedings.mlr.press/v87/jang18a/jang18a.pdf
A self-supervised approach to generating labels for representation learning and object-centric instance grasping.
https://docs.google.com/presentation/d/1zSBj-5ZUUMnEhUflKxtwlMd1s_BY1XGpycdO4cSWtQc/edit?usp=sharing
40
(New optional reading)Piergiovanni et al., Evolving Losses for Unsupervised Video Representation Learning, 2020https://arxiv.org/pdf/2002.12177.pdf
Please check if you are interested in knowing more about self-supervised learning for videos
41
4/1/2020
42
PresenterTitle (Scene text recognition)Paper/Project/Video/Slides linksshort descriptionSlides
43
MinhConvolutional Character Networks. Linjie Xing, Zhi Tian, Weilin Huang and Matthew Scott. ICCV'19http://openaccess.thecvf.com/content_ICCV_2019/papers/Xing_Convolutional_Character_Networks_ICCV_2019_paper.pdf
Every student needs to read this paper and submit the paper summary
https://www.dropbox.com/s/4s4ou1dueqvzgs9/02_SceneTextRecognition.pdf?dl=0
44
MinhABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network. Y. Liu, H. Chen, C. Shen, T. He, L. Jin, L. Wang. CVPR'20https://arxiv.org/pdf/2002.10200
Every student needs to read this paper and submit the paper summary
https://www.dropbox.com/s/4s4ou1dueqvzgs9/02_SceneTextRecognition.pdf?dl=0
45
MinhReal-time Scene Text Detection with Differentiable Binarization. Minghui Liao, Zhaoyi Wan, Cong Yao, Kai Chen, Xiang Bai. AAAI'20https://aaai.org/Papers/AAAI/2020GB/AAAI-LiaoM.578.pdf
All students do not need to read this paper, unless interested.
https://www.dropbox.com/s/4s4ou1dueqvzgs9/02_SceneTextRecognition.pdf?dl=0
46
Zekun, SouradeepScene Text Recognition via Transformer. Xinjie Feng, Hongxun Yao, Yuankai Yi, Jun Zhang, and Shengping Zhang. ArXiv 2020.https://arxiv.org/abs/2003.08077
Except for Zekun and Souradeep, other students don't need to read this paper.
47
SagnikColab 1https://github.com/MhLiao/DBColabhttps://docs.google.com/document/d/19Qm9Kh0nVBbKjgTt4T2n-THiYVAot-y3q7z2Te6sZ9E/edit?usp=sharing
48
MingzhenColab 2
https://github.com/MalongTech/research-charnet
Colabhttps://www.dropbox.com/s/mxvoku6rh8bzq06/CharNet.zip?dl=0
49
Georgi, PrantikY. Xu, Y. Wang, W. Zhou, Y. Wang, Z. Yang, and X. Bai. Textfield: Learning a deep direction field for irregular scene text detection. IEEE TIP 2019https://arxiv.org/abs/1812.01393
Except for Georgi and Prantik, other students don't need to read this paper.
https://docs.google.com/presentation/d/1ksaTgk2PGPX9fNQLmOuGyD3EfhmKw_3El-ymQQZ5vBM/edit?usp=sharing
50
4/8/2020
51
PresenterTitle (Visual SLAM)Paper/Project/Video/Slides linksshort descriptionSlides
52
Georgi, PrantikY. Xu, Y. Wang, W. Zhou, Y. Wang, Z. Yang, and X. Bai. Textfield: Learning a deep direction field for irregular scene text detection. IEEE TIP 2019https://arxiv.org/abs/1812.01393
Except for Georgi and Prantik, other students don't need to read this paper.
https://docs.google.com/presentation/d/1ksaTgk2PGPX9fNQLmOuGyD3EfhmKw_3El-ymQQZ5vBM/edit?usp=sharing
53
Bo, PratyushRaul Mur-Artal, J. M. M. Montiel, Juan D. Tardos ORB-SLAM: a Versatile and Accurate Monocular SLAM SystemIEEE T-Robotics, 2015https://arxiv.org/pdf/1502.00956.pdf
A complete system using the same ORB features for all SLAM tasks: tracking, mapping, relocalization, and loop closing.
https://docs.google.com/presentation/d/1GvqCLGvobM0bqFW6gnIzAxIrE2VyFplfgEkgNv34sao/edit?usp=sharing
54
Heng, ShahiraRichard A. Newcombe ; Steven J. Lovegrove ; Andrew J. Davison DTAM: Dense tracking and mapping in real-time ICCV 2011https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6126513
https://docs.google.com/presentation/d/1w9i1Gl0PvJKVJhiamgLdI07kDGbQJksOCHhYLRtfoy8/edit?ts=5e8dfd9f#slide=id.g830fadcdf6_1_19
55
Georgi, PavaniMichael Bloesch, Tristan Laidlow, Ronald Clark, Stefan Leutenegger, Andrew J. Davison. Learning Meshes for Dense Visual SLAM. ICCV 2019http://openaccess.thecvf.com/content_ICCV_2019/papers/Bloesch_Learning_Meshes_for_Dense_Visual_SLAM_ICCV_2019_paper.pdfhttps://docs.google.com/presentation/d/1cPGD03BdLTg-woRjDiRHm3-iXb10ojlBEcuzwGqMWCE/edit?usp=sharing
56
57
58
4/15/2020
59
Presenter Title (Optimal Transport)Paper/Project/Video/Slides linksshort descriptionSlides
60
HuidongA brief introduction to optimal transporthttps://docs.google.com/presentation/d/1jkrKsXFduCctNSJ6MpKWCFQ6BwEVwrPJYTFBVGijdLM/edit#slide=id.p
61
Kumara, LeiWasserstein Generative Adversarial Networks, ICML 2017https://arxiv.org/abs/1701.07875WGANhttps://docs.google.com/presentation/d/1jkrKsXFduCctNSJ6MpKWCFQ6BwEVwrPJYTFBVGijdLM/edit#slide=id.p
62
Tao, ZhiboImproved training of Wasserstein GANs, NeurIPS 2018https://arxiv.org/pdf/1704.00028.pdfWGAN-GPhttps://docs.google.com/presentation/d/1jkrKsXFduCctNSJ6MpKWCFQ6BwEVwrPJYTFBVGijdLM/edit#slide=id.p
63
David B, MingzhenSpectral Normalization for Generative Adversarial Networks, ICLR 2018https://arxiv.org/abs/1802.05957Spectral Normalizationhttps://docs.google.com/presentation/d/1jkrKsXFduCctNSJ6MpKWCFQ6BwEVwrPJYTFBVGijdLM/edit#slide=id.p
64
4/22/2020
65
PresenterTitle (Flow Models)Paper/Project/Video/Slides linksshort descriptionSlides
66
Jingyi, VuDiederik P. Kingma, and Prafulla Dhariwal. “Glow: Generative flow with invertible 1x1 convolutions.” NIPS 2018.https://arxiv.org/pdf/1807.03039.pdfhttps://drive.google.com/file/d/1EenQ7OqmYl2cdjR0aUQaUaj1PM2fMEj8/view?usp=sharing
67
Lei, SagnikC-Flow: Conditional Generative Flow Models for Images and 3D Point Cloudshttps://arxiv.org/pdf/1912.07009.pdfhttps://docs.google.com/presentation/d/1qtz777CROaMTy4MqOw_W2llu0iUm9MrHXIEUgGluJiM/edit?usp=sharing
68
Huidong, TaoImplicit Generation and Generalization Methods for Energy-Based Modelshttps://arxiv.org/pdf/1903.08689.pdfIGEBM: Good quality results on ImageNethttps://docs.google.com/presentation/d/1X3Z9S1TUhwwtmJuAvvlfmr-_xAh3Qy-UecHH669e5SA/edit?ts=5e9db9b2#slide=id.p
69
Prantik, XiaolingYour classifier is secretly an energy based model and you should treat it like one, ICLR2020https://openreview.net/pdf?id=Hkxzx0NtDB
Joint Energy Based Model: Creates a generative model using a standard image classifier. It unifies generative and discriminative models quite nicely and is an insightful read.
https://docs.google.com/presentation/d/1sgHldKbpNPAeCK1cjz7sVRHyZSPIfvOHIp54nRHc18w/edit?usp=sharing
70
Souradeep, RuyiDo Deep Generative Models Know What They Don't Know? https://arxiv.org/pdf/1810.09136.pdf
This is a good paper on the shortcomings of current generative models and can provide interesting research directions to look into.
https://docs.google.com/presentation/d/16XbERXrNXrDP5LRHVVRHTtlC5bE_EkDFIFv_2UvygFw/edit?usp=sharing
71
4/29/2020
72
PresenterTitle (Neural architecture search)Paper/Project/Video/Slides linksshort descriptionSlides
73
Pratyush, PavaniZoph and Le, Neural Architecture Search with Reinforcement Learning, 2017https://arxiv.org/pdf/1611.01578.pdfhttps://docs.google.com/presentation/d/1QZgbce781u9PP9FdFlf39TITBhPZ_Nid/edit#slide=id.p1
74
Xiaoling, ShahiraLiu et al., DARTS: Differentiable Architecture Search, ICLR 2019https://arxiv.org/pdf/1806.09055.pdfhttps://docs.google.com/presentation/d/1Wsz3Fs_kN_A3IZCFw0HItCT5wso60uK7T0kXElVO0J8/edit?usp=sharing
75
Vu, VinhTan and Le, EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks, 2019https://arxiv.org/pdf/1905.11946.pdfhttps://docs.google.com/presentation/d/1ZOhiFZO2Bt2Vy-iwEJQ4SNwJfvKdf88Ub1eQHM8rct0/edit?usp=sharing
76
Zekun, SupreethXie et al., Exploring Randomly Wired Neural Networks for Image Recognition, 2019
http://openaccess.thecvf.com/content_ICCV_2019/papers/Xie_Exploring_Randomly_Wired_Neural_Networks_for_Image_Recognition_ICCV_2019_paper.pdf
https://docs.google.com/presentation/d/17c7zK79i0LerJq2oI_ytq6Uk_vxqJDME9ClofkykNJ0/edit#slide=id.g7ff87733a4_5_29
77
Michael RyooRyoo et al., AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures, 2019https://arxiv.org/pdf/1905.13209.pdf
78
5/6/2020
79
PresenterTitle (Graph Neural Network)Paper/Project/Video/Slides linksshort descriptionSlides
80
Qiaomu, JingyiXinhong Ma, Tianzhu Zhang, Changsheng Xu. GCAN: Graph Convolutional Adversarial Network for Unsupervised Domain Adaptation. CVPR 2019.http://openaccess.thecvf.com/content_CVPR_2019/papers/Ma_GCAN_Graph_Convolutional_Adversarial_Network_for_Unsupervised_Domain_Adaptation_CVPR_2019_paper.pdfhttps://docs.google.com/presentation/d/1OFdE_ZVCHrObqWgHgx23H1sWn9GIw1U30g8CZeP9t2Q/edit?usp=sharing
81
Jiaxiang, SagnikJiwoong Park, Minsik Lee, Hyung Jin Chang, Kyuewang Lee, Jin Young Choi. Symmetric Graph Convolutional Autoencoder for Unsupervised Graph Representation Learning. ICCV 2019.https://arxiv.org/pdf/1908.02441.pdfhttps://docs.google.com/presentation/d/1BT-6HmiktHsQolxToaeem2IE3ajslluOKb1JplapMoQ/edit?usp=sharing
82
David P, SagnikLong Zhao, Xi Peng, Yu Tian, Mubbasir Kapadia, Dimitris N. Metaxas. Semantic Graph Convolutional Networks for 3D Human Pose Regression. CVPR 2019
http://openaccess.thecvf.com/content_CVPR_2019/papers/Zhao_Semantic_Graph_Convolutional_Networks_for_3D_Human_Pose_Regression_CVPR_2019_paper.pdf
https://docs.google.com/presentation/d/1hv_1pQpAZg8kbsVyArxwGgq6s0CRe6q64MSbJKkMOlg/edit?usp=sharing
83
Heng, JiaxiangPaul Swoboda, Dagmar Kainm"uller, Ashkan Mokarian, Christian Theobalt, Florian Bernard. A Convex Relaxation for Multi-Graph Matching. CVPR 2019.http://openaccess.thecvf.com/content_CVPR_2019/papers/Swoboda_A_Convex_Relaxation_for_Multi-Graph_Matching_CVPR_2019_paper.pdfhttps://docs.google.com/presentation/d/1r54RnwfWdF6Tjh2IRt6tZdX8NSv-z3D3O53SUnYl5yE/edit#slide=id.g84b878a7d3_0_72
84
Zhibo, MingzhenJongmin Kim, Taesup Kim, Sungwoong Kim, Chang D. Yoo. Edge-Labeling Graph Neural Network for Few-Shot Learning. CVPR 2019.https://arxiv.org/pdf/1905.01436.pdfhttps://docs.google.com/presentation/d/1U4nXV0C66Muu6zjsmsbG1VKFpKMLuvq_h9zeR-0lJ2Q/edit?usp=sharing
85
5.13
86
5.20th
87
88
89
90
91
92
93
94
95
96
97
98
99
100