| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | ORAL SESSIONS OF ICPR-2024 Here R2 means papers accepted from round 2 submissions and COMP means competetion papers | |||||||||||||||||||||||||
2 | December 02, 2024 (11:00 - 12:30 IST) | |||||||||||||||||||||||||
3 | Oral Session 1 | |||||||||||||||||||||||||
4 | Session Number | Session Chair | Category | Paper IDs | Authors | Paper Title | ||||||||||||||||||||
5 | 1-A | Andreas Fisher | Classification | R1-903 | Akhmedova, Shakhnaz; Körber, Nils | Next Generation Loss Function for Image Classification | ||||||||||||||||||||
6 | R2-140 | Xia, Yulong; Zhang, Jianwei | ORA-Trans: Object region attention transformer based on key tokens selector with structure feature modeling for fine-grained visual classification | |||||||||||||||||||||||
7 | R2-640 | Srivastava, Adit ; Ramagiri, Aravind; Gupta, Puneet; Gupta , Vivek | SANGAM: Synergizing Local and Global Analysis for simultaneous WBC Classification and Segmentation | |||||||||||||||||||||||
8 | R2-459 | Cao, Zongjing; Li, Yan; Shin, Byeong-Seok | Attention-Guided Energy-based Model for Out-of-Distribution Data Detection | |||||||||||||||||||||||
9 | 1-B | Subhasis Chaudhuri | Representation Learning | R1-529 | Yang, Chuhong; Li, Bin; Wu, Nan | DSparsE: Dynamic Sparse Embedding for Knowledge Graph Completion | ||||||||||||||||||||
10 | R1-229 | Chen, Shangyu; Yang, Xiaohao; Fang, Pengfei; Harandi, Mehrtash; Phung, Dinh Q; Cai, Jianfei | Stereographic Projection for Embedding Hierarchical Structures in Hyperbolic Space | |||||||||||||||||||||||
11 | R2-33 | Cheng, Zhen; Zhu, Fei; Zhang, Xu-Yao; Liu, Cheng-Lin | Delving into Feature Space: Improving Adversarial Robustness by Feature Spectral Regularization | |||||||||||||||||||||||
12 | R2-296 | Huang, Zhixin; He, Yujiang; Nivarthi, Chandana Priya; Sick, Bernhard; Gruhl, Christian | Time-Series Representation Learning via Heterogeneous Spatial-Temporal Contrasting for Remaining Useful Life Prediction | |||||||||||||||||||||||
13 | 1-C | Sebastiano Battiato | Human behavior from images and videos | R1-98 | Kim, Hye-Geun; Na, You-Kyoung; Joe, Hae-Won; Moon, Yong-Hyuk; Cho, Yeong-Jun | Re-identification Based on the Spatial-temporal Fusion Network | ||||||||||||||||||||
14 | R1-403 | Hasan, Kazi Reyazul; Adnan, Muhammad Abdullah | EMPATH: MediaPipe-aided Ensemble Learning with Attention-based Transformers for Accurate Recognition of Bangla Word-Level Sign Language | |||||||||||||||||||||||
15 | R1-883 | Lee, Ming-Han; Zhang, Yu Chen; WU, KUN-RU; Tseng, Yu-Chee | GolfPose: From Regular Posture to Golf Swing Posture | |||||||||||||||||||||||
16 | R1-260 | Xu, Jingwen; Wei, Xiaoge; Yuen, PongChi | Multi-Scale Value-Density Transformer with Medical Semantic Guidance for Disease Risk Prediction based on Clinical Time Series | |||||||||||||||||||||||
17 | 1-D | Larry Ogorman | Text detection and recognition | R1-55 | Muth, Markus; Sablatnig, Robert; Peer, Marco; Kleber, Florian | Advancing Handwritten Text Detection by Synthetic Text | ||||||||||||||||||||
18 | R1-1298 | Mathew, Minesh; Mondal, Ajoy; Jawahar, C.V. | Towards Deployable OCR Models for Indic Languages | |||||||||||||||||||||||
19 | R2-335 | Xu, Shuo; Zhuang, Zeming; Li, Mingjun; Su, Feng | Arbitrary-Shaped Scene Text Recognition with Deformable Ensemble Attention | |||||||||||||||||||||||
20 | R2-283 | Han, Zhiwang; Yadikar, Nurbiya; Xuebin, Xu; Aysa, Alimjan ; Ubul, Kurban | Oracle Character Recognition Based on Attention Enhancement and Multi-scale Feature Fusion | |||||||||||||||||||||||
21 | December 02, 2024 (16:30 - 18:30 IST) | |||||||||||||||||||||||||
22 | Oral Session 2 | |||||||||||||||||||||||||
23 | Session Number | Session Chair | Category | Paper IDs | Authors | Paper Title | ||||||||||||||||||||
24 | 2-A | Joao Paulo Papa | Classification and adaptation | R1-41 | Vavilthota, Venkata R; Ramanathan, Ranjith; Aakur, Sathyanarayanan N | Capturing Temporal Components for Time Series Classification | ||||||||||||||||||||
25 | R1-234 | Gupta, Ravi Kant; Das, Shounak ; Sethi, Amit | IDAL: Improved Domain Adaptive Learning for Natural Images Dataset | |||||||||||||||||||||||
26 | R1-699 | Gupta, Ravi Kant; P, Chirag; Wagle, Mukta G; Jeevan P, Pranav ; Sethi, Amit | CHATTY: Coupled Holistic Adversarial Transport Terms with Yield for Unsupervised Domain Adaptation | |||||||||||||||||||||||
27 | R1-1150 | Makhija, Shradha; Mandal, Srimanta; Pandya, Utkarsh; Chirakkal, Sanid; Putrevu, Deepak | PolSAR Image Classification Using Complex-Valued Squeeze and Excitation Network | |||||||||||||||||||||||
28 | R1-1520 | Mahmud, Hasanul I; Desai, Kevin; Lama, Palden; Prasad, Sushil | EncodeNet: A Framework for Boosting DNN Accuracy with Entropy-driven Generalized Converting Autoencoder | |||||||||||||||||||||||
29 | R1-1533 | Jiu, Mingyuan; zhu, hailong; Sahbi, Hichem | Deep Multi-order Context-aware Kernel Network for Multi-label Classification | |||||||||||||||||||||||
30 | 2-B | Keiji Yanai | Neural networks and deep learning - I | R1-488 | chen, qiyun; diao, boyu; Yang, Yu; xu, yongjun | SCP: A Structure Combination Pruning method via Structured Sparse for Deep Convolutional Neural Networks | ||||||||||||||||||||
31 | R1-511 | jarraya, zakaria; Rousseau, Francois; Benaichouche, Simon; Drumetz, Lucas; Ben Salem, Douraied | On divergence-free neural ODE for classification | |||||||||||||||||||||||
32 | R1-219 | Becker, Marlon; Butz, Marco; Lemli, David; Schuck, Carsten; Risse, Benjamin | Learning Proposal Distributions in Simulated Annealing via Template Networks: A Case Study in Nanophotonic Inverse Design | |||||||||||||||||||||||
33 | R1-1498 | Bennabhaktula, Guru Swaroop; Alegre, Enrique; Strisciuglio, Nicola; Azzopardi, George | PushPull-Net: Inhibition-driven ResNet robust to image corruptions | |||||||||||||||||||||||
34 | R1-1605 | Zhou, Xuan; Kundu, Souvik; Chen, Dake; Huang, Jie; A. Beerel, Peter | What makes vision transformers robust towards bit-flip attack? | |||||||||||||||||||||||
35 | R2-221 | Liu, Yunfeng; Jung, Cheolkon | DWT-SALF: Subband Adaptive Neural Network Based In-Loop Filter for VVC Using Cyclic DWT | |||||||||||||||||||||||
36 | 2-C | In Kyu Park | Motion and video analysis | R1-93 | Benaglia, Riccardo; Porrello, Angelo; Buzzega, Pietro; CALDERARA, SIMONE; Cucchiara, Rita | Trajectory Forecasting through Low-Rank Adaptation of Discrete Latent Codes | ||||||||||||||||||||
37 | R1-225 | Hannan, Tanveer; Koner, Rajat; Bernhard, Maximilian; Shit, Suprosanna; Menze, Bjoern; Tresp, Volker; Schubert, Matthias; Seidl, Thomas | GRAtt-VIS: Gated Residual Attention for Video Instance Segmentation | |||||||||||||||||||||||
38 | R1-789 | Liao, Bor-Chen; Wu, Jie-Syuan; Hsu, Gee-Sern; Kang, Jiunn-Horng ; Tang, Chen-Lung | 3D Pose-based Evaluation of the Risk of Sarcopenia | |||||||||||||||||||||||
39 | R1-891 | Zang, Han; Xu, Tianyang; Zhu, Xue-Feng; Song, Xiaoning; Wu, Xiao-Jun; Kittler, Josef | Attention-based Patch Matching and Motion-driven Point Association for Accurate Point Tracking | |||||||||||||||||||||||
40 | R1-1292 | Guo, Song; Liu, Rujie; Narishige, Abe | RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking | |||||||||||||||||||||||
41 | R1-171 | Roy, Debaditya; Fernando, Basura | Predicting the Next Action by Modeling the Abstract Goal | |||||||||||||||||||||||
42 | 2-D | Jayanta Mukhopadhyay | Image and video processing | R1-79 | Ko, Seonggwan; Cho, Donghyeon | CSSR: Cross-and Self-Feature Transformer with High-Frequency Feature Alignment for Reference-Based Super-Resolution | ||||||||||||||||||||
43 | R1-1230 | Li, Bing; Yu, Wei; Zheng, Naishan; Huang, Jie; Zhao, Feng | Unsupervised Low-light Image Enhancement via Spectral Consistency | |||||||||||||||||||||||
44 | R2-110 | Li, Tianyi; Tian, Ming; Gao, Changxin; Sang, Nong | Self-Distilled Dual-Network with Pixel Screening Loss for Blind Image Deblurring | |||||||||||||||||||||||
45 | R1-1264 | Mo, Fanbin; Huang, Yixiang; Wu, Ming; Zhu, Xun; Zhang, Chuang | MMSISP: A Satellite Image Sequence Prediction Network with Multi-Factor Decoupling and Multi-Modal Fusion | |||||||||||||||||||||||
46 | R2-253 | He, Runlin; Zhou, Gang; Xue, Tianhao; Liu, Zhaoxi; Jia, Zhenhong | Deformable Multi-Scale Network for Snow Removal in Video | |||||||||||||||||||||||
47 | R2-321 | Li, Mingjun; Zhuang, Zeming; Xu, Shuo; Su, Feng | MPGTSRN: Scene Text Image Super-Resolution Guided by Multiple Visual-Semantic Prompts | |||||||||||||||||||||||
48 | December 03, 2024 (10:00 - 11:00 IST) | |||||||||||||||||||||||||
49 | Oral Session 3 | |||||||||||||||||||||||||
50 | Session Number | Session Chair | Category | Paper IDs | Authors | Paper Title | ||||||||||||||||||||
51 | 3-A | Mayank Vatsa | Clustering | R1-648 | Cui, Jianbin; Chu, Lingyang | Interpretable Deep Graph-level Clustering: A Prototype-based Approach | ||||||||||||||||||||
52 | R1-1377 | Bangde, Yashwant; Saini, Naveen | Multi-view Ensemble Clustering-based Podcast Recommendation in Indian Regional Setting | |||||||||||||||||||||||
53 | R2-258 | Lin, Houshen; Hou, Jian; Yuan, Huaqiang | Adaptive Nearest Neighbor Density Peak Clustering Based On Fuzzy Logic | |||||||||||||||||||||||
54 | 3-B | Guoying Zhao | Online and continual learning | R1-836 | Kim, Minha; Bhaumik, Kishor Kumar; Ahsan Ali, Amin; Woo, Simon S | MIXAD: Memory-Induced Explainable Time Series Anomaly Detection | ||||||||||||||||||||
55 | R1-42 | Mosconi, Matteo; Sorokin, Andriy; Panariello, Aniello; Porrello, Angelo; Bonato, Jacopo; Cotogni, Marco; Sabetta, Luigi; CALDERARA, SIMONE; Cucchiara, Rita | Efficient Continual Human Action Recognition using Skeletons | |||||||||||||||||||||||
56 | R2-127 | Feillet, Eva; Popescu, Adrian; Hudelot, Céline | Recommendation of data-free class-incremental learning algorithms by simulating future data | |||||||||||||||||||||||
57 | 3-C | Arpan Pal | Action and behavior recognition | R1-384 | Zou, Yishan; Nugent, Chris; Burns, Matthew; Xi, Xiaoming; Liu, Meng | Towards Open-set Egocentric Action Recognition with Uncertainty Estimation | ||||||||||||||||||||
58 | R1-1090 | SURESH DASS, SHARANA DHARSHIKGAN; Barua, Hrishav Bakul; Krishnasamy, Ganesh; Paramesran, Raveendran; Phan, Raphael CW | ActNetFormer: Transformer-ResNet Hybrid Method for Semi-Supervised Action Recognition in Videos | |||||||||||||||||||||||
59 | R2-26 | Shin, Jongmin; Maiti, Abhishek; Zou, Yuliang; Choi, Jinwoo | Multi-Teacher Invariance Distillation for Domain-Generalized Action Recognition | |||||||||||||||||||||||
60 | 3-D | Shang Hong Lai | Computer aided diagnostics | R1-1779 | Yang, Ge; Qing, Linbo; Zhang, Yanteng; Gao, Feng; Gao, Li; He, Xiaohai; Peng, Yonghong | An attention transformer-based method for the modelling of functional connectivity and the diagnosis of autism spectrum disorder | ||||||||||||||||||||
61 | R2-13 | Wang, Shiyun; Xu, Yongchao | Auxiliary Information Guided Segmentation for the Clinical Target Volume of Cervical Cancer | |||||||||||||||||||||||
62 | R1-992 | Macias, Eric Macias; Morales, Aythami; Pruenza, Cristina; Fierrez, Julian | Privacy-Preserving Statistical Data Generation: Application to Sepsis Detection | |||||||||||||||||||||||
63 | December 03, 2024 (11:30 - 12:30 IST) | |||||||||||||||||||||||||
64 | Oral Session 4 | |||||||||||||||||||||||||
65 | Session Number | Session Chair | Category | Paper IDs | Authors | Paper Title | ||||||||||||||||||||
66 | 4-A | Michal Haindl | Graph models | R1-620 | Wang, Yongyu; Zhuang, Xiaotian | Mitigating the Impact of Noisy Edges on Graph-Based Algorithms via Adversarial Robustness Evaluation | ||||||||||||||||||||
67 | R1-87 | Yin, Naiyu; Yu, Yue; Gao, Tian; Ji, Qiang | Efficient Nonlinear DAG Learning under Projection Framework | |||||||||||||||||||||||
68 | R1-913 | Wu, Qi; Yang, Yingguang; He, Buyun; liu, hao; Yang, Renyu; Liao, Yong; Zhou, Pengyuan | BotSCL: Heterophily-aware Social Bot Detection with Supervised Contrastive Learning | |||||||||||||||||||||||
69 | 4-B | Vishal Patel | Few-shot and zero-shot learning | R1-323 | Nair, Nilah Ravi; Matei, Arthur; Krön, Dennis; Moya Rueda, Fernando; Reining, Christopher; Fink, Gernot A. | Augmentation of Human Activity Data: Convert, Generate, Transform | ||||||||||||||||||||
70 | R1-640 | Chen, Liangyuan; He, Zhenan; Zhang, Hai | Image Domain Translation for Few-Shot Learning | |||||||||||||||||||||||
71 | R1-705 | Sinha, Abhishek Kumar; Mishra, Deepak; S, Manthira Moorthi | Towards Adversarial Robustness and Reducing Uncertainty Bias through Expert Regularized Pseudo-Bidirectional Alignment in Transductive Zero Shot Learning | |||||||||||||||||||||||
72 | 4-C | Ross Whitaker | Fine-grained recognition | R1-1588 | Zhou, Beichen; Bi, Qi; Ding, Jian; Xia, Gui-Song | Boosting Fine-Grained Oriented Object Detection via Text Features | ||||||||||||||||||||
73 | R1-108 | Moratelli, Nicholas; Cornia, Marcella; Baraldi, Lorenzo; Cucchiara, Rita | Fluent and Accurate Image Captioning with a Self-Trained Reward Model | |||||||||||||||||||||||
74 | R1-438 | Frank, Hannah; Vetter, Karl; Varga, Leon; Wolff, Lars; Zell, Andreas | Hyperspectral Imaging for Characterization of Construction Waste Material in Recycling Applications | |||||||||||||||||||||||
75 | 4-D | Bob Fisher | 3D vision | R1-309 | Wan, Junkang; Miao, Yubin; Wu, Hang | Multimodal Point Cloud Completion via Residual Attention Feature Fusion | ||||||||||||||||||||
76 | R1-410 | Pathak, Stuti; McDonald, Thomas M; Sels, Seppe; Penne, Rudi | GP-PCS: One-shot Feature-Preserving Point Cloud Simplification with Gaussian Processes on Riemannian Manifolds | |||||||||||||||||||||||
77 | R1-352 | Nakano, Gaku | Inverse DLT method for One-sided Radially Distortion Homography | |||||||||||||||||||||||
78 | December 03, 2024 (16:30 - 18:30 IST) | |||||||||||||||||||||||||
79 | Oral Session 5 | |||||||||||||||||||||||||
80 | Session Number | Session Chair | Category | Paper IDs | Authors | Paper Title | ||||||||||||||||||||
81 | 5-A | Marwan Torki | Deep learning - I | R1-315 | Liu, Zhoufeng; li, bingrui; Ding, Shumin; Xi, Jiangtao; Li, Chunlei | Cross-Domain Calibration and Boundary Denoising Network for Weakly Supervised Semantic Segmentation | ||||||||||||||||||||
82 | R1-808 | Yang, Jie; Jing, LiWei; Xu, Yuanzhuo; Wu, Shaowu; Drew, Steve; Niu, Xiaoguang | Label-expanded Feature Debiasing for Single Domain Generalization | |||||||||||||||||||||||
83 | R1-1066 | Oehri, Sven; Ebert, Nikolas; Abdullah, Ahmed; Stricker, Didier; Wasenmüller, Oliver | GenFormer - Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets | |||||||||||||||||||||||
84 | R1-319 | Lu, GuangTong; Du, Weidong; Li, Fanzhang | EFLLD-NET: Enhancing Few-Shot Learning With Local Descriptors | |||||||||||||||||||||||
85 | R1-542 | Bylander, Karl; Nyström, Ingela; Bengtsson Bernander, Karl | Equivariant Neural Networks for TEM Virus Images Improves Data Efficiency | |||||||||||||||||||||||
86 | R1-924 | Liu, Jie; LI, QILIN; An, Senjian; Ezard, Brad; Li, Ling | EdgeConvFormer: an unsupervised anomaly detection method for multivariate time series | |||||||||||||||||||||||
87 | 5-B | Santanu Chaudhuri | Weakly supervised and partial label learning | R1-559 | Iqbal, Owais ; Chakraborty, Omprakash; Hussain, Aftab; Panda, Rameswar; Das, Abir | SITAR: Semi-Supervised Image Transformer for Action Recognition | ||||||||||||||||||||
88 | R1-725 | Hirner, Dominik; Fraundorfer, Friedrich | SAda-Net: A Self-Supervised Adaptive Stereo Estimation CNN for Remote Sensing Image Data | |||||||||||||||||||||||
89 | R1-1117 | Jiang, Xiaoheng; Xiao, Penghui; Yan, Feng; Lu, Yang; Jin, Shaohui; Xu, Mingliang | Context Mutual Evolution Network for Weakly Supervised Surface Defect Detection | |||||||||||||||||||||||
90 | R1-1196 | Borah, Parashjyoti; Dutta, Aparajita | Conditional Probability-based Feature Embedding for Genomic Sequence Data | |||||||||||||||||||||||
91 | R1-1263 | Xu, Fankang; Qian, Wenbin; Cai, Xingxing; Huang, Jintao; CHEUNG, Yiu-ming; Ding, Weiping | Label Disambiguation-based Feature Selection for Partial Multi-Label Learning | |||||||||||||||||||||||
92 | R2-168 | Wang, Naihao; Yang, Yukun; Yang, Haixin; Li, Ruirui | Enhancing Fairness and Robustness in Label-Noise Learning through Advanced Sample Selection and Adversarial Optimization | |||||||||||||||||||||||
93 | 5-C | Bertrand Kerautret | Object detection and recognition | R1-114 | Marichal, Henry; Passarella, Diego N.; Randall, Gregory | Automatic Wood Pith Detector: Local Orientation Estimation and Robust Accumulation | ||||||||||||||||||||
94 | R1-560 | Maglo, Adrien; Audigier, Romaric | Early Features Distributions Alignment in Visible-to-thermal Unsupervised Domain Adaptation for Object Detection | |||||||||||||||||||||||
95 | R1-1212 | Purkayastha, Kunal; Sarkar, Shashwat ; Palaiahnakote, Shivakumara; Pal , Umapada; Ghosal, Palash | DATR: Domain Agnostic Text Recognizer | |||||||||||||||||||||||
96 | R1-1394 | Wang, Jinzhong; Tian, Xuetao; Dai, Shun; Zhuo, Tao; Zeng, Haorui; Liu, Hongjuan; Liu, Jiaqi; Zhang, Xiuwei; Zhang, Yanning | RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision | |||||||||||||||||||||||
97 | R2-56 | Zhong, WenLong; Zhang, Yunfei ; Wu, Si | Environment-Independent Fusion for Robust Object Detection in Adverse Environments | |||||||||||||||||||||||
98 | R2-499 | Quan, Yitong; Kiefer, Benjamin; Messmer, Martin; Akupati, Charan; Graser, Rainer; Zell, Andreas | Robust Single-Cam Surround View Object Detection and Localization Using Memory Maps | |||||||||||||||||||||||
99 | 5-D | Ajay Kumar | Biometrics and benchmarking | R1-1265 | Pandey, Anurag; Singh, Pushap; Bhavsar, Arnav; Nigam, Aditya; Acharya, Dr. Divya; Verma, Basu | SIGN-Diffusion: Generating User Specific Online Signature For Digital Verification | ||||||||||||||||||||
100 | R1-62 | Chen, Hongxu; Xiong, Jianghao; Huang, YuHeng; Xie, Xiaohua; Lai, Jian-Huang | Visible-Infrared Person Search: A Novel Benchmark and Solution | |||||||||||||||||||||||