Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging Miao CaoLishun WangXin Yuan OriginalPaper 19 May 2024
Synthetic Data for Video Surveillance Applications of Computer Vision: A Review Rita DelussuLorenzo PutzuGiorgio Fumera OriginalPaper Open access 17 May 2024
Regional Adversarial Training for Better Robust Generalization Chuanbiao SongYanbo FanKun He OriginalPaper 17 May 2024
SA\(^3\)WT: Adaptive Wavelet-Based Transformer with Self-Paced Auto Augmentation for Face Forgery Detection Yihui LiYifan ZhangDi Huang OriginalPaper 16 May 2024
An Adaptive Correlation Filtering Method for Text-Based Person Search Mengyang SunWei SuoQi Wu OriginalPaper 16 May 2024
Open-Vocabulary Text-Driven Human Image Generation Kaiduo ZhangMuyi SunTieniu Tan OriginalPaper 15 May 2024
Benchmarking Object Detection Robustness against Real-World Corruptions Jiawei LiuZhijie WangZhenyu Chen OriginalPaper 15 May 2024
PRCL: Probabilistic Representation Contrastive Learning for Semi-Supervised Semantic Segmentation Haoyu XieChangqi WangBaigui Sun OriginalPaper 14 May 2024
Imbalance-Aware Discriminative Clustering for Unsupervised Semantic Segmentation Mingyuan LiuJicong ZhangWei Tang OriginalPaper 14 May 2024
Adaptive Discriminative Regularization for Visual Classification Qingsong ZhaoYi WangCairong Zhao OriginalPaper 13 May 2024
Exploring the Usage of Pre-trained Features for Stereo Matching Jiawei ZhangLei HuangEdwin Hancock OriginalPaper 11 May 2024
An Empirical Study on Multi-domain Robust Semantic Segmentation Yajie LiuPu GeYunhong Wang OriginalPaper 10 May 2024
Design and Analysis of Efficient Attention in Transformers for Social Group Activity Recognition Masato Tamura OriginalPaper 08 May 2024
3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking Urs WaldmannAlex Hoi Hang ChanFumihiro Kano OriginalPaper Open access 07 May 2024
Towards Diverse Binary Segmentation via a Simple yet General Gated Network Xiaoqi ZhaoYouwei PangLei Zhang OriginalPaper 07 May 2024
Physics-Driven Spectrum-Consistent Federated Learning for Palmprint Verification Ziyuan YangAndrew Beng Jin TeohYi Zhang OriginalPaper 07 May 2024
L3AM: Linear Adaptive Additive Angular Margin Loss for Video-Based Hand Gesture Authentication Wenwei SongWenxiong KangYitao Qiao OriginalPaper 06 May 2024
Meet JEANIE: A Similarity Measure for 3D Skeleton Sequences via Temporal-Viewpoint Alignment Lei WangJun LiuPiotr Koniusz OriginalPaper Open access 06 May 2024
Scaling Up Multi-domain Semantic Segmentation with Sentence Embeddings Wei YinYifan LiuAnton van den Hengel OriginalPaper 01 May 2024
A Causal Inspired Early-Branching Structure for Domain Generalization Liang ChenYong ZhangLingqiao Liu OriginalPaper 30 April 2024
Species-Agnostic Patterned Animal Re-identification by Aggregating Deep Local Features Ekaterina NepovinnykhIlia ChelakCharles V. Stewart OriginalPaper Open access 30 April 2024
Matching Compound Prototypes for Few-Shot Action Recognition Yifei HuangLijin YangYoichi Sato OriginalPaper Open access 29 April 2024
Domain-Agnostic Priors for Semantic Segmentation Under Unsupervised Domain Adaptation and Domain Generalization Xinyue HuoLingxi XieQi Tian OriginalPaper 27 April 2024
Light Flickering Guided Reflection Removal Yuchen HongYakun ChangBoxin Shi OriginalPaper 26 April 2024
PIE: Physics-Inspired Low-Light Enhancement Dong LiangZhengyan XuSongcan Chen OriginalPaper 25 April 2024
Guest Editorial: Special Issue on Traditional Computer Vision in the Age of Deep Learning Matteo PoggiFederica ArrigoniTomas Pajdla Editorial 24 April 2024
Guest Editorial: Special Issue on the British Machine Vision Conference 2022 Guang YangAngelica Aviles-RiveroConstantino Carlos Reyes-Aldasoro Editorial 24 April 2024
I2DFormer+: Learning Image to Document Summary Attention for Zero-Shot Image Classification Muhammad Ferjad NaeemYongqin XianFederico Tombari OriginalPaper 24 April 2024
Integrated Heterogeneous Graph Attention Network for Incomplete Multi-modal Clustering Yu WangXinjie YaoQinghua Hu OriginalPaper 24 April 2024
WildCLIP: Scene and Animal Attribute Retrieval from Camera Trap Data with Domain-Adapted Vision-Language Models Valentin GabeffMarc RußwurmAlexander Mathis OriginalPaper Open access 24 April 2024
An Open-World, Diverse, Cross-Spatial-Temporal Benchmark for Dynamic Wild Person Re-Identification Lei ZhangXiaowei FuXinbo Gao OriginalPaper 24 April 2024
Position, Padding and Predictions: A Deeper Look at Position Information in CNNs Md Amirul IslamMatthew KowalNeil D. B. Bruce OriginalPaper 24 April 2024
Descriptor Distillation: A Teacher-Student-Regularized Framework for Learning Local Descriptors Yuzhen LiuQiulei Dong OriginalPaper 24 April 2024
MutualFormer: Multi-modal Representation Learning via Cross-Diffusion Attention Xixi WangXiao WangBin Luo OriginalPaper 24 April 2024
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects Elisa WarnerJoonsang LeeArvind Rao OriginalPaper Open access 23 April 2024
VNAS: Variational Neural Architecture Search Benteng MaJing ZhangDacheng Tao OriginalPaper 23 April 2024
Augmenting the Softmax with Additional Confidence Scores for Improved Selective Classification with Out-of-Distribution Data Guoxuan XiaChristos-Savvas Bouganis OriginalPaper Open access 23 April 2024
On Finite Difference Jacobian Computation in Deformable Image Registration Yihao LiuJunyu ChenJerry Prince OriginalPaper Open access 18 April 2024
Ensemble Quadratic Assignment Network for Graph Matching Haoru TanChuang WangCheng-Lin Liu OriginalPaper 13 April 2024
Error-Aware Conversion from ANN to SNN via Post-training Parameter Calibration Yuhang LiShikuang DengShi Gu OriginalPaper 08 April 2024
CRetinex: A Progressive Color-Shift Aware Retinex Model for Low-Light Image Enhancement Han XuHao ZhangJiayi Ma OriginalPaper 08 April 2024
FSODv2: A Deep Calibrated Few-Shot Object Detection Network Qi FanWei ZhuoYu-Wing Tai OriginalPaper 04 April 2024
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm Jiangning ZhangXiangtai LiDacheng Tao OriginalPaper 02 April 2024
MMoT: Mixture-of-Modality-Tokens Transformer for Composed Multimodal Conditional Image Synthesis Jianbin ZhengDaqing LiuDacheng Tao OriginalPaper 02 April 2024
InterGen: Diffusion-Based Multi-human Motion Generation Under Complex Interactions Han LiangWenqian ZhangLan Xu OriginalPaper 26 March 2024
Hyperbolic Deep Learning in Computer Vision: A Survey Pascal MettesMina Ghadimi AtighSerena Yeung OriginalPaper Open access 26 March 2024
Pictorial and Apictorial Polygonal Jigsaw Puzzles from Arbitrary Number of Crossing Cuts Peleg HarelOfir Itzhak ShaharOhad Ben-Shahar OriginalPaper Open access 22 March 2024