样式: 排序: IF: - GO 导出 标记为已读
-
Softmax-Free Linear Transformers Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-13 Jiachen Lu, Junge Zhang, Xiatian Zhu, Jianfeng Feng, Tao Xiang, Li Zhang
-
One-Shot Neural Face Reenactment via Finding Directions in GAN’s Latent Space Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-13
Abstract In this paper, we present our framework for neural face/head reenactment whose goal is to transfer the 3D head orientation and expression of a target face to a source face. Previous methods focus on learning embedding networks for identity and head pose/expression disentanglement which proves to be a rather hard task, degrading the quality of the generated images. We take a different approach
-
PL $${}_{1}$$ P: Point-Line Minimal Problems under Partial Visibility in Three Views Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-10
Abstract We present a complete classification of minimal problems for generic arrangements of points and lines in space observed partially by three calibrated perspective cameras when each line is incident to at most one point. This is a large class of interesting minimal problems that allows missing observations in images due to occlusions and missed detections. There is an infinite number of such
-
Deep Learning Technique for Human Parsing: A Survey and Outlook Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-09 Lu Yang, Wenhe Jia, Shan Li, Qing Song
-
Unsupervised Point Cloud Representation Learning by Clustering and Neural Rendering Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-08 Guofeng Mei, Cristiano Saltori, Elisa Ricci, Nicu Sebe, Qiang Wu, Jian Zhang, Fabio Poiesi
-
Adaptive Multi-Source Predictor for Zero-Shot Video Object Segmentation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-07 Xiaoqi Zhao, Shijie Chang, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu
-
Open Set Recognition in Real World Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-07 Zhen Yang, Jun Yue, Pedram Ghamisi, Shiliang Zhang, Jiayi Ma, Leyuan Fang
-
Does Confusion Really Hurt Novel Class Discovery? Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-07
Abstract When sampling data of specific classes (i.e., known classes) for a scientific task, collectors may encounter unknown classes (i.e., novel classes). Since these novel classes might be valuable for future research, collectors will also sample them and assign them to several clusters with the help of known-class data. This assigning process is known as novel class discovery (NCD). However, category
-
Domain Generalization with Small Data Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-06 Kecheng Chen, Elena Gal, Hong Yan, Haoliang Li
-
A Survey on Global LiDAR Localization: Challenges, Advances and Open Problems Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-06 Huan Yin, Xuecheng Xu, Sha Lu, Xieyuanli Chen, Rong Xiong, Shaojie Shen, Cyrill Stachniss, Yue Wang
-
CBNet: A Plug-and-Play Network for Segmentation-Based Scene Text Detection Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-05 Xi Zhao, Wei Feng, Zheng Zhang, Jingjing Lv, Xin Zhu, Zhangang Lin, Jinghe Hu, Jingping Shao
-
Automated Detection of Cat Facial Landmarks Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-05
Abstract The field of animal affective computing is rapidly emerging, and analysis of facial expressions is a crucial aspect. One of the most significant challenges that researchers in the field currently face is the scarcity of high-quality, comprehensive datasets that allow the development of models for facial expressions analysis. One of the possible approaches is the utilisation of facial landmarks
-
PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-04
Abstract We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across \(\sim \) 20,000 camera trap videos of chimpanzees and gorillas collected at 18 field sites in tropical Africa as part of the Pan African Programme: The Cultured Chimpanzee. The footage is accompanied by
-
Cross-Modal Fusion and Progressive Decoding Network for RGB-D Salient Object Detection Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-02 Xihang Hu, Fuming Sun, Jing Sun, Fasheng Wang, Haojie Li
-
Uncertainty Modeling for Group Re-Identification Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-01 Quan Zhang, Jianhuang Lai, Zhanxiang Feng, Xiaohua Xie
-
SplatFlow: Learning Multi-frame Optical Flow via Splatting Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-29
Abstract The occlusion problem remains a crucial challenge in optical flow estimation (OFE). Despite the recent significant progress brought about by deep learning, most existing deep learning OFE methods still struggle to handle occlusions; in particular, those based on two frames cannot correctly handle occlusions because occluded regions have no visual correspondences. However, there is still hope
-
A Survey on Adaptive Cameras Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-28 Julien Ducrocq, Guillaume Caron
-
Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-27 Chenyi Jiang, Yuming Shen, Dubing Chen, Haofeng Zhang, Ling Shao, Philip H. S. Torr
-
Correction to: Deep Unpaired Blind Image Super-Resolution Using Self-supervised Learning and Exemplar Distillation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-26 Jiangxin Dong, Haoran Bai, Jinhui Tang, Jinshan Pan
-
Training Object Detectors from Scratch: An Empirical Study in the Era of Vision Transformer Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-26 Weixiang Hong, Wang Ren, Jiangwei Lao, Lele Xie, Liheng Zhong, Jian Wang, Jingdong Chen, Honghai Liu, Wei Chu
-
Robust Heterogeneous Model Fitting for Multi-source Image Correspondences Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-23 Shuyuan Lin, Feiran Huang, Taotao Lai, Jianhuang Lai, Hanzi Wang, Jian Weng
-
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-23 Zhi-Song Liu, Robin Courant, Vicky Kalogeiton
-
Learning to Generalize over Subpartitions for Heterogeneity-Aware Domain Adaptive Nuclei Segmentation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-22 Jianan Fan, Dongnan Liu, Hang Chang, Weidong Cai
-
UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-modal Learning Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-22 Xue-Feng Zhu, Tianyang Xu, Zongtao Liu, Zhangyong Tang, Xiao-Jun Wu, Josef Kittler
-
Semantic-Aligned Matching for Enhanced DETR Convergence and Multi-Scale Feature Fusion Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-20 Gongjie Zhang, Zhipeng Luo, Jiaxing Huang, Shijian Lu, Eric P. Xing
-
Cross-Architecture Knowledge Distillation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-19 Yufan Liu, Jiajiong Cao, Bing Li, Weiming Hu, Jingting Ding, Liang Li, Stephen Maybank
-
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-18
Abstract Unsupervised cross-modal hashing (UCMH) has been commonly explored to support large-scale cross-modal retrieval of unlabeled data. Despite promising progress, most existing approaches are developed on convolutional neural network and multilayer perceptron architectures, sacrificing the quality of hash codes due to limited capacity for excavating multi-modal semantics. To pursue better content
-
Annotation-Free Human Sketch Quality Assessment Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-17 Lan Yang, Kaiyue Pang, Honggang Zhang, Yi-Zhe Song
-
MixStyle Neural Networks for Domain Generalization and Adaptation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-01 Kaiyang Zhou, Yongxin Yang, Yu Qiao, Tao Xiang
-
Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-01 Wei Zhai, Pingyu Wu, Kai Zhu, Yang Cao, Feng Wu, Zheng-Jun Zha
-
3D Adversarial Augmentations for Robust Out-of-Domain Predictions Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-03-01 Alexander Lehner, Stefano Gasperini, Alvaro Marcos-Ramiro, Michael Schmidt, Nassir Navab, Benjamin Busam, Federico Tombari
-
ReliTalk: Relightable Talking Portrait Generation from a Single Video Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-16
Abstract Recent years have witnessed great progress in creating vivid audio-driven portraits from monocular videos. However, how to seamlessly adapt the created video avatars to other scenarios with different backgrounds and lighting conditions remains unsolved. On the other hand, existing relighting studies mostly rely on dynamically lighted or multi-view data, which are too expensive for creating
-
A New Dataset and a Distractor-Aware Architecture for Transparent Object Tracking Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-16 Alan Lukežič, Žiga Trojer, Jiří Matas, Matej Kristan
-
Learning Adaptive Spatio-Temporal Inference Transformer for Coarse-to-Fine Animal Visual Tracking: Algorithm and Benchmark Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-12 Tianyang Xu, Ze Kang, Xuefeng Zhu, Xiao-Jun Wu
-
Benchmarking the Robustness of LiDAR Semantic Segmentation Models Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-12 Xu Yan, Chaoda Zheng, Ying Xue, Zhen Li, Shuguang Cui, Dengxin Dai
-
Are Multi-view Edges Incomplete for Depth Estimation? Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-12 Numair Khan, Min H. Kim, James Tompkin
-
Relative Norm Alignment for Tackling Domain Shift in Deep Multi-modal Classification Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-09 Mirco Planamente, Chiara Plizzari, Simone Alberto Peirone, Barbara Caputo, Andrea Bottino
-
Focus for Free in Density-Based Counting Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-09 Zenglin Shi, Pascal Mettes, Cees G. M. Snoek
-
HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-07 Sha Zhang, Jiajun Deng, Lei Bai, Houqiang Li, Wanli Ouyang, Yanyong Zhang
-
InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction from Multi-view RGB-D Images Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-06 Yinghao Huang, Omid Taheri, Michael J. Black, Dimitrios Tzionas
-
Deep Learning Based Prediction of Pulmonary Hypertension in Newborns Using Echocardiograms Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-06 Hanna Ragnarsdottir, Ece Ozkan, Holger Michel, Kieran Chin-Cheong, Laura Manduchi, Sven Wellmann, Julia E. Vogt
-
HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-06 Shuzhe Wang, Zakaria Laskar, Iaroslav Melekhov, Xiaotian Li, Yi Zhao, Giorgos Tolias, Juho Kannala
-
Robust Object Re-identification with Coupled Noisy Labels Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-05 Mouxing Yang, Zhenyu Huang, Xi Peng
-
Geometric Prior Guided Feature Representation Learning for Long-Tailed Classification Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-05 Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Puhua Chen
-
HCLR-Net: Hybrid Contrastive Learning Regularization with Locally Randomized Perturbation for Underwater Image Enhancement Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-02-04 Jingchun Zhou, Jiaming Sun, Chongyi Li, Qiuping Jiang, Man Zhou, Kin-Man Lam, Weishi Zhang, Xianping Fu
-
Spatially-Varying Illumination-Aware Indoor Harmonization Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-30 Zhongyun Hu, Jiahao Li, Xue Wang, Qing Wang
-
Multi-dataset Detection with Transformers Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-30 Bo Ke, Ruizhi Qiao, Xing Sun
-
Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-30 Petra Bevandić, Marin Oršić, Josip Šarić, Ivan Grubišić, Siniša Šegvić
-
Oriented R-CNN and Beyond Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-29 Xingxing Xie, Gong Cheng, Jiabao Wang, Ke Li, Xiwen Yao, Junwei Han
-
Towards Robust Monocular Depth Estimation: A New Baseline and Benchmark Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-20 Ke Xian, Zhiguo Cao, Chunhua Shen, Guosheng Lin
-
Deep Learning-Based Image and Video Inpainting: A Survey Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-19
Abstract Image and video inpainting is a classic problem in computer vision and computer graphics, aiming to fill in the plausible and realistic content in the missing areas of images and videos. With the advance of deep learning, this problem has achieved significant progress recently. The goal of this paper is to comprehensively review the deep learning-based methods for image and video inpainting
-
View-Invariant Skeleton Action Representation Learning via Motion Retargeting Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-16 Di Yang, Yaohui Wang, Antitza Dantcheva, Lorenzo Garattoni, Gianpiero Francesca, François Brémond
-
GyroFlow+: Gyroscope-Guided Unsupervised Deep Homography and Optical Flow Learning Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-14 Haipeng Li, Kunming Luo, Bing Zeng, Shuaicheng Liu
-
Delving into Identify-Emphasize Paradigm for Combating Unknown Bias Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-13 Bowen Zhao, Chen Chen, Qian-Wei Wang, Anfeng He, Shu-Tao Xia
-
S $$^{2}$$ P $$^{3}$$ : Self-Supervised Polarimetric Pose Prediction Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12
Abstract This paper proposes the first self-supervised 6D object pose prediction from multimodal RGB + polarimetric images. The novel training paradigm comprises (1) a physical model to extract geometric information of polarized light, (2) a teacher–student knowledge distillation scheme and (3) a self-supervised loss formulation through differentiable rendering and an invertible physical constraint
-
ToTem NRSfM: Object-Wise Non-rigid Structure-from-Motion with a Topological Template Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12 Agniva Sengupta, Adrien Bartoli
-
Generative Adversarial Network Applications in Industry 4.0: A Review Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12 Chafic Abou Akar, Rachelle Abdel Massih, Anthony Yaghi, Joe Khalil, Marc Kamradt, Abdallah Makhoul
-
Probabilistic-Based Feature Embedding of 4-D Light Fields for Compressive Imaging and Denoising Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12
Abstract The high-dimensional nature of the 4-D light field (LF) poses great challenges in achieving efficient and effective feature embedding, that severely impacts the performance of downstream tasks. To tackle this crucial issue, in contrast to existing methods with empirically-designed architectures, we propose a probabilistic-based feature embedding (PFE), which learns a feature embedding architecture
-
Reliability-Adaptive Consistency Regularization for Weakly-Supervised Point Cloud Segmentation Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12 Zhonghua Wu, Yicheng Wu, Guosheng Lin, Jianfei Cai
-
Learning by Asking Questions for Knowledge-Based Novel Object Recognition Int. J. Comput. Vis. (IF 19.5) Pub Date : 2024-01-12 Kohei Uehara, Tatsuya Harada