当前位置:
X-MOL 学术
›
Pattern Recognit. Image Anal.
›
论文详情
Our official English website, www.x-mol.net, welcomes your
feedback! (Note: you will need to create a separate account there.)
Jointly Image Annotation and Classification Based on Supervised Multi-Modal Hierarchical Semantic Model
Pattern Recognition and Image Analysis ( IF 0.7 ) Pub Date : 2020-03-31 , DOI: 10.1134/s1054661820010058 Chun-yan Yin , Yong-Heng Chen , Wan-li Zuo
中文翻译:
基于监督多模态分层语义模型的联合图像标注与分类
更新日期:2020-03-31
Pattern Recognition and Image Analysis ( IF 0.7 ) Pub Date : 2020-03-31 , DOI: 10.1134/s1054661820010058 Chun-yan Yin , Yong-Heng Chen , Wan-li Zuo
Abstract
A lot of applications involve capturing correlations from multi-modality data, where available information spans multiple modalities, such as text, images or speech. In this paper, we pay attention to the specific case in which images are both labeled with a category and annotated with free text, and develop a supervised multi-modal hierarchical semantic model (smHSM), where we incorporate image classification into the joint modeling of visual and textual information, for the tasks of image annotation and classification. To evaluate the effectiveness of our model, we experiment our model on two datasets, and compare with other traditional models. The results demonstrate the effectiveness and advantages of our model in caption perplexity, classification accuracy and image annotation accuracy.中文翻译:
基于监督多模态分层语义模型的联合图像标注与分类