当前位置: X-MOL 学术IEEE Trans. Vis. Comput. Graph. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Sketch-R2CNN: An RNN-Rasterization-CNN Architecture for Vector Sketch Recognition
IEEE Transactions on Visualization and Computer Graphics ( IF 4.7 ) Pub Date : 2020-04-15 , DOI: 10.1109/tvcg.2020.2987626
Lei Li , Changqing Zou , Youyi Zheng , Qingkun Su , Hongbo Fu , Chiew-Lan Tai

Sketches in existing large-scale datasets like the recent QuickDraw collection are often stored in a vector format, with strokes consisting of sequentially sampled points. However, most existing sketch recognition methods rasterize vector sketches as binary images and then adopt image classification techniques. In this article, we propose a novel end-to-end single-branch network architecture RNN-Rasterization-CNN (Sketch-R2CNN for short) to fully leverage the vector format of sketches for recognition. Sketch-R2CNN takes a vector sketch as input and uses an RNN for extracting per-point features in the vector space. We then develop a neural line rasterization module to convert the vector sketch and the per-point features to multi-channel point feature maps, which are subsequently fed to a CNN for extracting convolutional features in the pixel space. Our neural line rasterization module is designed in a differentiable way for end-to-end learning. We perform experiments on existing large-scale sketch recognition datasets and show that the RNN-Rasterization design brings consistent improvement over CNN baselines and that Sketch-R2CNN substantially outperforms the state-of-the-art methods.

中文翻译:


Sketch-R2CNN:用于矢量草图识别的 RNN-光栅化-CNN 架构



现有大型数据集中的草图(例如最近的 QuickDraw 集合)通常以矢量格式存储,笔画由顺序采样点组成。然而,大多数现有的草图识别方法将矢量草图光栅化为二值图像,然后采用图像分类技术。在本文中,我们提出了一种新颖的端到端单分支网络架构 RNN-Rasterization-CNN(简称 Sketch-R2CNN),以充分利用草图的矢量格式进行识别。 Sketch-R2CNN 将矢量草图作为输入,并使用 RNN 提取矢量空间中的每点特征。然后,我们开发了一个神经线光栅化模块,将矢量草图和每点特征转换为多通道点特征图,随后将其馈送到 CNN 以提取像素空间中的卷积特征。我们的神经线光栅化模块是以可微分的方式设计的,用于端到端学习。我们对现有的大规模草图识别数据集进行了实验,结果表明 RNN-光栅化设计相对于 CNN 基线带来了一致的改进,并且 Sketch-R2CNN 大大优于最先进的方法。
更新日期:2020-04-15
down
wechat
bug