当前位置: X-MOL 学术Wirel. Commun. Mob. Comput. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
An Encoder-Decoder Network Based FCN Architecture for Semantic Segmentation
Wireless Communications and Mobile Computing ( IF 2.146 ) Pub Date : 2020-07-07 , DOI: 10.1155/2020/8861886
Yongfeng Xing 1, 2 , Luo Zhong 1 , Xian Zhong 1
Affiliation  

In recent years, the convolutional neural network (CNN) has made remarkable achievements in semantic segmentation. The method of semantic segmentation has a desirable application prospect. Nowadays, the methods mostly use an encoder-decoder architecture as a way of generating pixel by pixel segmentation prediction. The encoder is for extracting feature maps and decoder for recovering feature map resolution. An improved semantic segmentation method on the basis of the encoder-decoder architecture is proposed. We can get better segmentation accuracy on several hard classes and reduce the computational complexity significantly. This is possible by modifying the backbone and some refining techniques. Finally, after some processing, the framework has achieved good performance in many datasets. In comparison with the traditional architecture, our architecture does not need additional decoding layer and further reuses the encoder weight, thus reducing the complete quantity of parameters needed for processing. In this paper, a modified focal loss function is also put forward, as a replacement for the cross-entropy function to achieve a better treatment of the imbalance problem of the training data. In addition, more context information is added to the decode module as a way of improving the segmentation results. Experiments prove that the presented method can get better segmentation results. As an integral part of a smart city, multimedia information plays an important role. Semantic segmentation is an important basic technology for building a smart city.

中文翻译:

基于编码器-解码器网络的FCN语义分割架构

近年来,卷积神经网络(CNN)在语义分割方面取得了令人瞩目的成就。语义分割方法具有理想的应用前景。如今,这些方法大多使用编码器-解码器体系结构作为逐像素分割预测生成像素的方法。编码器用于提取特征图,解码器用于恢复特征图分辨率。提出了一种基于编码器-解码器架构的改进语义分割方法。我们可以在几个硬类上获得更好的分割精度,并显着降低计算复杂度。这可以通过修改主干和一些改进技术来实现。最后,经过一些处理,该框架在许多数据集中都取得了良好的性能。与传统建筑相比,我们的架构不需要额外的解码层,并且可以进一步重用编码器权重,从而减少了处理所需参数的完整数量。本文还提出了一种改进的焦点损失函数,作为交叉熵函数的替代,可以更好地处理训练数据的不平衡问题。另外,更多的上下文信息被添加到解码模块,以改善分割结果。实验证明,该方法能取得较好的分割效果。作为智慧城市不可或缺的一部分,多媒体信息扮演着重要角色。语义分割是建设智慧城市的重要基础技术。因此减少了处理所需参数的完整数量。本文还提出了一种改进的焦点损失函数,作为交叉熵函数的替代,可以更好地处理训练数据的不平衡问题。另外,更多的上下文信息被添加到解码模块,以改善分割结果。实验证明,该方法能取得较好的分割效果。作为智慧城市不可或缺的一部分,多媒体信息扮演着重要角色。语义分割是建设智慧城市的重要基础技术。因此减少了处理所需参数的完整数量。本文还提出了一种改进的焦点损失函数,作为交叉熵函数的替代,可以更好地处理训练数据的不平衡问题。另外,更多的上下文信息被添加到解码模块,以改善分割结果。实验证明,该方法能取得较好的分割效果。作为智慧城市不可或缺的一部分,多媒体信息扮演着重要角色。语义分割是建设智慧城市的重要基础技术。更多的上下文信息被添加到解码模块,以改善分割结果。实验证明,该方法能取得较好的分割效果。作为智慧城市不可或缺的一部分,多媒体信息扮演着重要的角色。语义分割是建设智慧城市的重要基础技术。更多的上下文信息被添加到解码模块,以改善分割结果。实验证明,该方法能取得较好的分割效果。作为智慧城市不可或缺的一部分,多媒体信息扮演着重要角色。语义分割是建设智慧城市的重要基础技术。
更新日期:2020-07-07
down
wechat
bug