Malware Detection Using CNN via Word Embedding in Cloud Computing Infrastructure,Scientific Programming

当前位置： X-MOL 学术 › Sci. Program. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Malware Detection Using CNN via Word Embedding in Cloud Computing Infrastructure
Scientific Programming Pub Date : 2021-09-13 , DOI: 10.1155/2021/8381550
Rong Wang ₁ , Cong Tian ₁ , Lin Yan ₁

Affiliation

The Internet of Things (IoT), cloud, and fog computing paradigms provide a powerful large-scale computing infrastructure for a variety of data and computation-intensive applications. These cutting-edge computing infrastructures, however, are nevertheless vulnerable to serious security and privacy risks. One of the most important countermeasures against cybersecurity threats is intrusion detection and prevention systems, which monitor devices, networks, and systems for malicious activity and policy violations. The detection and prevention systems range from antivirus software to hierarchical systems that monitor the traffic of whole backbone networks. At the moment, the primary defensive solutions are based on malware feature extraction. Most known feature extraction algorithms use byte N-gram patterns or binary strings to represent log files or other static information. The information taken from program files is expressed using word embedding (GloVe) and a new feature extraction method proposed in this article. As a result, the relevant vector space model (VSM) will incorporate more information about unknown programs. We utilize convolutional neural network (CNN) to analyze the feature maps represented by word embedding and apply Softmax to fit the probability of a malicious program. Eventually, we consider a program to be malicious if the probability is greater than 0.5; otherwise, it is a benign program. Experimental result shows that our approach achieves a level of accuracy higher than 98%.

中文翻译：

在云计算基础设施中通过词嵌入使用 CNN 进行恶意软件检测

物联网 (IoT)、云和雾计算范式为各种数据和计算密集型应用程序提供了强大的大规模计算基础设施。然而，这些尖端的计算基础设施仍然容易受到严重的安全和隐私风险的影响。针对网络安全威胁的最重要对策之一是入侵检测和预防系统，它监控设备、网络和系统是否存在恶意活动和策略违规。检测和预防系统的范围从防病毒软件到监控整个骨干网流量的分层系统。目前，主要的防御解决方案是基于恶意软件特征提取。大多数已知的特征提取算法使用字节 N-gram 模式或二进制字符串来表示日志文件或其他静态信息。从程序文件中获取的信息使用词嵌入（GloVe）和本文提出的新特征提取方法表示。因此，相关向量空间模型 (VSM) 将包含更多有关未知程序的信息。我们利用卷积神经网络 (CNN) 来分析词嵌入表示的特征图，并应用 Softmax 来拟合恶意程序的概率。最终，如果概率大于 0.5，我们认为程序是恶意的；否则，它是一个良性程序。实验结果表明，我们的方法达到了高于 98% 的准确率。从程序文件中获取的信息使用词嵌入（GloVe）和本文提出的新特征提取方法表示。因此，相关向量空间模型 (VSM) 将包含更多有关未知程序的信息。我们利用卷积神经网络 (CNN) 来分析词嵌入表示的特征图，并应用 Softmax 来拟合恶意程序的概率。最终，如果概率大于 0.5，我们认为程序是恶意的；否则，它是一个良性程序。实验结果表明，我们的方法达到了高于 98% 的准确率。从程序文件中获取的信息使用词嵌入（GloVe）和本文提出的新特征提取方法表示。因此，相关向量空间模型 (VSM) 将包含更多有关未知程序的信息。我们利用卷积神经网络 (CNN) 来分析词嵌入表示的特征图，并应用 Softmax 来拟合恶意程序的概率。最终，如果概率大于 0.5，我们认为程序是恶意的；否则，它是一个良性程序。实验结果表明，我们的方法达到了高于 98% 的准确率。我们利用卷积神经网络 (CNN) 来分析词嵌入表示的特征图，并应用 Softmax 来拟合恶意程序的概率。最终，如果概率大于 0.5，我们认为程序是恶意的；否则，它是一个良性程序。实验结果表明，我们的方法达到了高于 98% 的准确率。我们利用卷积神经网络 (CNN) 来分析词嵌入表示的特征图，并应用 Softmax 来拟合恶意程序的概率。最终，如果概率大于 0.5，我们认为程序是恶意的；否则，它是一个良性程序。实验结果表明，我们的方法达到了高于 98% 的准确率。

更新日期：2021-09-13

点击分享查看原文

点击收藏

公开下载

阅读更多本刊最新论文本刊介绍/投稿指南11