当前位置: X-MOL 学术J. Syst. Softw. › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Testing and validating machine learning classifiers by metamorphic testing
Journal of Systems and Software ( IF 3.7 ) Pub Date : 2011-04-01 , DOI: 10.1016/j.jss.2010.11.920
Xiaoyuan Xie 1 , Joshua W K Ho , Christian Murphy , Gail Kaiser , Baowen Xu , Tsong Yueh Chen
Affiliation  

Machine Learning algorithms have provided core functionality to many application domains - such as bioinformatics, computational linguistics, etc. However, it is difficult to detect faults in such applications because often there is no "test oracle" to verify the correctness of the computed outputs. To help address the software quality, in this paper we present a technique for testing the implementations of machine learning classification algorithms which support such applications. Our approach is based on the technique "metamorphic testing", which has been shown to be effective to alleviate the oracle problem. Also presented include a case study on a real-world machine learning application framework, and a discussion of how programmers implementing machine learning algorithms can avoid the common pitfalls discovered in our study. We also conduct mutation analysis and cross-validation, which reveal that our method has high effectiveness in killing mutants, and that observing expected cross-validation result alone is not sufficiently effective to detect faults in a supervised classification program. The effectiveness of metamorphic testing is further confirmed by the detection of real faults in a popular open-source classification program.

中文翻译:

通过变形测试测试和验证机器学习分类器

机器学习算法为许多应用领域提供了核心功能——例如生物信息学、计算语言学等。然而,在这些应用中很难检测到错误,因为通常没有“测试预言机”来验证计算输出的正确性。为了帮助解决软件质量问题,在本文中,我们提出了一种用于测试支持此类应用程序的机器学习分类算法的实现的技术。我们的方法基于“变形测试”技术,该技术已被证明可以有效缓解预言机问题。还介绍了一个关于真实世界机器学习应用程序框架的案例研究,以及关于程序员如何实现机器学习算法可以避免我们研究中发现的常见陷阱的讨论。我们还进行了突变分析和交叉验证,这表明我们的方法在杀死突变体方面具有很高的效率,并且仅观察预期的交叉验证结果不足以有效地检测监督分类程序中的错误。在流行的开源分类程序中检测真实故障进一步证实了变形测试的有效性。
更新日期:2011-04-01
down
wechat
bug