Abstract—
Finding graph edit distances (determining the similarity of graph models) is an important task in various areas of computer science, such as image analysis, machine learning, and chemoinformatics. In recent years, due to the development of process mining techniques, it has become necessary to adapt the existing graph matching methods to be applied to the analysis of process models (annotated graphs) discovered from event logs of information systems. In particular, methods for finding the minimum graph edit distance can be used to reveal patterns (subprocesses) and to compare discovered process models. As was shown experimentally and theoretically substantiated, exact methods for finding the minimum edit distance between the discovered process models (and graphs in the general case) have a great time complexity and can be applied only to small-sized process models. In this paper, we estimate the accuracy and time performance characteristics of a genetic algorithm applied to find distances between process models discovered from the event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from the event logs by using different synthesis algorithms. It is shown that the genetic algorithm proposed in the paper allows us to significantly reduce the computation time and produces results close to the optimal solutions (the minimum edit distances).
Similar content being viewed by others
REFERENCES
Van der Aalst, W.M.P., Process Mining: Data Science in Action, Springer, 2016, 2nd ed.
Van der Aalst, W.M.P., Weijters, T., and Maruster, L., Workow mining: Discovering process models from event logs, IEEE Trans. Knowl. Data Eng., 2004, vol. 16, no. 9, pp. 1128–1142.
Leemans, S.J.J., Fahland, D., and van der Aalst, W.M.P., Discovering block-structured process models from incomplete event logs, in Application and Theory of Petri Nets and Concurrency, Springer; Lect. Notes Comput. Sci., 2014, vol. 8489, pp. 91–110.
Kalenkova, A.A., Ageev, A.A., Lomazova, I.A., and van der Aalst, W.M.P., E-government services: Comparing real and expected user behavior, in Business Process Management Workshops, Springer International Publishing, 2018, pp. 484–496.
Garey, M.R. and Johnson, D.S., Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman & Co., 1990.
Ivanov, S.Y., Kalenkova, A.A., and van der Aalst, W.M.P., BPMNDiffViz: A tool for BPMN models comparison, Proceedings of the BPM Demo Session 2015 Co-located with the 13th International Conference on Business Process Management (BPM 2015) (Innsbruck, Austria, September 2, 2015), 2015, pp. 35–39.
Hart, P.E., Nilsson, N.J., and Raphael, B., A formal basis for the heuristic determination of minimum cost paths, IEEE Trans. Syst. Sci. Cybern., 1968, vol. 4, no. 2, pp. 100–107.
Business Process Model and Notation (BPMN), Object Management Group, formal/2013-12-09, 2013.
Cross, A.D.J., Wilson, R.C., and Hancock, E.R., Inexact graph matching using genetic search, Pattern Recognit., 1997, vol. 30, no. 6, pp. 953–970.
Riesen, K., Fischer, A., and Bunke, H., Improving approximate graph edit distance using genetic algorithms, in Structural, Syntactic, and Statistical Pattern Recognition, Springer Berlin Heidelberg, 2014, pp. 63–72.
Kalenkova, A.A., van der Aalst, W.M.P., Lomazova, I.A., and Rubin, V.A., Process mining using BPMN: Relating event logs and process models, Software Syst. Model., 2017, vol. 16, no. 4, pp. 1019–1048.
Levenshtein, V.I., Binary codes capable of correcting deletions, insertions and reversals, Sov. Phys. Dokl., 1966, vol. 10, p. 707.
Gladkov, L.A., Kureichik, V.V., and Kureichik, V.M., Geneticheskie algoritmy (Genetic Algorithms), Moscow: Fizmatlit, 2010.
Funding
The research was supported by the Grant of the President of the Russian Federation for young Russian scientists — Candidates of Sciences no. MK-4188.2018.9.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
The authors declare that they have no conflicts of interest.
Additional information
Translated by I. Tselishcheva
About this article
Cite this article
Kalenkova, A.A., Kolesnikov, D.A. Application of a Genetic Algorithm for Finding Edit Distances between Process Models. Aut. Control Comp. Sci. 53, 617–627 (2019). https://doi.org/10.3103/S0146411619070071
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.3103/S0146411619070071