Skip to main content
Log in

Application of a Genetic Algorithm for Finding Edit Distances between Process Models

  • Published:
Automatic Control and Computer Sciences Aims and scope Submit manuscript

Abstract

Finding graph edit distances (determining the similarity of graph models) is an important task in various areas of computer science, such as image analysis, machine learning, and chemoinformatics. In recent years, due to the development of process mining techniques, it has become necessary to adapt the existing graph matching methods to be applied to the analysis of process models (annotated graphs) discovered from event logs of information systems. In particular, methods for finding the minimum graph edit distance can be used to reveal patterns (subprocesses) and to compare discovered process models. As was shown experimentally and theoretically substantiated, exact methods for finding the minimum edit distance between the discovered process models (and graphs in the general case) have a great time complexity and can be applied only to small-sized process models. In this paper, we estimate the accuracy and time performance characteristics of a genetic algorithm applied to find distances between process models discovered from the event logs. In particular, we find distances between BPMN (Business Process Model and Notation) models discovered from the event logs by using different synthesis algorithms. It is shown that the genetic algorithm proposed in the paper allows us to significantly reduce the computation time and produces results close to the optimal solutions (the minimum edit distances).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.
Fig. 5.
Fig. 6.
Fig. 7.
Fig. 8.
Fig. 9.
Fig. 10.

Similar content being viewed by others

Notes

  1. http://www.processmining.org.

REFERENCES

  1. Van der Aalst, W.M.P., Process Mining: Data Science in Action, Springer, 2016, 2nd ed.

    Book  Google Scholar 

  2. Van der Aalst, W.M.P., Weijters, T., and Maruster, L., Workow mining: Discovering process models from event logs, IEEE Trans. Knowl. Data Eng., 2004, vol. 16, no. 9, pp. 1128–1142.

    Article  Google Scholar 

  3. Leemans, S.J.J., Fahland, D., and van der Aalst, W.M.P., Discovering block-structured process models from incomplete event logs, in Application and Theory of Petri Nets and Concurrency, Springer; Lect. Notes Comput. Sci., 2014, vol. 8489, pp. 91–110.

  4. Kalenkova, A.A., Ageev, A.A., Lomazova, I.A., and van der Aalst, W.M.P., E-government services: Comparing real and expected user behavior, in Business Process Management Workshops, Springer International Publishing, 2018, pp. 484–496.

    Google Scholar 

  5. Garey, M.R. and Johnson, D.S., Computers and Intractability: A Guide to the Theory of NP-Completeness, W.H. Freeman & Co., 1990.

    MATH  Google Scholar 

  6. Ivanov, S.Y., Kalenkova, A.A., and van der Aalst, W.M.P., BPMNDiffViz: A tool for BPMN models comparison, Proceedings of the BPM Demo Session 2015 Co-located with the 13th International Conference on Business Process Management (BPM 2015) (Innsbruck, Austria, September 2, 2015), 2015, pp. 35–39.

  7. Hart, P.E., Nilsson, N.J., and Raphael, B., A formal basis for the heuristic determination of minimum cost paths, IEEE Trans. Syst. Sci. Cybern., 1968, vol. 4, no. 2, pp. 100–107.

    Article  Google Scholar 

  8. Business Process Model and Notation (BPMN), Object Management Group, formal/2013-12-09, 2013.

  9. Cross, A.D.J., Wilson, R.C., and Hancock, E.R., Inexact graph matching using genetic search, Pattern Recognit., 1997, vol. 30, no. 6, pp. 953–970.

    Article  Google Scholar 

  10. Riesen, K., Fischer, A., and Bunke, H., Improving approximate graph edit distance using genetic algorithms, in Structural, Syntactic, and Statistical Pattern Recognition, Springer Berlin Heidelberg, 2014, pp. 63–72.

    Google Scholar 

  11. Kalenkova, A.A., van der Aalst, W.M.P., Lomazova, I.A., and Rubin, V.A., Process mining using BPMN: Relating event logs and process models, Software Syst. Model., 2017, vol. 16, no. 4, pp. 1019–1048.

    Article  Google Scholar 

  12. Levenshtein, V.I., Binary codes capable of correcting deletions, insertions and reversals, Sov. Phys. Dokl., 1966, vol. 10, p. 707.

    MathSciNet  Google Scholar 

  13. Gladkov, L.A., Kureichik, V.V., and Kureichik, V.M., Geneticheskie algoritmy (Genetic Algorithms), Moscow: Fizmatlit, 2010.

Download references

Funding

The research was supported by the Grant of the President of the Russian Federation for young Russian scientists — Candidates of Sciences no. MK-4188.2018.9.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to A. A. Kalenkova or D. A. Kolesnikov.

Ethics declarations

The authors declare that they have no conflicts of interest.

Additional information

Translated by I. Tselishcheva

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kalenkova, A.A., Kolesnikov, D.A. Application of a Genetic Algorithm for Finding Edit Distances between Process Models. Aut. Control Comp. Sci. 53, 617–627 (2019). https://doi.org/10.3103/S0146411619070071

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.3103/S0146411619070071

Keywords:

Navigation