Abstract
Decision Model and Notation (DMN) has become a relevant topic for organizations since it allows users to control their processes and organizational decisions. The increasing use of DMN decision tables to capture critical business knowledge raises the need for supporting analysis tasks such as the extraction of inputs, outputs and their relations from natural language descriptions. In this paper, we create a stepping stone towards implementing a Natural Language Processing framework to model decisions based on the DMN standard. Our proposal contributes to the generation of decision rules and tables from a single sentence analysis. This framework comprises three phases: (1) discourse and semantic analysis, (2) syntactic analysis and (3) decision table construction. To the best of our knowledge, this is the first attempt devoted to automatically discovering decision rules according to the DMN terminology from natural language descriptions. Aiming at assessing the quality of the resultant decision tables, we have conducted a survey involving 16 DMN experts. The results have shown that our framework is able to generate semantically correct tables. It is convenient to mention that our proposal does not aim to replace analysts but support them in creating better models with less effort.
Similar content being viewed by others
Notes
The Penn Treebank is a corpus of manually parsed newspaper articles.
The description of each dependency tag appears in the Stanford typed dependencies manual (Marneffe and Manning 2010).
amod, aux, auxpass, case, compound, cop, dep, det, dobj, mark, mwe, nmod, nummod, xcomp and or.
DecisionRuleMiner, the developed prototype tool that implements the main stages of our framework, will be described in Sect. 6.
References
Bajwa IS, Lee MG, Bordbar B (2011) SBVR business rules generation from natural language specification. In: Proceedings of the AAAI Spring symposium—AI for Business Agility, Palo Alto, California, United States, vol SS-11-03, pp 2–8
Boufrida A, Boufaida Z (2014) Automatic rules extraction from medical texts. In: Proceedings of the international workshop on advanced information systems for enterprises (IWAISE), pp 29–33
Califf ME, Mooney RJ (1999) Relational learning of pattern-match rules for information extraction. In: Proceedings of the 16th national conference on artificial intelligence, Orlando, Florida, USA. ACM, pp 328–334
Calvanese D, Dumas M, Laurson U, Maggi FM, Montali M, Teinemaa I (2018) Semantics, analysis and simplification of DMN decision tables. Inf Syst 78:112–125
Caporale T (2016) A tool for natural language oriented business process modeling. In: Hochreiner C, Schulte S (eds) CEUR 8th ZEUS workshop proceedings, Vienna, Austria, 27–28 January 2016, vol 1562, pp 49–52
Ciravegna F (1999) Adaptive information extraction from text by rule induction and generalisation. Nat Lang Eng 10:145–165
Corradini F, Ferrari A, Fornari F, Gnesi S, Polini A, Re B, Spagnolo GO (2018) A guidelines framework for understandable BPMN models. Data Knowl Eng 113:129–154
De Marneffe M-C, MacCartney B, Manning CD (2006) Generating typed dependency parses from phrase structure parses. In: Proceedings of the 5th international conference on language resources and evaluation (LREC 2006), Genoa, Italy, 22–28 May 2016. European Language Resources Association (ELRA), pp 449–454
De Smedt J, De Weerdt J, Serral E, Vanthienen J (2018) Discovering hidden dependencies in constraint-based declarative process models for improving understandability. Inf Syst 74:40–52
Boyer J, Mili H (2011) Agile business rule development: process, architecture, and JRules examples. Springer, Berlin
De Smedt J, Hasic F, vanden Broucke SKLM, Vanthienen J, (2017) Business Process Management—15th international conference (BPM 2017), Barcelona, Spain, 10–15 September 2017, Proceedings, volume 10445 of Lecture notes in computer science, chapter Towards a Holistic Discovery of Decisions in Process-Aware Information Systems. Springer, pp 183–199
Demner-Fushman D, Chapman WW, McDonald CJ (2009) What can natural language processing do for clinical decision support? J Biomed Inform 42(5):760–772
Dragoni M, Governatori G, Villata S (2015) Automated rules generation from natural language legal texts. In: Proceedings of the workshop on automated detection, extraction and analysis of semantic information in legal texts (ICAIL 2015), San Diego, USA, pp 1–6
Dragoni M, Villata S, Rizzi W, Governatori G (2016) Combining NLP approaches for rule extraction from legal documents. In: Proceedings of the 29th international conference on legal knowledge and information systems, pp 1–13
Figl K, Mendling J, Tokdemir G, Vanthienen J (2018) What we know and what we do not know about DMN. Enterp Model Inf Syst Archit 13(2):1–16
Fortineau V, Paviot T, Guissé A, Lamouri S (2013) A transformation model to express business rules from natural language to formal execution: an application to nuclear power plant. IFAC Proc Vol 46(9):1096–1101
Friedrich F, Mendling J, Puhlmann F (2011) 23rd international conference on advanced information systems engineering (CAiSE 2011), London, UK, 20–24 June 2011, Proceedings, volume 6741 of Lecture notes in computer science, chapter Process model generation from natural language text. Springer, pp 482–496
Garza D (2014) Automated business rule harvesting with abstract syntax tree transformation
Ghose A, Koliadis G, Chueng A (2007) Conceptual Modeling—ER 2007, 26th international conference on conceptual modeling, Auckland, New Zealand, 5–9 Nov 2007, Proceedings, volume 4801 of Lecture notes in computer science, chapter Rapid Business Process Discovery (R-BPD). Springer, pp 391–406
Goldberg L, Bv Halle (2009) The decision model. Taylor & Francis Group, Milton Park
Gonçalves JC, Santoro FM, Baião FA (2009) Business process mining from group stories. In Proceedings of the 13th international conference on computer supported cooperative work in design (CSCWD 2009), number September. IEEE, pp 161–166
Hassanpour S, O’Connor MJ, Das AK (2011) Rule-based reasoning, programming, and applications, volume 6826 of Lecture notes in computer science, chapter A framework for the automatic extraction of rules from online text. Springer, pp 266–280
Hays DG (1964) Dependency theory: a formalism and some observations. Technical report, Santa Monica, California
Huffman SB (1996) Connectionist, statistical and symbolic approaches to learning for natural language processing, volume 1040 of Lecture notes in computer science, chapter Learning information extraction patterns from examples. Springer, pp 246–260
IIBA (2009) A guide to the business analysis body of knowledge (BABOK Guide), Version 2.0. International Institute of Business Analysis
Kuss E, Leopold H, van der Aa H, Stuckenschmidt H, Reijers HA (2018) A probabilistic evaluation procedure for process model matching techniques. Data Knowl Eng 117:393–406
Lévy F, Nazarenko A (2013) Theory, practice, and applications of rules on the web. RuleML 2013, volume 8035 of Lecture notes in computer science, chapter Formalization of natural language regulations through SBVR Structured English. Springer, pp 19–33
Liddy E (1998) Enhanced text retrieval using natural language processing. Bull Assoc Inf Sci Technol 24(4):14–16
Lima R, Freitas F, Espinasse B (2016) Relation extraction from texts with symbolic rules induced by inductive logic programming. In: Proceedings of the IEEE international conference on tools with artificial intelligence (ICTAI), Vietri sul Mare, Italy. IEEE, pp 194–201
Lin D, Pantel P (2001) DIRT—Discovery of inference rules from text. In: Proceedings of ACM conference on knowledge discovery and data mining (KDD-01), San Francisco, CA. ACM, pp 323–328
Liu Q, Gao Z, Liu B, Zhang Y (2015) Automated rule selection for aspect extraction in opinion mining. In: Proceedings of the international joint conference on artificial intelligence (IJCAI), volume January, pp 1291–1297
Marcus MP, Santorini B, Marcinkiewicz MA (1993) Building a large annotated corpus of English: The Penn Treebank. Comput Linguist 19(2):313–330
Marneffe M-CD, Manning CD (2010) Stanford typed dependencies manual. 20090110 Httpnlp Stanford 40(September):1–22
Melkuc IA (1988) Dependency syntax: theory and practice. State University of New York Press, Albany
Moldovan DI (1995) Acquisition of linguistic patterns for knowledge-based information extraction. IEEE Trans Knowl Data Eng 7(5):713–724
Ono T, Hishigaki H, Tanigami A, Takagi T (1999) Automatic extraction of information on protein–protein interaction from scientific literature. Genome Inform 1999:296–297
Papanikolaou N (2012) On the move to meaningful internet systems: OTM 2012, volume 7566 of Lecture notes in computer science, chapter Natural language processing of rules and regulations for compliance in the cloud. Springer, pp 620–627
Riefer M, Ternis SF, Thaler T (2016) Mining process models from natural language text: a state-of-the-art analysis. In Nissen V, Stelzer D, Straßburger S, Fischer D (eds) Proceedings of the Multikonferenz Wirtschaftsinformatik (MKWI 2016). Springer, pp 1–12
Riloff E (1993) Automatically constructing a dictionary for information extraction tasks. In: Proceedings of the 11th national conference on artificial intelligence, Washington DC, pp 811–816
Satyal S, Weber I, Paik H-y, Di Ciccio C, Mendling J (2019) Business process improvement with the AB-BPM methodology. Inf Syst 84:283–298
Silver B (2016) DMN method & style: the practitioner’s guide to decision modeling with business rules. Cody-Cassidy Press, Altadena
Sinha A, Paradkar A (2010) Use cases to process specifications in business process modeling notation. In: Proceedings of the 8th international conference on web services (ICWS 2010). IEEE, pp 473–480
Soderland SG (1997) Learning text analysis rules for domain-specific Natural Language Processing. Ph.D. thesis, University of Massachusetts
Soderland S (1999) Learning information extraction rules for semi-structured and free text. Mach Learn 34(1):233–272
Sorgente A, Vettigli G, Mele F (2013) Automatic extraction of cause-effect relations in natural language text. In: Proceedings of the 7th international workshop on information filtering and retrieval, Turin, Italy, vol 1109, pp 37–48. CEUR Workshop Proceedings
Taylor J, Fish A, Vanthienen J, Vincent P (2013) Intelligent BPM systems: impact and opportunity, chapter Emerging standards in decision modeling—an introduction to decision model & notation, pp 133–146. BPM and Workflow Handbook Series. Future Strategies, Incorporated
van der Aa H, Leopold H, del Río-Ortega A, Resinas M, Reijers HA (2017) Transforming unstructured natural language descriptions into measurable process performance indicators using hidden markov models. Inf Syst 71:27–39
Van Der Aalst W (2011) Process mining: discovery, conformance and enhancement of business processes, vol 2. Springer, Berlin
Wang X, Sun J, Yang X, He Z, Maddineni S (2004) Business rules extraction from large legacy systems. In: 8th European conference on software maintenance and reengineering, 2004. CSMR 2004. Proceedings, pp 249–258
Wyner A, Peters W (2011) Legal knowledge and information systems, volume 235 of Frontiers in Artificial Intelligence and Applications, chapter On rule extraction from regulations. IOS Press, pp 113–122
Xia R, Ding Z (2019) Emotion-cause pair extraction: a new task to emotion analysis in texts. In: Proceedings of the 57th annual meeting of the association for computational linguistics, Florence, Italy. ACL, pp 1003–1012
Yarahmadi A (2018) Enhanced machine learning approaches in text analysis for business intelligence: the appealing story of documents. Ph.D. thesis, University of Hasselt
Acknowledgements
This research was supported by the special research fund for incoming mobility of Hasselt University, Belgium. The authors gratefully acknowledge Veronika Boyanova and Aziz Yarahmadi for providing useful descriptions in our experiments, as well as the experts who kindly answered the survey.
Author information
Authors and Affiliations
Corresponding author
Additional information
Responsible editor: Hendrik Blockeel
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Arco, L., Nápoles, G., Vanhoenshoven, F. et al. Natural language techniques supporting decision modelers. Data Min Knowl Disc 35, 290–320 (2021). https://doi.org/10.1007/s10618-020-00718-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10618-020-00718-4