A novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm

doi:10.1016/j.knosys.2021.107239

Knowledge-Based Systems

Volume 228, 27 September 2021, 107239

https://doi.org/10.1016/j.knosys.2021.107239 Get rights and content

Abstract

The ontology matching is a significant task for data integration and semantic interoperability. Although a large number of effective ontology matching methods have been proposed in a fully automated way, user involvement during the matching process is needed for real-world applications. It has been recognized as an effective method for further improving the quality of matching, especially for very precise matching cases. However, involving users during complex matching process suffers from new challenges of how to reduce the burden on users and how to increase effective interaction. In this paper, we propose a novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm to address the above-mentioned issues. This new model takes into account the periodic feedback from users during the optimization process, rather than every generation, and a roulette wheel method is introduced to select the most problematic candidate mappings to present to users, not all, and to reduce the burden on users. To ensure the effectiveness of the interaction, a reward and punishment mechanism is considered for candidate mappings to propagate the feedback of user, and to guide the search direction of the algorithm. The experiments, conducted on two interactive tracks from Ontology Alignment Evaluation Initiative (OAEI), show that the proposed model significantly improve the quality of matching. Compared to other state-of-the-art matching systems, our model outperforms other methods in almost all cases with given different error rate, which makes it one of the most advanced leaders. Finally, a typical case of data integration is studied to present how the proposed approach is able to help enterprises to harmonize product catalogs.

Introduction

With the development of intelligent systems, ontology has been widely used in many domains and applications, for example, product lifecycle management [1], document spanning systems [2], cognitive and robotic systems [3], modern early warning system [4], intelligent transportation systems [5], and smart manufacturing systems [6], and so on. Therefore, a large number of ontologies with heterogeneity have been developed in the same domain. The problem of heterogeneity leads to the communication dilemma between application systems or humans. To solve this problem, ontology matching is a significant task for integrating heterogeneous ontologies. Therefore, many researchers have proposed ontology matching methods with different abilities [7], [8], [9], [10]. The purpose of these methods is to find a mapping set of entity pairs between different ontologies. However, the task of ontology matching remains a challenge to find high-quality alignment.

Despite many effective ontology matching systems have been developed in a fully automated way, such methods considered to have certain limitations in some knowledge domains. One of the effective methods of ontology matching is user involvement to the matching process [11], [12]. It is considered necessary in many real-world applications. Specifically, allowing the users to interactively contribute own knowledge to the mapping suggestions generated by the system during the ontology alignment process, and to further improve the quality of matching. The experiments in [13] show that user involvement is beneficial even when users make mistakes.

However, the challenge of interactive ontology matching is how to design an effective way to interact with users so that they can help to improve the quality of matching results. It is necessary and meaningful to design an interaction scheme that is not burdensome for users. Further, a good interaction design should be both natural and complete [14]. In addition, the visualization of user interaction is also a challenge. In terms of the way of user involvement, it can be divided into three categories: recommending an initial alignment in advance, selecting various matching components and weights, or providing feedback to the system during the automatic matching process. For the first category, when an initial alignment is provided as input, the matching system only needs to search a suboptimal alignment based on the initial alignment. Exactly, the user controls the behavior of the system. However, the construction of the initial alignment may impact the quality of the final alignment. The second way is to tune the strategy and parameters through user feedback. For example, the threshold tuned is used to find problematic mappings to query the user [15], [16]. The selection process of the threshold may be onerous for users, especially when increasing ontology sizes. Considering the precise solution that it suits the actual need and preferences for users, the third design way is suitable. It allows users to provide feedback on the intermediate correspondence during the matching process to improve the quality of matching. Technically, how an interactive matching system can effectively interact with users while minimizing the number of interactions is a challenge.

In recent years, meta-heuristic algorithms have been widely used in a variety of problems, such as image segmentation [2], feature selection [3], financial stress prediction [4], medical image fusion [17], and signal processing [18]. Metaheuristic technology is also known as a high-level heuristic technology or nature-inspired algorithm, which solves the problems that traditional optimization algorithms cannot solve. Recently, meta-heuristic algorithms for ontology matching have been widely concerned. Some researchers proposed ontology optimization models based on meta-heuristic algorithms, as it has demonstrated effectiveness of matching complex ontologies, which has the ability to enhance the ontology alignment process and improve the alignment quality. However, most optimization models solve ontology matching problem in a completely automatic manner. In order to incorporate user involvement, a recently proposed meta-heuristic algorithm is utilized to construct an interactive optimization model. The grasshopper optimization algorithm (GOA) is a recently proposed nature-inspired algorithm. This algorithm simulates the repulsion and attraction forces by transplanting the behavior of grasshoppers in nature. Further, the repulsion force drives grasshoppers to explore search space extensively by avoiding each other. Therefore, the algorithm has the advantage of high local optimal avoidance. The attraction force drives grasshopper swarms exploitation and convergence towards the best target. In particular, the algorithm balances exploration and exploitation by adapting the coefficient c of the comfort zone. These characteristics make the GOA algorithm more adaptable and search ability compared to other meta-heuristic algorithms in practical applications. Additionally, the interactive evolutionary computation is able to incorporate the user knowledge and preferences into the evaluation of the individual [19].

This motivated us to propose a simple and efficient ontology matching model based on interaction grasshopper optimization algorithm. The difference between interactive grasshopper optimization and non-interactive is that user feedback is used to guide the search direction.

However, new challenges arise in the optimization process of user intervention into ontology matching. That is, how to allow user involvement during the optimization process and reduce user fatigue. How to effectively interact with users to further improve the quality of matching. To address these issues, in this paper, we propose a novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm. In order to reduce the burden on users, the users are required to give feedback every t generation instead of every generation. A roulette wheel technique is employed to select the most problematic candidate mappings to present to the user instead of all. In order to enhance the effectiveness of the interaction, a reward and punishment mechanism is considered to propagate the feedback of user, and to guide the search direction of swarms. The main contributions of this work are as follows:

$•$ A novel periodic learning optimization model based on interactive grasshopper optimization algorithm is proposed;

$•$ A novel roulette wheel approach is introduced into the model to select the most problematic mappings;

$•$ We proposed reward and punishment mechanisms to propagate user feedback to evolving population;

$•$ We study the effectiveness of the proposed model on two interactive tracks from Ontology Alignment Evaluation Initiative. Finally, a case is studied to further demonstrate the significance of the proposed method in practical application.

The remainder of this paper is summarized as follows. Section 2 introduces the related work for interactive ontology alignment methods. Section 3 introduces the knowledge for ontology matching problem and the original GOA briefly. Section 4 presents the periodic learning optimization model based on interaction grasshopper optimization algorithm. The time and space complexity for proposed algorithm is analyzed in Section 5. The experimental results of interactive and non-interactive are presented and analyzed in Section 6. Section 7 summarizes this paper and gives future work.

Section snippets

Related works

In recent years, the interactive ontology matching methods have been paid attention to by many researchers. It is divided into two major categories: concrete techniques based matching method and evolutionary algorithms-based global matching methods.

Preliminaries

In this subsection, the basic concepts involved in the ontology matching problem, adopted from [15], [26], [27], are described.

Proposed model

In this section, we present a novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm to enhance ontology matching, called PLGOM, as illustrated in Fig. 1. The model consists of three steps:

1.
Construct basic optimization model based on grasshopper optimization algorithm.
2.
Calculate similarity matrix using basic matchers.
3.
Learn ontology matching periodically using interaction grasshopper optimization algorithm

Time and space complexity analysis

In this section, we conducted a deep analysis of the performance of the proposed PLGOM algorithm. As we know, the running of an algorithm needs time and memory overhead on computer. Therefore, the performance of the PLGOM algorithm was analyzed through time and space complexity. The time complexity of PLGOM includes two components: similarity matrix calculation and interactive grasshopper optimization algorithm. The time and space complexity of calculating the similarity matrix mainly includes

Experiments and results

In this experiment, we performed exhaustive experiments by using interactive anatomy track and conference tack to verify the performance of the proposed algorithm. According to the official description of OAEI, the reference alignment of each track as oracle to simulate the user [39]. Specifically, when the system activates the interactive program of users, the most problematic mappings are sent to oracle. Oracle gives confirmation for each mapping. In order to reflect a more realistic

Conclusions and future work

In this paper, we proposed a novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm. We analyze the challenges of interactive ontology matching methods based on meta-heuristic algorithm from two aspects: reducing the burden on users and further improving the quality of alignment. Then we conducted experimental investigation on two interactive tracks. These experimental results shown that the performance of the proposed approach outperforms all

CRediT authorship contribution statement

Zhaoming Lv: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Writing - original draft. Rong Peng: Supervision, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

We are very grateful to the anonymous reviewers for their insightful comments on our paper. This work was financially supported by the National Key Research and Development Plan of China under Grant No. 2017YFB0503702.

References (48)

DinhP-H.
A novel approach based on grasshopper optimization algorithm for medical image fusion
Expert Syst. Appl.
(2021)
BrajovicM. et al.
Post-processing of time-frequency representations in instantaneous frequency estimation based on ant colony optimization
Signal Process.
(2017)
XueX. et al.
Interactive ontology matching based on partial reference alignment
Appl. Soft Comput.
(2018)
XueX. et al.
Collaborative ontology matching based on compact interactive evolutionary algorithm
Knowl. Based Syst.
(2017)
AcamporaG. et al.
Enhancing ontology alignment through a memetic aggregation of similarity measures
Inform. Sci.
(2013)
SaremiS. et al.
Grasshopper optimisation algorithm: Theory and application
Adv. Eng. Softw.
(2017)
BockJ. et al.
Discrete particle swarm optimisation for ontology alignment
Inform. Sci.
(2012)
MariniF. et al.
Particle swarm optimization (PSO), a tutorial
Chemometr. Intell. Lab. Syst.
(2015)
GengQ. et al.
Cross-domain ontology construction and alignment from online customer product reviews
Inform. Sci.
(2020)
AliM.M. et al.
Ontology-based approach to extract product’s design features from online customers’ reviews
Comput. Ind.
(2020)

LemboD. et al.

Ontology-based document spanning systems for information extraction

Int. J. Semant. Comput.

(2020)

AzevedoH. et al.

Using ontology as a strategy for modeling the interface between the cognitive and robotic systems

J. Intell. Robot. Syst.

(2020)

PhengsuwanJ. et al.

Ontology-based discovery of time-series data sources for landslide early warning system

Computing

(2020)

SobralT. et al.

An ontology-based approach to knowledge-assisted integration and visualization of urban mobility data

Expert Syst. Appl.

(2020)

SmirnovA.V. et al.

Ontology-based modelling of state machines for production robots in smart manufacturing systems

Int. J. Embed. Real Time Commun. Syst.

(2020)

OchiengP. et al.

Large-scale ontology matching: State-of-the-art analysis

ACM Comput. Surv.

(2018)

M. Abubakar, H. Hamdan, N. Mustapha, T.N.M. Aris, Instance-based ontology matching: a literature review, in: Proc....

ThiéblinÉ. et al.

Survey on complex ontology matching

Semant. Web

(2020)

AnnaneA. et al.

GBKOM: A generic framework for BK-based ontology matching

J. Web Semant.

(2020)

LiH. et al.

User validation in ontology alignment: functional assessment and impact

Knowl. Eng. Rev.

(2019)

DragisicZ. et al.

User validation in ontology alignment

Int. Semant. Web Conf.

(2016)

E. Jiménez-Ruiz, B.C. Grau, Y. Zhou, I. Horrocks, Large-scale interactive ontology matching: algorithms and...

ShvaikoP. et al.

Ontology matching: State of the art and future challenges

IEEE Trans. Knowl. Data Eng.

(2013)

EuzenatJ. et al.

Ontology Matching

(2013)

Cited by (12)

Developing a goal-driven data integration framework for effective data analytics
2024, Decision Support Systems
Data integration plays a crucial role in business intelligence, aiding decision-makers by consolidating data from heterogeneous sources to provide deep insights into business operations and performance. In the big data era, automated data integration solutions need to process high volumes of disparate data robustly and seamlessly for various analytical needs or operational actions. Existing data integration solutions exhibit limited capabilities for capturing and modeling users' needs to execute on-demand data integration. This study, underpinned by affordance theory and the goal definition principles from the Goal-Question-Metric approach, designs and instantiates a goal-driven data integration framework for data analytics. The proposed innovative design automates data integration for non-technical data users. Specifically, it demonstrates how to elicit and ontologize users' data-analytic goals and addresses semantic heterogeneity, thereby recognizing goal-relevant datasets. In a structured evaluation using the context of counter-terrorism analytics, our design artifact shows promising performance in capturing diverse and dynamic user goals for data analytics and in generating integrated data tailored to these goals. Our research establishes a theoretical framework to guide future scholars and practitioners in building smart, goal-driven data integration.
Knowledge Base component of Intelligent ALMM System based on the ontology approach
2022, Expert Systems with Applications
Citation Excerpt :
In practice, there are ontologies with different degrees of formalization — from predefined vocabulary to knowledge models based on Descriptive Logic (DL) on which the OWL is based (Ashraf, Chang, Hussain, & Hussain, 2015; Gayathri, Easwarakumar, & Elias, 2017; Kuo-Wei Su, 2016; McGarry, Graham, McDonald, & Rashid, 2018; Munir & Anjum, 2018). In recent time, many new studies have appeared, in which different approaches to effective ontology matching methods can be found (Lv & Peng, 2021; Saeidlou, Saadat, Sharifi, & Jules, 2019; Usip, Umoren, Inyang, & Ntekop, 2017). All of them require user involvement during the matching process and it is common need while building ontografs of knowledge based real-world applications (Ali, Doumbouya, Louge, Rai, & Karray, 2020; Sobral, Galvão, & Borges, 2020).
This paper presents the implementation of a knowledge base supporting an intelligent system to solve problems of optimization especially problems of discrete production processes optimization called Intelligent Algebraic-Logical Meta-Model (ALMM) Solver. Using a unified description of selected optimization problems, an ontological knowledge base was designed, which allows for selective selection of Intelligent ALMM Solver components necessary to solve and model problems. Using the definitions of the properties of optimization problems, scalable components describing exemplary optimization jobs were selected. Ontology for this area was developed, with particular emphasis on the requirements of the ALMM Solver. Using the possibility of interactive communication with the ALMM ontology in the form of SQL queries in the experimental part of the work, exemplary queries for the designed Knowledge Base (KB) module were presented, and the response generated by the system is a scenario of intelligent selection of a set of components modeling and solving a given problem. Such an innovative approach allows for dynamic construction of algorithms solving problems of discrete optimization. The use of knowledge about the properties of the considered processes and ALMM technology universalizes the proposed KB system making it an intelligent and efficient tool for solving discrete optimization jobs. The key advantage of the proposed ontological approach is the ability to flexibly expand it and extend its use to other classes of problems which have already been described in the ALMM technology.
WETA: Automatic taxonomy alignment via word embeddings
2022, Computers in Industry
Citation Excerpt :
Real et al. (2020) developed an approach in which matchers can take advantage of both domain-specific lexicon and grammar to improve their performance when matching domain-knowledge resources. Lv and Peng (2021) propose a periodic learning optimization model based on an interactive grasshopper optimization algorithm is proposed. They also introduce a roulette wheel approach to select the most problematic mappings and reward or punishment mechanisms to propagate user feedback to evolving population.
Lexical taxonomies are widely used to foster information retrieval and exchange in several domains and applications. When there are multiple taxonomies, heterogeneity among them is a severe problem for efficient collaboration processes. In this paper, we propose WETA, a domain-independent, knowledge-poor method for automatic taxonomy alignment via word embeddings. WETA associates all the leaf terms of the origin taxonomy to one or many concepts in the destination taxonomy, employing a scoring function, which merges the score of a hierarchical method based on cosine similarity and the score of a classification task. WETA is developed in the context of an EU Grant aiming at bridging the national taxonomies of EU countries towards the European Skills, Competences, Qualifications and Occupations taxonomy (ESCO) using AI Algorithms. The results, validated within the EU project activities for bridging the Italian occupation taxonomy CP and ESCO, confirm the usefulness of WETA in supporting the automatic alignment of national labor taxonomies. WETA reaches a 0.8 accuracy on recommending top-5 occupations and a wMRR of 0.72. WETA reduces the human effort needed for building a mapping from scratch: it would allow domain experts to concentrate on the validation task and decrease the incoherence due to multiple judgments. It would also make the approach reproducible and transparent to policymakers.
An optimized nonlinear grey Bernoulli prediction model and its application in natural gas production
2022, Expert Systems with Applications
Citation Excerpt :
2) The algorithm used is too simple. At present, there are many novel optimization algorithms, such as whale optimization algorithm (WOA) (Zhang et al., 2019),arithmetic optimization algorithm(AOA) (Abualigah et al., 2021; Khatir et al., 2021),grasshopper optimization algorithm(GOA) (Dinh, 2021d; Lv & Peng, 2021; Yadav et al., 2021),equilibrium optimizer algorithm(EOA) (Dinh, 2021a, 2021c),marine predators algorithm(MPA) (Abdel-Basset et al., 2021; Dinh, 2021b; Elaziz et al., 2021),aquila optimizer(AO) (Abualigah et al., 2021), etc. In order to make the parameter selection process of the model more reasonable, this paper selects five latest optimization algorithms(WOA,EOA,GOA,MPA and AOA) to solve the programming problem of the proposed model respectively and obtains the best hyperparameters by comparing their results.
Natural gas, an efficient, eco-friendly and clean green energy, has become one of the important energy structures of various countries in the world, accurately predicting the production of natural gas can help the national energy agency solve “gas shortage” problem. To accurately predict natural gas production in China, this paper establishes an optimized grey system model with weighted fractional accumulation generation operation (abbreviated as WFNGBM(1,1,N)). The proposed model has all the advantages of the GMP(1,1,N) model, NGBM(1,1) model and weighted fractional accumulation generation operation, which makes it have excellent prediction performance. Moreover, five outstanding intelligent optimization algorithms (whale optimization algorithm, marine predators algorithm, grasshopper optimization algorithm, equilibrium optimization algorithm and arithmetic optimization algorithm) are used to solve the hyperparameters of the WFNGBM(1,1,N) model. It is found that the WFNGBM(1,1,N) model has the characteristics of convertibility and small sample modeling, which indicates that it is a small sample prediction model with strong compatibility. After confirming the feasibility of the proposed model compared with its competing models by using natural gas production in Germany, Italy and Canada as examples, the proposed model is used to study China’s natural gas production. The results show that this model is very suitable for predicting and analyzing natural gas production in China. Based on this, the WFNGBM(1,1,N) model is used to estimate China’s natural gas production in the next three years, and some reasonable suggestions are given according to the development trend of natural gas production.
An Intelligent Semi-Honest System for Secret Matching against Malicious Adversaries
2023, Electronics (Switzerland)
Soil salinity prediction using hybrid machine learning and remote sensing in Ben Tre province on Vietnam’s Mekong River Delta
2023, Environmental Science and Pollution Research

View all citing articles on Scopus

View full text

A novel periodic learning ontology matching model based on interactive grasshopper optimization algorithm

Abstract

Introduction

Section snippets

Related works

Preliminaries

Proposed model

Time and space complexity analysis

Experiments and results

Conclusions and future work

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

Expert Syst. Appl.

Signal Process.

Appl. Soft Comput.

Knowl. Based Syst.

Inform. Sci.

Adv. Eng. Softw.

Inform. Sci.

Chemometr. Intell. Lab. Syst.

Inform. Sci.

Ontology-based approach to extract product’s design features from online customers’ reviews

Comput. Ind.

Ontology-based document spanning systems for information extraction

Int. J. Semant. Comput.

Using ontology as a strategy for modeling the interface between the cognitive and robotic systems

J. Intell. Robot. Syst.

Ontology-based discovery of time-series data sources for landslide early warning system

Computing

An ontology-based approach to knowledge-assisted integration and visualization of urban mobility data

Expert Syst. Appl.

Ontology-based modelling of state machines for production robots in smart manufacturing systems

Int. J. Embed. Real Time Commun. Syst.

Large-scale ontology matching: State-of-the-art analysis

ACM Comput. Surv.

Survey on complex ontology matching

Semant. Web

GBKOM: A generic framework for BK-based ontology matching

J. Web Semant.

User validation in ontology alignment: functional assessment and impact

Knowl. Eng. Rev.

User validation in ontology alignment

Int. Semant. Web Conf.

Ontology matching: State of the art and future challenges

IEEE Trans. Knowl. Data Eng.

Ontology Matching