Language model based interactive estimation of distribution algorithm

doi:10.1016/j.knosys.2020.105980

Knowledge-Based Systems

Volume 200, 20 July 2020, 105980

https://doi.org/10.1016/j.knosys.2020.105980 Get rights and content

Highlights

•
The presented IEDA employs a language model to encode candidate searched items.
•
The language model introduces social intelligence and reduces information loss.
•
The IEDA adopts Dirichlet-Multinomial distribution as its probabilistic model.
•
The probabilistic model is updated with Bayesian learning to track variable.
•
Then, a faster personalized search can be expected.

Abstract

It is very hard, if not impossible to use analytical objective functions for optimization of personalized search due to the difficulties in mathematically describing qualitative problems. To solve such optimization problems, interactive evolutionary algorithms, which can make use of human preferences, are highly desirable. However, due to the lack of effective encoding methods, interactive evolutionary algorithms have been limited to numerically encoded optimization problems. In practice, however, linguistic terms (words) are the most natural expression of human preferences, and they are also commonly used to describe items in personalized search or E-commerce; therefore, language models better suit encoding, and the optimization of personalized search is converted into a dynamic document matching problem. To optimize word-described personalized search, we propose a novel interactive estimation of distribution algorithm. This algorithm combines a language model-based encoding approach, a Dirichlet-Multinomial compound distribution-based preference expression, and a Bayesian inference mechanism. The proposed algorithm is applied to two personalized search cases to demonstrate the capability of the algorithm in ensuring a more efficient and accurate search with less user fatigue.

Graphical abstract

Introduction

Objective functions of personalized optimization problems, e.g. product design, personalized search, and information retrieval, are impossible to precisely define using mathematical expressions since they are highly dependent on user preferences. Although evolutionary computation (EC) has been proven to be a powerful tool for solving complex optimization [1], [2], [3] problems, the requirement of precise mathematical definitions cannot be fulfilled in personalized search. In such scenarios, interactive evolutionary computation (IEC) [4], [5] is more feasible and efficient as it involves a human user in the evaluation process; they have been developed and successfully applied to various practical problems, such as product design, web page layout design, and anti-collision design of vehicles [5], [6], [7], [8]. This paper considers the optimization problems that occur in, for example, the following scenario: A person is searching the web for a particular movie, beginning the search with a query of a few words. The search engine presents a few movie candidates. The user clicks on some of the candidates and saves some of them. Based on the user’s actions, the search engine presents some new candidates. This process continues until the user is satisfied with the result. The purpose of this proposed novel method is to speed up the search process and reduce the need for user interference.

IEC requires human evaluations, which can inevitably cause user fatigue given a complex problem. The restriction on population size and evolutionary generation prevents the use of IEC in tackling a wide range of problems [5]. Accordingly, much more attention has been paid to alleviating user fatigue and improving explorations from the following three aspects [9]: (1) the design of friendly human–computer interfaces or novel evaluation modes to reduce the user burden, e.g. evaluating individuals with discrete, fuzzy, or interval numbers [9], [10], [11]; (2) the use of a preference surrogate, with a small number of evaluated individuals and then apply it to help with the assessment. With surrogates, IEC can upscale the population size and generation as conventional EC approaches [12], [13], which greatly improves the explorations of IEC. (3) the use of knowledge from evolution to modify evolutionary operators to accelerate search and reduce fatigue [14], [15]. These studies have greatly enhanced IEC. Although the above-mentioned studies have greatly enhanced IEC , the application of those methods in resolving preference-related complex problems remains a challenging task. Particularly in word-described ones such as online personalized search in E-commerce.

The main reason is that the information covered by the numerical representations for these word-described problems is insufficient to model the preference-related objectives, leading to inefficient or even wrong searches. Sun et al. [9], [16] used a limited number of numerical values to encode these word-described items in the framework of an interactive genetic algorithm for the personalized search, which made the traditional evolutionary operators easier to implement. Wang et al. [17] modeled TV programs with five attributes in their experiments when studying preference recommendations for personalized search.

These studies are easier to understand and implement; however, such numerical encoding loses a considerable amount of implied semantic information contained in the words. In addition, IEC depends on the evaluations assigned by the users, who have got used to thinking and evaluating with words instead of numbers. The gap between the users’ evaluations and numerical encoding results in a need for additional human–computer interactions and inevitably causes more user fatigue. Accordingly, designing a non-numerical encoding method which minimizes loss of semantic information and developing corresponding evolutionary operators becomes essential as this will help to enhance the performance of IEC in solving more practical and complex problems. Furthermore, user preferences or decisions will be influenced greatly by other user comments, and social or group comments should be integrated with IEC so that the current user is able to precisely evaluate the searched solutions. Motivated by the above, we focus on developing an enhanced IEC by integrating social comments, designing a novel encoding and corresponding evolutionary operators for solving problems described with words in the personalized search.

Applying IEC, short for Interactive Evolutionary Computation, to the word/document-described optimization problems relies on establishing a bridge between the textual phenotype evaluated by the user and the numerical genotype operated by EC. The language model Doc2Vec [18], [19] is a good choice to convert textual phenotypes into numerical vectors by preserving most of the semantic relationships among the words. Therefore, we employ this model to represent the genotype of a document, i.e., a searched item, including the word description and social comments. Clearly, both are naturally combined in the Doc2Vec based representation. Given this, a new IEC must be developed to gain the advantages both from itself and the model, i.e., Doc2Vec-assisted initialization and interactive evolutionary operators.

This work develops a language model based interactive estimation of distribution algorithm (LMIEDA) to perform the evolutionary optimization in personalized search. In LMIEDA, the Doc2Vec is applied to convert the word/document-described problem into a dynamic document matching one by encoding the word frequency as individuals. A preference function is approximately constructed based on user interactions and is used to estimate the individual’s fitness. Based on the fitness, the Dirichlet-Multinomial compound distribution and a Bayesian inference involving the Dirichlet-Multinomial compound distribution is designed to track the user’s preference on the variables. With these, the probabilistic model of estimation of distribution algorithm (EDA) and the corresponding sampling are presented. The user’s burden here can be greatly reduced since our algorithm is able to estimate the fitness of all individuals without the user.

The main contributions of this study are as follows. (1) To the best of our knowledge, language models have not been used in EC/IEC. As an encoding method, it helps to reduce information loss and naturally introduces social intelligence. (2) Since the encoded variables are discrete (with finite states), the Dirichlet-Multinomial compound distribution is adopted as the probabilistic model of IEDA to be compatible with encoded candidates. (3) The probabilistic model is updated with the help of Bayesian learning to directly track variable distribution, and most conventional EDAs employ Bayesian networks to depict the dependencies between variables. (4) The proposed algorithm is applied to some personalized search for books and movies to prove its effectiveness and efficiency.

The remainder of this paper is organized as follows. Section 2 describes the related work on the personalized search assisted with evolutionary algorithms, the estimation of distribution algorithms (EDAs), and the basic concept of Doc2Vec. The details of the proposed algorithm, including the definition of the word-described personalized search, the framework, the critical encoding, preference expression, and the IEDA, are presented in Section 3. Section 4 addresses the application of the proposed algorithm together with the experimental results and analyses. Conclusions are drawn in Section 5.

Section snippets

Personalized search assisted with evolutionary algorithms

The task of the personalized search is to find the items that give the user the most satisfaction; therefore, it is an optimization problem in nature. However, what distinguishes personalized search from typical optimization problems is that users, rather than mathematical functions, play the role of the fitness functions. Although it is hardly possible to solve this problem involving cognitive processes only with tractable mathematical calculations, researchers can still describe some

Definitions of word/document-described optimization problems

$\{\begin{matrix} max f (d o c u m e n t) \\ s.t. d o c u m e n t \in H \end{matrix}$

For the word-described personalized search, by naturally combining social intelligence from comments, an item can be expressed as $d o c u m e n t = \{d e s c r i p t i o n\} \cup \{c o m m e n t s\}$ , in which the first part comes from its seller, and $\{c o m m e n t s\}$ with specific meaning on the item come from users. Supposing a user’s preference on a $d o c u m e n t$ is $f (d o c u m e n t)$ , the search can be formulated as Eq. (1), where $H$ is the feasible space of searched items. Evolutionary algorithms will be powerful for solving

Experimental setting

Comparisons on the personalized search of two different fields, i.e., movies and books, among the proposed algorithm and other IECs are conducted to prove its effectiveness and efficiency. Movies and books, which are commonly described with text, are chosen as the search target because they cannot be well modeled with the key–value pattern. The data for movies and books (updated in March 2018) are from imdb.com and Douban.com. IMDb is an online database of information related to films, TV

Conclusions

For solving word/document-described problems that cannot be well encoded with the structural numerical methods, LMIEDA is proposed by integrating the mixture of unigrams, LDA, and Doc2Vec into the EDA framework. From the viewpoint of optimization, language model based encoding, a novel preference expression by use of Dirichlet-Multinomial compound distribution, and a Bayesian inference-enhanced interactive version of EDA (estimation of the distribution algorithm) have been studied to

CRediT authorship contribution statement

Yang Chen: Conceptualization, Methodology, Software, Writing - original draft. Yaochu Jin: Conceptualization, Supervision, Validation, Writing - review & editing. Xiaoyan Sun: Conceptualization, Supervision, Software, Validation, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

This work is supported by the National Natural Science Foundation of China with Grant No. 61876184 and 61473298.

References (63)

GongD. et al.
A novel hybrid multi-objective artificial bee colony algorithm for blocking lot-streaming flow shop scheduling problems
Knowl.-Based Syst.
(2018)
WangH. et al.
Preference recommendation for personalized search
Knowl.-Based Syst.
(2016)
TianX. et al.
Sequential funding the venture project or not? a prospect consensus process with probabilistic hesitant fuzzy preference information
Knowl.-Based Syst.
(2018)
AhnH.J.
Evaluating customer aid functions of online stores with agent-based models of customer behavior and evolution strategy
Inform. Sci.
(2010)
XieH. et al.
Incorporating sentiment into tag-based user profiles and resource profiles for personalized search in folksonomy
Inf. Process. Manage.
(2016)
WangY. et al.
Word sense disambiguation: A comprehensive knowledge exploitation framework
Knowl.-Based Syst.
(2020)
EspositoM. et al.
Hybrid query expansion using lexical resources and word embeddings for sentence retrieval in question answering
Inform. Sci.
(2020)
ZhangW. et al.
Unsupervised language identification based on latent Dirichlet allocation
Comput. Speech Lang.
(2016)
ElmanJ.
Finding structure in time* 1
Cogn. Sci.
(1990)
HanY. et al.
Evolutionary multi-objective blocking lot-streaming flow shop scheduling with machine breakdowns
IEEE Trans. Cybern.
(2019)

SunX. et al.

Indicator-based set evolution particle swarm optimization for many-objective problems

Soft Comput.

(2016)

ChenY. et al.

Federated learning assisted interactive eda with dual probabilistic models for personalized search

TakagiH.

Interactive evolutionary computation: fusion of the capabilities of EC optimization and human evaluation

Proc. IEEE

(2001)

ChenY. et al.

DPM-IEDA: Dual probabilistic model assisted interactive estimation of distribution algorithm for personalized search

IEEE Access

(2019)

FukumotoM. et al.

A proposal for user’s intervention in interactive evolutionary computation for optimizing fragrance composition

Commun. Comput. Inf. Sci.

(2014)

A. Oliver, O. Regragui, N. Monmarch, G. Venturini, Genetic and interactive optimization of web sites, in: Proceedings...

SunX. et al.

Interactive genetic algorithm with CP-nets preference surrogate and application in personalized search

Control Decis.

(2015)

GongD. et al.

Neural network surrogate models of interactive genetic algorithms with individual’s interval fitness

Control Decis.

(2009)

GongD. et al.

Interactive genetic algorithms with individual’s uncertain fitness

Chinese J. Electron.

(2009)

Y. Li, Adaptive learning evaluation model for evolutionary art, in: 2012 Ieee Congress on Evolutionary Computation...

R. Kamalian, E. Yeh, Y. Zhang, A.M. Agogino, H. Takagi, Reducing human fatigue in interactive evolutionary computation...

ChughT. et al.

An interactive simple indicator-based evolutionary algorithm (I-SIBEA) for multiobjective optimization problems

Lecture Notes in Comput. Sci.

(2015)

LuqueM. et al.

An interactive evolutionary multiobjective optimization method based on the WASF-GA algorithm

SunX. et al.

Interactive genetic algorithm with group intelligence articulated possibilistic condition preference model

MikolovT. et al.

Efficient estimation of word representations in vector space

(2013)

LeQ.V. et al.

Distributed representations of sentences and documents

(2014)

SunY. et al.

Research development of user interest modeling in China

J. Intell.

(2013)

GuoxiaW. et al.

Survey of personalized recommendation systems

Comput. Eng. Appl.

(2012)

CapuanoN. et al.

Fuzzy rankings for preferences modeling in group decision making

Int. J. Intell. Syst.

(2018)

LiaoH. et al.

Hesitant fuzzy linguistic preference utility set and its application in selection of fire rescue plans

Int. J. Environ. Res. Public Health

(2018)

KassakO. et al.

User preference modeling by global and individual weights for personalized recommendation

Acta Polytech. Hung.

(2015)

Cited by (8)

Identification of emerging business areas for business opportunity analysis: An approach based on language model and local outlier factor
2022, Computers in Industry
Citation Excerpt :
This pioneering method introduced Negative sampling, which learns more accurate vectors for frequent words—instead of the hierarchical SoftMax—to facilitate both faster training and better representation of uncommon words. The success of Word2Vec ignited the continuous development of semantic language models (Chen et al., 2020) that has led to the evolution of such models based on the attention mechanism (Bahdanau et al., 2014) or transformer (Vaswani et al., 2017). Recently, universal language models such as Bidirectional Encoder Representations from Transformers (BERT) (Devlin et al., 2018), a robustly optimised BERT training approach (RoBERTa) (Liu et al., 2019), and text-to-text transfer transformer (T5) (Raffel et al., 2019) have been developed.
Emerging business areas are early indicators of potential business opportunities, which are considered key to formulating new business strategies and envisioning near-future business environments. However, existing methods for analysing business opportunities solely depend on the opinion and knowledge of experts, which are time-consuming and labour-intensive. In academia, recent years have witnessed a significant increase in attempts to identify emerging business areas as near-future business opportunities with data-driven approaches. Although successful innovation requires sources of novelty, how to measure the novelty of business areas has barely been investigated in the literature. As a solution, we propose an approach to identifying emerging business areas with high novelty with a systematic process and quantitative outcomes. At the heart of the proposed approach is the composite use of the language model and local outlier factor (LOF). The meaning of business opportunities become more explicit by identifying emerging business areas composed of novel goods and services, with implications for the business operation stage. Finally, business opportunity maps are developed based on recency and visibility values, thereby investigating the implications as business opportunities. A case study of the trademarks related to scientific apparatus is presented to illustrate the proposed approach. The systematic process and quantitative results are expected to be employed in practice as a complementary tool, serving as a cornerstone for analysing business opportunities using trademarks.
A two-stage approach for multicast-oriented virtual network function placement
2021, Applied Soft Computing
Citation Excerpt :
Running a stochastic search algorithm 20 times is meaningful from the point of view of statistics, i.e., the results collected to some extent reflect that algorithm’s optimization performance [63,64]. All results are collected by running each algorithm 20 times [65,66] on a machine with Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10 GHz and 16 GB RAM. Fig. 12 illustrates the best fitness curves obtained by the six algorithms in 18 test instances.
Network function virtualization (NFV) is an emerging network paradigm that decouples softwarized network functions from proprietary hardware. Nowadays, resource allocation has become one of the hot topics in the NFV domain. In this paper, we formulate a service function chain (SFC) mapping problem in the context of multicast, which is also referred to as the multicast-oriented virtual network function placement (MVNFP) problem. The objective function considers end-to-end delay as well as compute resource consumption, with bandwidth requirements met. A two-stage approach is proposed to address this problem. In the first stage, Dijkstra’s algorithm is adopted to construct a multicast tree. In the second stage, a novel estimation of distribution algorithm (nEDA) is developed to map a given SFC over the multicast tree. Simulation results show that the proposed two-stage approach outperforms a number of state-of-the-art evolutionary, approximation, and heuristic algorithms, in terms of the solution quality.
Multi-scale Self-Organizing Map assisted Deep Autoencoding Gaussian Mixture Model for unsupervised intrusion detection
2021, Knowledge-Based Systems
Citation Excerpt :
For instance, support vector machines and random forests are common options. Recently, deep models become competitive because more complex algorithms coupled with greater computational capacity can be obtained [2,3]. In industries and academia, deep anomaly detection has received a wide range of applications [4], e.g., deep models based on self-taught learning [5] and DAGMM [6].
In an age when the Internet has become the backbone of communications, a robust and safe network environment is critical. Intrusion detection techniques are thus valuable for IT infrastructure. The state of the art (SOTA) solution, Deep Autoencoding Gaussian Mixture Model (DAGMM), outperforms those approaches relying on decoupled two-stage training and the standard Expectation–Maximization optimization algorithm. However, DAGMM suffers from the failure in preserving the input topology, caused by the bottleneck layer of the adopted deep autoencoder as well as the method of constructing the input for the follow-up density estimation. This research first presents a Self-Organizing Map assisted Deep Autoencoding Gaussian Mixture Model (SOM-DAGMM) for overcoming the above-mentioned shortcoming of DAGMM, through well balancing the low-dimensional demand of Gaussian Mixture Model (GMM) and the topology-preserving requirement. The proposed SOM-DAGMM employs a self-organizing map to extract features as a supplement with well-preserved input space topology for better network intrusion detection. The paper also studies the superiority of multi-scale topology over the single-scale one in improving the performance of DAGMM. The better performance of the SOM-DAGMM is empirically proven by extensive experiments involving six datasets. Experimental results show that single/multi-scale SOM-DAGMMs outperform the SOTA DAGMM on all tests and achieve up to 110.38% improvement in F1 score and with better stability.
Review on personalized search and recommendation algorithms for multi-source heterogeneous data
2024, Kongzhi Lilun Yu Yingyong/Control Theory and Applications
Marine Goal Optimizer Tuned Deep BiLSTM-Based Self-Configuring Intrusion Detection in Cloud
2024, Journal of Grid Computing
An Interactive Estimation of the Distribution Algorithm Integrated with Surrogate-Assisted Fitness
2023, Symmetry

View all citing articles on Scopus

View full text

Language model based interactive estimation of distribution algorithm

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Personalized search assisted with evolutionary algorithms

Definitions of word/document-described optimization problems

Experimental setting

Conclusions

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgment

Knowl.-Based Syst.

Knowl.-Based Syst.

Knowl.-Based Syst.

Inform. Sci.

Inf. Process. Manage.

Knowl.-Based Syst.

Inform. Sci.

Comput. Speech Lang.

Cogn. Sci.

Evolutionary multi-objective blocking lot-streaming flow shop scheduling with machine breakdowns

IEEE Trans. Cybern.

Indicator-based set evolution particle swarm optimization for many-objective problems

Soft Comput.

Federated learning assisted interactive eda with dual probabilistic models for personalized search

Interactive evolutionary computation: fusion of the capabilities of EC optimization and human evaluation

Proc. IEEE

DPM-IEDA: Dual probabilistic model assisted interactive estimation of distribution algorithm for personalized search

IEEE Access

A proposal for user’s intervention in interactive evolutionary computation for optimizing fragrance composition

Commun. Comput. Inf. Sci.

Interactive genetic algorithm with CP-nets preference surrogate and application in personalized search

Control Decis.

Neural network surrogate models of interactive genetic algorithms with individual’s interval fitness

Control Decis.

Interactive genetic algorithms with individual’s uncertain fitness

Chinese J. Electron.

An interactive simple indicator-based evolutionary algorithm (I-SIBEA) for multiobjective optimization problems

Lecture Notes in Comput. Sci.

An interactive evolutionary multiobjective optimization method based on the WASF-GA algorithm

Interactive genetic algorithm with group intelligence articulated possibilistic condition preference model

Efficient estimation of word representations in vector space

Distributed representations of sentences and documents

Research development of user interest modeling in China

J. Intell.

Survey of personalized recommendation systems

Comput. Eng. Appl.

Fuzzy rankings for preferences modeling in group decision making

Int. J. Intell. Syst.

Hesitant fuzzy linguistic preference utility set and its application in selection of fire rescue plans

Int. J. Environ. Res. Public Health

User preference modeling by global and individual weights for personalized recommendation

Acta Polytech. Hung.