Elsevier

Theoretical Computer Science

Volume 820, 8 June 2020, Pages 26-44
Theoretical Computer Science

Markov chain analysis of evolutionary algorithms on OneMax function – From coupon collector's problem to (1 + 1) EA

https://doi.org/10.1016/j.tcs.2020.03.007Get rights and content

Abstract

The theoretical investigation of Evolutionary Algorithms (EAs) has increased our understanding of the computational mechanism of algorithms. OneMax is a test function most frequently and deeply studied in the field of EAs. In this work, a method is presented for describing the runtime properties of (1+1) EA on OneMax. This method is motivated by the work of Erdös and Rényi treating the coupon collector's problem. They showed that the success probability of the coupon collector's problem is given by a function of double exponential form, and that the number of uncollected coupons follows the Poisson distribution. Today, the double exponential function is called Gumbel function, which is one of three fundamental functions in extreme value statistics. We introduce an algorithm that is a variant of the (1+1) EA, First Order Evolutionary Algorithm (FO-EA). FO-EA takes into account only the effect of single-bit mutations in the (1+1) EA, which in general includes multiple-bit mutations. We modified the method of Erdös and Rényi to apply FO-EA. We apply the Gumbel distribution for calculating the success probability of the (1+1) EA on OneMax. This method turns out to give a sufficiently reliable estimation for success probabilities, even in the tail region.

Introduction

The theoretical study of randomized search heuristics is now a growing research domain, and many interesting results have been obtained [1]. This class of approaches contains Randomized Local Search (RLS) and Evolutionary Algorithms (EAs). EAs have been investigated by many researchers whose main interests are their convergence properties. Especially, computational time analysis is a major challenge and hot spot in the study of the basic theory of EAs [2].

The computational time of an EA refers to the number of iterations required by the algorithm to find the optimal solution, also known as runtime, which is an important index to measure the algorithm's advantages and disadvantages. If T is a random variable measuring the time to obtain the optimum solution, the main task of runtime analysis is estimating E(T), the expected value of T. For the convenience of analysis, many studies have been conducted on the (1+1) EA. Although the (1+1) EA is a very simple form of EA, it has a value of theoretical analysis and is a basic model for runtime analysis. By studying the (1+1) EA deeply, we can reveal the dynamic behavior of EAs and provide a useful tool for analyzing more complex algorithms. A detailed overview of the (1+1) EAs is presented in [3].

Droste et al. [4] carried out a rigorous probabilistic analysis of the (1+1) EA for pseudo-Boolean functions such as OneMax, LeadingOnes and BinVal functions. They proved that the average running time of the (1+1) EA for solving linear functions is Θ(log), where is the length of binary string. He and Yao proposed another type of mathematical model, usually called as drift analysis, for the study of EAs [5], [6]. Drift analysis is based on the drift theorem, which was introduced as a widely applicable mathematical model for the computation time analysis of EAs. Later, a new drift theorem (multiplicative drift theorem) was introduced by Doerr et al. [7], [8]. Multiplicative drift analysis allows easier analyses if the optimization progress is roughly proportional to the current distance to the optimum. Hwan et al. performed their probabilistic analysis on the runtime of the (1+1) EA for OneMax problem [9]. They reported a new asymptotic approximation for the expected hitting time m with pm=1/. The result is given bym=elog+c1+12elog+c2+O(1log), where c11.8925 and c20.5979.

A framework of Markov chains for analyzing the average hitting times of EAs was presented He and Yao [10]. The behavior of this Markov chain is completely determined by the transition matrix P of an absorbing Markov chain [11]. Mülenbein carried out a pioneering study by using Markov chain for analyzing the time evolution of states in the (1+1) EA [12]. He derived the expected hitting time m of the optimum of OneMax function. Jägersküpper [13] combined the Markov chain with drift analysis and reanalyzed the upper bound of running time of the (1+1) EA on the linear function to obtain a more accurate estimate. These results show that the model making full use of Markov property of EA is a powerful tool for analyzing the dynamical behavior of EA. Furutani et al. also analyzed the expected hitting time of the (1+1) EAs by applying Markov chain, and the explicit form of absorbing Markov chain for OneMax problem was presented [14].

Most of above mentioned studies treat the mean and median of the runtime distribution. There is another important performance measure, the runtime distribution in the tail region of large t. The study of tail distribution of runtime provides useful information that can contribute to the design of better algorithms. However, only a small number of studies have investigated the tail distribution in evolutionary algorithms. There is a branch in statistics and probability theory, which deals with the distribution of (exceptionary) large or small values. This field, usually called extreme value statistics, has been applied to problems of natural disaster, insurance, engineering, and so on. There are three types in the extreme value distribution, and the Gumbel distribution is one of them [15]. In ref. [9], it is proven that the success probability of OneMax function asymptotically approaches the Gumbel distribution.

The main objective of this paper is to study the success probability for the (1+1) EA on OneMax function. The success probability is defined as F(t)=Pr(Tt) given a certain number of steps t. For this end, we take into account of theoretical achievements in the study of coupon collector's problem. As Doerr stated in his paper [16], “understanding its solution pays off when analyzing randomized search heuristics, because very similar situations are encountered here”. Especially, we were interested in the results of Erdős and Rényi [17]. They treated the urn problem, which is equivalent to the coupon collector's problem. Their main result is summarized;

  • (1)

    the success probability of the coupon collector's problem is given by a function of double exponential type;

  • (2)

    the number of uncollected coupons obeys the Poisson distribution.

In this paper, we demonstrate that these two findings can be applicable to the (1+1) EA on OneMax function in the case of large enough .

For an intermediate step to the goal, we formalize the Markov chain model for RLS on OneMax function. Since the transition matrix of RLS chain has a triangular form [3], its diagonal elements are also eigenvalues of the matrix. Their corresponding eigenvectors are given in terms of binomial coefficients. These results can be used to verify the findings of Erdős and Rényi.

As a simplified model for the application of Markov chain, we introduce a modified form of the (1+1) EA, named as FO-EA short for First Order EA. FO-EA is designed as a generalization of RLS for the mathematical analysis of the (1+1) EA on OneMax. The transition matrix for the Markov chain model of FO-EA only contains the first order terms, which are proportional to pm(1pm)1. This makes the Markov chain analysis tractable.

Section snippets

Markov chain analysis of OneMax function

This study considers the maximization of OneMax function. OneMax function f(y) is defined byf(y)=i=1yi,yi{0,1} where y=(y1,,y) is a binary string of length . In the maximization of OneMax function, the optimum is yopt={1}, and f(yopt)=. Algorithm 1 presents the pseudo-code of the (1+1) EA on OneMax.

Since many studies suggested that the mutation rate of pm=1/ may be the best one, we consider this value as a standard.

We consider the discrete time Markov chains X(t),t{0,1,,} with

RLS algorithm

In this section, we consider the most simple form of evolutionary algorithm RLS. We define RLS in Algorithm 2.

The evolution process of RLS can be formulated in the framework of Markov chain with finite state space S. In this model, S is divided into +1 states S0,S1,,S, as given in the (1+1) EA for OneMax problem.

The transition matrix of RLS Markov chain is given by the sparse upper triangular matrixP=(0100001/(1)/00002/(2)/0000(1)/1/00001) The matrix IQ is(110000(1

Markov chain model of FO-EA

In this section, we consider the optimization of OneMax function by the (1+1) EA, and apply the Markov chain model as an extension of the coupon collector's problem and RLS. Since it seems that the current state of the art does not allow us to make a full treatment of the problem by Markov chain, we introduce a simplified version of the (1+1) EA, FO-EA.

The algorithm of FO-EA model is presented in Algorithm 3.

Here ha(x,y) is the Hamming distance of strings x and y.

The transition matrix

Numerical result

This section treats numerical aspects of the (1+1) EA on OneMax. As we have shown in Subsection 2.1, the expected hitting time of the (1+1) EA is given by eq. (9). Thus we are not able to obtain a closed form solution of the expected runtime m. If FO-EA provides a good approximation for runtime properties of (1+1) EA, we may use FO-EA as the first choice for analyzing the (1+1) EA.

Discussion and conclusion

In this study, we considered the success probability of the (1+1) EA on OneMax function, and applied the Gumbel distribution for investigating the runtime behavior of the (1+1) EA. We introduced FO-EA model to describe how the (1+1) EA works for the optimization of OneMax function. This model gives the simple forms of the expected runtime and its variance. Erdös and Rényi showed that the success probability of the coupon collector's problem is given by a function, the same as Gumbel function,

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work is supported by Science and Technology Project in Qinghai Province (No. 2017-ZJ-717).

References (20)

There are more references available in the full text version of this article.

Cited by (0)

View full text