Operational union-complexity

doi:10.1016/j.ic.2021.104692

Information and Computation

Volume 284, March 2022, 104692

https://doi.org/10.1016/j.ic.2021.104692 Get rights and content

Abstract

Union-free languages are described by regular expressions using only concatenation and Kleene-star. Every regular language can be given as a union of finitely many union-free languages. By the minimal number of union-free languages needed in such union-free decompositions of a regular language, its union-complexity is defined. In this paper, the union-complexity of the languages obtained by various operations is studied, e.g., having two languages with union-complexities n and m, respectively, what could be the union-complexity of their union/concatenation/shuffle. Particularly, it is shown that the Kleene-star of any regular language has union-complexity 1. In some cases, e.g., at union and concatenation, the resulting language has a bounded union-complexity. In some other cases, e.g., at complement, the resulted language can have arbitrarily large union-complexity. At (k-th) power of a language, the case of the unary alphabet and the general case (alphabet with at least two symbols) have different upper bounds. While, in case of shuffle, there is an unbounded growth in the general case, while for languages over the unary alphabet the growth is bounded. Tight upper bounds are shown for all of the above mentioned cases (whenever the growth is bounded). At intersection we also show an unbounded growth in the general case, especially, over a binary alphabet.

Introduction

Descriptional complexity of formal systems, especially of formal languages is an interesting and fruitful branch of theoretical computer science. Regular languages are the most studied, as they are very well known and applied in various places. The family of regular languages is the smallest, the simplest class of the Chomsky-hierarchy. Their descriptions by regular expressions are widely used. They are generated by regular, by left-linear and also by right-linear grammars. They are accepted by finite state automata: both nondeterministic and deterministic variants characterize this class of languages. Recently various classes of subregular languages play also importance [11]. The most known measure of the descriptional complexity of regular languages is the number of states of the minimal deterministic finite automaton accepting them (in several cases, completely defined finite automata are used, which may contain a sink state) [5]. A similar measure based on nondeterministic finite automata is also studied [9]. Also, another measure that could be more interesting if the finite automata are only partially defined, namely, the transition-complexity [8], [16]. These measures are somehow connected to some global properties of the finite automata. A more recent measure is based on the number of final (or accepting) states of the automata [7], [13], comparing it to the previous measure, it seems to be a measure which reflects only part of the automata, not the whole, but a crucial part. Descriptional complexity of regular languages may also be based on regular grammars, however, since the concept of grammars and automata are closely related to each other with very simple constructions from one to the other, these measures do not seem really different than the previously mentioned ones, e.g., the state complexity is closely related to the number of nonterminals needed in the grammar, while the transition complexity is closely related to the number of rewriting rules applied in the grammar.

Measures of regular languages can also be defined based on their regular expressions. A kind of “global measure” could be the length or the number of regular operators used in an expression, and for each regular language there is a regular expression (or there are some regular expressions) having the smallest such value, which can be assigned to the described language. However, we may also give some measures which are not measuring the whole expression, but some, yet significant, details of them. For example, such a measure is the star-height, the number of nested Kleene-stars needed to describe the language [10]. Here, another, a relatively new measure is studied which is also based on the possible regular expressions of a regular language. The union-free languages are defined by regular expressions without the union. They were first mentioned as star-dot regular languages in [3]. Later on, in [6], their description by equations were examined, and it was shown that this class cannot be axiomatized by a finite set of equations. Automata theoretical characterisation of this language class was given in [18]: nondeterministic finite automata with the property that there is exactly one cycle-free accepting path from each of their states accept these languages. This class of automata allowed to define the deterministic counterpart of the class, the family of deterministic union-free languages [4], [14], [15]. On the other hand, every regular language is a finite union of union-free languages [3], [17], [21]. The union-complexity of the regular languages is defined subsequently based on minimal decompositions [17]. In this paper, we use union-complexity to measure for the complexity of regular languages.

While the invited talk and the paper appeared in the proceedings of the DCFS [20] were more about a general view about union-free languages, the corresponding 1-cycle-free-path automata and some known results about union-complexity, here, in this paper, we focus (see Section 3) on the operational union-complexity of regular languages. More specifically, we present and prove theorems about the possible values of the union-complexity of the resulting languages after applying some basic (regular, set theoretical and other) operations on the languages with known union-complexity. As far as we know, this is the first study of union-complexity from the view of descriptional complexity. In the next section we formally define the union-complexity and we give some preliminaries. In Section 3, we present our main results, while some further thought about deterministic union-complexity in Section 4. Finally, summary and conclusions close the paper.

Section snippets

Preliminaries

In this section first we recall the definition of union-complexity of regular languages [17], [19] and we also recall some known results. We assume that the reader is familiar with the basic concepts of formal languages and regular expressions, thus for each unexplained concepts she/he is referred to any standard textbook on the topic, e.g., to [12] or the Handbook chapter [22]. We also fix our notation here. The empty word is denoted by λ, V is a finite alphabet, while $+, \cdot,^{⁎}$ are the regular

Main results

As the name of the operation union is involved in the term union-complexity, we start our studies by analysing how the union-complexity may change if the union of two languages is considered.

Theorem 3.1

Let $L_{1}$ and $L_{2}$ be two regular languages with union-complexities n and m, respectively. Then the union of them, i.e., $L = L_{1} \cup L_{2}$ could have the union-complexity at most $n + m$ . Moreover, this bound is tight, i.e., for any two positive integers $n, m$ , there are languages $L_{1}$ and $L_{2}$ with union-complexities n and m, such

On deterministic union-complexity

We start this section by recalling the definition of deterministic union-free languages, see, e.g., [15], [20]. Since this language class is defined by a specific class of automata, we present definitions of automata first.

Definition 4.1

A 5-tuple $A = (Q, S, V, δ, F)$ is a non-deterministic finite automaton, with the finite set of states Q. Further, $S \in Q$ is the initial state, V is the (input) alphabet and $F \subset Q$ is the set of final (or accepting) states. The function $δ : Q \times (V \cup {λ}) \to 2^{Q}$ is the transition function.

A path $Q_{0} a_{1} Q$

Summary and conclusions

Every regular language is a finite union of union-free languages, that type of description of a regular language is called its union normal-form [17]. Based on the union normal-form of regular expressions, union-complexity of languages has been defined (see, e.g., [17], [19], [20]). Consequently, the class of union-free regular languages plays an important role studying union-complexity. In this paper, the union-complexity as a complexity measure of the regular languages was considered.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (22)

Siniša Crvenković et al.
On equations for union-free regular languages
Inf. Comput.
(2001)
Kosaburo Hashiguchi
Algorithms for determining relative star height and star height
Inf. Comput.
(1988)
Eva Maia et al.
Incomplete operational transition complexity of regular languages
Inf. Comput.
(2015)
Sergey Afonin et al.
Minimal union-free decompositions of regular languages
J. Ramírez Alfonsín
The Diophantine Frobenius Problem
(2005)
Janusz A. Brzozowski
Regular Expression Techniques for Sequential Circuits
(June 1962)
Janusz A. Brzozowski et al.
Most complex deterministic union-free regular languages
Cezar Câmpeanu et al.
State complexity of regular languages: finite versus infinite
Jürgen Dassow
On the number of accepting states of finite automata
J. Autom. Lang. Comb.
(2016)
Yuan Gao et al.
Transition complexity of incomplete DFAs
Fundam. Inform.
(2011)

Yuan Gao et al.

A survey on operational state complexity

J. Autom. Lang. Comb.

(2016)

Cited by (2)

Union-Complexities of Kleene Plus Operation
2022, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Union-Freeness Revisited-Between Deterministic and Nondeterministic Union-Free Languages
2021, International Journal of Foundations of Computer Science

View full text

Operational union-complexity

Abstract

Introduction

Section snippets

Preliminaries

Main results

On deterministic union-complexity

Summary and conclusions

Declaration of Competing Interest

Inf. Comput.

Inf. Comput.

Inf. Comput.

Minimal union-free decompositions of regular languages

The Diophantine Frobenius Problem

Regular Expression Techniques for Sequential Circuits

Most complex deterministic union-free regular languages

State complexity of regular languages: finite versus infinite

On the number of accepting states of finite automata

J. Autom. Lang. Comb.

Transition complexity of incomplete DFAs

Fundam. Inform.

A survey on operational state complexity

J. Autom. Lang. Comb.