1 Introduction

The Travelling Salesman problem (TSP) is undoubtedly one of the most famous combinatorial optimisation problems. In its standard version, we are given an edge-weighted (undirected) graph and the goal is to find a closed tour with a minimum cost that visits each vertex at least once. This is equivalent to the Graphic Travelling Salesman problem (where the cost between any two vertices corresponds to their shortest path in a graph) and where exactly one visit per vertex is allowed.

The Graphic TSP plays an important role in understanding complexity of more general the Metric TSP problem where cost function \(c: V \times V \rightarrow \mathbb {R}^+\) is defined by a metric. The approximability of the Metric TSP is a long-standing open problem, Christofides’s approximation algorithm with ratio 1.5 (Christofides 1976) hasn’t been improved for more than four decades. It is generally believed that the approximation ratio can be close to 4/3 due to known integrality gap for the Held-Karp LP relaxation (Held and Karp 1970).

In the last decade, some significant progress has been done in the Graphic TSP. Gharan et al. (2011) made first breakthrough with a \((1.5-\varepsilon )\)-approximation algorithm where \(\varepsilon \) being of the order of \(10^{-12}\). Following that, Mömke and Svensson (2011) obtained a significantly better approximation factor of \(\frac{14(\sqrt{2} -1)}{12\sqrt{2} -13}\approx 1.461\), which was improved further to \(\frac{13}{9}\approx 1.444\) by Mucha (2014). To our best knowledge, currently the best known approximation ratio is 1.4 due to Sebő and Vygen (2014). The overview about this recent development can also be found in Svensson (2013).

However, there is still a significant gap between the ratio of the best approximation algorithm and the approximation ratio that provably can’t be achieved unless P = NP. The first APX-hardness result showed the NP-hardness to approximate the TSP problem within \(1+\varepsilon \) without any explicit value for \(\varepsilon \) (Papadimitriou and Yannakakis 1993). The first explicit value 5381/5380 was set by Engebretsen (1998), further improved to 3813/3812 by Böckenhauer et al. (2000) and 220/219 by Papadimitriou and Vempala (2006). The further progress in the reductions and amplifiers increased the threshold to 185/184 by Lampis (2014) and to our best knowledge the currently best value is 123/122 by Karpinski et al. (2015).

Main contribution The main novelty of this paper is using weighted amplifiers and weighted low occurrence of Constraint Satisfaction problems (CSP) as intermediate steps in the NP-hard gap reductions to the Travelling Salesman problem. Allowing the weights in intermediate problems to TSP (or the Steiner Tree problem) is rather natural, as the problems themselves are using edge weights. We demonstrate the technique for TSP and use the parametrised weighted amplifiers in the gap reductions to allow more flexibility in fine-tuning their expanding parameters. In this paper we don’t aim to optimise the parameters of amplifiers that provably exist, but show that already slight improvement of known values modestly improve the hardness of approximation for TSP from the current best value \(\frac{123}{122}\) (Karpinski et al. 2015) to the new value \(\frac{117}{116}\). This provides a new motivation for study of expanding properties of random graphs in order to improve approximation lower bounds of TSP and other edge-weighted optimisation problems.

1.1 Preliminaries

All graphs in this paper are undirected and connected. Let \(G=(V, E)\) be an edge-weighted graph with cost edge-function \(c :E\rightarrow \mathbb {R}^+\). For an edge \(e=\{u, v\}\in E\) we also use the notation uv as an shorthand. In a narrow sense tour in the graph \(G=(V, E)\) is an alternating sequence of vertices and edges, starting and ending at a vertex, where each vertex is incident with the previous and the following edge in the sequence. If a starting and ending vertex is the same, the tour is closed. Any solution of TSP is a closed tour spanning V. The multiset T of its edges is itself called a ’tour’ (in a slightly broader sense adopted in this paper) even if it doesn’t capture the order in which the edges are traversed by that tour. This multiset creates an Eulerian multigraph (edges are taken with their multiplicities if they are used multiple times) spanning V. It is well known (from the theory of Eulerian multigraphs) that from a multiset of edges of a tour one (or possibly many) such traversing the edges can be easily constructed.

A quasi-tour \(T_0\) in G can be obtained from a tour T removing edges of a finite set of closed tours. It is always viewed as a multgraph with the original vertex set V, and now with edge multiset \(T_0\). Each connected component of this multigraph is an Eulerian multigraph or an isolated vertex. A multiset \(T_0\) of edges from E corresponds to a quasi-tour if and only if when all vertices V are balanced with respect to \(T_0\) (each vertex from V is incident with even number of edges from \(T_0\), possibly 0).

Our inapproximability results for the Travelling Salesman problem use reductions from Håstad’s NP-hard gap type result for MAX-E3-LIN-2, the Maximum Satisfiability problem for linear equations modulo 2 with exactly 3 variables per equation Håstad (2001). In fact, Håstad’s tight inapproximability results can be stated in the form in which every variable occurs the same number of times in the system of equations, see e.g. Chlebík and Chlebíková (2003).

Theorem 1

For every \(\varepsilon \in \left( 0, \frac{1}{4}\right) \) and every fixed sufficiently large integer \(k \ge k(\varepsilon )\), the following partial decision subproblem \(Q(\varepsilon , k)\) of MAX-E3-LIN-2 is NP-hard: given an instance of MAX-E3-LIN-2 with m equations and exactly k occurrences of each variable, to decide if at least \((1-\varepsilon )m\) or at most \((\frac{1}{2}+\varepsilon )m\) equations are satisfied by the optimal assignment.

The results of such form were already used to prove the inapproximability results for other optimisation problems, e.g., the Steiner Tree problem (Chlebík and Chlebíková 2008).

For some optimisation problems it is more convenient to use reductions if all equations of MAX-E3-LIN-2 have the same right hand side. The NP-hard gap results in such a case can be easily enforced if we allow flipping some occurrences of variables, so also the literal \({\overline{x}} :=1-x\) can be used for a variable x. The canonical gap versions \(Q_b(\varepsilon , 2k)\), for any fixed \(b=0\) or \(b=1\), of MAX-E3-LIN-2 are as follows:

The \(Q_b(\varepsilon , 2k)\) problem, \(b\in \{0, 1\}\)

Input An instance of MAX-E3-LIN-2 with m equations of the form \(x\oplus y\oplus z =b\), each variable occurring exactly k times as unnegated and k times negated.

Task To decide if at least \((1-\varepsilon )m\) or at most \((\frac{1}{2}+\varepsilon )m\) equations are satisfied by the optimal assignment.

The corresponding ‘fixed occurrence’ NP-hard gap result reads as follows [see Chlebík and Chlebíková (2003) for the details of the following theorem]:

Theorem 2

For every \(\varepsilon \in (0, \frac{1}{4})\) and every sufficiently large integer k, \(k\ge k(\varepsilon )\), the partial decision subproblems \(Q_0(\varepsilon , 2k)\) and \(Q_1(\varepsilon , 2k)\) of MAX-E3-LIN-2 are \(\mathrm{NP}\)-hard.

Weighted amplifiers

Amplifier graphs are useful in proving inapproximability results for CSPs in which every variable appears a bounded (and, typically, very low) number of times. Such CSPs are often used as intermediate steps in proving approximation hardness results for many combinatorial optimisation problems. For problems like Travelling Salesman, or Steiner Tree which are based on edge weights, it is natural to consider the intermediate low degree CSPs with their edge weights as well.

For a graph \(G=(V,E)\), a cut is a partition of V into two subsets U and \(\overline{U}:= V{\setminus }U\). The cut set \(E(U, \overline{U})\) is defined as \(E(U, \overline{U})=\{uv\in E, u\in U \text { and } v\in \overline{U}\}\) and the cut size as \(|E(U, \overline{U})|\). If edges are weighted with \(p:E\rightarrow \mathbb {R}^+\), then \(p(E(U, \overline{U}))\) is weight of the cut set \(E(U, \overline{U})\), hence \(p(E(U, \overline{U}))=\sum _{uv\in E, u\in U, v\in \overline{U}} p(uv)\).

Definition 1

Let \(G=(V, E)\) be a graph with edge weights \(p:E \rightarrow \mathbb {R}^+\), and \(D \subseteq V\), \(|D|\ge 2\). We say that a weighted graph (Gp) is an amplifier for D if for every vertex set \(A \subseteq V\)

$$\begin{aligned}p(E(A, \overline{A}))\ge \min \{|D\cap A|, |D \cap \overline{A}|\}.\end{aligned}$$

The vertices of the given set D are called the contacts, the rest of the vertices (\(=V{\setminus }D\)) is the set of checkers. We say that an amplifier (Gp) for the set D is a d-regular amplifier if, additionally, all contacts have degree \((d-1)\) and all checkers have degree d (in G).

In full generality, one could also allow distinct weights for vertices of D to replace the sizes \(|D\cap A|\), \(|D\cap \overline{A}|\) with their weighted version, but for our purposes the vertices of D are uniformly weighted each with weight 1.

Note If \(G=(V,E)\) is a connected graph and \(D\subseteq V\) with \(|D|\ge 2\) then we can define edge weights \(p:E \rightarrow \mathbb {R}^+\) in such a way that the graph (Gp) is an amplifier. Indeed, if we set

$$\begin{aligned} \beta = \min \left\{ \frac{|E(A, \overline{A})|}{|D \cap A|}: A \subseteq V, \ 1\le |D \cap A| \le \frac{1}{2} |D|\right\} \end{aligned}$$

then clearly, \(\beta >0\), and any weights \(p:E\rightarrow \mathbb {R}^+\) such that \(p(e)\ge \frac{1}{\beta }\), \(\forall e\in E\), produce an amplifier (Gp) for D.

2 Intermediate weighted CSPs

In this section we extend the NP-hard gap results from a system of linear equations with exactly 3 variables to a low occurrence version of \(w\)-MAX-3-LIN-2, a weighted hybrid system of linear equations over \(\mathbb {Z}_2\) with either 2 or 3 variables. Similarly to MAX-E3-LIN-2, the task of the \(w\)-MAX-3-LIN-2 problem is to find an assignment that maximizes weight of the satisfied equations in the hybrid system.

To prove the NP-hard gap results for the \(w\)-MAX-3-LIN-2 problem, we extend Håstad’s results for MAX-E3-LIN-2 using the amplifiers defined in Sect. 1.

Reduction from \(Q(\varepsilon , k)\) to \(w\)-MAX-3-LIN-2

Let \(\varepsilon \in (0, \frac{1}{4})\), and \(k>0\) be an integer such that the problem \(Q(\varepsilon , k)\) is NP-hard. Let an instance I of \(Q(\varepsilon , k)\) be given, denote by \(\nu (I)\) the set of variables of I, \(\nu :=| \nu (I)|\). Let’s assume that \(G=(V, E)\) with the edge weights \(p:E \rightarrow \mathbb {R}^+\) be an amplifier for a set \(D \subseteq V\) with \(|D|=k\).

Now we describe a gap preserving reduction from \(Q(\varepsilon , k)\) to the \(w\)-MAX-3-LIN-2 problem with an amplifier (Gp) as a parameter. The instance I of \(Q(\varepsilon , k)\) is transformed to a weighted hybrid instance J of \(w\)-MAX-3-LIN-2.

  • For each variable \(x\in \nu (I)\) take a copy of the amplifier (Gp), let \((G_x, p)\) denote that copy:

    • Inside \((G_x, p)\) the vertices correspond to the variables in J and each edge \(vv'\) represents the equation \(v\oplus v'=0\) with weight \(p(vv')\) in J.

      The contact vertices of \((G_x, p)\) represent k occurrences of the variable x in the equations of I. Distinct occurrences of a variable x in I are represented by the distinct contact vertices in \(G_x\).

  • Every equation \(x \oplus y \oplus z=b\) from I, \(b\in \{0, 1\}\), also belongs to J with weight 1.

Remark 1

Observe that the above reduction from an instance I of \(Q(\varepsilon , k)\) to an instance J of \(w\)-MAX-3-LIN-2 preserves the \(\mathrm{NP}\)-hard gap of \(Q(\varepsilon , k)\). Indeed, there is a simple dependence of an optimal value for J on that of I.

In the following we show that if we look at these problems as Minimum Unsatisfiability problems, where \(\text {OPT}'\) is the corresponding minimum weight of unsatisfied equations over all assignments, then \(\text {OPT}'(I)=\text {OPT}'(J)\). Clearly, any assignment to variables from \(\nu (I)\) generate an assignment to variable of J in a natural way; the value of a variable \(x \in \nu (I)\) is assigned to all variables of \(G_x\). Such assignments to variables of J are called standard. Hence, obviously \(\text {OPT}'(J)\le \text {OPT}'(I)\).

The observation that the optimum \(\text {OPT}'(J)\) is achieved on standard assignments is based on the amplifier’s properties. Any assignment \(\varphi \) to the variables of J can be converted to a standard one in such a way that the weight of unsatisfied equations doesn’t increase as follows: consider a variable x from \(\nu (I)\). Assign to all variables in \(G_x\) the same value as it is assigned to the majority of contact vertices in \(G_x\) by the assignment \(\varphi \). The fact that \((G_x, p)\) is the amplifier ensures that the weight of unsatisfied equations in J doesn’t increase. Now if we repeat the same operation for each variable from \(\nu (I)\), one after another, the result will be a standard assignment without increase of the weight of unsatisfied equations in J. Consequently, \(\text {OPT}'(J)\) is achieved on the standard assignments. But for every standard assignment the weight of unsatisfied equations of J is the same as the number of unsatisfied equations of I by that assignment, hence \(\text {OPT}'(I)=\text {OPT}'(J)\).

Reduction from \(Q_b(\varepsilon , 2k)\) to \(w\)-MAX-3-LIN-2

Now we slightly modify the previous reduction from \(Q(\varepsilon , k)\) to deal with the instances of \(Q_b(\varepsilon , 2k)\) for any fixed \(b=0\) or \(b= 1\).

Let \(\varepsilon \in (0, \frac{1}{4})\) and \(k>0\) be an integer such that \(Q_b(\varepsilon , 2k)\) is \(\mathrm{NP}\)-hard. Assume that \(G=(V, E)\) with edge weights \(p:E\rightarrow \mathbb {R}^+\) is an amplifier for a set \(D \subseteq V\) with \(|D|= 2k\). Let \(\{V^u, V^n\}\) be a partition of V balanced in D, namely \(| D \cap V^u|= |D \cap V^n| =k\). Denote further \(G^u\) and \(G^n\) the induced subgraph of G with the vertex sets \(V^u\) and \(V^n\), respectively. In what follows we describe the reduction from \(Q_b(\varepsilon , 2k)\) to \(w\)-MAX-3-LIN-2 parametrised by an amplifier (Gp) for \(D\subseteq V\) with \(|D|=2k\) and with chosen balanced partition \(\{V^u, V^n\}\) of V.

Let an instance I of \(Q_b(\varepsilon , 2k)\) be given, \(\nu (I)\) be the set of variables of I, \(\nu =|\nu (I)|\).

  • For each variable x from \(\nu (I)\) take a copy of an amplifier (Gp), let \(G_x\) denote such a copy.

    • Any edge \(vv'\) inside either \(G^u_x\) or \(G^n_x\) represents the cycle equation \(v\oplus v'=0\) taken with weight \(p(vv')\).

    • Any edge between \(v\in V^u_x\) and \(v' \in V^n_x\) in \(G_x\) represents the matching equation \(v\oplus v'=1\) taken with weight \(p(vv')\).

  • The contact vertices of \(G^u_x\) (resp. \(G^n_x\)) represent k occurrences of unnegated (resp. negated) variable x in the equations of I. Every equation \(x \oplus y \oplus z=b\) from I, \(b\in \{0, 1\}\), also belongs to J with weight 1.

This way we produce an instance J of the \(w\)-MAX-3-LIN-2 problem. Any assignment to variables from \(\nu (I)\) generates an assignment to variables of J in a natural way: the value of a variable x is assigned to all variables of \(G^u_x\), and the value opposite to x, \(\overline{x} = 1-x\), is assigned to all vertices of \(G^n_x\). Such assignment to the variables of J is called standard. Any assignment to variables of J can be converted to a standard one without increasing the weight of unsatisfied equations as it follows from properties of an amplifier. The arguments from Remark 1 applied to the problem \(Q(\varepsilon , 2k)\) can be transformed to the \(Q_b(\varepsilon , 2k)\) by reversing all variables from \(V^n_x\) to their opposite.

3 The weighted bi-wheel amplifiers

The previous reductions were based on a general theoretical model of weighted amplifiers. In this section we introduce a class of weighted graphs with such expanding properties that generalise the bi-wheel amplifiers from Karpinski et al. (2015). Further we describe in the details the properties of the instances of the subproblem of \(w\)-MAX-3-LIN-2, called the Hybrid bi-wheel instances.

Definition 2

Let an integer \(k>0\) and a rational number \(\tau >1\) be such that \(\tau k\) is an integer. The weighted \((2k, \tau )\)-bi-wheel amplifier \(W_{k,\tau }=(V, E)\), \(p:E \rightarrow \mathbb {R}^+\), is a (weighted) 3-regular amplifier with a specific balanced partition constructed as follows: Take two disjoint cycles, each on \(\tau k\) vertices (connected in consecutive order), \(V^u=\{1^{u}, 2^{u}, \dots , (\tau k)^{u}\}\) and \(V^n=\{1^{n}, 2^{n}, \dots , (\tau k)^{n}\}\), respectively. Select the sets of k contacts \(D^u\subseteq V^u\) and \(D^n\subseteq V^n\) as \(D^u=\{c_1^{u}, c_2^{u}, \dots , c_k^{u}\}\), \(D^n=\{c_1^{n}, c_2^{n}, \dots , c_k^{n}\}\). The remaining vertices of both cycles, \(V^u{\setminus }D^u\) and \(V^n{\setminus }D^n\), are checkers.

To complete the construction, consider a perfect matching between the checkers of these two cycles where each matching edge has one vertex in the first cycle \(V^u{\setminus }D^u\) and another one in the second cycle \(V^n{\setminus }D^n\).

We assume that in each cycle of the bi-wheel consecutive contacts are separated by a chain of several (at least 1) checkers. Hence, in particular, \(\tau \ge 2\).

Remark 2

Let us denote by \({\mathscr {C}}^u\) (\({\mathscr {C}}^n\), resp.) the set of edges contained in the first (the second, resp.) cycle in \(W_{k, \tau }\), so \({\mathscr {C}}^u=\{\{i^{u}, (i+1)^{u}\}: i=1, 2, \dots , \tau k\}\) and \({\mathscr {C}}^n=\{\{i^{n}, (i+1)^{n}\}\}: i = 1, 2, \dots , \tau k \}\) (the vertex \(\tau k+1\) is the vertex 1), and by \({\mathscr {M}}\subseteq E\) the associated perfect matching on the set of checkers. Clearly, \(|{\mathscr {C}}^u|=|{\mathscr {C}}^n|=\tau k\), \(|{\mathscr {M}}| =| V^u{\setminus }X^u| = | V^n{\setminus }X^n|= (\tau -1)k\).

In this paper we consider only bi-wheel amplifiers \((W_{k, \tau }, p)\) whose weights have uniform cycle weight \(p_c\) for all cycle edges of both \({\mathscr {C}}^u\) and \({\mathscr {C}}^n\), and another uniform matching weight \(p_m\) for all matching edges from \(\mathcal M\).

Now we are ready to describe the specific properties of the Hybrid bi-wheel instances of \(w\)-MAX-3-LIN-2 based on a fixed \((2k, \tau )\)-bi-wheel amplifier \(W_{k, \tau }\) with weights \(p_c\) and \(p_m\).

Theorem 3

For every \(\varepsilon \in (0, \frac{1}{4})\) and \(b\in \{0, 1\}\) there exist instances of \(w\)-MAX-3-LIN-2, called \(\text {Hybrid}(W_{k, \tau }, p)\), with the following properties:

  1. (i)

    each variable of the system equations \(\text {Hybrid}(W_{k, \tau }, p)\) occurs exactly 3 times;

  2. (ii)

    m equations are of the form \(x\oplus y\oplus z=b \), each of weight 1;

  3. (iii)

    \(3\tau m\) equations are of the form \(x\oplus y=0\) each of weight \(p_c\);

  4. (iv)

    \(\frac{3}{2}m(\tau -1) \) equations are of the form \(x\oplus y=1\) each of weight \(p_m\),

for which it is \(\mathrm{NP}\)-hard to decide whether there is an assignment to the variables that leaves unsatisfied equations of weight at most \(\varepsilon m\), or every assignment to the variables leaves unsatisfied equations of weight at least \((0.5-\varepsilon ) m\).

Proof

Let \(\varepsilon \in \left( 0, \frac{1}{4}\right) \), \(b\in \{0, 1\}\), and \(k>0\) be an integer such that \(Q_b(\varepsilon , 2k)\) is \(\mathrm{NP}\)-hard. Let an instance I of \(Q_b(\varepsilon , 2k)\) be given, \(\nu (I)\) be its set of \(\nu =|\nu (I)|\) variables, so \(m:=|I|=\frac{2k\nu }{3}\).

For any fixed variable \(x \in \nu (I)\) we construct a \((2k, \tau )\)-bi-wheel \(W_x\) (a copy of \(W_{k, \tau }\)) with \(2\tau k\) variables \(Var(x)=\{x^{u}_j, x_j^{n}\}^{\tau k}_{j=1}\). Following the notation from Definition 2, each \(j^{u}\) is mapped to the variable \(x^{u}_j\), \(j^{n}\) to \(x_j^{n}\). Let \({\mathscr {C}}^u(W_x)\), \({\mathscr {C}}^n(W_x)\) denote the cycle edges, \(\mathscr {M}(W_x)\) the corresponding matching edges.

  • For each matching edge \(j^{u} l^{n}\in \mathscr {M}(W_x)\) we create the equation \(x_j^{u}\oplus x_l^{n}=1\) weighted by \(p_m\). Therefore, we have \(\nu (\tau -1)k=\frac{3}{2} m(\tau -1)\) such equations.

  • For each cycle edge \(\{j^q, (j+1)^q\}\in {\mathscr {C}}^q(W_x)\), \(j=1, 2, \dots , \tau k\) (the vertex \(\tau k+1\) is the vertex 1) with \(q\in \{u, n\}\) we introduce the equation \(x^{q}_j\oplus x^{q}_{j+1}=0\) weighted by \(p_c\). Hence we have \(2k\tau \nu = 3\tau m\) such equations.

Finally, we replace the l-th unnegated appearance of x in I by the contact variable \( x^{u}_{c^u(l)}:= c^{u}_l\), whereas the l-th negated appearance is replaced by \(x^{n}_{c^n(l)}:= c^{n}_l\), for \(l=1, \dots , k\). This yields m equations of the form \(x\oplus y\oplus z=b\), each weighted by 1.

Obviously, each of the variables \(Var(x)=\{x^{u}_j, x_j^{n}\}^{\tau k}_{j=1}\) where \(x\in \nu (I)\) occurs exactly 3 times and the hardness results follows directly from a properties of the reduction discussed in Sect. 2.\(\square \)

The reduction from Hybrid\((W_{k,\tau },p)\), presented later in Sect. 4, is a gap preserving reduction to TSP parametrised by a \((2k, \tau )\)-bi-heel amplifier with cycle weights \(p_c\) and matching weights \(p_m\). The trade-off between parameters \(p_c\), \(p_m\) and \(\tau \) is crucial for quality of approximation lower bounds.

Definition 3

We call the triple \((p_c, p_m, \tau )\) admissible if for every \(k_0\) there exists \(k\ge k_0\) and a \((2k, \tau )\)-bi-wheel that is an amplifier with cycle weights \(p_c\) and matching weights \(p_m\).

The bi-wheel amplifiers introduced by Karpinski et al. (2015) are based on the fact that the triple (\(p_c=1\), \(p_m=1\), \(\tau = 7\)) is admissible. This leads to NP-hardness to approximate TSP to within any constant approximation ratio less than \(\frac{123}{122}\). They also observed (Karpinski et al. 2015) that their proof (of amplification properties) doesn’t seem to work with \(\tau =6\) instead \(\tau =7\). However, there is an opportunity for fine-tuning here if we allow non-integral \(\tau \). If, e.g., 90% of pairs of consecutive contacts in bi-wheel cycles are separated by 6 checkers, and 10% of such pairs are separated by a chain of 5 checkers only, then the proof of required amplification properties still works. The detailed explanation together with all computations for wheel amplifiers can be found in the paper Chlebík and Chlebíková (2003). The proof for bi-wheels is very similar, so along these lines one can argue that the triple (\(p_c=1\), \(p_m=1\), \(\tau = 6.9\)) is admissible. This itself would (very modestly) improve on the lower approximation bound for TSP given in Karpinski et al. (2015).

Introducing weighted amplifier graph constructions seems to have paid off even more compared to improvement of parameters for unweighted amplifiers. In this case we have more freedom in fine-tuning the approximation hardness lower bounds obtained in parametric way, if we can prove that bi-wheel amplifiers with certain parameters \((p_c, p_m, \tau )\) exist.

Let us explain trade-off between parameters (\(p_c, p_m, \tau \)) of bi-wheels in a simple scenario with \(p_m=1\) fixed. Our contribution allows to use weighted amplifiers with \(p_c< 1\) (strengthening of amplifiers) or with \(p_c>1\) (relaxing of amplifiers). One can achieve amplifiers with \(p_c<1\) by increasing \(\tau \) from \(\tau =7\). On the other hand, to relax to \(p_c>1\) can be achieved with \(\tau <7\). These ideas indicate importance to better understand the exact trade-off between \((p_c, p_m, \tau )\) triples for bi-wheel amplifiers that provably exist.

Based on the definition of the admissible triples, the main result of this paper presented as Theorem 4 in Sect. 4 can be formulated as:

Theorem

If \((p_c, p_m, \tau )\) is an admissible triple then it is \(\mathrm{NP}\)-hard to approximate the Travelling Salesman problem to within any constant approximation ratio less than

$$\begin{aligned}1+ \frac{1}{3(\tau -1)(4p_c+\max \{1, p_m\})+12p_c+20 \max \{1, p_m\}}.\end{aligned}$$

In this paper we only want to demonstrate effectiveness of weighted parametrised amplifiers and address the question of fine-tuning in (\(p_c, p_m, \tau \)) triples for bi-wheel amplifier. This can be summarises as [see Theorem 5 in Sect. 5 for the proof]:

Theorem

The triple (\(p_c=\frac{1}{2}, p_m=1, \tau =11 \)) is admissible, hence for every large enough \(k\ge k_0\) there is a (2k, 11)-bi-wheel that is an amplifier with cycle weights \(p_c=\frac{1}{2}\) and matching weights \(p_m=1\).

Therefore we can conclude

Corollary

It is NP-hard to approximate the Travelling Salesman problem within any constant approximation ratio less than \(\frac{117}{116}\).

4 Gap preserving reduction from Hybrid\((W_{k,\tau },p)\) to TSP

In this section we describe a gap preserving reduction from the system of equations Hybrid\((W_{k,\tau },p)\) to the Travelling Salesman problem. In the reduction we suppose that all equations of Hybrid\((W_{k,\tau },p)\) with three variables are of the form \(x\oplus y\oplus z=0\) to simplify a discussion later (hence Hybrid\((W_{k,\tau },p)\) was obtained via reduction from \(Q_0(\varepsilon , 2k)\)). We also introduce a real parameter \(\theta >0\) set to \(\theta =\frac{1}{\max \{1, p_m\}}\), in order to simultaneously capture different scenarios \(p_m\le 1\) and \(p_m>1\).

The gap preserving reduction is similar to the reduction presented in Karpinski et al. (2015), the main difference is in using a parametrised weighted \((2k, \tau )\)-bi-wheel amplifier \((W_{k,\tau },p)\) introduced in Sect. 3. We use the concept of forced edges introduced by Lampis (2014) (used also in Karpinski et al. (2015)). The idea is based on the observation that we are able to stipulate that some edges, called forced edges, are to be used at least once in any valid tour. It can be achieved by replacing such an edge with a path of many edges of the same total weight. With this trick we may assume without loss of generality that we can force some edges to be used at least once [see Karpinski et al. (2015) for the details]. If u and v are vertices that are connected by a forced edge e, we write \(\{u,v\}_F\) or simply \(uv_F\). The construction contains some forced edges, all other edges in the constructed graph are unforced edges with edge weight 1.

We start with an instance I of \(Q_0(\varepsilon , 2k)\) with \(\nu \) variables, m equations of the form \(x\oplus y\oplus z=0\) and use the reduction from Sect. 2 to create an instance J of Hybrid\((W_{k,\tau },p)\). Using the same notation as in Theorem 3 we construct an instance G[J] of TSP in the following way: for each copy \(W_j:= \) \((W_{k,\tau },p)\), \(1\le j\le \nu \), of a \((2k, \tau )\)-bi-wheel we construct a subgraph of G[J]:

  1. (i)

    each variable x of the bi-wheel \(W_j\), corresponds to a vertex x in the subgraph,

  2. (ii)

    for each cycle equation \(x\oplus y=0\), we create an unforced edge xy with weight 1.

Now we add the edges among the vertices of ‘bi-wheel’ subgraphs using two types of gadgets:

  • a 3-variable gadget \(H^{3Q}\):

    for each equation j, \(1\le j\le m\), of the form \(x\oplus y\oplus z=0\) we add a 3-variable gadget \(H^{3Q}_{j}\) connecting the contacts x, y, z, where each contact vertex x, y, and z is part of its own \((2k, \tau )\)-bi-wheel. Each gadget \(H^{3Q}_{j}\) contains two new vertices \(\gamma ^l\), \(\gamma ^r\) for every vertex \(\gamma \in \{x, y, z\}\) and two additional vertices \(e_j^l\) and \(e^r_j\), see Fig. 1 how the vertices are connected. All edges \(\{\gamma ^\alpha , \gamma \}_F\) with \(\alpha \in \{r, l\} \) and \(\gamma \in \{x, y, z\}\) are forced edges with weight \(w(\{\gamma ^\alpha , \gamma \}_F)= 0.5+p_c \theta \). All remaining edges of \(H^{3Q}_{j}\) are unforced with weight 1.

  • a matching gadget \(H^{2M}\):

    for each equation \(x_t^u\oplus x_q^n=1\) we add a matching gadget \(H^{2M}\) connecting the checkers \(x^u_t\) and \(x^n_q\) via two forced edges \(\{x^u_t, x^n_q\}^1_F\) and \(\{x^u_t, x^n_q\}^2_F\), each of the same weight \(2p_c\theta \) (Fig. 2).

At the end of the construction, we add a new central vertex s that is connected to every gadget \(H^{3Q}_{j}\) with two forced edges \(\{e_j^l, s\}_F\) and \(\{e^r_j, s\}_F\), both with weight 0.5, \(w(\{e_j^\alpha , s\}_F)=0.5\) for both \(\alpha \in \{r, l\}\).

Observe that the construction doesn’t need gadgets for the cycle edges, the connections between the matching edge gadgets are sufficient to encode these constraints.

Fig. 1
figure 1

An example of a 3-variable gadget \(H^{3Q}_{j}\) including the central vertex s, which is not part of the gadget. Thick lines represent forced edges.

Fig. 2
figure 2

A gadget \(H^{2M}\) inside the bi-wheel \(G_x\) for the equations \(x_t^u\oplus x_q^n=1\) contains only two forced edges, represented as thick lines.

Now in the following we describe in the details the properties of the gap preserving reduction from the \(\text {Hybrid}\) \((W_{k,\tau },p)\) to the Travelling Salesman problem.

Local edge cost To count the cost c(T) of a tour T, we use the local edge cost counting based on the ideas from Karpinski et al. (2015): the cost w(uv) of any edge uv of T is split into two nonnegative parts, one attached to u and the second one to v. If an edge uv doesn’t contain s then cost is split equally with contribution 0.5w(uv) for each vertex, but for edges of the form us, the full cost contributes to u, and none to s.

Let T be a multi-set of edges from E that defines a quasi-tour in G[J](VE). Then for a set \(V'\subseteq V\), the local edge cost of \(V'\) is formally defined as

$$\begin{aligned} c_{T}(V')=\sum _{u\in V'{\setminus }\{s\}} \sum _{uv\in T} 0.5 w(uv)+ \sum _{e_j^\alpha \in V'} \sum _{e_j^\alpha s\in T} 0.5w(e^\alpha _js).\end{aligned}$$

Note that for two vertex sets \(V_1\), \(V_2\) we have \(c_T(V_1 \cup V_2)\le c_T(V_1) + c_T(V_2)\) (with equality for disjoint sets), and \(c_T(V)=\sum _{e\in T} w(e)\).

In Sect. 4.2 we also use the full local cost of the quasi-tour T for the set \(V'\), \(c^*_{T}(V')\), which is defined as follows: if \(\#_{T}(V')\) is the number of connected components induced by T which are fully contained in \(V'\), then

$$\begin{aligned} c^*_{T}(V')= c_{T}(V') + 2 \#_{T}(V').\end{aligned}$$

Intuitively, \(c^*_{T}(V')\) captures the cost of the full tour restricted to \(V'\): it includes the local edge cost and the cost of a connection of the components on \(V'\) of the lowest possible price (using two unforced edges), to the rest of the tour. Note that for two disjoint vertex sets \(V_1\), \(V_2 \subseteq V\) we have \(c^*_T(V_1 \cup V_2)\ge c^*_T(V_1) + c^*_T(V_2)\).

4.1 How to construct a tour from an assignment

Given an instance J of the Hybrid\((W_{k,\tau },p)\) and an assignment \(\varphi \) to its variables, we describe a construction of a tour T in G[J] with cost related to \(\varphi \).

Lemma 1

Let J be an instance of Hybrid\((W_{k,\tau },p)\) from Theorem 3. If there exists an assignment \(\varphi \) to the variables of J with unsatisfied equations of total weight \(\varDelta \), then there exists a tour in G[J] with cost at most

$$\begin{aligned}\biggl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+6p_c\theta +10\biggr )m+2\nu +\varDelta \,.\end{aligned}$$

Proof

Step 1 First we describe a construction of a quasi-tour T that may have several connected components, but uses all forced edges, each exactly once. For each vertex the selection of edges chosen for T is based on type of the gadget to which the vertex belongs to, either a 3-variable gadget \(H^{3Q}\) or a matching gadget \(H^{2M}\).

Part A: A matching gadget \(H^{2M}\)

Let \(\{W_x, x \in \nu (I)\}\) be the associated set of bi-wheels of J. For a fixed \(x\in \nu (I)\}\), let \(\{x_j^{u}\}^{\tau k}_{j=1}\), \(\{x_j^{n}\}^{\tau k}_{j=1}\) be the sets of the bi-wheel variables of \(W_x\). Due to the amplifier properties, a given assignment \(\varphi \) can be converted into a standard one satisfying the same upper bound on the weight of unsatisfied equations [see Sect. 2 for the details]. Hence, we can assume that all equations with two variables are satisfied by the given assignment \(\varphi \). It means, \(x^{u}_t=x^{u}_q\), \(x_t^{n}=x_q^{n}\), \(x_t^{u}\ne x_q^{n}\) for all \( t, q = 1, 2, \dots , \tau k\). Assuming \(x^\alpha _t=1\) for some \(\alpha \in \{u, n\}\), we include in T all unforced edges \(x^\alpha _t x_{t+1}^\alpha \) with \(t= 1, 2, \dots , \tau k-1\) (each edge once), the edge \(x^\alpha _{\tau k} x^\alpha _1\), and both adjacent forced edges for each checker corresponding to matching equations (a copy of matching gadget \(H^{2M}\)). Informally, for each bi-wheel we select the cycle that corresponds to the assignment 1 and use all the unforced cycle edges including both forced edges from each checker. This creates a component of the quasi-tour T that contains all checkers from both cycles, and all contacts from one cycle.

The contribution of the vertices and chosen incident edges to the local edge cost of the quasi-tour T is \(4p_c\theta \) for two forced edges of matching, and 1 for two unforced edges of one (checker) vertex of matching edge (one with \(x^\alpha _t=1\) for \(\alpha \in \{u, n\}\)), hence \(4p_c\theta +1\) for each matching edge \(x^{u}_t x_q^{n}\in {\mathscr {M}}\).

Part B: A 3-variable gadget \(H^{3Q}\)

In this part we discuss how the quasi-tour T can traverse through the all vertices of the gadget \(H^{3Q}_{j}\), \(j = 1, 2, \dots , m\). Following the construction of G[J] each \(H^{3Q}_{j}\) simulates the equation \(x\oplus y \oplus z=0\). A selection of edges belonging to T is based on a given standard assignment \(\varphi \) to the contact variables x, y, and z. Depending on their values, we consider the following cases:

  • Case \(x+y+z=2\): Suppose \(y=z=1\), \(x=0\), the other cases are symmetrical. Then we add the edges \(z^lz^r\), \(y^ly^r\), \(e^l_jx^l\), and \(e^r_jx^r\) to T which contributes 4 to the local edge cost of T restricted to the gadget \(H^{3Q}_{j}\). The unforced edges adjacent to y and z contributes 2, hence 6 together for unforced edges.

  • Case \(x+y+z=1\) or \(x+y+z=0\): In both cases we add for each vertex \(\gamma \in \{x, y, z\}\) the edges \(\gamma ^le^l_j\), \(\gamma ^re^r_j\) to T with local unforced edge cost contribution of 6. If \(x+y+z=1\), we have to add 1 for edges adjacent to a vertex with assignment 1.

  • Case \(x+y+z=3\): This case is similar to \(x+y+z=2\). We add the edges \(z^lz^r\), \(y^ly^r\), \(e^l_jx^l\), and \(e^r_jx^r\) to T with local unforced edge cost contribution 4 for the chosen edges inside \(H^{3Q}_{j}\), 3 for edges adjacent to x, y, and z, hence 7 together.

The local edge cost contribution of the gadget \(H^{3Q}_{j}\) to T can be split into contribution of forced and unforced edges. The forced edges contribute \(6(p_c \theta +0.5)+1 = 6p_c\theta +4\) (our local count method allow to omit the vertex s, as weight of its adjacent edges is fully counted at the vertices \(e^l_j\), \(e^r_j\)) and following the detailed discussion above, if the equation is satisfied the local unforced edge contribution to cost of T is 6, otherwise 7.

To summarise, in this way we construct a quasi-tour T for which we can count cost summing all local edge costs through all the gadgets (except s):

  • there are \(\frac{3}{2}(\tau -1)m\) gadgets of size two (matching gadget \(H^{3Q}\)) with local edge cost \(4p_c\theta +1\), and

  • there are m copies of 3-variable gadget \(H^{3Q}\), each contributes \(6p_{c}\theta + 10\) if satisfied, \(6p_{c}\theta +11\) otherwise.

The total edge cost of the quasi-tour T is thus at most

$$\begin{aligned} \biggl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+ 6p_c\theta +10\biggr )m+\varDelta . \end{aligned}$$

Step 2 Observe that the quasi-tour T constructed in Step 1 may have at most \(\nu +1\) connected components (one for each bi-wheel and one containing s), which needs to be connected to create a tour.

Since all unforced edges have cost 1, we can connect components by double using an unforced edge that connects them (this is possible as G[J] was a connected graph). This doesn’t increase the cost of quasi-tour but decreases the number of components. Repeating this will result in a connected tour T of the cost

$$\begin{aligned} \biggl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+6p_c\theta +10\biggr )m+2\nu +\varDelta .\qquad \qquad \qquad \end{aligned}$$

\(\square \)

4.2 How to define an assignment from a tour

Now we need to prove the opposite direction of the gap preserving reduction: given a tour in G[J] the task is to define an assignment to the variables of the system equations I of Hybrid\((W_{k,\tau },p)\) such that weight of unsatisfied equations is in a correlation with cost of a given tour.

Lemma 2

If there is a tour in G[J] with cost

$$\begin{aligned} \biggl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+6p_c\theta +10\biggr )m+\varDelta -2, \end{aligned}$$

then there is an assignment to the instance J that leaves unsatisfied equations of weight at most \(\varDelta \cdot \max \{1, p_m\}=\frac{\varDelta }{ \theta }\), where \(\theta =\frac{1}{\max \{1, p_m\}}\).

The high-level idea of the proof is to partition the vertex set of G[J] into the gadget-based subgraphs similarly as in the proof of Lemma 1. For each such subgraph we give a lower bound on the local edge cost of any quasi-tour restricted to it, which in fact corresponds to cost of the tour constructed in Lemma 1. If a given quasi-tour behaves inside a gadget differently, its cost must be obviously higher. The difference between the full tour’s local cost and the lower bound is called the credit of the gadget. Based on the tour we define an assignment for J and show that the total sum of credits can be used to bound from above the weight of unsatisfied equation, where the total sum of credits is at most \(\varDelta \).

Proof

Let G[J](VE) be the graph constructed at the beginning of Sect. 4. Obviously, we can suppose that T contains at most two copies of each edge. Otherwise we can simply remove two copies of the same edge from T and decrease the cost of the initial tour. If T contains two copies of the same unforced edge, removing both copies from the tour results in a quasi-tour, where the number of the connected component may increase by 1. If this is the case, then the cost of 2, which corresponds to the cost of removed copies of unforced edges, is paid to reconnect the tour. Therefore removing two copies of the same edge doesn’t increase the cost of the tour, but may result in a quasi-tour.

Thus, we can transform the initial tour T without increasing its initial cost into a quasi-tour without multiple unforced edges and with at most two copies of forced edges.

Consider the following partition of V: (A) \(\frac{3}{2}(\tau -1)m\) sets of size 2 vertices (‘matching pairs’) corresponding to the gadgets \(H^{2M}\) covering all checker vertices, (B) m sets of size 11 vertices corresponding to the gadgets \(H^{3Q}\) for size-three equations, (C) a single vertex \(\{s\}\).

Obviously, local edge cost of the subgraph from part (C) is 0 due to the method how we calculate the local cost of T (cost of all edges adjacent to s is fully contributed to the neighbourhoods of s), therefore part (C) doesn’t need to be considered further.

In the following we analyse how the quasi-tour T can locally traverse the vertices inside the subgraphs in (A) and (B) and based on that we define an assignment to J.

As it follows from the construction, only the vertices from the bi-wheels correspond to the variables of J. Observe that each vertex of a bi-wheel is adjacent to exactly two forced and two unforced edges. For a given quasi-tour T, a bi-wheel vertex v (at the same time also variable) of G[J] is called honest if the quasi-tour T uses both forced edges adjacent to v exactly once, otherwise the vertex is called dishonest. Now we define the honest assignment used for all honest variables (checkers and contacts): if the quasi-tour T for the honest vertex v also contains both unforced edges adjacent to it, then we assign 1 to v, and 0 otherwise.

Case A: The matching gadgets \(H^{2M}\)

In this part we define the assignment to the checker variables of the bi-wheels. Let \(W:=\) \((W_{k,\tau },p)\) be a fixed copy of the bi-wheel gadget \(H^{2M}\) for a variable \(x\in \nu (I)\) and \(M:= \{x^u_t, x^n_q\}\) be a fixed matching pair from W.

The variables from M appear in the following equations: a matching equation \(x^u_t\oplus x^n_q = 1\) and four incident cycle equations \(x^u_{t-1}\oplus x_t^u=0\), \(x^u_t\oplus x^u_{t+1}=0\), \(x^n_{q-1}\oplus x^n_q=0\), \(x^n_{q}\oplus x^n_{q+1}=0\) (indices are modulo \(k\tau \)).

The full local cost of any quasi-tour on M is \(c^*_T(M) \ge 4p_c\theta +1\): \(4p_c\theta \) for two forced edges and 1 for two unforced edges, hence

$$\begin{aligned} cr_{T}(M)=c^*_{T}(M)- (4p_c\theta +1).\end{aligned}$$
(1)

Clearly, \(x^u_t\) is the honest vertex if and only if \(x_q^n\) is the honest vertex, therefore we split the discussion into two independent cases: honest matching pairs and dishonest matching pairs.

Case A.1 Honest checker variables

We use the honest assignment for all honest checker vertices: it means if the quasi-tour T contains both unforced edges adjacent to \(x^u_t\) (resp. \(x_q^n\)), then we assign 1 to \(x^u_t\) (resp. \(x_q^n\)), 0 otherwise.

If we use such assignment for the honest checkers in all matching pairs in W, the cycle equation \(x^u_t \oplus y=0\) (resp. \(x_q^n \oplus y=0\)) is satisfied, when the adjacent vertex y is a honest checker. In other two cases, when y is a dishonest checker or a contact vertex, such equations are included in case of dishonest checkers (Case A.2) or contacts (case B).

Therefore we only need to discuss a matching equation \(x^u_t\oplus x^n_q = 1\) in this part.

Obviously, if T selects two unforced edges for only one variable, hence one checker variable is set to 1 and the second to 0, \(cr_{T}(M)=0\), but the matching equation is satisfied, so no credit is needed.

Otherwise, a quasi-tour on M can either contain two unforced edges adjacent to both \(x^u_t\), \(x^n_q\) or to be a connected component without any unforced edges. The full local cost of the quasi-tour T is in both cases \(c^*_{T}(M)= 4p_c\theta +2 \), hence \(cr_{T}(M)=1 \). In both these cases the honest assignment sets the same value to both variables, therefore the equation \(x^u_t\oplus x^n_q = 1\) is not satisfied, but there is enough credit to pay for it.

To summarise, the cycle equations incident to M are either satisfied (if a neighbourhood is a honest checker) or they are considered later. If the matching equation is satisfied, then \(cr_{T}(M)=0 \), if not then \(cr_{T}(M)= 1 \):

  • If \(p_m\le 1\) the credit \(cr_T(M)\) is sufficient to pay for the unsatisfied matching equation,

  • If \(p_m>1\) then the credit is not sufficient, the weight of an unsatisfied equation is \(p_m \cdot cr_T(M)\). That is why we aim in this scenario to bound from above the weight of unsatisfied equations by \(\max \{1, p_m\}\cdot cr_T\).

Case A.2 Dishonest checker variables

If \(M:=\{x^u_t, x_q^n\}\) is a dishonest matching pair, then the quasi-tour must use one of the incident forced edges twice. Therefore, the full local cost of T is at least \(6p_c\theta +1\), hence the credit is \(cr_T(M)\ge 2p_c \theta \) due to (1).

In this case we define an assignment to the variables from M in such a way that the matching equation \(x^u_t\oplus x^n_q=1\) is satisfied, and the credit is used for unsatisfied incident cycle equations.

Now we consider an assignment to the all dishonest matching pairs in W all it once and show that there is an assignment for dishonest checker variables for which at most half of the cycle equations incident to the dishonest checker variables is not satisfied.

Let \(\mathscr {M}_{DH}\) be a set of dishonest matching pairs in W. Clearly,

$$\begin{aligned} \sum _{e\in \mathscr {M}_{DH}} cr_T(e)\ge 2p_c \theta | \mathscr {M}_{DH}|. \end{aligned}$$

Let us denote \({\mathscr {C}_{DH}}\) the union of quadruples of cycle edges incident to all \(e\in \mathscr {M}_{DH}\). Obviously, \(| {\mathscr {C}_{DH}}| \le 4 | \mathscr {M}_{DH}|\).

Now consider random assignments to all pairs \(\mathscr {M}_{DH}\) satisfying \(x^u_t \oplus x_q^n=1\) for every \(x_t^u x^n_q\in \mathscr {M}_{DH}\). The expected number of unsatisfied cycle equations in \({\mathscr {C}_{DH}}\) is at most \( \frac{1}{2} | {\mathscr {C}_{DH}}| \le 2 | \mathscr {M}_{DH}|\) so the weight of unsatisfied equations is at most

$$\begin{aligned} 2p_c|\mathscr {M}_{DH}|\le \frac{1}{\theta }\cdot \sum _{e\in \mathscr {M}_{DH}} cr_T(e)=\max \{1, p_m\}\sum _{e\in \mathscr {M}_{DH}} cr_T(e).\end{aligned}$$

Hence similarly to Case A.1, the weight of unsatisfied equations is bounded from above by \(\max \{1, p_m\}\cdot cr_T\).

Case B: The 3-variable gadget \(H^{3Q}\)

In this part we define the assignment to the contact variables of the bi-wheels.

Let \(H^{3Q}:= H^{3Q}_j\), \(j\in \{1, \dots , m\}\), be a fixed 3-variable gadget on the vertex set \(V^{3Q}=\{x, y, z, x^r, x^l, y^r, y^l, z^r, z^l, e^r, e^l\}\). We’ll discuss the individual cases depending on how many vertices from \(\{x, y, z\}\) are dishonest in \(H^{3Q}\) with regard to T.

Case B.1. All three vertices x, y, zare honest Any unforced edge is contained in the quasi-tour T at most once, therefore to keep the degree of the vertices x, y, z balanced in T, the quasi-tour T either contains both unforced edges or none for every x, y, z.

To balance the degree of the vertices \(\gamma ^l, \gamma ^r\) in T, where \(\gamma \in \{x, y, z\}\), either the edges \(\gamma ^le^l\), \(\gamma ^re^r\) are in T, or T contains the edge \(\gamma ^l\gamma ^r\). In the latter case the additional cost at least 1 must be added for connecting the quasi-tour on the vertex set \(\gamma \), \(\gamma ^l\), \(\gamma ^r\) to the rest of the tour either using two different unforced edges incident to \(\gamma \) or using a multiple unforced edge. Hence, the contribution of the vertex set \(\gamma ^l\), \(\gamma ^r\), \(\gamma \) to the cost of any tour is at least \(3 + 2p_c\theta \), where the full cost of the edges \(\gamma ^le^l\), \(\gamma ^re^r\) is added to the vertices \(\gamma ^l\), \(\gamma ^r\) to simplify counting. This holds for any vertex \(\gamma \in \{x, y, z\}\), therefore the vertices x, y, z contribute together to the cost of T with at least \((9 + 6p_c\theta )\).

The degree of the vertices \(e^l\), \(e^r\) may be balanced in T through the edges \(\gamma ^le^l\), \(\gamma ^re^r\), whose cost is already counted, but cost at least 1 needs to be added for forced edges \(e^ls\), \(e^rs\). (If the degree of the vertices \(e^l\), \(e^r\) is not balanced in T through the edges \(\gamma ^le^l\), \(\gamma ^re^r\), then multiple used of forced edges \(e^ls\), \(e^rs\) contributes at least 2.)

To summarise, \(c_T^*(V^{3Q})\ge 6p_c\theta +10\), and the credit of \(H^{3Q}_{}\) with respect to the quasi-tour T is

$$\begin{aligned} cr_T (V^{3Q})=c^*_T(V^{3Q}) - (6p_c\theta +10).\end{aligned}$$
(2)

In this subcase we use the honest assignment for the contact honest variables: for each \(\gamma \in \{x, y, z\}\), we set \(\gamma =1\) if the quasi-tour T uses both unforced edges incident on \(\gamma \), and 0 otherwise.

To recall, each variable \(\gamma \) appears in four cycle equations and the size-three equation. Due to the properties of the construction any contact vertex in the cycle is incident only to a checker: (i) if a honest checker is incident with a contact variable in the bi-wheel, the honest assignment sets the same value to both honest variables (1, if they are adjacent by an edge, 0 otherwise), therefore the corresponding cycle equations are always satisfied; (ii) if a dishonest checker is incident with a contact variable in the bi-wheel, weight of all such unsatisfied cycle equations is considered in Case A.2 for dishonest checkers.

Therefore we need to prove that the credit \(cr_T (V^{3Q})\) is enough to pay only for unsatisfied the size-three equation using the honest assignment to its contacts x, y, z. We’ll discuss the possible options separately.

  • If \(x+y + z =0\) or \(x+y+z =2\) then the equation \(x\oplus y\oplus z=0\) is satisfied. As it has been proved before, the full local cost of any tour T on the vertex \(V^{3Q}\) is at least \((6p_c\theta +10)\), and therefore \(cr_T(V^{3Q})\ge 0\), but no credit is needed to pay for the size-three equation.

  • If \(x+y+z=1\), then the assignment failed to satisfy the size-three equation of weight 1. We assume that \(x=y=0\ne z\) holds, the other cases are symmetrical. Now we want to prove that under such constraints the full local cost of the tour T inside \(H^{3Q}_{\ }\)is always at least \(c^*_T(V^{3Q})\ge 6p_c\theta +11\), hence \(cr_T(V^{3Q})\ge 1\) to pay for the unsatisfied size-three equation. Let us discuss two subcases:

    • If there is an edge \(\gamma ^l\gamma ^r\in T\) for some \(\gamma \in \{x, y\}\), then we need to increase the cost about 2 for connecting a connected component on the vertex set \(\{\gamma , \gamma ^l, \gamma ^r\}\) to the rest of the tour. Hence \(c^*_T(V^{3Q})\ge 6p_c \theta +11\), therefore \(cr_T(V^{3Q})\ge 1\) in this case as it follows from (2).

    • If this is not a case, then the only way how the vertices \(x^r, x^l, y^r, y^l\) are balances with respect to T is \(e^l \gamma ^l, e^r\gamma ^r\in T\) for both \(\gamma \in \{x, y\}\). Because the vertices \(e^l\), \(e^r\) must also be balanced in T, we obtain that either \(e^r z^r, e^l z^l \in T\) or alternatively, the forced edges \({e^rs}_F\), \({e^ls}_F\) were used more than once together with \(z^r z^l\). Again, \(c^*_T(V^{3Q})\ge 6p_c\theta +11\), hence \(cr_T(V^{3Q})\ge 1\).

  • If \(x=y=z=1\), then similarly, the assignment failed to satisfy the size-three equation with weight 1 and we need to prove that \(c^*_T(V^{3Q})\ge 6p_c\theta +11\) for the tour T which includes both unforced edges for each vertex x, y, z.

If T contains all three edges \(x^lx^r\), \(y^ly^r\), \(z^lz^r\), then all the vertices \(\gamma \), \(\gamma ^l\), \(\gamma ^r\), for each \(\gamma \in \{x, y, z\}\) are balanced in T, therefore the edges \(\gamma ^re^r\), \(\gamma ^le^l\) can’t be in the quasi-tour T and the degree of \(e^r\), \(e^l\) must be balanced with multi-use of the forced edges \(e^rs_F\), \(e^ls_F\). Therefore, \(c^*_T(V^{3Q})\ge 6p_c\theta +11\).

If there exists \(\gamma \in \{x, y, z\}\) such that \(\gamma ^le^l\) or \(\gamma ^re^r\) are in T, then both \(\gamma ^le^l\), \(\gamma ^re^r\) must be in T and the vertex set \(\gamma , \gamma ^l, \gamma ^r\) (including the full cost of edges \(\gamma ^le^l\), \(\gamma ^re^r\)) contributes at least \(4+2p_c\theta \) to the cost of T, therefore \(c^*_T(V^{3Q})\ge 6p_c\theta +11\).

Summary for the case of x, y, z being honest contacts: if size-three equation is not satisfied then \( cr_T(V^{3S})\ge 1\); all incident cycle equations are either satisfied or are considered in Case A.2 of dishonested checkers.

Case B.2. At least one of the vertices x , y , z is dishonest

We start with the following observation.

Observation Suppose that a quasi-tour T is dishonest at a vertex \(\gamma \in \{x, y, z\}\). Then it is possible to extend the initial honest assignment defined for the set of honest variables from \(\{x, y, z\}\) (this can also be an empty set) such that the size-three equation \(x\oplus y\oplus z=0\) is satisfied, and there is sufficient credit to pay for one cycle equation incident to \(\gamma \). \(\square \)

Proof of Observation

If \(\gamma \) is incident to at least one dishonest checker then the corresponding cycle equation is covered in Case A.2, hence at most one incident cycle equation needs to be discussed here

If both cycle neighbours adjacent to \(\gamma \) are honest, as \(\gamma \) is the dishonest vertex, due to the balanced degree at \(\gamma \) in T, only one of the unforced edges from \(\gamma \) can be used in the quasi-tour T. Consequently, one of cycle neighbours of \(\gamma \) is set to 1 and the other to 0. Therefore we can chose any assignment for \(\gamma \) (e.g. the one satisfying the size-three equation) and show that we have enough credit to pay for one cycle equation associated with \(\gamma \). \(\square \)

In what follows we explain that for each dishonest variable \(\gamma \in \{x, y, z\}\) there is always sufficient credit to pay for one cycle equation. The discussion will depend on the number of the dishonest variables from \(\{x, y, z\}\).

Obviously, for any dishonest variable \(\gamma \in \{x, y, z\}\) only one of the two incident forced edges is used twice in a quasi-tour T. If this is not a case then one copy of the edges \(\gamma ^r \gamma _F\), \(\gamma ^l \gamma _F\) can be replaced by \(\gamma ^r \gamma ^l\) reducing the cost of T by \(2p_c\theta >0\).

  • Suppose all three vertices x, y, z use a forced edge twice in T. The cost of forced edges inside \(H^{3Q}\) is at least \(9(p_c\theta +0.5)+1\), and the unforced edges incident to x, y, z contribute 1.5 to the local edge cost in order to balance the degree at these vertices in T. Three vertices of the form \(\gamma ^\alpha \), for \(\gamma \in \{x, y, z\}\) and \(\alpha \in \{l, r\}\) have odd degree in T and any quasi-tour must use three unforced edges inside \(H^{3Q}\) to balance their degrees. Hence, \(c^*_T(V^{3Q})\ge 9p_c\theta +10\), and \(cr_T(V^{3Q})\ge 3p_c\theta \) due to (2). Therefore, if we chose an assignment satisfying the size-three equation, there is sufficient credit \(cr_T(V^{3Q})\ge 3p_c\theta \) to pay for 3 cycle equations which is enough due to the observation above.

  • Suppose exactly two of the vertices x, y, z use a forced edge twice in T, let’s assume x, y (the other cases are symmetrical). For the honest variable z we use the honest assignment: 1, if both unforced edges incident to z are used in T; 0 otherwise. Then obviously, the cycle equations for edges incident with z are either satisfied or are considered in Case A.2. The contribution of the forced edges inside \(H^{3Q}\) to the cost of T is at least \(8(p_c\theta +0.5)+1\), and unforced edges incident to x, y contributes to the local edge cost by 1. One of the vertices \(x^l\), \(x^r\) and one of \(y^l\), \(y^r\) have odd degree in T, therefore any quasi-tour needs to use two unforced edges from \(H^{3Q}\) to balance the degrees in T which increases the cost by 2.

Finally, the degree of the vertices \(z^l\), \(z^r\) must be balanced in the quasi-tour T. If two unforced edges incident to \(z^l\), \(z^r\) are in T, we have the full local cost \(c_T^*(V^{3Q})\ge 8p_c\theta +10\), hence \(cr_T(V^{3Q})\ge 2p_c\theta \). Otherwise if the edge \(z^lz^r\) is in T, then either two unforced edges incident on z are used or the quasi-tour pays 2 extra for a component, so again \(c^*_T(V^{3Q})\ge 8p_c \theta +10\) and \(cr_T(V^{3Q})\ge 2p_c\theta \).

Therefore we can select an assignment for x, y that together with the honest assignment for z satisfies the size-three equation in such a way that there is sufficient credit to pay for at most 2 cycle equations incident on x, y, which is enough following the observation above.

  • Suppose that only one of x, y, z has an incident forced edge used twice in T, let’s assume x (the other cases are symmetrical). For y and z we set the honest assignment and similarly to the previous case, the cycle equations for edges incident to y and z are either satisfied or are considered in Case A.2.

The contribution of the forced edges to the full local cost of T is at least \(7(p_c\theta +0.5)+1\), and the local edge cost 0.5 for one unforced edge incident to x.

Also, at least one unforced edge incident on \(x^l\) or \(x^r\) is used, since one of them has odd degree in T. For \(y^l\), \(y^r\) either two unforced edges are used to balance their degree in T, or if \(y^ly^r\in T\), then either unforced edges incident on y contribute 1 (or the quasi-tour pays extra for a component), hence the contribution to the cost to balance the degree of the vertices \(y^l\), \(y^r\) is at least 2. With the same reasoning for \(z^l\), \(z^r\), we get that the full local cost is \(c_T^*(V^{3Q}_j)\ge 7p_c\theta +10\), hence \(cr_T(V^{3Q})\ge p_c\theta \).

Now selecting an assignment for x that satisfies the size-three equation, we have sufficient credit to pay for one cycle equation incident on x which is enough due to observation above.

Summary for the quasi-tour with a dishonest vertex: we have an option to choose an assignment for x, y, z that satisfies the size-three equation; and the weight of unsatisfied incident cycle equations that are not considered in Case A.2 can be bounded from above by \(\frac{1}{\theta }cr_T(V^{3Q})=\max \{1, p_m\}\cdot cr_T(V^{3Q})\).

Let us now conclude our analysis. To calculate the cost of the quasi-tour T, the vertex set is partitioned into two types of gadget-based subgraphs covering the entire set V which were considered separately:

  • Case A: the size of 2 gadgets \(H^{2M}\) (‘matching pairs’), there are \(\frac{3}{2}(\tau -1)m\) such gadgets;

  • Case B: the size of 11 gadgets \(H^{3Q}\) (‘size-three equation’), there are m such gadgets; the edges incident to s are counted in this case even if s is not part of the gadget.

Following the discussion in both cases, the cost of any quasi-tour must be at least \( \bigl (\frac{3}{2} (\tau -1)(4p_c\theta +1)+6p_c\theta +10\bigr )m\) and following the assumption of the theorem, the sum of full local costs must be

$$\begin{aligned}c^*_T(V)\le \biggl (\frac{3}{2} (\tau -1)(4p_c\theta +1)+6p_c\theta +10\biggr )m +\varDelta ,\end{aligned}$$

where the additional cost 2 may need to be added to connect the component with the singleton s to the rest of the tour. Therefore the sum of all credits \(cr_T\) is at most \(\varDelta \). Since we have already argued that the \(\max \{1, p_m\}\sum cr_T\) bounds from above the weight of all equations unsatisfied by the defined assignment in both cases locally, this concludes the proof. \(\square \)

Theorem 4

If \((p_c, p_m, \tau )\) is an admissible triple then it is \(\mathrm{NP}\)-hard to approximate the Travelling Salesman problem to within any constant approximation ratio less than

$$\begin{aligned}1+ \frac{1}{3(\tau -1)(4p_c+\max \{1, p_m\})+12p_c+20 \max \{1, p_m\}}.\end{aligned}$$

Proof

Let \(\varepsilon \in (0, \frac{1}{4})\). Consider a \((2k, \tau )\)-bi-wheel with large enough k, which is an amplifier with cycle weights \(p_c\) and matching weights \(p_m\). We have instances of Hybrid\((W_{k,\tau },p)\) with \(\nu \) copies of a bi-wheel \((W_{k,\tau },p)\), m equations of the form \(x\oplus y\oplus z=0\) each of weight 1, \(3\tau m\) equations of the form \(x\oplus y=1\) each of weight \(p_m\) with the following \(\mathrm{NP}\)-hard gap results: It is \(\mathrm{NP}\)-hard to decide whether there is an assignment to the variables that leaves unsatisfied equations of weight at most \(\varepsilon m\), or every assignment to the variables leaves unsatisfied equations of weight at least \((0.5 -\varepsilon )m\). Due to Lemma 1 and 2 we now know that for produced instances G[J] of TSP it is NP-hard to decide whether there is a tour with cost at most \(\bigl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+6p_c\theta +10\bigr )m +2\nu +\varepsilon m\), where \(\theta =\frac{1}{\max \{1, p_m\}}\) or all tours have cost at least \(\bigl (\frac{3}{2}(\tau -1)(4p_c\theta +1)+6p_c\theta +10\bigr )m +(0.5-\varepsilon )m\cdot \theta -2\).

The ratio between these two cases can get arbitrarily close to

$$\begin{aligned}1+\frac{1}{3(\tau -1)(4p_c+\max \{1, p_m\})+12p_c+20\max \{1, p_m\}}\end{aligned}$$

by appropriate choices of \(\varepsilon >0\) and large enough k.\(\square \)

5 Random construction of bi-wheel amplifiers

As it follows directly from Theorem 4, the better bounds for the random constructions of bi-wheel amplifiers can directly improve lower bounds for approximability of the TSP. In this section we only focus on case \(p_c<1\) which will be enough to improve known lower bounds for TSP.

Theorem 5

The triple \((p_c=\frac{1}{2}, p_m=1, \tau =11 )\) is admissible, hence for every large enough \(k\ge k_0\) there is a (2k, 11)-bi-wheel that is an amplifier with cycle weights \(p_c=\frac{1}{2}\) and matching weights \(p_m=1\).

Proof

Let an integer \(k>0\) be large enough. Consider a bi-wheel \(W_{k, 11}\) with two disjoint cycles, each on 11k vertices,

$$\begin{aligned}V^u=\{1^u, 2^u, \dots , (11k)^u\} \text { and } V^n=\{1^n, 2^n, \dots , (11k)^n\}\end{aligned}$$

with sets of k contacts \(D=D^u\cup D^n\), where

$$\begin{aligned}D^u=\{11^u, (11\cdot 2)^u, \dots , (11\cdot k)^u\} \text { and }D^n=\{11^n, (11\cdot 2)^n, \dots , (11\cdot k)^n\},\end{aligned}$$

the remaining vertices \(V^u{\setminus }D^u\) and \(V^n{\setminus }D^n\) are checkers. That means, that any two consecutive contacts in the bi-wheel cycles are separated by a chain of 10 checkers. For a given subset \(A \subseteq V\), the fragments of A are all connected components of A within cycles of \(V^u\) and \(V^n\) together.

Now select uniformly at random a perfect matching from checkers \(V^u{\setminus }D^u\) to checkers \(V^n{\setminus }D^n\). We claim that with a high probability a bi-wheel created in this way is a weighted amplifier with weights \(p_c=\frac{1}{2}\) and \(p_m=1\).

Such a randomly constructed bi-wheel fails to have required amplification properties if the system of so-called bad sets,

$$\begin{aligned}{ \mathscr {B}}:=\{B\subseteq V: p(E(B, \overline{B})) < \min \{|D\cap B|, |D \cap \overline{B}| \}\end{aligned}$$

is nonempty. If \({ \mathscr {B}}\ne 0\), it also contains a minimal bad set, where by minimal we mean an element of \({ \mathscr {B}} \) that is minimal with respect to the partial order \(\preceq \) defined for any \(A, B\subseteq V\) as:

$$\begin{aligned}A \preceq B \text { iff } | (|(A \triangle B) \cap D| \le p(E(B, \overline{B})) - p(E(A, \overline{A}))\end{aligned}$$

and if \(A \cap D=B\cap D\) and \(p(E(B, \overline{B}))=p(E(A, \overline{A}))\) then the number of fragments of A is at most the number of fragments of B.

These minimal bad sets have certain more restrictive properties that allow better estimation of the upper bound of probability that such sets B exist in a randomly constructed bi-wheel. These results are derived in details in Chlebík and Chlebíková (2003) for the case of wheel-amplifiers. The methods and results can be extended in straightforward way to the weighted bi-wheel case as well.

Let us mention some of these properties: given a bad set B, then

  • Fact 1: Any fragment of a minimal bad set B (and of \(\overline{B}\)) contains at least 2 vertices.

  • Fact 2: If an end vertex of a fragment of a minimal bad set B is a checker x, then this checker is matched with a vertex from B as well. The same is true for fragments of \(\overline{B}\).

In our strategy to prove that probability that a minimal bad set exists is \(<1\), we need to understand how for a given subset \(S \subseteq V {\setminus }D\) of checkers to estimate from above the probability (over the random matchings of \(V^u{\setminus }D^u\) to \(V^n{\setminus }D^n\)) that S is the trace of a minimal bad set B in the checker vertices \(V{\setminus }D\), i.e., \(S= B{\setminus }D\).

We want to take the sum of these probabilities over all potentially minimal bad sets and prove that the sum is \(<1\). It follows that with high probability, no set is a minimal bad set and therefore a randomly constructed bi-wheel is a weighted amplifier with parameters (\(p_c=\frac{1}{2}\), \(p_m=1\), \(\tau =11\)).

Now suppose that \(S\subseteq V{\setminus }D\) with \(|S|=w\), what is the probability that there are exactly c matching edges with exactly one endpoint in S?

We can observe that the most interesting case is the balanced one, namely when S contains equal (or nearly equal) number of checkers from each side \(V^u\), \(V^n\), as follows:

Suppose that S contains \(\frac{w}{2}+l\) checkers on one side, and \(\frac{w}{2}-l\) checkers on the side, \(l\ge 0\). The probability that exactly c matching edges have exactly one endpoint in S is

$$\begin{aligned} P&(w, c, l)=\\&= \left( {\begin{array}{c}\frac{w}{2} +l\\ \frac{c}{2} +l\end{array}}\right) \left( {\begin{array}{c}\frac{w}{2} -l\\ \frac{c}{2} -l\end{array}}\right) \left( {\begin{array}{c}10k-\frac{w}{2} -l\\ \frac{c}{2} -l\end{array}}\right) \left( {\begin{array}{c}10k-\frac{w}{2} +l\\ \frac{c}{2} +l\end{array}}\right) \times \\&\qquad \qquad \times \left( \frac{c}{2}+l\right) !\left( \frac{c}{2}-l\right) ! \frac{\left( \frac{w}{2} -\frac{c}{2}\right) !\left( 10k-\frac{w}{2}-\frac{c}{2}\right) !}{(10k)!} \end{aligned}$$

We observe that in such scenario to create exactly c matching edges between S and \(\overline{S}\) we need to select more endpoints, \(\frac{c}{2} +l\), on the side where S is larger, since after we remove checkers matched from S to \(\overline{S}\) we must have a perfect matching for the rest of S (and \(\overline{S}\)).

Interestingly, P(wcl) achieves maximum for \(l=0\). To see that, one can simplify \(\frac{P(w, c, l+1)}{P(w, c, l)}\) as \(\left( 1-\frac{2l+1}{\frac{c}{2} +l+1}\right) \left( 1+\frac{2l+1}{\frac{11}{2} -l}\right) \left( 1+\frac{2l+1}{10k-\frac{w}{2} -l}\right) \) and to prove that this quotient is for relevant range of parameters less than \(<1\) (using the fact \(1+x\le e^x\)).

We consider several parameters for a potentially minimal bad set such as \(\min \{|D \cap B|, |D \cap \overline{B}|\} \); the number of fragments of B in cycles \(V^u\) and \(V^n\) together (this is the same as the number of fragments of \(\overline{B}\)). As \(p(E(B, \overline{B}))< \min \{|D \cap B|, |D \cap \overline{B}| \}\), it also gives estimate \(f< \min \{|D \cap B|, |D \cap \overline{B}|\}\).

Also, with \(S:=B{\setminus }D\) we have an upper bound on the possible c for which we have exactly c matching edges with exactly one endpoint in S, namely

$$\begin{aligned}c+f < \min \{|D \cap B|, |D \cap \overline{B}|\}.\end{aligned}$$

For any feasible vector of parameters we have in hand the probability P(wcl) computed above. Using restrictions of a minimal bad set we estimate from above the number of ways how such a bad set can be generated. We use simple sum of probabilities for the union bound. Using Stirling’s formula and the binary entropy function for bounding \(\frac{1}{k}\log \) of that probability, we can see that it stays negative and bounded away from zero.

This will complete the proof that with high probability we constructed bi-wheel above amplifiers with parameters (\(p_c=\frac{1}{2}\), \(p_m=1\), \(\tau =11\)).\(\square \)

6 Conclusion

The methods of this paper provide a new motivation for the study of expanding properties of random graphs. As we have demonstrated, introducing the parametrised weighted amplifiers and weighted low occurrence Constraint Satisfaction problems as intermediate steps in the NP-hard gap reductions, allows more flexibility in fine-tuning their expanding parameters. We show that already slight improvement of known expander values modestly improve the hardness of approximation for TSP from the current best value \(\frac{123}{122}\) (Karpinski et al. 2015) to the new value \(\frac{117}{116}\). The introduced method of weighted amplifiers (or expanders) can be of independent interest. Such technique could be used in the gap preserving reductions for other edge-weighted optimisation problems to improve their approximation hardness results.