1 Introduction

The rapid growth of innovation applications such as smart agriculture, smart factories, and intelligent traffic monitoring system has triggered an explosive growth of internet-of-things (IoT) devices [1, 2]. Nevertheless, IoT devices are usually limited by batteries and computing power, or even no computing power, which makes IoT devices unable to process data. To solve these problems, mobile edge computing (MEC) has been proposed as a promising solution [3,4,5,6]. The main idea of MEC is to complete the computation-intensive and energy-intensive tasks of IoT devices at the edge nodes, such as access points (APs) or base stations, through task offloading. After the MEC node completes the data processing, the calculation results (e.g., working or operation instructions to the IoT terminals) will be returned to the devices. In this regard, the energy consumption of the devices can be significantly reduced, which can reduce the device maintenance costs in actual IoT usage scenarios. MEC enabled mission-critical IoT system has been taken as a key enabler for the sustainability of wireless cellular networks.

Extensive work has been done to study the joint task offloading and radio resource optimization for MEC-enabled IoT systems [7,8,9]. Specifically, in [7], the total summation of the hovering energy and the computation energy minimization problem for the MEC system assisted by unmanned aerial vehicle (UAV) was studied. Therein, the hovering time, scheduling and resource allocation for IoT users are jointly optimized. In addition, the authors in [8] solved the problem of offloading decision for multiple users from the perspective of game theory. By designing the optimal response algorithm, it is proved that there is a Nash equilibrium among users. In addition, in the literature [9], the offloading of the MEC network supporting Orthogonal Frequency Division Multiple Access (OFDMA) and the joint uplink and downlink resource allocation problems were studied. Therein, several sub-optimal solutions that achieve different leverages between the system performance and the time complexity are explored. Recently, machine learning aware methods have been widely utilized to make task offloading decisions on MEC networks [10,11,12]. More precisely, Yang et al. studied the joint communication and computation resource allocation for non-orthogonal multiple access (NOMA) assisted MEC systems in [10]. A multi-agent Q-learning empowered algorithm is developed to make offloading decisions for multiple NOMA users. Moreover, Huang et al. in [11] explored a deep reinforcement learning algorithm to perform the online computation offloading for wireless powered MEC systems. It was explicitly shown that the designed learning assisted scheme makes the real-time and optimal offloading become a reality. Furthermore, Wu et al. [12] investigated the combination between the optimization method and the Q-learning strategy to optimize the joint offloading and resource allocation problem in MEC systems, while considering the time-varying channel and unknown users’ channel state information.

However, the aforementioned work [7,8,9,10,11,12] mainly focused on the conventional packet transmission scheme, in which the Shannon capacity formula was applied. Nevertheless, for mission-critical IoT, enabling the transmission mechanism of the conventional block length mechanism may cause a longer delay, which further shortens the battery life of IoT devices [13]. To meet the stringent requirements of latency-sensitive applications, a MEC network assisted by short packet transmission is proposed. For which, several remarkable attempts are worth to be introduced [14,15,16]. More specifically, the authors in [14] studied the offloading decision for MEC systems supporting single-carrier time division multiple access. Meanwhile, in [15, 16], the resource allocation for MEC systems supporting OFDMA was analyzed with short packet transmission consideration. In particular, a deep learning based approach was utilized in [15] to allocate online resources for MEC users to minimize the maximum normalized energy consumption of each user. In addition, the optimal resource management of OFDMA-MEC system for ultra-reliable low-latency communication (URLLC) was studied in [16]. The goal in [16] was to minimize the end-to-end latency of the network. Thereof, the optimal joint optimization solution based on branch and bound method and two time-efficient suboptimal solutions are designed, respectively. Furthermore, the interplay between URLLC and enhanced mobile broadband (eMBB) has been studied by Yang et al. in [17], where the energy efficiency and the sum of received SNRs are maximized for eMBB users and URLLC users, respectively. The formulated multiple timescales problem has been solved by a sample average approximation (SAA) oriented technique. Similar models to that in [17] have been investigated by Tang et al. [18] and Anand et al. [19] to minimize the URLLC users’ latency.

Since the next generation of wireless cellular networks has strict requirements on the energy efficiency and transmission efficiency (low latency) [20,21,22,23], in this paper, we investigate the short-packet transmission enabled data offloading between IoT devices and access points, aiming at minimizing the total energy consumption. Our main contributions are summarized as follows:

  • We formulated the energy cost minimization problem for the short packets transmission for mission-critical IoT in MEC system, and revealed the effect of short packet transmission on radio resource management. The minimization problem is a mixed integer non-linear programming (MINLP) problem, and it is challenging to solve this problem in the best way. The difficulty is essentially derived from the coupling of offloading strategy and resource optimization.

  • For ease of tractability, we decoupled the original MINLP problems into two sub-problems: 1) the offloading decision-making sub-problem and 2) the resource allocation sub-problem. It clearly demonstrated that the resource allocation sub-problem of IoT devices under the fixed offloading policy is a convex optimization problem, and its optimal solution can be realized by various convex optimization tools.

  • Through the previous analysis, a time-saving algorithm that works in an iterative manner is developed to jointly optimize the offloading strategy and the resource (i.e., computation, power and bandwidth), so as to have a provable convergence guarantee.

Apart from the above contributions, Monte-Carlo simulations were also conducted to verify the effiectiveness of our proposed joint optimization scheme from various aspects, such as the convergence and the energy consumption, under extensive and comprehensive system settings.

The rest of this paper is summarized as follows: in Sect. 2, we introduce the considered MEC-enabled mission-critical IoT system model and discuss the total energy consumption of the system. In Sect. 3, the problem of minimizing energy cost is formulated, which considers the joint optimization of the offloading decision, bandwidth, and transmit power. In Sect. 4, we propose a time-efficient near optimal algorithm to solve the non-tractable MINLP problem. Monte-Carlo simulation is performed in Sect. 5 to evaluate the performance of our developed strategy in terms of convergence performance and total energy consumption. Finally, we conclude this paper and reveal the future research direction.

2 System description and energy consumption

In this section, we first introduce the system model of the mobile edge computing (MEC) enabled mission-critical internet of things (IoT) network. Whereafter, we elaborate on the achievable data rate of each IoT device and the system’s energy cost.

2.1 System model

We consider the MEC based IoT network that consists of N APs serving K devices, as illustrated by Fig. 1. Let \({\mathcal {N}}=\{1,2,\ldots ,N\}\) and \({\mathcal {K}}=\{1,2,\ldots ,K\}\) be the index sets of all the APs and the IoT devices, respectively. Each of the IoT devices has a latency-critical task to process. Denote by \(T_k\) the delay requirement of the kth IoT terminal, where \(k\in {\mathcal {K}}\). The APs provide computing services to all the devices as the IoT devices have no data processing capability in the context of this article.

Fig. 1
figure 1

MEC-enabled mission-critical IoT system model

In addition, it is assumed that both the APs and the devices are equipped with a single antenna. For \(k\in {\mathcal {K}}\), we use a three-dimension tuple to characterize the task of the kth device, referred to as \(({\mathbb {U}}_k,{\mathbb {F}}_k,{\mathbb {D}}_k)\). Specifically, \({\mathbb {U}}_k\) indicates the data volume of device k that needs to be offloaded/computed. In addition, \({\mathbb {F}}_k\) (in CPU cycles) denotes the computation resource required by the kth device to execute its data. Moreover, \({\mathbb {D}}_k\) represents the output data size of device k after the processing is completed at its selected AP.

We stipulate that each device chooses one of the APs to offload its task for further process.Footnote 1 To characterize this feature, we define an auxiliary variable, referred to as \(a_{k,n} \in \{0,1\}\), as the indicator to show whether the kth device uploads its data to AP n or not, where \(k\in {\mathcal {K}}\) and \(n\in {\mathcal {N}}\). In detail, \(a_{k,n}=1\) if device k is associated with the nth AP and \(a_{k,n}=0\) otherwise. With aforementioned definitions, we have the following constraint,

$$\begin{aligned} {\sum _{n\in {\mathcal {N}}} a_{k,n}=1,} \end{aligned}$$
(1)

where \(k\in {\mathcal {K}}\). Before ending this subsection, we define \(\tau _{\text {U}}\) and \(\tau _{\text {D}}\) as the offloading transmission time and the downloading transmission time of IoT devices, respectively.

2.2 Achievable capacity

As mentioned, we focus on the short packet transmission between the IoT devices and the APs. The upload and download transmission rates of the kth device are denoted by \(R_{k,n}^{\text {U}}\) and \(R_{k,n}^{\text {D}}\), respectively, therein we assume that device k is associated with AP n. In addition, for \(k\in {\mathcal {K}}\) and \(n\in {\mathcal {N}}\), define \(g_{k,n}\) as the channel gain between IoT device k and AP n. Based on [24, 25], the maximum achievable rate in short packet regime can be accurately approximated by

$$\begin{aligned} R_{k,n}^{\text {U}} \approx \frac{W_{k,n}^{\text {U}}}{\ln 2} \left[ \ln (1+\frac{g_{k,n}p_{k,n}^{\text {U}}}{\delta N_0W_{k,n}^{\text {U}}} )-\sqrt{\frac{V_{k,n}^{\text {U}}}{\tau _{\text {U}}W_{k,n}^{\text {U}}}}Q^{-1}_G(\varepsilon _k)\right] , \end{aligned}$$
(2)

where \(W_{k,n}^{\text {U}}\) and \(p_{k,n}^{\text {U}}\) indicate the assigned bandwidth and transmit power to device k for offloading data to AP n, respectively. \(\delta\) represents the signal-to-noise-ratio (SNR) loss due to imperfect channel state information (CSI) at the transmitter [26], \(N_0\) depicts the single-side noise power spectrum density. Moreover, \(\varepsilon _k\) is the decoding error probability of device k. Furthermore, \(Q^{-1}_G(\cdot )\) expresses the inverse of Gaussian Q-function and \(V_{k,n}^{\text {U}}\) is the channel dispersion of device k [27], which is quoted as follows:

$$\begin{aligned} {V_{k,n}^{\text {U}}=1-\frac{1}{(1+\frac{g_{k,n}p_{k,n}^{\text {U}}}{\delta N_0W_{k,n}^{\text {U}}})^2}.} \end{aligned}$$
(3)

Note that \(V_{k,n}^{\text {U}}\) is approximated to 1 in the following. Such an approximation is very accuracy, when the received SNR is higher than 5 dB, which has been precisely validated by the authors in [25]. Details are omitted here to avoid redundancy.

2.3 Energy consumption analysis

In this subsection, we elaborate on the energy consumption induced by the IoT devices’ data offloading and the computation results downloading, as well as the data processing at the APs, respectively.

2.3.1 Energy consumption of offloading and downloading

We assume that device k selects AP n to do the data offloading. With aforementioned definitions, we have

$$\begin{aligned} R_{k,n}^{\text {U}}\tau _{\text {U}} \ge {\mathbb {U}}_k. \end{aligned}$$
(4)

Substituting (2) into (4), the transmit power of device k, i.e., \(p_{k,n}^{\text {U}}\), should satisfy

$$\begin{aligned} p_{k,n}^{\text {U}}&\ge \frac{\delta N_0W_{k,n}^{\text {U}}}{g_{k,n}} \left\{ \exp ( \frac{{\mathbb {U}}_k \ln 2}{\tau _U W_{k,n}^{\text {U}}}+\frac{Q^{-1}_G(\varepsilon _k)}{\sqrt{\tau _{\text {U}} W_{k,n}^{\text {U}}}} ) -1 \right\} \\&=f_k^{\text {U}}(W_{k,n}^{\text {U}}). \end{aligned}$$
(5)

For simplicity, we use \(f_k^{\text {U}}(W_{k,n}^{\text {U}})\) to represent the lower bound of \(p_{k,n}^{\text {U}}\). It is noteworthy that \(f_k^{\text {U}}(W_{k,n}^{\text {U}})\) is non-convex with respect to (w.r.t.) \(W_{k,n}^{\text {U}}\) [25]. In a similar way, the constraint for the downlink phase can be expressed as follows:

$$\begin{aligned} R_{k,n}^{\text {D}}\tau _{\text {D}} \ge {\mathbb {D}}_k, \end{aligned}$$
(6)

from which the allocated transmission power to device k in the download phase can be obtained, namely,

$$\begin{aligned} p_{k,n}^{\text {D}}&\ge \frac{\delta N_0W_{k,n}^{\text {D}}}{g_{k,n}} \left\{ \exp ( \frac{{\mathbb {D}}_k \ln 2}{\tau _{\text {D}} W_{k,n}^{\text {D}}}+\frac{Q^{-1}_G(\varepsilon _k)}{\sqrt{\tau _{\text {D}} W_{k,n}^{\text {D}}}} ) -1 \right\} \\&= f_k^{\text {D}}(W_{k,n}^{\text {D}}), \end{aligned}$$
(7)

where \(W_{k,n}^{\text {D}}\) and \(p_{k,n}^{\text {D}}\) express the allocated bandwidth and transmit power for device k to download the computation results from AP n, respectively. It is not hard to check that \(f_k^{\text {D}}(W_{k,n}^{\text {D}})\) is also a non-convex function w.r.t. \(W_{k,n}^{\text {D}}\).

Although \(f_k^{\text {U}}(W_{k,n}^{\text {U}})\) and \(f_k^{\text {D}}(W_{k,n}^{\text {D}})\) are non-convex w.r.t. \(W_{k,n}^{\text {U}}\) and \(W_{k,n}^{\text {D}}\), they have some interesting properties as quoted below [13]:

Property 1

There is a unique solution \({\tilde{W}}_{k,n}^{U}\) (\({\tilde{W}}_{k,n}^{D}\)) that minimizes \(f_{k}^U(W_{k,n}^U)\) (\(f_{k}^D(W_{k,n}^D)\)). In addition, \(f_{k}^U(W_{k,n}^U)\) (\(f_{k}^D(W_{k,n}^D)\)) is strictly convex w.r.t. \(W_{k,n}^{U}\) (\(W_{k,n}^{D}\)) when \(0<W_{k,n}^{U} \le {\tilde{W}}_{k,n}^{U}\) (\(0<W_{k,n}^{D} \le {\tilde{W}}_{k,n}^{D}\)).

Based on the foregoing property, we have the following Lemma:

Lemma 1

The global optimal solution of problem:

$$\begin{aligned} \min _{W_{k,n}^U,W_{k,n}^D,p_{k,n}^U,p_{k,n}^D} {\mathscr {F}}(W_{k,n}^U,W_{k,n}^D,p_{k,n}^U,p_{k,n}^D) \end{aligned}$$

with constraints \((3),\,(4)\,\text {and}\,W_{k,n}^m>0,\, p_{k,n}^m>0, \,m\in \{U, D\}\), can be obtained via solving the optimization problem with objective function

$$\begin{aligned} \min _{W_{k,n}^{\text {U}},W_{k,n}^{\text {D}},p_{k,n}^U,p_{k,n}^D} {\mathscr {F}}(W_{k,n}^U,W_{k,n}^D,p_{k,n}^U,p_{k,n}^D), \end{aligned}$$

subjecting to

$$\begin{aligned} (3),\,(4),\,W_{k,n}^m>0,\, p_{k,n}^m>0,\, W_{k,n}^m \le {\tilde{W}}_{k,n}^{m}, \,m\in \{U, D\}, \end{aligned}$$

given that the objective function \({\mathscr {F}}\) is increases with bandwidth \(W_{k,n}^m\) and transmit power \(p_{k,n}^m\), where \(m\in \{U, D\}\).

The aforementioned conclusions have been proved by [13]. Details are omitted here for brevity. Denote by \(E_{k,n}^{\text {U}}\) the energy cost of device k in terms of offloading data to the nth AP. In addition, let \(E_{k,n}^{\text {D}}\) be the energy consumption of the nth AP to feedback the computation results to device k. As a consequence, we have

$$\begin{aligned} E_{k,n}^{\text {U}}=a_{k,n}p_{k,n}^{\text {U}}\tau _{\text {U}}, \,k\in {\mathcal {K}}, \end{aligned}$$
(8)

and

$$\begin{aligned} E_{k,n}^{\text {D}}=a_{k,n}p_{k,n}^{\text {D}}\tau _{\text {D}}, \,k\in {\mathcal {K}}. \end{aligned}$$
(9)

2.3.2 Energy cost of APs for data processing

Define \(L_{k,n}\) as the assigned computation resource for IoT device k by AP n. Thereby, the execution delay of device k, denoted by \(\tau _{k,n}\), is obtained as follows:

$$\begin{aligned} \tau _{k,n} ={\mathbb {F}}_k/{L_{k,n}}, \,k\in {\mathcal {K}}. \end{aligned}$$
(10)

It is assumed that the AP executes its associated devices’ tasks in a sequential manner.Footnote 2 Thereby, the following latency requirement should be met:

$$\begin{aligned} a_{k,n} (\tau _{\text {U}}+\tau _{\text {D}}+\sum _{k\in {\mathcal {K}}}\tau _{k,n} )\le T_k,\,n\in {\mathcal {N}},\,k\in {\mathcal {K}}. \end{aligned}$$
(11)

In addition, define \(L_n\) (in cycles) as the computation capacity of AP n, where \(n \in {\mathcal {N}}\). With aforementioned definitions, we have

$$\begin{aligned} \sum _{k\in {\mathcal {K}}}a_{k,n}L_{k,n} \le L_n,\,n\in {\mathcal {N}}. \end{aligned}$$
(12)

The energy consumed by data processing for device k at AP n, denoted by \(E_{k,n}^{\text {P}}\), can be expressed as follows:

$$\begin{aligned} E_{k,n}^{\text {P}}=a_{k,n}\alpha _n {\mathbb {F}}_k(L_{k,n})^2, \end{aligned}$$
(13)

where \(\alpha _n\) indicates the effective switched capacitance, which is a hardware architecture dependent parameter [7]. Following the system setup in [7], we let \(\alpha _n=10^{-27}\) for \(n\in {\mathcal {N}}\).

Define \(E_{\text {Total}}\) as the total energy expenditure of the whole system. With the foregoing analysis, we have

$$\begin{aligned} E_{\text {Total}}=\sum _{k\in {\mathcal {K}}}\sum _{n\in {\mathcal {N}}}(E_{k,n}^{\text {U}}+E_{k,n}^{\text {D}}+E_{k,n}^{\text {P}}), \end{aligned}$$
(14)

where \(E_{k,n}^{\text {U}}\), \(E_{k,n}^{\text {D}}\), and \(E_{k,n}^{\text {P}}\) have been given in (8), (9), and (13), respectively.

3 Problem formulation

In this section, we discuss the considered optimization problem formulation. Some definitions are given at first. Denote by \({\varvec{W}}=(W_{k,n}^{\text {U}},W_{k,n}^{\text {D}})_{k\in {\mathcal {K}}, n\in {\mathcal {N}}}\) the bandwidth allocation for all the devices. In addition, define \({\varvec{p}}=(p_{k,n}^{\text {U}},p_{k,n}^{\text {D}})_{k\in {\mathcal {K}}, n\in {\mathcal {N}}}\) as the power allocation for all the IoT devices. Besides, denote \({\varvec{L}}=(L_{k,n})_{k\in {\mathcal {K}}, n\in {\mathcal {N}}}\) as the computation resource allocation strategy of the system. Moreover, let \({\varvec{a}}_k=(a_{k,n})_{n\in {\mathcal {N}}}\) and \({\varvec{a}}=({\varvec{a}}_k)_{k\in {\mathcal {K}}}\) be the offloading decision vector of device k and the system, respectively. Furthermore, denote by W the total system bandwidth, while let \(P_n\) be the transmit power budget of AP n, where \(n\in {\mathcal {N}}\). In this work, we aim to minimize the total energy cost via jointly optimizing the offloading decision, computation resource, bandwidth, as well as the transmit power, taking into account the total computation capability, bandwidth and power constraints. The minimization problem, referred to as \(\mathbf{P }(0)\), is mathematically formulated as follows:

$$\begin{aligned} \mathbf{P }(0):\,\min _{{\varvec{W}},{\varvec{p}}, {\varvec{L}},{\varvec{a}}} E_{\text {Total}} \end{aligned}$$
(15)

s.t.

$$\begin{aligned} &\text {C1}:\,p_{k,n}^{\text {U}} \ge a_{k,n}f_{k}^{\text {U}} (W_{k,n}^{\text {U}} ),\, k\in {\mathcal {K}},\,n\in {\mathcal {N}}, \\&\text {C2}:\,p_{k,n}^{\text {D}} \ge a_{k,n}f_{k}^{\text {D}} (W_{k,n}^{\text {D}} ),\, k\in {\mathcal {K}},\,n\in {\mathcal {N}}, \\&\text {C3}:\,a_{k,n}( \tau _{\text {U}}+\tau _{\text {D}}+\sum _{k\in {\mathcal {K}}}\tau _{k,n}) \le T_k,\,n\in {\mathcal {N}},\,k\in {\mathcal {K}},\\&\text {C4}:\,\sum _{k\in {\mathcal {K}}}a_{k,n}W_{k,n}^{\text {U}} \le W,\,n\in {\mathcal {N}}\\&\text {C5}:\,\sum _{k\in {\mathcal {K}}}a_{k,n}W_{k,n}^{\text {D}} \le W,\,n\in {\mathcal {N}}\\&\text {C6}:\,\sum _{k\in {\mathcal {K}}}a_{k,n}p_{k,n}^\text {D}\le P_{n}, \,n\in {\mathcal {N}},\\&\text {C7}:\,\sum _{k\in {\mathcal {K}}}a_{k,n}L_{k,n} \le L_n,\,n\in {\mathcal {N}},\\&\text {C8}:\,p_{k,n}^{\text {U}} \ge 0,\,k\in {\mathcal {K}},\,n\in {\mathcal {N}},\\&\text {C9}:\,p_{k,n}^{\text {D}} \ge 0,\,k\in {\mathcal {K}},\,n\in {\mathcal {N}},\\&\text {C10}:\,W_{k,n}^{\text {U}} \ge 0,\,k\in {\mathcal {K}},\,n\in {\mathcal {N}},\\&\text {C11}:\,W_{k,n}^{\text {D}} \ge 0,\,k\in {\mathcal {K}},\,n\in {\mathcal {N}},\\&\text {C12}:\,a_{k,n}\in \{0,1 \}, \,k\in {\mathcal {K}}, n\in {\mathcal {N}}, \\&\text {C13}:\,\sum _{n\in {\mathcal {N}}}a_{k,n}=1, \,k\in {\mathcal {K}}, \end{aligned}$$

where C1 and C2 indicate the power constraints of the offloading and the downloading phases, respectively. C3 represents the latency requirement of each IoT device. C4 as well as C5, and C6 express the total bandwidth requirement and the power budget of each AP, respectively. C7 gives the computation resource constraint of each AP. C8–C9 and C10–C11 depict the non-negativity of the transmit power and the allocated bandwidth, respectively. C12 illustrates the binarity of the offloading decision. Moreover, C13 indicates that each device can only select one AP to do the task offloading. The energy minimization problem \(\mathbf{P }(0)\) is a mixed-integer non-linear programming (MINLP) problem, which is in general NP-hard and is difficult to solve [7]. For ease of tractability, in the following section, a time-efficient sub-optimal algorithm with ensured convergence performance is proposed to do the joint offloading and resource management for IoT devices.

4 Methodology of resource management

In this section, we first demonstrate that the resultant resource allocation problem under given offloading decision is convex. With which, we show that the optimal solution of \(\mathbf{P }(0)\) can be obtained via an exhaustive search oriented method, which suffers from a worst-case exponentially increased time complexity. Subsequently, a time-efficient joint offloading decision and resource allocation algorithm is proposed, which works in an iterative manner and has ensured convergence performance.

4.1 Resource allocation with fixed offloading decision

Given the offloading strategy of each IoT device, the original energy minimization problem \(\mathbf{P }(0)\), can be re-written as a resource management problem, denoted by \(\mathbf{P }(1)\), and it is given as follows:

$$\begin{aligned} \mathbf{P }(1):\,\min _{{\varvec{W}}, {\varvec{p}}, {\varvec{L}}} E_{\text {Total}} \end{aligned}$$
(16)

subject to C1 to C11.

Lemma 2

\(\mathbf{P }(1)\) is a convex optimization problem.

Proof

To show that problem \(\mathbf{P }(1)\) is convex, we only need to clarify the convexity of the objective function as well as the constraints C1 and C2 since the convexity of the other constraints is obvious.

According to (8), (9), and (13), it is easy to check that \(E_{k,n}^{\text {U}}\), \(E_{k,n}^{\text {D}}\) and \(E_{k,n}^{\text {P}}\) are all convex w.r.t. the variables \(({\varvec{W}},{\varvec{p}},{\varvec{L}})\). Namely, the objective function is convex. In addition, it increases with \({\varvec{W}}\) and \({\varvec{p}}\), which means C1 and C2 are equivalent to the constraints C1, C2 and \(W_{k,n}^m\le {{\tilde{W}}_{k,n}^{m}}\), where \(m\in \{{\text {U}},{\text {D}}\}\), according to Lemma 2. Together with Property 1, the proof is completed.□

With aforementioned analysis, \(\mathbf{P }(0)\) can be optimally solved by various existing convex optimization tools. In other words, the optimal joint offloading decision and resource allocation method for problem \(\mathbf{P }(0)\) can be obtained via a full search methodology.

Lemma 3

The optimal solution for \(\mathbf{P }(0)\) suffers an exponentially increased time complexity.

Proof

The computation complexity of the full-search enabled optimal solution relies on the size of the strategy space, which is exponentially increased by K. The proof is completed.□

Since the exhaustive search scheme suffers from an exponentially increased computation complexity, which is not affordable for practical IoT systems, especially when the number of the devices is large. In the following subsection, a time-efficient algorithm is developed, which is implemented in an iterative manner and has ensured convergence property.

4.2 Time-efficient joint offloading strategy and resource management algorithm

In this subsection, we investigate a sub-optimal joint offloading decision and resource allocation method. Define \({\varvec{e}}_n\) as a N-dimension vector, whose items are all zeros except the nth component, which is set to be 1. In addition, denote by \({\mathcal {E}}=\{{\varvec{e}}_n |n\in {\mathcal {N}}\}\). Obviously, we have \({\varvec{a}}_k \subset {\mathcal {E}}\). Moreover, define \({\varvec{a}}_k^t\) and \({\varvec{a}}^t=({\varvec{a}}_k^t)_{k\in {\mathcal {K}}}\) as the offloading decision of IoT device k and the offloading strategy of the system in the tth iteration, respectively. The joint optimization algorithm is conducted in an iterative manner. In the tth iteration, we first determine the optimal offloading scheme for device k, denoted by \({{\bar{{\varvec{a}}}}}_k^{t}\) and it is given below

$$\begin{aligned} {\bar{{\varvec{a}}}}_k^{t}=\mathop {\mathrm{argmin}}\limits _{{\varvec{a}}_k^t\subset {\mathcal {E}}} E_{\text {Total}} ({\varvec{a}}_{-k}^t), \end{aligned}$$
(17)

where \(E_{\text {Total}}\) represents the system energy consumption under the optimal resource mangement.Footnote 3 In addition, \({\varvec{a}}_{-k}^t\) is defined as follows:

$$\begin{aligned} {\varvec{a}}_{-k}^t=({\varvec{a}}_1^{t-1},{\varvec{a}}_2^{t-1},\ldots ,{\varvec{a}}_{k-1}^{t-1},{\varvec{a}}_{k}^{t},{\varvec{a}}_{k+1}^{t-1},\ldots ,{\varvec{a}}_K^{t-1}). \end{aligned}$$

Moreover, we denote by

$$\begin{aligned} {\bar{{\varvec{a}}}}_{-k}^t=({\varvec{a}}_1^{t-1},{\varvec{a}}_2^{t-1},\ldots ,{\varvec{a}}_{k-1}^{t-1},{\bar{{\varvec{a}}}}_k^{t},{\varvec{a}}_{k+1}^{t-1},\ldots ,{\varvec{a}}_K^{t-1}). \end{aligned}$$
(18)

Furthermore, let \(k^*\) be the user that satisfies

$$\begin{aligned} k^* \triangleq \mathop {\mathrm{argmin}}\limits _{k\in {\mathcal {K}}}E_{\text {Total}}({\bar{{\varvec{a}}}}_{-k}^t). \end{aligned}$$
(19)

With aforementioned definitions, we renew the offloading decision of the system in tth iteration as follows:

$$\begin{aligned} {\varvec{a}}^t={{\bar{{\varvec{a}}}}}_{-k^*}^t, \end{aligned}$$
(20)

until the stopping criteria are satisfied. For brevity, we summarize the pseudo-code of our sub-optimal joint offloading and resource allocation scheme in Algorithm 1.

figure a

The convergence performance of our designed joint optimization approach is discussed below:

Lemma 4

The convergence of the objective function in our developed joint offloading and resource management algorithm is ensured.

Proof

The proof for the convergence performance is obvious since the system’s energy consumption is degraded with the iterations. In addition, the total energy cost is lower bounded by zero. The proof is completed.□

Before ending this section, we analyze the computational complexity of our developed algorithm. Details are expressed in the following lemma:

Lemma 5

The time complexity of Algorithm 1 is \({\mathcal {O}}(KN)\).

Proof

Based on Algorithm 1, to update the system’s offloading strategy, we need to calculate the energy consumption under all the possible offloading policies for each IoT device, i.e., (17). The maximum number of the offloading schemes per device is N. In total, we have K IoT terminals, which completes the proof.□

5 Simulation results

In this section, Monte-Carlo simulation is conducted to evaluate the performance of our designed joint offloading decision and resource allocation algorithm for mission-critical IoT enabled MEC systems. We assume that the IoT devices are randomly and uniformly distributed within a disk with radius setting to be 250 m. In total, we have three APs, i.e., N = 3. In addition, we assume that the data size of each device is randomly selected from a given interval, namely, \({\mathbb {U}}_k \in [60,100]\) bytes for \(k\in {\mathcal {K}}\). Meanwhile, we stipulate that the output size for the task of IoT device k to be \({\mathbb {D}}_k \in [30,50]\) bytes, where \(k\in {\mathcal {K}}\). Besides, the computation capacity per AP is set to be 1 G CPU cycles per second. Moreover, the latency requirement of device k, referred to as \(T_k\), is assumed to be 5 s. The SNR loss and the noise power spectral density are set as \(\delta =1.5\) and \(N_0= -130\) dBm/Hz, respectively. Furthermore, we assume the system bandwidth and the transmit power per AP to be W = 1 MHz and \(P_n=1\) watt, respectively. At last, we declare that the applied small scale fading follows the Rayleigh fading with unit variance. Meanwhile, the distance dependent path loss is expressed as \(128.1 + 37.6 \log _{10} d\), in which d is in Km and represents the Euclidean distance between IoT devices and the APs. For brevity, we summarize the simulation parameters in Table 1.

Table 1 Simulation parameters

Two baseline schemes are taken into account for performance comparison, and they are listed below:

  • Baseline 1: In this scheme, different IoT devices select the nearest AP to do the task offloading.

  • Baseline 2: In this strategy, each device chooses its associated AP randomly.

Note that in foregoing mentioned two benchmark schemes, the resource management strategy is identical to that used in our explored scheme. In addition, to provide a comprehensive evaluation, a pair of system performance metrics are considered in this section, i.e., the convergence performance and the total energy consumption, respectively.

Figure 2 shows the convergence performance of our proposed joint offloading decision and resource allocation algorithm, i.e., Algorithm 1. We use the system energy consumption during different iterations to represent this performance. It can be seen from Fig. 2 that, for any given K, the designed method can converge rapidly, illustrating the efficiency of our joint optimization approach. In addition, as expected, a larger number of IoT devices, namely, K, induces a higher total energy consumption.

Fig. 2
figure 2

Convergence performance

Figure 3 depicts the energy consumption of four schemes with varying number of devices, where the x-axis is the number of IoT devices, while the y-axis expresses the total energy consumption. We claim that the optimal method in Fig. 3 is the exhaustive search oriented approach as stated in Sect. 4.1. Due to the limited workspace capacity of MATLAB, the values of the optimal scheme when K is larger than 12 are omitted. Each point on these two figures is obtained via averaging over 300 feasible instances. In addition, two baselines as mentioned before are taken into account to do the comparison with our devised algorithm. From Fig. 3, we can observe that, the energy consumption gap between Algorithm 1 and the optimal scheme is tiny, showing the energy efficiency of our proposed joint optimization approach. In addition, for any given K, Algorithm 1 outperforms the other two baselines, demonstrating the significance of offloading decision optimization in terms of saving total energy costs. More specific, when K = 24, Algorithm 1 saves energy by 15.8% and 63.3% when compared to Baseline 1 and Baseline 2, respectively. Moreover, it is worth noting that for any given K, Baseline 1 requires a less total energy than that of Baseline 2.

Fig. 3
figure 3

The total energy consumption v.s. IoT devices number, K

Figure 4 expresses the energy consumption of our devised algorithm compared to two baseline schemes, wherein the x-axis depicts the offloading time of the IoT devices, referred to as \(\tau _{\text {U}}\), while the y-axis represents the total energy cost of each approach. In addition, it is stipulated that \(\tau _{\text {D}}=30\) ms. From which, we see that the explored algorithm outperforms the benchmark strategies significantly, especially when \(\tau _{\text {U}}\) is large. For instance, when \(\tau _{\text {U}}=280\) ms, our developed method saves energy by 70.9% and 80.1% compared to Baseline 1 and Baseline 2, respectively. Unsurprisingly, for any given \(\tau _{\text {U}}\), Baseline 1 always needs less total energy compared to Baseline 2.

Fig. 4
figure 4

The total energy consumption v.s. the offloading transmission time, \(\tau _{\text {U}}\)

At last, we investigate the effect of the downloading transmission time, i.e., \(\tau _{\text {D}}\), on the total energy consumption of each schemes, as illustrated in Fig. 5. Therein, the total offloading time is set to be \(\tau _{\text {U}}=30\) ms. As expected, the proposed joint optimization strategy has the best energy efficiency among all the three schemes due to the sophisticatedly designed offloading decision as well as the resource management. Details are not repeated here to avoid redundancy. Comparing Fig. 4 with Fig. 5, we observe that increase the download time of all the IoT devices results in a fast-growing total energy consumption, i.e., the slope of the curves in Fig. 5 is larger than that of Fig. 4. This is because the data sizes of computing results at APs are smaller than that of the offloaded tasks. Increasing the transmission time for downloading phase degrades the time for tasks’ offloading and computing as the total latency, i.e., \(T_k\), is fixed for IoT device k, where \(k\in {\mathcal {K}}.\)

Fig. 5
figure 5

The total energy consumption versus the downloading transmission time, \(\tau _{\text {D}}\)

6 Conclusion and future work

In this paper, the energy cost minimization for mission-critical IoT in MEC system was studied. The formulated minimization problem is a MINLP problem, which has been decomposed into two sub-problems for analytical tractability, i.e., the task offloading decision-making sub-problem and the resource optimization sub-problem. We demonstrated that the resource management sub-problem of IoT devices under the fixed offloading strategy are convex and can be best solved by the existing convex optimization tools. On this basis, we showed that the optimal solution to the original optimization problem was attainable by an exhaustive search enabled approach, which suffered an exponentially increased time complexity. Thereafter, a time-efficient sub-optimal algorithm was designed, which was conducted in an iterative manner and had a provable convergence guarantee compared with a wide range of benchmark schemes, the simulation results show the effectiveness of our algorithm in terms of convergence and energy consumption. In the future, we will investigate the joint offloading decision and radio resource management for MEC-enabled IoT networks with ultra-low latency consideration. Besides, the optimization for AP deployment will be another important direction.