Admission control in a two-class loss system with periodically varying parameters and abandonments

Zayas-Cabán, Gabriel; Lewis, Mark E.

doi:10.1007/s11134-019-09620-3

Admission control in a two-class loss system with periodically varying parameters and abandonments

Published: 20 June 2019

Volume 94, pages 175–210, (2020)
Cite this article

Queueing Systems Aims and scope Submit manuscript

326 Accesses
4 Citations
Explore all metrics

Abstract

Motivated by service systems, such as telephone call centers and emergency departments, we consider admission control for a two-class multi-server loss system with periodically varying parameters and customers who may abandon from service. Assuming mild conditions for the parameters, a dynamic programming formulation is developed. We show that under the infinite horizon discounted problem, there exists an optimal threshold policy and provide conditions for a customer class to be preferred for each fixed time, extending stationary results to the non-stationary setting. We approximate the non-stationary problem by discretizing the time horizon into equally spaced intervals and examine how policies for this approximation change as a function of time and parameters numerically. We compare the performance of these approximations with several admission policies used in practice in a discrete-event simulation study. We show that simpler admission policies that ignore non-stationarity or abandonments lead to significant losses in rewards.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Admission control strategies for tandem Markovian loss systems

Article 13 January 2018

Scheduling of multi-class multi-server queueing systems with abandonments

Article 11 November 2015

Loss rates in the single-server queue with complete rejection

Article Open access 14 March 2015

References

Altman, E., Jiménez, T., Koole, G.: On optimal call admission control in resource-sharing system. IEEE Trans. Commun. 49(9), 1659–1668 (2001)
Article Google Scholar
Blackwell, D.: Discrete dynamic programming. Ann. Math. 33, 719–726 (1962)
Article Google Scholar
Buyukkoc, C., Varaiya, P., Walrand, J.: The $c\mu $-rule revisited. Adv. Appl. Probab. 17(1), 237–238 (1985)
Article Google Scholar
Chavis, J.T., Cochran, A.L., Kocher, K.E., Washington, V.N., Zayas-Caban, G.: A simulation model of patient flow through the emergency department to determine the impact of a short stay unit on hospital congestion. In: Roeder, T.M.K., Frazier, P.I., Szechtman, R., Zhou, E., Huschka, T., Chick, S.E. (eds.) Proceedings of the 2016 Winter Simulation Conference, pp. 1982–1993. Institute of Electrical and Electronics Engineers Inc, Piscataway, NJ (2016)
Chapter Google Scholar
Cudina, M., Ramanan, K.: Asymptotically optimal controls for time-inhomogeneous networks. SIAM J. Control Optim. 49(2), 611–645 (2011)
Article Google Scholar
Feinberg, E.A., Reiman, M.I.: Optimality of randomized trunk reservation. Probab. Eng. Inf. Sci. 8(04), 463–489 (1994)
Article Google Scholar
Fischer, W., Meier-Hellstern, K.: The Markov-modulated Poisson process (MMPP) cookbook. Perform. Eval. 18(2), 149–171 (1993)
Article Google Scholar
Fu, M.C., Marcus, S.I., Wang, I.-J.: Monotone optimal policies for a transient queueing staffing problem. Oper. Res. 48(2), 327–331 (2000)
Article Google Scholar
Green, L., Kolesar, P.: The pointwise stationary approximation for queues with nonstationary arrivals. Manag. Sci. 37(1), 84–97 (1991)
Article Google Scholar
Green, L.V., Kolesar, P.J.: On the accuracy of the simple peak hour approximation for Markovian queues. Manag. Sci. 41(8), 1353–1370 (1995)
Article Google Scholar
Green, L.V., Kolesar, P.J.: The lagged PSA for estimating peak congestion in multiserver Markovian queues with periodic arrival rates. Manag. Sci. 43(1), 80–87 (1997)
Article Google Scholar
Green, L.V., Kolesar, P.J.: A note on approximating peak congestion in $M_t/G/\infty $ queues with sinusoidal arrivals. Manag. Sci. 44(11–part–2), S137–S144 (1998)
Article Google Scholar
Green, L.V., Kolesar, P.J., Soares, J.: Improving the SIPP approach for staffing service systems that have cyclic demands. Oper. Res. 49(4), 549–564 (2001)
Article Google Scholar
Hampshire, R.C., Massey, W.A.: Dynamic Optimization with Applications to Dynamic Rate Queues. TUTORIALS in Operations Research, pp. 210–247. INFORMS Society, Catonsville (2010)
Google Scholar
Hordijk, A., Puterman, M.L.: On the convergence of policy iteration in finite state undiscounted Markov decision processes: the unichain case. Math. Oper. Res. 12(1), 163–176 (1987)
Article Google Scholar
Ingolfsson, A., Akhmetshina, E., Budge, S., Li, Y., Wu, X.: A survey and experimental comparison of service-level-approximation methods for nonstationary M (t)/M/s (t) queueing systems with exhaustive discipline. INFORMS J. Comput. 19(2), 201–214 (2007)
Article Google Scholar
Jain, A., Elmaraghy, H.: Production scheduling/rescheduling in flexible manufacturing. Int. J. Prod. Res. 35(1), 281–309 (1997)
Article Google Scholar
Jaśkiewicz, A.: A fixed point approach to solve the average cost optimality equation for semi-Markov decision processes with Feller transition probabilities. Commun. Stat. Theory Methods 36(14), 2559–2575 (2007)
Article Google Scholar
Kim, S.-H., Whitt, W.: Are call center and hospital arrivals well modeled by nonhomogeneous Poisson processes? Manuf. Serv. Oper. Manag. 16(3), 464–480 (2014)
Article Google Scholar
Koole, G.: Structural results for the control of queueing systems using event-based dynamic programming. Queueing Syst. 30(3–4), 323–339 (1998)
Article Google Scholar
Koole, G., Mandelbaum, A.: Queueing models of call centers: an introduction. Ann. Oper. Res. 113(1–4), 41–59 (2002)
Article Google Scholar
Lewis, M.E., Ayhan, H., Foley, R.D.: Bias optimal admission control policies for a multiclass nonstationary queueing system. J. Appl. Probab. 39, 20–37 (2002)
Article Google Scholar
Lewis, P.A.: Some results on tests for Poisson processes. Biometrika 52, 67–77 (1965)
Article Google Scholar
Lippman, S.: Applying a new device in the optimization of exponential queuing systems. Oper. Res. 23(4), 687–710 (1975)
Article Google Scholar
Martin-Löf, A.: Optimal control of a continuous-time Markov chain with periodic transition probabilities. Oper. Res. 15(5), 872–881 (1967)
Article Google Scholar
Miller, B.L.: A queueing reward system with several customer classes. Manag. Sci. 16(3), 234–245 (1969)
Article Google Scholar
Örmeci, E.L., Burnetas, A., van der Wal, J.: Admission policies for a two class loss system. Stoch. Model 17(4), 513–539 (2001)
Article Google Scholar
Örmeci, E.L., van der Wal, J.: Admission policies for a two class loss system with general interarrival times. Stoch. Models 22(1), 37–53 (2006)
Article Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Mathematical Statistics. Wiley, New York (1994)
Book Google Scholar
Puterman, M.L., Shin, M.C.: Modified policy iteration algorithms for discounted Markov decision problems. Manag. Sci. 24(11), 1127–1137 (1978)
Article Google Scholar
Roubos, A., Bhulai, S., Koole, G.: Flexible staffing for call centers with non-stationary arrival rates In: Markov decision processes in practice, pp. 487–503. Springer, Cham (2017)
Google Scholar
Roubos, D., Bhulai, S.: Approximate dynamic programming techniques for the control of time-varying queuing systems applied to call centers with abandonments and retrials. Probab. Eng. Inf. Sci. 24(01), 27–45 (2010)
Article Google Scholar
Rowe, B.H., Channan, P., Bullard, M., Blitz, S., Saunders, L.D., Rosychuk, R.J., Lari, H., Craig, W.R., Holroyd, B.R.: Characteristics of patients who leave emergency departments without being seen. Acad. Emerg. Med. 13(8), 848–852 (2006)
Article Google Scholar
Savin, S.V., Cohen, M.A., Gans, N., Katalan, Z.: Capacity management in rental businesses with two customer bases. Oper. Res. 53(4), 617–631 (2005)
Article Google Scholar
Stidham Jr., S.: Optimal control of admission to a queueing system. IEEE Trans. Autom. Control 30(8), 705–713 (1985)
Article Google Scholar
Wakuta, K.: Arbitrary state semi-Markov decision processes. Optimization 18(3), 447–454 (1987)
Article Google Scholar
Wei, Q., Guo, X.: New average optimality conditions for semi-Markov decision processes in Borel spaces. J. Optim. Theory Appl. 153(3), 709–732 (2012)
Article Google Scholar
Yoon, S., Lewis, M.E.: Optimal pricing and admission control in a queueing system with periodically varying parameters. Queueing Syst. 47(3), 177–199 (2004)
Article Google Scholar
Zayas-Cabán, G., Xie, J., Green, L.V., Lewis, M.E.: Dynamic control of a tandem system with abandonments. Queueing Syst. 84(3–4), 279–293 (2016)
Article Google Scholar

Download references

Acknowledgements

We would like to thank the Associate Editor and the two referees for several and insightful comments.

Author information

Authors and Affiliations

Industrial and Systems Engineering, University of Wisconsin, Madison, WI, USA
Gabriel Zayas-Cabán
School of Operations Research and Information Engineering, Cornell University, Ithaca, NY, USA
Mark E. Lewis

Authors

Gabriel Zayas-Cabán
View author publications
You can also search for this author in PubMed Google Scholar
Mark E. Lewis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gabriel Zayas-Cabán.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A Appendix

This section is dedicated to showing that the discounted reward optimality equations have a solution and that these solutions can be computed via successive approximations. The result is an application of Theorems 4.1 and 4.2 in Wakuta [36]. There are minor adjustments to the proofs since the distribution of the state change and the time between decision points is not separable. The following notation and definitions are used throughout this Appendix.

Let $\mathbb {X}$ and $\mathbb {A}$, respectively, denote the state and action space for a generic semi-Markov decision process (SMDP).
Let the Euclidean distance between $\ell = \left( \left( i,j,k,z\right) ,a\right) $, $\ell ' = \left( \left( i',j',k',z'\right) ,a'\right) \in \text {Gr}(\mathbb {A})$ be given by
$$\begin{aligned} \Vert \ell -\ell '\Vert&:= \sqrt{(i-i')^2 + (j-j')^2 + (k-k')^2 + (z-z')^2 + (a-a')^2}. \end{aligned}$$

Definition A1

The transition kernel $Q(\cdot |x,a)$ is called strongly continuous provided the mapping $(x,a) \rightarrow \int _{x' \in \mathbb {X}} \! f(x') \, Q(\mathrm {d}x'|x,a)$ is continuous and bounded whenever f is measurable and bounded on $\mathbb {X}$.

1.1 A.1 Discounted reward optimality equations (DROE)

In this section, we show that the DROE have a solution and that the solution can be computed via the (contraction) mappings defined in Sect. 4. Before doing so we note that because the rewards are bounded below, it suffices (and is in fact equivalent) to consider the case with nonnegative rewards. That is to say that adding the lower bound on rewards to all rewards does not change the optimal control. Let this lower bound be denoted L and, throughout this section, assume $r(x,a) \ge 0$.

Lemma 2

The following hold:

1.
There exist a Borel measurable function w on $\mathbb {X}$ with $w(x) \ge 1, x \in \mathbb {X}$, a number $\xi $ ($0< \xi < 1$), and a number $M > 0$ such that
1. (a)
  for any $(x,a) \in Gr(\mathbb {A})$
  $$\begin{aligned} \Vert r(x,a)\Vert \le M w(x). \end{aligned}$$
2. (b)
  for any policy, the sequence of decision times $\{t_n, n \ge 0\}$ are such that $\limsup _{n \rightarrow \infty } t_n = \infty $ almost surely.
2.
$\mathbb {X}$ and $\mathbb {A}$ are non-empty Borel subsets of a complete separable metric space.
3.
A(x) is a Borel measurable compact-valued multi-function from $\mathbb {X}$ into $\mathbb {A}$.
4.
The one-stage reward function r(x, a) is upper-semicontinuous on $\mathbb {A}$ for each fixed $x \in \mathbb {X}$.
5.
The transition kernel Q is strongly continuous.

We remark that it is usual that Statement 1(b) (see Proposition 3.1 of [36]) is proved under the assumption of the existence of a Lyapunov function (see inequality (3.2) in [36]).

Proof

Statement 1(a) holds with $w(x) \equiv 1$ and for any M that maximizes the one-stage reward functions r(x, a) (below and above). For Statement 1(b), notice that if the initial state is (i, j, k, 0), the expected number of transitions before the dummy transition at time T is ${{\,\mathrm{\mathbb {E}}\,}}N(T) < \infty $ (in fact $\varPsi T$), where N(T) is a Poisson process of rate $\varPsi $. This implies that $N(T) < \infty $ almost surely. At time T, there is a renewal. Since there are deterministic renewals at times $(T, 2T, \ldots )$, we have that it requires an infinite time to see an infinite number of transitions (almost surely).

Statements 2, 3 and 4 hold by construction and by the finiteness of A(x) for each x. For Statement 5, it is enough to show that

$$\begin{aligned}&\int _z^T \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \nonumber \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \nonumber \\&\quad + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z)}\,{\text {d}}t \end{aligned}$$

(8.1)

is continuous for any bounded, real-valued measurable function f. To this end, fix $\epsilon > 0$ and let $x = (i,j,k,z), x' = (i',j',k',z') \in \mathbb {X}$ with $z < z'$. Next, choose $\delta < 1$ so that $\Vert (x,a) - (x',a')\Vert < \delta $ implies that $i = i', j = j', k = k'$ and $a = a'$. It follows that

$$\begin{aligned}&\Big | \Big [\int _{z}^{T} \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z)}\,{\text {d}}t \Big ] \\&\quad - \Big [\int _{z'}^{T} \Big [ \lambda _{1}(t) f(i',j',1,t) + \lambda _{2}(t) f(i',j',2,t) \\&\quad + \left( \mu _{i'}^{1}(t) + \beta _{i'}^{1}(t) \right) f(i',j',-1,t) + \left( \mu _{j'}^{2}(t) + \beta _{j'}^{2}(t) \right) f(i',j',-2,t) \\&\quad + \big ( 1 - \lambda (t) - \mu _{i'}^{1}(t) - \beta _{i'}^{1}(t) -\mu _{j'}^{2}(t) - \beta _{j'}^{2}(t) \big ) f(i',j',0,t) \Big ] e^{-(1+\alpha )(t-z')}\,{\text {d}}t \Big ] \Big | \\&= \Big | \Big [\int _z^T \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z)}\,{\text {d}}t \Big ] \\&\quad - \Big [\int _{z'}^T \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad + \big ( 1 - \lambda (t) - \beta _{i}^{1}(t) - \beta _{j}^{2}(t) - \mu _{i}^{1}(t) -\mu _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z')}\,{\text {d}}t \Big ] \Big |, \end{aligned}$$

which simplifies to

$$\begin{aligned}&\Big | \Big [\int _{z}^{z'} \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z)}\,{\text {d}}t \Big ] \\&\quad + \Big (e^{-(z'-z)} - 1\Big )\Big [\int _{z'}^T \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \Big ] e^{-(1+\alpha )(t-z')}\,{\text {d}}t \Big ] \Big |. \end{aligned}$$

Let $M < \infty $ be such that $\sup _{x \in \mathbb {X}} |f(x)| \le M < \infty $. The convexity of the function $x^2$ for $0< x < \infty $ implies that $\Big ( \frac{f+g}{2} \Big )^2 \le \frac{1}{2} \big (f^2 + g^2 \big )$, and hence, that $(f+g)^2 \le 2f^2 + 2g^2$ for any two functions f, g. It follows that the previous expression is bounded above by

$$\begin{aligned}&\left[ 2 \left[ \int _{z}^{z'} \big ( \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t) \right. \right. \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad \left. \left. + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \right] e^{-(1+\alpha )(t-z)}\big ),{\text {d}}t \right] ^2 \\&\quad + 2 \left[ (e^{-(z'-z)} - 1) \left( \int _{z'}^T \Big [ \lambda _{1}(t) f(i,j,1,t) + \lambda _{2}(t) f(i,j,2,t)\right. \right. \\&\quad + \left( \mu _{i}^{1}(t) + \beta _{i}^{1}(t) \right) f(i,j,-1,t) + \left( \mu _{j}^{2}(t) + \beta _{j}^{2}(t) \right) f(i,j,-2,t) \\&\quad \left. \left. + \big ( 1 - \lambda (t) - \mu _{i}^{1}(t) - \beta _{i}^{1}(t) -\mu _{j}^{2}(t) - \beta _{j}^{2}(t) \big ) f(i,j,0,t) \right) e^{-(1+\alpha )(t-z')}\,{\text {d}}t \Big ]^2 \right] ^{\frac{1}{2}}. \end{aligned}$$

This last expression is bounded above by $\sqrt{2} M (1 - e^{-(1+\alpha )(z'-z)})$. The continuity of $e^{-x}$ on $-\infty< x < \infty $ implies that we can choose $\delta $ so that $z'-z < \delta $ implies $\sqrt{2} M (1 - e^{-(1+\alpha )(z'-z)}) < \epsilon $. This proves the last statement. $\square $

The previous lemma implies that Theorem 4.1 in [36] holds with some minor adjustments. We include the statement and proof of necessity (for the current model) for completeness. Sufficiency holds with almost no changes to that in [36] since A(x) is finite for each $x \in \mathbb {X}$.

Theorem 3

A policy $\pi $ is optimal if and only if its reward $v^{\pi }_{\alpha }(x)$ satisfies the discounted reward optimality equations (see (4.1)).

Proof

Suppose $v^{\pi '}_{\alpha }(x)$ satisfies the DROE. Fix $n \ge 1$. Denote the history under a generic policy $\pi = \{d_0, d_1, \ldots \}$ by

$$\begin{aligned} h_k = \{t_0,x,a_0,\ldots ,t_{k},x_{k},a_{k}\}, \end{aligned}$$

and note that

$$\begin{aligned} {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,x}\Big [ \sum _{k=0}^{n-1} \big [ e^{-\alpha t_{k+1}} v_{\alpha }^{\pi '}(x_{k+1}) - {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,h_k} e^{-\alpha t_{k+1}} v_{\alpha }^{\pi '}(x_{k+1}) \big ] \Big ] = 0, \end{aligned}$$

(8.2)

for any $n \ge 2$. For ease of notation, for any fixed $s \in \mathbb {X}$, let p(y|s, a) denote the probability of next entering state y given that the current state is s and action a is chosen. Let $\{\tau _n, n \ge 1\}$ be the inter-event times so that $\tau _n = t_n - t_{n-1}$. Consider the second term above,

$$\begin{aligned} {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,h_k} e^{-\alpha t_{k+1}} v_{\alpha }^{\pi '}(x_{k+1})&= e^{-\alpha t_{k}} {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,h_k} e^{-\alpha \tau _{k+1}} v_{\alpha }^{\pi '}(x_{k+1})\\&= e^{-\alpha t_{k}} \Big [ \int _{y \in \mathbb {X}} e^{-\alpha t(y)} v_{\alpha }^{\pi '}(y) p({\text {d}}y|x_k,a_k)\Big ] \\&= e^{-\alpha t_{k}} \Big [ r(x_k,a_k) + \int _{y \in \mathbb {X}} e^{-\alpha t(y)} v_{\alpha }^{\pi '}(y) p({\text {d}}y|x_k,a_k)\Big ] \\&\quad - e^{-\alpha t_{k}} r(x_k,a_k) \\&\le e^{-\alpha t_{k}} \Big [ v_{\alpha }^{\pi '}(x_k)\Big ] - e^{-\alpha t_{k}} r(x_k,a_k), \end{aligned}$$

where we have denoted by t(y) the discount time associated with next transition and the last inequality holds by taking the maximum in the DROE and the assumption that $v_{\alpha }^{\pi '}$ satisfies it. Thus, using (8.2) yields

$$\begin{aligned} {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,x}\Big [ \sum _{k=0}^{n-1} e^{-\alpha t_{k+1}} v_{\alpha }^{\pi '}(x_{k+1}) - e^{-\alpha t_{k}} \Big [ v_{\alpha }^{\pi '}(x_{k})\Big ] + e^{-\alpha t_{k}} r(x_k,a_k)\Big ] \le 0. \end{aligned}$$

The first two terms in the sum telescope, yielding the following inequality:

$$\begin{aligned} {{\,\mathrm{\mathbb {E}}\,}}_{\pi ,x}\Big [e^{-\alpha t_{n}} v_{\alpha }^{\pi '}(x_{n}) - v_{\alpha }^{\pi '}(x) + \sum _{k=0}^{n-1} e^{-\alpha t_{k}} r(x_k,a_k)\Big ] \le 0. \end{aligned}$$

Statement 1(a) of Lemma 2 implies that $v_{\alpha }^{\pi '}$ is bounded. The Dominated Convergence Theorem and Statement 1(b) of Lemma 2 yield that ${{\,\mathrm{\mathbb {E}}\,}}_{\pi ,x}e^{-\alpha t_{n}} v_{\alpha }^{\pi '}(x_{n}) \rightarrow 0$ as $n \rightarrow \infty $. The last term converges to the total discounted reward of $\pi $ so that we have $v_{\alpha }^{\pi '} \ge v_{\alpha }^{\pi }$, which yields the result. $\square $

We conclude this section by stating Lemma 5.7 and Theorem 5.8 of [36], which follow in the same manner.

Theorem 4

The following hold for the discounted reward model:

1.
The mapping $U_{n, \alpha }$ maps the set of bounded functions on $\mathbb {X}$ back into the same set and is a contraction mapping. This implies that $v_{\alpha }$ is the unique solution to the DROE.
2.
There exists a Borel measurable function f that achieves the maximum in the DROE.
3.
There exists an optimal stationary policy.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zayas-Cabán, G., Lewis, M.E. Admission control in a two-class loss system with periodically varying parameters and abandonments. Queueing Syst 94, 175–210 (2020). https://doi.org/10.1007/s11134-019-09620-3

Download citation

Received: 30 July 2016
Revised: 02 June 2019
Published: 20 June 2019
Issue Date: February 2020
DOI: https://doi.org/10.1007/s11134-019-09620-3

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Admission control in a two-class loss system with periodically varying parameters and abandonments

Abstract

Access this article

Similar content being viewed by others

Admission control strategies for tandem Markovian loss systems

Scheduling of multi-class multi-server queueing systems with abandonments

Loss rates in the single-server queue with complete rejection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

A Appendix

Definition A1

1.1 A.1 Discounted reward optimality equations (DROE)

Lemma 2

Proof

Theorem 3

Proof

Theorem 4

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Admission control in a two-class loss system with periodically varying parameters and abandonments

Abstract

Access this article

Similar content being viewed by others

Admission control strategies for tandem Markovian loss systems

Scheduling of multi-class multi-server queueing systems with abandonments

Loss rates in the single-server queue with complete rejection

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

A Appendix

A Appendix

Definition A1

1.1 A.1 Discounted reward optimality equations (DROE)

Lemma 2

Proof

Theorem 3

Proof

Theorem 4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation