Abstract
Numerous safety-related applications have rigid requirements for the threshold of latency limitation of message transmissions to enhance safety and traffic efficiency of vehicles on the road in vehicular ad-hoc networks (VANETs). Consequently, a medium access control (MAC) protocol for reliable and fast transmission of safety messages should focus on the maximum transmission delay. Herein, we refer to the concept of packet value and integrate it with the latency of different messages determined by the waiting interval of packets in a queue. Subsequently, we propose a reciprocal revenue function, i.e., the transmission probability of the participants, by considering both its own benefit and the other nodes’ revenue. Additionally, an inter-vehicle cross-layer cooperative game model based on the local optimal utility of participants is constructed. We then theoretically prove the existence of an equilibrium using the partially observable Markov decision process (POMDP), and provide a specific approach for obtaining the channel access probability of a vehicle by using deep reinforcement learning. Finally, the analysis and simulation results in saturated and non-saturated data traffic conditions are presented to evaluate the performance of Reciprocal Partially Observable MAC Protocol (RPO-MAC) proposed in this paper and compare it with the IEEE 802.11p standard protocol. These comparisons demonstrate the advantages of our proposed reciprocal revenue game method in the case of channel network congestion, especially in terms of delay. It is shown that the RPO-MAC protocol can provide a strong support to delay-sensitive safety related applications.
Similar content being viewed by others
References
Azarafrooz, M., Chandramouli, R., & Subbalakshmi, K. (2014). Reciprocity, fairness and learning in medium access control games. Computer Communication, 46, 22–28.
Babu, S., Patra, M., & Murthy, C. S. R. (2015). A novel context-aware variable interval mac protocol to enhance event-driven message delivery in ieee 802.11 p/wave vehicular networks. Vehicular Communications, 2(3), 172–183.
Burton, M., & Hill, G. (2009). 802.11 arbitration. White Paper. Durham: Certified Wireless Network Professional Inc.
Chen, L., Low, S. H., & Doyle, J. C. (2010). Random access game and medium access control design. IEEE/ACM Transactions on Networking (TON), 18(4), 1303–1316.
Cheng, N., Zhang, N., Lu, N., Shen, X., Mark, J. W., & Liu, F. (2013). Opportunistic spectrum access for cr-vanets: A game-theoretic approach. IEEE Transactions on Vehicular Technology, 63(1), 237–251.
Committee ICSLMS, et al. (1999) Wireless lan medium access control (mac) and physical layer (phy) specifications. ANSI/IEEE Std 80211-1999
Cui, T., Chen, L., & Low, S. H. (2008). A game-theoretic framework for medium access control. IEEE Journal on Selected Areas in Communications, 26(7), 1116–1127.
Dang, D. N. M., Hong, C. S., Lee, S., & Huh, E. N. (2014). An efficient and reliable mac in vanets. IEEE Communications Letters, 18(4), 616–619.
Fazio, P., De Rango, F., & Sottile, C. (2016). A predictive cross-layered interference management in a multichannel mac with reactive routing in vanet. IEEE Transactions on Mobile Computing, 15(8), 1850–1862.
Hajek, B. (1984). Optimal control of two interacting service stations. IEEE transactions on automatic control, 29(6), 491–499.
Hegde, N., & Proutiere, A. (2012). Simulation-based optimization algorithms with applications to dynamic spectrum access. In: Information Sciences and Systems (CISS), 2012 46th Annual Conference on, IEEE, pp 1–6
Jiang, L., Shah, D., Shin, J., & Walrand, J. (2010). Distributed random access algorithm: Scheduling and congestion control. IEEE Transactions on Information Theory, 56(12), 6182–6207.
Ju, P., & Song, W. (2016). Repeated game analysis for cooperative mac with incentive design for wireless networks. IEEE Transactions on Vehicular Technology, 65(7), 5045–5059.
Karkus, P., Hsu, D., & Lee, W.S. (2017). Qmdp-net: Deep learning for planning under partial observability. In: Advances in Neural Information Processing Systems, pp 4694–4704
Kenney, J. B. (2011). Dedicated short-range communications (dsrc) standards in the united states. Proceedings of the IEEE, 99(7), 1162–1182.
Khanduri, R., & Rattan, S. (2013). Performance comparison analysis between ieee 802.11 a/b/g/n standards. International Journal of Computer Applications, 78(1), 13–20.
Kim, T. H., Ni, J., Srikant, R., & Vaidya, N. H. (2013). Throughput-optimal csma with imperfect carrier sensing. IEEE/ACM Transactions on Networking (TON), 21(5), 1636–1650.
Kumar, P. R., & Varaiya, P. (2015). Stochastic systems: Estimation, identification, and adaptive control (Vol. 75). Philadelphia: SIAM.
Kwon, Y., & Rhee, B. (2016). Bayesian game-theoretic approach based on 802.11 p mac protocol to alleviate beacon collision under urban vanets. International Journal of Automotive Technology, 17(1), 183–191.
Lee, J. H., Ernst, T., & Ma, X. (2014). Performance analysis of secure beaconing messages for geonetworking. Security and Communication Networks, 7(12), 2555–2563.
Lee, J.-W., Chiang, M., & Calderbank, A. R. (2006). Utility-optimal medium access control: Reverse and forward engineering. In Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications (pp. 1–13). Barcelona, Spain. https://doi.org/10.1109/INFOCOM.2006.252.
Lee, J. W., Tang, A., Huang, J., Chiang, M., & Calderbank, A. R. (2007). Reverse-engineering mac: A non-cooperative game model. IEEE Journal on Selected Areas in Communications, 25(6), 1135–1147.
Liu, L., Xia, W., & Shen, L. (2009). An adaptive multi-channel mac protocol with dynamic interval division in vehicular environment. In: 2009 First International Conference on Information Science and Engineering, IEEE, pp 2534–2537
Maalej, Y., Abderrahim, A., Guizani, M., Hamdaoui, B., & Balti, E. (2016). Advanced activity-aware multi-channel operations1609. 4 in vanets for vehicular clouds. In: 2016 IEEE Global Communications Conference (GLOBECOM), IEEE, pp 1–6
MacDermed, L., Isbell, C., & Weiss, L. (2011). Markov games of incomplete information for multi-agent reinforcement learning. In: Workshops at the Twenty-Fifth AAAI Conference on Artificial Intelligence
Nayyar, A., Mahajan, A., & Teneketzis, D. (2011). Optimal control strategies in delayed sharing information structures. IEEE Transactions on Automatic Control, 56(7), 1606–1620.
Nguyen, V., Kim, O. T. T., Pham, C., Oo, T. Z., Tran, N. H., Hong, C. S., & Huh, E. N. (2018). A survey on adaptive multi-channel mac protocols in vanets using markov models. IEEE Access, 6, 16493–16514.
Ni, J., Tan, B., & Srikant, R. (2012). Q-csma: Queue-length-based csma/ca algorithms for achieving maximum throughput and low delay in wireless networks. IEEE/ACM Transactions on Networking (TON), 20(3), 825–836.
Omar, H. A., Zhuang, W., & Li, L. (2012). Vemac: A tdma-based mac protocol for reliable broadcast in vanets. IEEE transactions on mobile computing, 12(9), 1724–1736.
Qiu, H. J., Ho, I. W. H., Chi, K. T., & Xie, Y. (2015). A methodology for studying 802.11 p vanet broadcasting performance with practical vehicle distribution. IEEE transactions on vehicular technology, 64(10), 4756–4769.
Rajagopalan, S., Shah, D., & Shin, J. (2009). Network adiabatic theorem: an efficient randomized protocol for contention resolution. In: Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems, pp 133–144
Reeves, D.M., & Wellman, M.P. (2004). Computing best-response strategies in infinite games of incomplete information. In: Proceedings of the 20th conference on Uncertainty in artificial intelligence, AUAI Press, pp 470–478
Sahoo, J., Wu, E. H. K., Sahu, P. K., & Gerla, M. (2013). Congestion-controlled-coordinator-based mac for safety-critical message transmission in vanets. IEEE Transactions on Intelligent Transportation Systems, 14(3), 1423–1437.
Smith, J. E., & McCardle, K. F. (2002). Structural properties of stochastic dynamic programs. Operations Research, 50(5), 796–809.
Song, C., Tan, G., Yu, C., Ding, N., & Zhang, F. (2017). Apdm: An adaptive multi-priority distributed multichannel mac protocol for vehicular ad hoc networks in unsaturated conditions. Computer Communications, 104, 119–133.
Vasal, D., & Anastasopoulos, A. (2014). Stochastic control of relay channels with cooperative and strategic users. IEEE Transactions on Communications, 62(10), 3434–3446.
Wang, J., Lang, P., Zhu, J., Deng, W., & Xu, S. (2018). Application-value-awareness cross-layer mac cooperative game for vehicular networks. Vehicular Communications, 13, 27–37.
Wang, Q., Leng, S., Fu, H., & Zhang, Y. (2011a). An ieee 802.11 p-based multichannel mac scheme with channel coordination for vehicular ad hoc networks. IEEE transactions on intelligent transportation systems, 13(2), 449–458.
Wang, Q., Leng, S., Zhang, Y., & Fu, H. (2011b). A qos supported multi-channel mac for vehicular ad hoc networks. In: 2011 IEEE 73rd Vehicular Technology Conference (VTC Spring), IEEE, pp 1–5
Yagan, D., & Tham, C. K., (2007). Coordinated reinforcement learning for decentralized optimal control. In: Approximate Dynamic Programming and Reinforcement Learning 2007. ADPRL 2007. IEEE International Symposium on, IEEE, pp. 296–302
Yao, Y., Zhang, K., & Zhou, X. (2017). A flexible multi-channel coordination mac protocol for vehicular ad hoc networks. IEEE Communications Letters, 21(6), 1305–1308.
Acknowledgements
This work was supported by National Nature Science Foundation[61572229 and 6171101066]; Jilin Provincial Science and Technology Development Foundation [20170204074GX and 20180201068GX]; Jilin Provincial International Cooperation Foundation [20180414015GH]; CERNET Innovation Project [NGII20170413].
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A
Proof
When \(t=1\) , it can be assumed that the initial channel usage state is idle, and obviously all primitive values \(R_i^1\), \(R_j^i\) of any two vehicles are independent of each other.
Suppose that when \(t=k,k \in N^+\), the conclusion is true, i.e.,
Then, when \(t=k+1\),
For any vehicle, i
So, when \(t=k+1\),
Thus,
\(\square\)
Appendix B
Proof
where \(a_i^t\) is a function of \(r_i^t\). \({\mathbb {P}}^{\varvec{\psi _{-i}}}\left( \varvec{a_{-i}^t}|r_i^{1:t},o^{1:t-1},a_i^{1:t}\right)\) can be switched into \({\mathbb {P}}^{\varvec{\psi _{-i}}} \left( \varvec{a_{-i}^t}|o^{1:t-1}\right)\) because each action \(a_i\) is conditionally independent given the historical observations \(o^{1:t-1}\)(Lemma 1).
For the second part,
\(\square\)
Appendix C
Proof
Fix \(\varvec{\phi }\),
which
Since packet values of all vehicles \(R_i^{1:t}\) are conditionally independent with the observations \(o^{1:t-1}\) (Lemma 1), we can get:
So, \(b_i^{t+1}\) can be expressed as follows:
We can further simplify this function because the delay of vehicle i in time \(t + 1\) is meaningful only when it chooses to wait at time t.Thus,
where \(\varvec{1}(\cdot )\) is the indicator function and H is independent of policy \(\psi\). \(\square\)
Appendix D
Proof
\(\square\)
Rights and permissions
About this article
Cite this article
Wang, J., Guo, X., Liu, X. et al. RPO-MAC: reciprocal Partially observable MAC protocol based on application-value-awareness in VANETs. Wireless Netw 27, 2509–2528 (2021). https://doi.org/10.1007/s11276-021-02602-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11276-021-02602-5