Abstract
As a convenient and economic computing model, cloud computing promotes the development of intelligence. Solving the workflow scheduling is a significant topic to promote the development of the cloud computing. In this work, an Actor-Critic architecture is utilized to solve this problem achieving the task executive time minimization under the task precedence constraint. It is similar to the list-based heuristic algorithm which includes the task prioritizing phase and task allocation phase. However, the results of the two phases interact with each other. In the task prioritizing phase, given a workflow represented as the data communication time matrix and task computation time matrix, a distribution over different task permutations by the improved Pointer network can be predicted. Then, the heuristic algorithm based on the HEFT achieves the task allocation to get the task executive time. Using negative task executive time as the reward signals, the model parameters by a policy gradient method in the first phase can be optimized. The simulation experiment is done from the task executive time, and the results shows that the workflow scheduling by the deep reinforcement learning is more effective comparing with other four single objective heuristic algorithms.
Similar content being viewed by others
References
Arabnejad H, Barbosa JG (2014) List scheduling algorithm for heterogeneous systems by an optimistic cost table. IEEE Trans Parallel Distrib Syst 25(3):682–694
Asghari A, Sohrabi MK, Yaghmaee F (2020) Online scheduling of dependent tasks of cloud’s workflows to enhance resource utilization and reduce the makespan using multiple reinforcement learning-based agents. Soft Comput. https://doi.org/10.1007/s00500-020-04931-7
Barto AG, Sutton RD, Anderson CW (1970) Neuron like elements that can solve difficult learning control problems. IEEE Trans Syst Man Cybern 13(5):834–846
Bittencourt LF, Sakellariou R, Madeira ERM (2010) DAG scheduling using a lookahead variant of the heterogeneous earliest finish time algorithm. In: 2010 18th Euromicro conference on parallel, distributed and network-based processing, pp 27–34. https://doi.org/10.1109/PDP.2010.56
Bodrow W (2017) Impact of industry 4.0 in service oriented firm. Adv Manuf 5(4):394–400
Cai XJ, Hu ZM, Chen JJ (2020a) A many-objective optimization recommendation algorithm based on knowledge mining. Inf Sci 537:148–161
Cai XJ, Hu ZM, Chen JJ (2020b) A many-objective optimization recommendation algorithm based on knowledge mining. Inf Sci 537:148–161
Cai X, Geng S, Wu D, Cai J, Chen J (2020c) A multi-cloud model based many-objective intelligent algorithm for efficient task scheduling in internet of things. IEEE Internet Things J. https://doi.org/10.1109/JIOT.2020.3040019
Cui ZH, Xue F, Cai XJ, Cao Y, Wang GG, Chen JJ (2018) Detection of malicious code variants based on deep learning. IEEE Trans Ind Inform 14(7):3187–3196
Cui ZH, Zhang JJ, Wu D, Cai XJ, Wang H, Zhang WS, Chen JJ (2020) Hybrid many-objective particle swarm optimization algorithm for green coal production problem. Inf Sci 518:256–271
Cui ZH, Xu XH, Xue F, Cai XJ, Cao Y, Zhang WS, Chen JJ (2020) Personalized recommendation system based on collaborative filtering for IoT scenarios. IEEE Trans Serv Comput 13(4):685–695
Dong TT, Xue F, Xiao CB, Li JT (2020) Task scheduling based on deep reinforcement learning in a cloud manufacturing environment. Concurr Comput Pract Exp 32(11):e5654
Dzmitry B, Kyunghyun C, Yoshua B (2015) Neural machine translation by jointly learning to align and translate. In: International conference on learning representations. arXiv:1409.0473
Faragardi HR, Sedghpour MRS, Faziahmadi S, Fahringer T, Rasouli N (2020) GRP-HEFT: a budget-constrained resource provisioning scheme for workflow scheduling in IaaS clouds. IEEE Trans Parallel Distrib Syst 31(6):1239–1254
Hassan M, Rehmani MH, Chen JJ (2020a) DEAL: differentially private auction for blockchain based microgrids energy trading. IEEE Trans Serv Comput 13(2):263–275
Hassan M, Rehmani MH, Chen JJ (2020b) Differential privacy techniques for cyber physical systems: a survey. IEEE Commun Surv Tutor 22(1):746–789
Hopfield JJ, Tank DW (1985) Neural computation of decisions in optimization problems. Biol Cybern 52(3):141–152
Irwan B, Hieu P, Quoc V L, Mohammad N, Samy B (2016) Neural combinatorial optimization with reinforcement learning. arXiv:1611.09940
Juve G, Chervenak A, Deelman E, Bharathi S, Mehta G, Vahi K (2013) Characterizing and profiling scientific workflows. Future Gener Comput Syst 29(3):682–692
Karpagam M, Geetha K, Rajan C (2020) A reactive search optimization algorithm for scientific workflow scheduling using clustering techniques. J Ambient Intell Hum Comput. https://doi.org/10.1007/s12652-020-02480-3
Khan SQ, Ghani A, Khurram M (2020) Frequency-dependent synaptic plasticity model for neurocomputing applications. Int J Bioinspired Comput 16(1):56–66
Kumar H, Giri S (2020) Optimisation of makespan of a flow shop problem using multi layer neural network. Int J Comput Sci Math 11(2):107–122
Liu QM (2020) Integrated deteriorating maintenance and patient scheduling for single medical device with heuristic algorithm. Int J Bioinspired Comput 16(2):121–131
Market Share: IT Services, Worldwide (2019). https://www.gartner.com/en/documents/3983385/market-share-it-services-worldwide-2019. id: g00717813
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Mohammadreza N, Afshin O, Martin T, Lawrence VS (2018) Reinforcement learning for solving the vehicle routing problem. arXiv:1802.04240
Nedjah N, Mourelle LD, Morais RG (2020) Inspiration-wise swarm intelligence meta-heuristics for continuous optimisation: a survey—part I. Int J Bioinspired Comput 15(4):207–223
Oriol V, Meire F, Navdeep J (2015) Pointer networks. In: Advances in neural information processing systems, pp 2692–2700. arXiv:1506.03134
Rajasekar P, Palanichamy Y (2020) Scheduling multiple scientific workflows using containers on IaaS cloud. J Ambient Intell Hum Comput. https://doi.org/10.1007/s12652-020-02483-0
Ronald W (1992) Simple statistical gradient following algorithms for connectionnist reinforcement learning. Mach Learn 8(3–4):229–256
Sun D, Gao S, Liu XY, Li FY, Buyya R (2020) Performance-aware deployment of streaming applications in distributed stream computing systems. Int J Bioinspired Comput 15(1):52–62
Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT Press, Cambridge
Tong Z, Chen HJ, Deng XM, Li KL, Li KQ (2019) A scheduling scheme in the cloud computing environment using deep Q-learning. Inf Sci 512:1170–1191
Tong Z, Deng XM, Chen HJ, Mei J, Liu H (2020) QL-HEFT: a novel machine learning scheduling scheme base on cloud computing environment. Neural Comput Appl 32(10):5553–5570
Topcuoglu H, Hariri S, Wu MY (2002) Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Trans Parallel Distrib Syst 13(3):260–274
Wang PH, Huang JR, Cui ZH, Xie LP, Chen JJ (2020) A Gaussian error correction multi-objective positioning model with NSGA-II. Concurr Comput Pract Exp 32(5):e5464
Watkins CJCH, Dayan P (1992) Q-learning. Mach Learn 8(3–4):279–292
Yuan D, Yang Y, Liu X, Chen JJ (2010) A data placement strategy in scientific cloud workflows. Future Gener Comput Syst 26(8):1200–1214
Zhang X, Li XT, Yin MH (2020) An enhanced genetic algorithm for the distributed assembly permutation flowshop scheduling problem. Int J Bioinspired Comput 15(2):113–124
Acknowledgements
This paper is supported by Humanity and Social Science Research of Ministry of Education (20YJCZH200), Beijing Intelligent Logistics System Collaborative Innovation Center Open Topic (No. BILSCIC-2019KF- 05), Grass-roots Academic Team Building Project of Beijing Wuzi University (No. 2019XJJCTD04).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Dong, T., Xue, F., Xiao, C. et al. Workflow scheduling based on deep reinforcement learning in the cloud environment. J Ambient Intell Human Comput 12, 10823–10835 (2021). https://doi.org/10.1007/s12652-020-02884-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-020-02884-1