Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning

TLDR

Cloud computing enables large‑scale workflow execution, yet optimal scheduling for multiple conflicting objectives remains inadequately addressed, with existing methods limited by expert‑dependent encoding that hampers performance. This study applies a deep‑Q‑network in a multi‑agent reinforcement learning framework to schedule multiple workflows on IaaS clouds. The authors formulate scheduling as a Markov game whose state encodes workflow counts and heterogeneous VMs, rewards combine makespan and cost, and the game seeks a correlated equilibrium without prior knowledge, converging in dynamic real‑time and validated on scientific workflow templates and Amazon EC2. Experimental results show the proposed approach outperforms traditional multi‑objective scheduling algorithms such as NSGA‑II, MOPSO, and game‑theoretic greedy methods in generating more optimal plans.

Abstract

Cloud Computing provides an effective platform for executing large-scale and complex workflow applications with a pay-as-you-go model. Nevertheless, various challenges, especially its optimal scheduling for multiple conflicting objectives, are yet to be addressed properly. The existing multi-objective workflow scheduling approaches are still limited in many ways, e.g., encoding is restricted by prior experts' knowledge when handling a dynamic real-time problem, which strongly influences the performance of scheduling. In this paper, we apply a deep-Q-network model in a multi-agent reinforcement learning setting to guide the scheduling of multi-workflows over infrastructure-as-a-service clouds. To optimize multi-workflow completion time and user's cost, we consider a Markov game model, which takes the number of workflow applications and heterogeneous virtual machines as state input and the maximum completion time and cost as rewards. The game model is capable of seeking for correlated equilibrium between make-span and cost criteria without prior experts' knowledge and converges to the correlated equilibrium policy in a dynamic real-time environment. To validate our proposed approach, we conduct extensive case studies based on multiple well-known scientific workflow templates and Amazon EC2 cloud. The experimental results clearly suggest that our proposed approach outperforms traditional ones, e.g., non-dominated sorting genetic algorithm-II, multi-objective particle swarm optimization, and game-theoretic-based greedy algorithms, in terms of optimality of scheduling plans generated.

References

Page 1

	Year	Citations
A fast and elitist multiobjective genetic algorithm: NSGA-II Kalyanmoy Deb, Amrit Pratap, Sakshi Agarwal, IEEE Transactions on Evolutionary Computation Artificial IntelligencePopulation SizeComputational ScienceEngineeringGenetic Algorithms	2002	46.1K
Handling multiple objectives with particle swarm optimization Carlos A. Coello Coello, Gregorio Toscano‐Pulido, M.S. Lechuga IEEE Transactions on Evolutionary Computation Artificial IntelligenceSpecial Mutation OperatorEngineeringAerospace EngineeringFirefly Algorithm	2004	4.3K
Resource Management with Deep Reinforcement Learning Hongzi Mao, Mohammad Alizadeh, Ishai Menache, Artificial IntelligenceReward HackingEngineeringMachine LearningDeep Reinforcement Learning	2016	1.1K
Learning to Rank Using User Clicks and Visual Features for Image Retrieval Jun Yu, Dacheng Tao, Meng Wang, IEEE Transactions on Cybernetics Ranking AlgorithmVisual ContentsMachine LearningEngineeringImage Retrieval	2014	444
iPrivacy: Image Privacy Protection by Identifying Sensitive Objects via Deep Multi-Task Learning Jun Yu, Baopeng Zhang, Zhengzhong Kuang, IEEE Transactions on Information Forensics and Security Convolutional Neural NetworkPrivacy ProtectionEngineeringMachine LearningInformation Security	2016	350
Optimization of global production scheduling with deep reinforcement learning Bernd Waschneck, André Reichstaller, Lenz Belzner, Procedia CIRP	2018	349
Computation Offloading for Service Workflow in Mobile Cloud Computing Shuiguang Deng, Longtao Huang, Javid Taheri, IEEE Transactions on Parallel and Distributed Systems Cluster ComputingService WorkflowVirtualization TechniquesEngineeringMobile Data Offloading	2014	273
A hybrid multi-objective Particle Swarm Optimization for scientific workflow scheduling Amandeep Verma, Sakshi Kaushal Parallel Computing Cluster ComputingWorkflow ExecutionEngineeringScientific Workflow SystemScheduling Problem	2017	253
Minimizing cost and makespan for workflow scheduling in cloud using fuzzy dominance sort based HEFT Xiumin Zhou, Gongxuan Zhang, Jin Sun, Future Generation Computer Systems Job SchedulerWorkflow ExecutionEngineeringScheduling ProblemCloud Scheduling	2018	251
Leveraging Content Sensitiveness and User Trustworthiness to Recommend Fine-Grained Privacy Settings for Social Image Sharing Jun Yu, Zhenzhong Kuang, Baopeng Zhang, IEEE Transactions on Information Forensics and Security Privacy ProtectionEngineeringMachine LearningSocial InfluenceCommunication	2018	206

Page 1