A Survey On Task Scheduling Based On Various Meta-Heuristics and Machine Learning Algorithms in Cloud Computing
A Survey On Task Scheduling Based On Various Meta-Heuristics and Machine Learning Algorithms in Cloud Computing
ISSN No:-2456-2165
Abstract:- The development of cloud computing in computing architecture that primarily consists of a group of
current decades has led to it serving as the basis for a interlinked and VMs that are provided flexibly and offered
variety of systems. It enables customers to access a list of as 1 or as greater than 1 integrated processing facilities
specified resources, act immediately and adaptably to depending on Service-Level Agreements (SLAs) formed via
customer preferences, and only be charged for actual negotiations between the service providers of clouds and
utilization. One of the most important problems in cloud customers” [2]. A large-scale dispersed processing
computing is Task Scheduling (TS). The issue is how to architecture called cloud computing is abstract, virtualized
equitably distribute and organize the user-provided and dynamically operated dependent on the monetary scale
tasks for Virtual Machine (VM) execution. Also, user of the operator. The primary function of cloud computing is
experience is directly impacted by the effectiveness of the management of computer resources, storage, multiple
scheduling efficiency. As a result, the TS issue in cloud platforms and applications that are rented out to outside
computing has to be more precisely addressed. In cloud customers over the internet [3].
computing, the TS is essential such that the optimal
scheduling of task requests may boost network Cloud computing is a quickly developing model for
efficiency. The main objective of TS is to assign tasks to processing that aims to alleviate cloud clients from the
appropriate processors to create the shortest deadline maintenance of hardware, software, networks and
achievable without compromising on priority criteria. information resources, as well as, transfer such obligations
Numerous research has been conducted to design TS to cloud service providers [4]. The essential features of
schemes based on various metaheuristic and machine cloud computing are distribution, virtualization and
learning algorithms that satisfy several criteria such as flexibility. Clouds offer a huge variety of resources, such as
minimization of the makespan, execution cost and computing platforms, data centers, storage, networks,
energy. They have demonstrated that conventional TS is firewalls, and applications delivered as services. In addition,
effective only to satisfy certain criteria and have devised it offers strategies for controlling those services ensuring
an optimum solution using multi-objectives in cloud that cloud clients may utilize them without experiencing any
computing. This paper presents a systematic and performance-related issues. The 3 categories of cloud
extensive analysis of TS algorithms in cloud computing computing services are Infrastructure as a Service (IaaS),
depending on the different optimization and machine Platform as a Service (PaaS) and Software as a Service
learning algorithms. Also, it addresses the challenges in (SaaS). Such categories are based on the degree of
those algorithms and recommends a few possible abstraction and the communication pattern of the providers
solutions for improving the utilization of cloud [5].
computing.
A. Architecture of Cloud Computing
Keywords:- Cloud computing, Task scheduling, Virtual Different types of enterprises use cloud computing
machine, Makespan, Metaheuristic, Machine learning, platforms to preserve information in the clouds therefore
Optimization. they may retrieve it anytime they need it. The 2 types of
cloud infrastructure as shown in Fig. 1 are a front end and a
I. INTRODUCTION back end, which are linked by the internet [6].
The back end is in the role of providing cloud B. Task Scheduling and Its Categories
applications with data protection. The back end is used by The cloud comprises a variety of resources, which are
the network operators. It oversees the management of every distinct from one another in terms of various resources, and
resource required to provide operations. It includes a since the expense of executing jobs in the cloud with those
security system, a huge amount of information storage, resources is distinct, therefore TS in the cloud differs from
hosts, VMs, traffic management systems, deployment conventional strategies of TS. As a result, TS in the cloud
models, etc. requires more emphasis since cloud operations rely on it. TS
is crucial for increasing the adaptability and dependability of
Indeed, individuals interact with the front end. cloud-based applications. The primary aim of allocating jobs
Programs and user interactions are required for front-end to resources in line with scheduling constraints is to
access to cloud computing. Computers, web browsers, and determine the optimal schedule in which to perform multiple
smartphones are included. The access methods for cloud jobs such that to provide the client with the optimum
storage are distinct from those for traditional storage because outcome [7].
the cloud holds a large quantity of information from a wide
range of individuals. The majority of operators implement In cloud computing, various resources, including
several access methods. The following includes a few cloud containers, firewalls, and networks, are often dynamically
computing architectural components: assigned by the order and specifications of the job and its
User infrastructure: It is regarded as a front-end subtasks. As a result, work scheduling in the cloud becomes
component. It offers a Graphical User Interface (GUI) for a flexible issue because no previously established schedule
communicating with the cloud. can be helpful while executing a job. Since the workflow is
Internet: It serves as a channel for 2 ends to interact with unpredictable, processing methods are also unpredictable,
one another. and resources are also unpredictable when several workloads
Application: The client might seek access to any are using resources concurrently, the scheduling is
application or network. unpredictable due to these factors.
Service: It offers IaaS, SaaS and PaaS.
TS in the cloud refers to selecting the optimal
Runtime cloud: It provides the VMs with an operational
resources provided for workload completion or allocating
and dynamic platform.
system resources to workloads in a way that minimizes the
Storage: It is one of the key components of cloud workload execution period. In scheduling strategies, a
computing infrastructure. It provides a lot of storage collection of workloads is formed by assigning a weight to
capacity in the cloud for handling and storing all jobs, with the significance of individual workload
information. depending on a variety of factors. After that, workloads are
Infrastructure: It provides functions at the application, selected based on their importance and given to the
host and network levels. It includes both hardware and processing systems that can meet a predetermined target
software components. function [8].
Management: It is employed to handle every component
of the back end. As well, it creates cooperation among Two major categories of TS are:
them. Fixed scheduling: It schedules workloads in a well-
Security: It executes a privacy method in the back end. known setting, i.e. it contains the data regarding the
overall arrangement of workloads, resource allocation
before processing and prediction of the workload
processing period [9].
Various algorithms have been developed over the past II. LITERATURE SURVEY
decades for TS in cloud applications. The primary purpose
of this paper is to give a comprehensive overview of TS A. Survey on Task Scheduling Based on Optimization
algorithms in cloud computing using various optimization Algorithms in Cloud Computing
and machine learning techniques. Also, a comparative An Improved Particle Swarm Optimization (IPSO)
analysis is presented to highlight the benefits and drawbacks algorithm [21] are developed to achieve the best distribution
of those algorithms in a tabular form, which supports us to for a huge amount of tasks. This was performed by
suggest possible future directions. partitioning the allocated tasks into batches dynamically.
Also, the resource usage condition was taken in the
The following sections have been prepared as follows: generation of all batches. Once obtaining a sub-optimal
Section II studies and analyzes the TS based on various result for all batches, every sub-optimal result for batches
optimization algorithms, whereas Section III studies and was added to the absolute distribution map. Moreover, the
analyzes the TS based on machine learning algorithms in loads over the absolute distribution map were balanced by
cloud systems. Section IV summarizes the complete study the IPSO.
and offers suggestions for future enhancement.
TS scheme depending on the DRL model [38] such as An improved training-enabled TS model depending on
DQN to adaptively schedule tasks with precedence the task Criticality and Collapse-Aware Scheduling (CCAS)
connection to cloud servers to reduce the task scheme [44]. In this scheme, 2 distinct strategies were
implementation period. To achieve this, the aspects of designed such as the TS strategy depending on task CCAS
servers and tasks were considered as state inputs and server and an ensemble forecast strategy such as Gradient Boosting
numbers were considered as activities. To reduce the DT (GBDT) to proactively estimate the system usage and
execution period, the negative change value of makespan task implementation status by capturing the high-level
from a particular state to the other state was described as the attributes via training the task variables. Also, a smart
incentive. Also, the task precedence connection restraint was scheduling scheme was adopted for best resource
accomplished during the state shift phase. The issue of TS of distribution.
cloud-based systems and intended to reduce the
computational cost under resource and deadline restraints An independent TS method in cloud computing based
[39]. To solve this issue, a clipped double deep Q-learning on the utilization of the Multi-Objective Artificial Bee
method was introduced based on the target network and Colony with Q-learning (MOABCQ) algorithm [45]. This
experience relay schemes, which allocates the tasks to their algorithm was used to compute the order of tasks for
corresponding VMs. appropriate resources and schedule the most suitable tasks
according to the execution time, cost and usage of resources.
Also, it was integrated with the First Come First Serve
(FCFS) and the Largest Job First (LJF) heuristic TS schemes
to achieve load balancing among VMs.
This part compares the merits and demerits of the different metaheuristic-based TS algorithms for cloud applications in
Table 1.
Table 1 addresses that many researchers focused on environments. From this viewpoint, a few researchers design
optimized TS in cloud applications using various machine learning algorithms with and without meta-
metaheuristic algorithms like PSO, GA, NSGA-II, ABC, etc. heuristics to accomplish dynamic TS in cloud systems,
Each algorithm has disadvantages regarding exploration, which are studied in Section III. Here, the merits and
exploitation and convergence abilities. To combat these demerits of those machine learning-based TS algorithms are
issues, more advanced and machine learning algorithms listed in Table 2.
must be developed to achieve TS in dynamic cloud