Mobile Edge Computing
Mobile Edge Computing
To better understand the internal logic of MEC, we first present a hierarchical archi-
tecture that vertically divides the edge computing system into three layers: the user
layer, the edge layer, and the cloud layer, as shown in Fig. 2.1. The user layer is distin-
guished by the wireless communication mode between mobile devices and wireless
infrastructures. The edge and cloud layers mainly refer to the computing resources
of the edge and cloud servers, respectively.
Devices in the user layer include sensors, smartphones, vehicles, smart meters, and
radio-frequency identification devices. These devices access edge servers via wire-
less communication and then offload computation-intensive tasks to the lightweight,
distributed edge servers to process. According to wireless network topology and
communication modes, the communication between mobile devices and a wireless
infrastructure can be split into the following three modes.
• Heterogeneous network: Next generation wireless networks will run applications
that require large demand for high data rates. One solution to help reduce the
data rate requirement is the densification of the network by deploying small cells.
Such densification results in higher spectral efficiency and can reduce the power
consumption of a mobile device due to its communication with small nearby
cell base stations. This solution significantly improves network coverage. The
concurrent operation of macro base stations (MBSs) and micro, pico, femto, and
© The Author(s) 2022 9
Y. Zhang, Mobile Edge Computing, Simula SpringerBriefs on Computing 9,
https://doi.org/10.1007/978-3-030-83944-4_2
10 2 Mobile Edge Computing
• Local execution: The entire computation task is completed locally. If the compu-
tational resources of the edge servers are unavailable or the wireless channel is of
poor quality, which can result in high transmission latency, local execution can be
preferred.
• Full offloading: The entire computation task is offloaded and processed by an edge
server.
• Partial offloading: Part of the computation task is processed locally while the rest
is offloaded to an edge server.
The computation offloading decision is very difficult, since it requires considering
multiple factors, such as application requirements, the quality of the communication
link, and the computing resource capacities of edge servers.
As the noted above, the CPU is the primary engine for computation. The CPU’s
performance is controlled by CPU cycles f m . The state-of-the-art mobile CPU archi-
tecture adopts an advanced dynamic frequency and voltage scaling technique, which
allows for stepping up or down CPU cycles, increasing and reducing energy con-
sumption, respectively. In practice, the value of f m is bounded by a maximum value,
f max , which reflects the limitation of the mobile’s computation capability. A compu-
tation task can be described as D (d, c, T ), where d denotes the data size of the
computation task, c is the required number of CPU cycles for computing one bit of
the computation task, and T denotes the maximum latency allowed to accomplish
the task. The local execution time for a computing task D can now be expressed as
dc
TL = (2.1)
fm
which indicates that more CPU cycles are required to reduce the execution latency.
2.2 Computation Model 13
Since devices are energy constrained, the energy consumption of local execution
is a critical performance metric for computing efficiency. According to [14], the
energy consumption of each CPU cycle is given by ς f m2 , where ς is the effective
switched capacitance, depending on the chip architecture. The energy consumption
for executing task D with f m CPU cycles can be derived as
E L = ς dc f m2 (2.2)
From (2.1) and (2.2), if T L is greater than the maximum latency or if the device’s
battery capacity is less than E L , the device should offload the task to edge servers to
process. Otherwise, local execution can support the computation task.
In this section, we present two computation models of the full offloading for a single-
user MEC system and a multi-user MEC system, respectively.
The single-user MEC system is the simplest case and consists of a single device
and a single edge server. Denote Fe as the computational resource capacity of the
edge server. The device offloads the entire computation task to the edge server to
process. The task computation time is thus given by
dc
t F,computing = (2.3)
Fe
Since the process of offloading involves wireless transmission, the total task execution
time is the sum of the task computation time and the task transmission time, which
can be expressed as
dc d
T F,s = + s (2.4)
Fe r
where r s is the wireless transmission data rate between the device and the edge server.
The energy consumption for completing the offloaded computation task also includes
two parts: the energy consumption for computation and the energy consumption for
wireless transmission. The total energy consumption can be expressed as
d
E F,s = ς dcFe2 + p (2.5)
rs
where p is the transmission power of the device.
In the multi-user MEC system, several devices can be associated with the same
edge server and offload their tasks to the edge server simultaneously. In this case,
each device is assigned only to a part of the edge server’s computational resources.
Denote the computation task of device i as Di (di , ci , Ti ), where di denotes the
14 2 Mobile Edge Computing
data size of the computation task on device i, ci is the required number of CPU
cycles for computing one bit of the computation task, and Ti denotes the maximum
latency allowed to accomplish the task. Let f ei be the computational resources that
the edge server allocates to device i. Since the process of offloading involves wireless
transmission, the total task execution time of device i can be expressed as
di ci di
Ti F,m = + m (2.6)
f ei ri
where rim is the wireless transmission data rate between device i and the edge server.
The corresponding energy consumption of completing the offloaded computation
task of device i can be expressed as
di
E iF,m = ς di ci ( f ei )2 + pi (2.7)
rim
Partial offloading is a very complex process that can be affected by different factors,
such as the offloadability of an application [16], the dependency of the offloadable
parts [17], and user preferences and channel connection quality [32]. To simplify
the description, we assume each computation task can be offloaded and arbitrarily
divided into two parts. One part is executed on the device and the other is offloaded
to an edge server for edge execution.
Let λ (0 ≤ λ ≤ 1) be the offloading ratio, which represents the ratio of the
offloaded task to the total task. That is, an amount λd is offloaded to the edge server
to be computed and the rest, (1 − λ) d, is computed locally. The task computation
time upon partial offloading can be expressed as
(1 − λ)dc λdc
t P,computing = + (2.8)
fm Fe
2.2 Computation Model 15
Since one part of the computation task (i.e., λd) involves wireless transmission, the
total time for completing this task can be expressed as
(1 − λ)dc λdc λd
TP = + + (2.9)
fm Fe r
The energy consumption required for completing this task consists of three parts:
λd
E P = ς (1 − λ)dc f m2 + ς λdcFe2 + p (2.10)
r
where the first term indicates the local energy consumption for processing the amount
(1 − λ) d, the second term indicates the energy consumption for processing the
amount λd on the edge server, and the third term is the energy consumption of the
wireless transmission. In partial offloading, the key problem is to decide the offload-
ing ratio, considering system constraints. For example, if the energy or computational
resources of the device are almost used up, offloading the task to the edge server is
desirable (i.e., the offloading ratio should be close to one). If the quality of the wire-
less channel is poor or the available computational resources of the edge server are
limited, local execution could be a better choice. Note that the above models can be
easily extended to the multi-user MEC system.
The key problem in edge computing is making the offloading decision. According
to the previous description, the results of the offloading decision are either local
execution, full offloading, or partial offloading. Combining local execution and full
offloading, the problem can be modeled as a zero–one binary offloading problem.
Partial offloading can be modeled as a continuous offloading decision making prob-
lem. First, we introduce the research on binary offloading in the next section.
Binary offloading mainly involves small-scale computation tasks that have high com-
putational resource requirements. Such tasks will be offloaded in entirety to the edge
server. Computing offloading can effectively reduce the task completion delay and
save the energy consumption of devices. When the device does not choose offload-
ing (i.e., local execution), the task completion delay involves only the local task
computation time. When the device chooses offloading, the task completion delay
involves three parts: (1) the wireless transmission time of the computation task from
the device to the edge server, (2) the task computation time spent on the edge server,
16 2 Mobile Edge Computing
and (3) the wireless transmission time of the computation result from the edge server
to the device. Similarly, when the device does not offload the task, the total energy
consumption required to complete the task includes only local task computation
energy consumption. If the device offloads any of the computation task, the total
energy consumption consists of two parts: the energy consumption of the wireless
transmission from the device to the edge server and the energy consumption of the
computation on the edge server.
One of the dynamic programming approaches to find the optimal radio scheduling
offloading policy is deterministic, while the other is randomized. Numerical exper-
iments showed both offline policies can reduce energy consumption compared to
offloading-only and static processing strategies. The authors in [22] further extended
the work in [23] from single user to multi-user by jointly optimizing resource allo-
cation and computation offloading to guarantee fairness between users, low energy
consumption, and average queuing/delay constraints. Another multi-user offloading
decision strategy was proposed in [24] to minimize system energy consumption. This
paper determined three multi-user types based on the time and energy cost of the task
computing process. The first type of user can compute tasks on the MEC server. The
second type of user computes the task on local equipment. The third type of user can
decide to either implement tasks locally or offload tasks to the MEC server. Based on
the user classification, a joint computation offloading and radio resource allocation
algorithm was proposed. The proposed algorithm can decrease energy consumption
by up to 15% compared to computation without offloading.
The literature cited above focused on binary offloading strategies. In a binary offload-
ing problem, the computing task is considered as a whole. However, in practical
applications, computing tasks are often divided into multiple parts [27]. According
to the divisible nature of computing tasks, devices can offload part of a task, rather
than its entirety, to the edge server. There are thus two types of tasks: (1) tasks that
can be divided into multiple discrete segments that can all be offloaded to the MEC
server for execution and (2) tasks that can be split into two consecutive parts, non-
offloadable and offloadable, and only the offloadable part can be offloaded. Next, we
introduce works focused on partial offloading.
the conditions under which local execution is optimal. Analyzing the optimality of
total offloading, the authors concluded that total offloading cannot be optimal under
dynamic voltage scaling of the device. The authors in [30] proposed a joint schedul-
ing and computation offloading algorithm for multi-component applications using
an integer programming approach. The optimal offloading decision involves which
components need to be offloaded, as well as their scheduling order. The proposed
algorithm provides a greater degree of freedom in the solution by moving away
from a compiler predetermined scheduling order for the components toward a more
wireless-aware scheduling order. For some component dependency graph structures,
the proposed algorithm can shorten execution times by the parallel processing of
appropriate components on the devices and in the cloud. To minimize the expected
energy consumption of the mobile device, an energy-efficient scheduling policy for
collaborative task execution between the mobile device and a cloud clone was pro-
posed in [31]. The authors formulated the energy-efficient task scheduling problem
as a constrained stochastic shortest path problem on a directed acyclic graph. They
also considered three alternative stochastic wireless channel models: the block fad-
ing channel, the independent and identically distributed stochastic channel, and the
Markovian stochastic channel. To solve the formulated problem, the authors lever-
aged a one-climb policy and designed a heuristic algorithm to determine the task
execution decision.
online algorithm based on Lyapunov optimization was proposed that decides the
optimal CPU frequency of the device for local execution and allocates the trans-
mission power and bandwidth when offloading the application to an edge server.
The numerical results demonstrated that computation offloading can reduce power
consumption up to roughly 90% and reduce execution delays by approximately by
98%.
A wide variety of research challenges and opportunities exists for future research
on computation offloading. However, the MEC research is still in its infancy, and
many critical factors have been overlooked for simplicity. In this section, we point
out several open challenges and shed light on possible future research directions.
• Multi-server scheduling: The collaboration of multiple MEC servers allows for
their resources to be jointly managed in serving a large number of mobile devices
simultaneously. Server cooperation not only can improve resource utilization but
also can provide mobile users with more resources to enhance user experience.
However, the increase in network size hinders practical MEC server scheduling.
Too many offloading users will cause severe inter-user communication interference
and the system will need to make large numbers of offloading decisions. More
comprehensive research is required for multi-server scheduling.
• Multi-resource optimization: The architecture of mobile edge networks involves
various resources: computing, caching, and communication resources. The effi-
cient integration of these resources to achieve optimal performance for all users
and applications is quite challenging. Efficient resource management requires the
design of distributed low-complexity resource optimization algorithms, consider-
ing radio and computational resource constraints and computation overhead.
• User mobility: User mobility is a key challenge in mobile edge networks. Since
the movement and trajectory of users provide location and personal preference
information for edge servers, the contact times between users and MEC servers is
dynamic, which will impact the offloading strategy. Moreover, the frequent mobil-
ity of users causes frequent handovers among edge servers, which will increase
computation latency and thus deteriorate user experience. Therefore, mobility
management techniques from both horizontal and vertical perspectives should be
implemented to allow users seamless access to edge servers.
• Security: Security is one of the main concerns of technology advisers in securing
MEC deployments. The deployment of edge cloud servers is creating novel security
challenges due to the exploitation of mobile device information. The growing
rate of the evolution of security solutions cannot keep up with the pace of new
security challenges. Many existing security protocols assume full connectivity,
which is not realistic in mobile edge networks, since many links are intermittent
by default. On the other hand, in MEC, user data are offloaded to an MEC server
2.4 Challenges and Future Directions 21
that gives access control to other mobile users. This introduces challenges, such
as data integrity and authorization. For example, offloaded data can be modified
or accessed by malicious users. Moreover, data owners and data servers possess
dissimilar identities and business interests that make the scenario more vulnerable.
Therefore, a comprehensive scientistic research study is required to avoid any
security issues that can damage MEC systems.
This chapter first introduced the hierarchical mobile edge computing architecture
with a cloud plane, an edge plane, and a user plane. Then, three types of computation
models were discussed in detail for the typical computation offloading problem in
MEC. In terms of the offloading decision, current research on computation offloading
was surveyed, as were the binary offloading and partial offloading problems. Finally,
several open challenges and future directions were discussed.
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing,
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate
credit to the original author(s) and the source, provide a link to the Creative Commons license and
indicate if changes were made.
The images or other third party material in this chapter are included in the chapter’s Creative
Commons license, unless indicated otherwise in a credit line to the material. If material is not
included in the chapter’s Creative Commons license and your intended use is not permitted by
statutory regulation or exceeds the permitted use, you will need to obtain permission directly from
the copyright holder.