Journal Article10.1145/3603703
Reinforcement Learning Methods for Computing Offloading: A Systematic Review
Mohammad Hossein Rezvani
43
TL;DR: In this paper , the authors provide a systematic review of the widely used reinforcement learning (RL) approaches in computation offloading, including both binary offloading and partial offloading techniques.
read more
Abstract: Today, cloud computation offloading may not be an appropriate solution for delay-sensitive applications due to the long distance between end-devices and remote datacenters. In addition, offloading to a remote cloud can consume bandwidth and dramatically increase costs. On the other hand, end-devices such as sensors, cameras, and smartphones have limited computing and storage capacity. Processing tasks on such battery-powered and energy-constrained devices becomes even more complex. To address these challenges, a new paradigm called Edge Computing (EC) emerged nearly a decade ago to bring computing resources closer to end-devices. Here, edge servers located between the end-device and the remote cloud perform user tasks. Recently, several new computing paradigms such as Mobile Edge Computing (MEC) and Fog Computing (FC) have emerged to complement Cloud Computing (CC) and EC. Although these paradigms are heterogeneous, they can further reduce energy consumption and task response time, especially for delay-sensitive applications. The computation offloading is a multi-objective, NP-hard optimization problem. A significant part of previous research in this field is devoted to Machine Learning (ML) methods. One of the essential types of ML is Reinforcement Learning (RL), in which an agent learns how to make the best decision using the experiences gained from the environment. This paper provides a systematic review of the widely used RL approaches in computation offloading. It covers research in complementary paradigms such as mobile cloud computing (MCC), edge computing, fog computing, and the Internet of Things (IoT). We explain the reasons for using various RL methods in computation offloading from a technical point of view. This analysis includes both binary offloading and partial offloading techniques. For each method, the essential elements of RL and the characteristics of the environment are discussed regarding the most important criteria. Research challenges and Future trends are also mentioned.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine learning-based computation offloading in edge and fog: a systematic review
Sanaz Taheri-abed,Amir Masoud Eftekhari Moghadam,Mohammad Rezvani +2 more
TL;DR: This paper reviews machine learning-based computation offloading mechanisms in the edge and fog environment and discusses various performance metrics, tools, and case studies and analyzes their advantages and disadvantages.
20
Trajectory-Aware Offloading Decision in UAV-Aided Edge Computing: A Comprehensive Survey
Tanmay Baidya,Ahmadun Nabi,Sangman Moh +2 more
- 13 Mar 2024
TL;DR: Trajectory-aware offloading decision in UAV-aided edge computing optimizes overall system performance by considering the trajectory of the UAVs.
8
Machine learning methods for service placement: a systematic review
Parviz Keshavarz Haddadha,Mohammad Rezvani,Mahdi MollaMotalebi,Achyut Shankar +3 more
TL;DR: This systematic review shows that one of the most important trends in ML is reinforcement learning, with a 56% share of research, and the findings show that 96% of applications use a distributed microservice architecture.
8
Proximal Policy Optimization for Efficient D2D-Assisted Computation Offloading and Resource Allocation in Multi-Access Edge Computing
Chen Zhang,Celimuge Wu,Min Lin,Yangfei Lin,William Liu +4 more
TL;DR: This work builds a realistic MEC network environment and develops a Markov decision process (MDP) model that minimizes time loss and energy consumption, and proposes a proximal policy optimization (PPO) based Device-to-Device (D2D)-assisted computation offloading and resource allocation scheme.
7
Multi-objective task offloading for highly dynamic heterogeneous Vehicular Edge Computing: An efficient reinforcement learning approach
Zhidong Huang,Xiaofei Wu,Shoubin Dong +2 more
5
References
•Book
Reinforcement Learning: An Introduction
Richard S. Sutton,Andrew G. Barto +1 more
- 01 Jan 1988
TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.
Human-level control through deep reinforcement learning
Volodymyr Mnih,Koray Kavukcuoglu,David Silver,Andrei Rusu,Joel Veness,Marc G. Bellemare,Alex Graves,Martin Riedmiller,Andreas K. Fidjeland,Georg Ostrovski,Stig Petersen,Charles Beattie,Amir Sadik,Ioannis Antonoglou,Helen King,Dharshan Kumaran,Daan Wierstra,Shane Legg,Demis Hassabis +18 more
TL;DR: This work bridges the divide between high-dimensional sensory inputs and actions, resulting in the first artificial agent that is capable of learning to excel at a diverse array of challenging tasks.
Mastering the game of Go with deep neural networks and tree search
David Silver,Aja Huang,Chris J. Maddison,Arthur Guez,Laurent Sifre,George van den Driessche,Julian Schrittwieser,Ioannis Antonoglou,Veda Panneershelvam,Marc Lanctot,Sander Dieleman,Dominik Grewe,John Nham,Nal Kalchbrenner,Ilya Sutskever,Timothy P. Lillicrap,Madeleine Leach,Koray Kavukcuoglu,Thore Graepel,Demis Hassabis +19 more
TL;DR: Using this search algorithm, the program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0.5, the first time that a computer program has defeated a human professional player in the full-sized game of Go.
•Posted Content
Proximal Policy Optimization Algorithms
TL;DR: A new family of policy gradient methods for reinforcement learning, which alternate between sampling data through interaction with the environment, and optimizing a "surrogate" objective function using stochastic gradient ascent, are proposed.
18K
The NIST Definition of Cloud Computing
Peter Mell,Timothy Grance +1 more
- 28 Sep 2011
TL;DR: This cloud model promotes availability and is composed of five essential characteristics, three service models, and four deployment models.
17.6K