Concepedia

Publication | Closed Access

Online Learning Aided Decentralized Multi-User Task Offloading for Mobile Edge Computing

25

Citations

31

References

2023

Year

Abstract

Mobile edge computing facilitates users to offload computation tasks to edge servers for meeting their stringent delay requirements. Previous works mainly explore task offloading when system-side information is given (e.g., server processing speed, cellular data rate), or centralized offloading under system uncertainty. But both generally fall short of handling task placement involving many coexisting users in an uncertain environment. In this paper, we develop a <i>multi-user</i> offloading framework considering <i>unknown yet stochastic</i> system-side information to enable a <i>decentralized user-initiated</i> service placement under overlapping server coverage. Specifically, we formulate the dynamic task placement as an online multi-user multi-armed bandit process, and propose a decentralized epoch based offloading (DEBO) to optimize user rewards which are subjected under network delay. We consider both cases without and with neighboring edge feedback once users’ tasks are processed, where the latter incorporates system-side information sharing among edge servers for an enhanced task placement. For both cases, we show that DEBO can gradually deduce the optimal user-server assignment during dynamic offloading, thereby achieving a <i>close-to-optimal</i> service performance and <i>tight</i> <inline-formula><tex-math notation="LaTeX">$O(\log _{2}\!\!T)$</tex-math></inline-formula> regret. Moreover, we generalize DEBO to various common scenarios such as unknown reward gap, dynamic entering or leaving of clients, and fair reward distribution, while further exploring when users’ offloaded tasks require <i>heterogeneous</i> computing resources. Particularly, we accomplish a sub-linear regret for each of these instances. Real measurements based evaluations corroborate the superiority of our offloading schemes over state-of-the-art approaches in optimizing delay-sensitive rewards.

References

YearCitations

Page 1