University of Oulu

D. Wen, M. Bennis and K. Huang, "Accelerating Partitioned Edge Learning via Joint Parameter-and-Bandwidth Allocation," GLOBECOM 2020 - 2020 IEEE Global Communications Conference, Taipei, Taiwan, 2020, pp. 1-6, doi: 10.1109/GLOBECOM42002.2020.9347992

Accelerating partitioned edge learning via joint parameter-and-bandwidth allocation

Saved in:
Author: Wen, Dingzhu1; Bennis, Mehdi2; Huang, Kaibin1
Organizations: 1Univeristy of Hong Kong
2University of Oulu
Format: article
Version: accepted version
Access: open
Online Access: PDF Full Text (PDF, 0.1 MB)
Persistent link: http://urn.fi/urn:nbn:fi-fe202102235684
Language: English
Published: Institute of Electrical and Electronics Engineers, 2020
Publish Date: 2021-02-23
Description:

Abstract

In this paper, we consider the framework of partitioned edge learning for iteratively training a large-scale model using many resource-constrained devices (called workers). To this end, in each iteration, the model is dynamically partitioned into parametric blocks, which are downloaded to worker groups for updating using their local data. Then, the local updates are uploaded to and cascaded by the server for updating a global model. To reduce resource usage by minimizing the total learning-and-communication latency, this work focuses on the novel joint design of parameter (computation load) and bandwidth allocation (for downloading and uploading). Two design approaches are adopted. First, a practical sequential approach, called partially integrated parameter-and-bandwidth allocation (PABA), yields one scheme, namely parameter aware bandwidth allocation. It allocates the largest bandwidth to the slowest worker. Second, PABA are jointly optimized. Despite its being a nonconvex problem, an efficient and optimal solution algorithm is derived by intelligently nesting a bisection search and solving a convex problem. Experimental results using real data demonstrate that integrating PABA can substantially improve the performance of partitioned edge learning in terms of latency (by e.g., 46%) and accuracy (by e.g., 4%).

see all

Series: IEEE Global Communications Conference
ISSN: 2334-0983
ISSN-E: 2576-6813
ISSN-L: 2334-0983
ISBN: 978-1-7281-8298-8
ISBN Print: 978-1-7281-8299-5
Article number: 9347992
DOI: 10.1109/GLOBECOM42002.2020.9347992
OADOI: https://oadoi.org/10.1109/GLOBECOM42002.2020.9347992
Host publication: GLOBECOM 2020 - 2020 IEEE Global Communications Conference
Conference: IEEE Global Communications Conference
Type of Publication: A4 Article in conference proceedings
Field of Science: 213 Electronic, automation and communications engineering, electronics
Subjects:
Copyright information: © 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.