BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems (2019-10-31T00:00:00.000000Z)

TL;DR

An end-to-end architecture that consists of an encoder, a non-trainable channel layer, and a decoder for more efficient feature compression and transmission, which achieves a much higher compression ratio than existing methods.

Abstract

The emergence of various intelligent mobile applications demands the deployment of powerful deep learning models at resource-constrained mobile devices. The device-edge co-inference framework provides a promising solution by splitting a neural network at a mobile device and an edge computing server. In order to balance the on-device computation and the communication overhead, the splitting point needs to be carefully picked, while the intermediate feature needs to be compressed before transmission. Existing studies decoupled the design of model splitting, feature compression, and communication, which may lead to excessive resource consumption of the mobile device. In this paper, we introduce an end-to-end architecture, named BottleNet++, that consists of an encoder, a non-trainable channel layer, and a decoder for more efficient feature compression and transmission. The encoder and decoder essentially implement joint source-channel coding via lightweight convolutional neural networks (CNNs), while explicitly considering the effect of channel noise. By exploiting the strong sparsity and the fault-tolerant property of the intermediate feature in deep neural network (DNNs), BottleNet++ achieves a much higher compression ratio than existing methods. Compared with merely transmitting intermediate data without feature compression, BottleNet++ achieves up to 64× bandwidth reduction over the additive white Gaussian noise channel and up to 256× bit compression ratio in the binary erasure channel, with less than 2% reduction in accuracy of classification.

Authors

Jiawei Shao

3 papers

Jun Zhang

1 papers

BottleNet++: An End-to-End Approach for Feature Compression in Device-Edge Co-Inference Systems

TL;DR

Abstract

Authors

References26 items

Distilled Split Deep Neural Networks for Edge-Assisted Real-Time Systems

Improving Device-Edge Cooperative Inference of Deep Learning via 2-Step Pruning

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

JALAD: Joint Accuracy-And Latency-Aware Deep Structure Decoupling for Edge-Cloud Execution

Neural Joint Source-Channel Coding

Deep Joint Source-channel Coding for Wireless Image Transmission

Near-Lossless Deep Feature Compression for Collaborative Intelligence

Deep Learning for Joint Source-Channel Coding of Text

Deep Feature Compression for Collaborative Object Detection

Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms

JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services

Model Compression and Acceleration for Deep Neural Networks: The Principles, Progress, and Challenges

Neurosurgeon: Collaborative Intelligence Between the Cloud and Mobile Edge

Densely Connected Convolutional Networks

Scalable Distributed Computing Hierarchy: Cloud, Fog and Dew Computing

Deep Residual Learning for Image Recognition

Internet of Things: A Survey on Enabling Technologies, Protocols, and Applications

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet classification with deep convolutional neural networks

Joint Source-Channel Coding for Video Communications

Joint source/channel coding for wireless channels

Determining and improving the fault tolerance of multilayer perceptrons in a pattern-recognition application

Region-Based Convolutional Networks for Accurate Object Detection and Segmentation

Learning Multiple Layers of Features from Tiny Images

A Mathematical Theory of Communication

LDPC Codes: An Introduction

Field of Study

Journal Information

Name

Page