Model Optimization Techniques for Embedded Artificial Intelligence

doi:10.1109/CDS52072.2021.00008

Proceedings Article10.1109/CDS52072.2021.00008

Model Optimization Techniques for Embedded Artificial Intelligence

- 01 Jan 2021

3

TL;DR: In this paper, the authors compare and discuss state-of-the-art methods within the range of these three methods as a way to guide software and hardware developers to select the best method for their objective.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.3390/bdcc7010044

Disclosing Edge Intelligence: A Systematic Meta-Survey

Vincenzo Barbuto, +3 more

- 02 Mar 2023

- Big data and cognitive computing

TL;DR: In this article , the authors analyze the wide landscape on edge intelligence by providing a systematic analysis of the state-of-the-art manuscripts in the form of a tertiary study.

...read moreread less

38

Journal Article•10.1109/eei63073.2024.10696459

Deep Learning Based Herbal Plant Recognition

Jiang Li, +3 more

- 28 Jun 2024

TL;DR: This study proposes a feature fusion focal pyramid network (PFN) for herbal plant recognition, achieving 96.91% accuracy on a new dataset (CHMP-50) with 50 classes and outperforming other deep learning algorithms in accuracy, recall, and F1 score.

...read moreread less

Proceedings Article•10.1109/mvip62238.2024.10491145

Convergence of Deep Learning and Edge Computing using Model Optimization

Peyman Babaei

- 06 Mar 2024

TL;DR: By using optimization techniques such as quantization, weight pruning, and weight clustering, the possibility of deploying a typical convolutional neural network model on edge systems that have limited computing resources and memory is investigated and it is shown that by using a collaborative algorithm, it is possible to achieve a small-sized model that can even be deployed on microcontrollers.

...read moreread less

References

•Proceedings Article•10.1109/CVPR.2017.243

Densely Connected Convolutional Networks

Gao Huang, +3 more

- 21 Jul 2017

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

46.1K

•Proceedings Article•10.1109/CVPR.2018.00474

MobileNetV2: Inverted Residuals and Linear Bottlenecks

Mark Sandler, +4 more

- 18 Jun 2018

TL;DR: MobileNetV2 as mentioned in this paper is based on an inverted residual structure where the shortcut connections are between the thin bottleneck layers and intermediate expansion layer uses lightweight depthwise convolutions to filter features as a source of non-linearity.

...read moreread less

19.4K

•Proceedings Article•10.1109/CVPR.2017.634

Aggregated Residual Transformations for Deep Neural Networks

Saining Xie, +4 more

- 21 Jul 2017

TL;DR: ResNeXt as discussed by the authors is a simple, highly modularized network architecture for image classification, which is constructed by repeating a building block that aggregates a set of transformations with the same topology.

...read moreread less

11.2K

•Proceedings Article

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

Song Han, +3 more

- 15 Feb 2016

TL;DR: Deep Compression as mentioned in this paper proposes a three-stage pipeline: pruning, quantization, and Huffman coding to reduce the storage requirement of neural networks by 35x to 49x without affecting their accuracy.

...read moreread less

8.5K

•Book Chapter•10.1007/978-3-030-01264-9_8

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Ningning Ma, +3 more

- 08 Sep 2018

TL;DR: ShuffleNet V2 as discussed by the authors proposes to evaluate the direct metric on the target platform, beyond only considering FLOPs, based on a series of controlled experiments, and derives several practical guidelines for efficient network design.

...read moreread less

6.6K