Please use this identifier to cite or link to this item: https://scholarhub.balamand.edu.lb/handle/uob/7052
DC FieldValueLanguage
dc.contributor.authorakkad, Ghattasen_US
dc.contributor.authorMansour, Alien_US
dc.contributor.authorInaty, Elieen_US
dc.date.accessioned2023-10-02T07:20:00Z-
dc.date.available2023-10-02T07:20:00Z-
dc.date.issued2024-05-11-
dc.identifier.urihttps://scholarhub.balamand.edu.lb/handle/uob/7052-
dc.description.abstractThe exponential increase in generated data as well as the advances in high-performance computing has paved the way for the use of complex machine learning methods. Indeed, the availability of Graphical Processing Units (GPU) and Tensor Processing Units (TPU) have made it possible to train and prototype Deep Neural Networks (DNN) on large-scale data sets and for a variety of applications, i.e., vision, robotics, biomedical, etc. The popularity of these DNNs originates from their efficacy and state-of-the-art inference accuracy. However, this is obtained at the cost of a considerably high computational complexity. Such drawbacks rendered their implementation on limited resources, edge devices, without a major loss in inference speed and accuracy, a dire and challenging task. To this extent, it has become extremely important to design innovative architectures and dedicated accelerators to deploy these DNNs to embedded and re-configurable processors in a high-performance low complexity structure. In this study, we present a survey on recent advances in deep learning accelerators (DLA) for heterogeneous systems and Reduced Instruction Set Computer (RISC-V) processors given their open-source nature, accessibility, customizability and universality. After reading this article, the readers should have a comprehensive overview of the recent progress in this domain, cutting edge knowledge of recent embedded machine learning trends and substantial insights for future research directions and challenges.en_US
dc.language.isoengen_US
dc.subjectComputer architectureen_US
dc.subjectConvolutional Neural Network (CNN)en_US
dc.subjectEmbedded machine learningen_US
dc.subjectField programmable gate arraysen_US
dc.subjectHardware acceleratorsen_US
dc.subjectPipelinesen_US
dc.subjectReduced instruction set computingen_US
dc.subjectRegistersen_US
dc.subjectRISC-Ven_US
dc.subjectRocketsen_US
dc.subjectSurveysen_US
dc.subjectTransformersen_US
dc.titleEmbedded Deep Learning Accelerators: A Survey on Recent Advancesen_US
dc.typeJournal Articleen_US
dc.identifier.doi10.1109/TAI.2023.3311776-
dc.identifier.scopus2-s2.0-85171589455-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85171589455-
dc.contributor.affiliationDepartment of Computer Engineeringen_US
dc.description.startpage1en_US
dc.description.endpage19en_US
dc.date.catalogued2023-10-02-
dc.description.statusPublisheden_US
dc.relation.ispartoftextIEEE Transactions on Artificial Intelligenceen_US
Appears in Collections:Department of Computer Engineering
Show simple item record

SCOPUSTM   
Citations

3
checked on Nov 16, 2024

Record view(s)

171
checked on Nov 21, 2024

Google ScholarTM

Check

Altmetric

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.