Design of lightweight neural networks for resource-constrained devices
DOI:
https://doi.org/10.37868/sei.v7i2.id542Abstract
Practical realization of modern artificial intelligence systems, especially deep neural networks, on edge platforms presents a daunting challenge. The root cause lies in the critical gap between the computational requirements of these models and the drastically limited capabilities of edge platforms in terms of processing power, memory, storage, and energy consumption. This constraint often requires applications to rely on cloud processing, which presents essential problems in terms of added latency, privacy, and persistent internet connectivity. To overcome this problem, this study presents an architecture for designing and deploying compact neural networks. The methodology begins with the choice of the MobileNet architecture as an initial reference, then adopts advanced model compression schemes, i.e., pruning to eliminate redundant neural connections, and quantization to reduce the numerical precision of weights, substantially contributing to model size reduction and computational requirements. The optimized models were then implemented and evaluated on Raspberry Pi and Arduino Nano boards to test their usability in practical situations. Experimental results clearly demonstrate that optimized models realized an energy consumption reduction of 40% and a latency reduction of 43.75%, while retaining an impressive level of performance in terms of an accuracy loss of less than 3%. This research provides evidence in support of bridging the gap between complex AI and resource-limited hardware, thereby enabling the realization of real-time, compact, and secure on-device intelligent applications.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Zainab Khudhur Mohsin, Rafeef Fauzi Najim Alshammari, Elham Mohammed Thabit A. Alsaadi

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.





