Power-efficient implementation of ternary neural networks in edge devices

Molina, Manuel; Méndez, Javier; Morales Santos, Diego Pedro; Castillo Morales, María Encarnación; López Vallejo, Marisa; Pegalajar Cuéllar, Manuel

doi:10.1109/JIOT.2022.3172843

79252.pdf (505.3Kb)

Identificadores

URI: https://hdl.handle.net/10481/101050

DOI: 10.1109/JIOT.2022.3172843

Exportar

Editorial

IEEE

Fecha

2022-05-05

Referencia bibliográfica

M. Molina, J. Mendez, D. P. Morales, E. Castillo, M. L. Vallejo and M. Pegalajar, "Power-Efficient Implementation of Ternary Neural Networks in Edge Devices," in IEEE Internet of Things Journal, vol. 9, no. 20, pp. 20111-20121, 15 Oct.15, 2022, doi: 10.1109/JIOT.2022.3172843

Patrocinador

Gobierno de España, ref. PGC2018-09733, NEUROWARE; Horizonte 2020, ref. 876019; Gobierno de España, ref. MIA.2021.M04.0008

Resumen

There is a growing interest in pushing computation to the edge, especially the problem-solving abilities of artificial neural networks (ANNs). This article presents a simplified method to obtain a ternary neural network based on the multilayer perceptron. The method is focused on resource-constrained devices, where memory, computing power, and battery are some of the most relevant constraints. A dynamic threshold is estimated to perform ternarization, and a new pruning technique is proposed to obtain a drastic reduction in the ANN’s size, with the corresponding decrease in resource utilization and power consumption of the resulting hardware. In addition, a support framework has been developed to automate hardware design exploration and generation from the network trained in software. Experimental results show that the proposed method and architecture, when implemented in a field-programmable gate array (FPGA), provide excellent figures in power (0.11–0.13 W) and efficiency (1225–1448 kfps/W) with respect to state of the art, being its efficiency double than the maximum one reported previously.

Colecciones

DETC - Artículos

Excepto si se señala otra cosa, la licencia del ítem se describe como Creative Commons Attribution-NonCommercial-NoDerivs 3.0 License