Energy-Efficient
Computing
~Low-power
Architecture and Systems/SoCs~
Our research in
power/energy-efficient computing systems is
critical for addressing the growing demand for
more powerful and sustainable technology. As our
reliance on computing devices increases, so does
the need to manage their energy consumption.
Efficient computing systems can reduce energy
costs, minimize environmental impact, and extend
the battery life of portable devices. Moreover,
in large-scale data centers, energy efficiency
translates to significant savings and reduced
carbon footprints. Our research in this area
drives innovation in hardware and software
design, leading to smarter, greener technologies
that benefit both users and the planet.
Scaling
Deep-Learning Pneumonia Detection Inference on a
Reconfigurable Platform
Jangkun
Wang, Khanh N. Dang and Abderazek Ben Abdallah,
“Scaling Deep-Learning Pneumonia Detection
Inference on a Reconfigurable Self-Contained
Hardware Platform”, 2023 IEEE 6th International
Conference on Electronics Technology (ICET), May
12-15, 2023.
Artificial Intelligence (AI) has
been used in applications to alleviate specific
problems in academia and industry. For instance,
in healthcare, where edge-based computing
platforms are heavily used, when it comes to
latency and security issues, the increased demands
of application of AI applications such as deep
learning require a specific platform to meet the
latency, security, and power consumption
challenges. This work presents methods and
architectures for scaling deep learning inference
for pneumonia detection in chest X-ray images
based on a reconfigurable self-contained hardware
platform named AIRBiS 1. The performance
evaluation results show that the proposed approach
achieves 95.2% detection accuracy of pneumonia
over the collected test data with the
computer-aided diagnosis scenario. The secure
collaborative-learning approach achieves
comparable accuracy to the conventional training
scenario. However, for rapid batch detection, the
detection could be accelerated by 0.023s.
Moreover, the system inference acceleration is 13
times (on average) more energy-efficient than
conventional approaches.
|
|
- Wang, Jiangkun, Ogbodo Mark
Ikechukwu, Khanh N. Dang, and Abderazek Ben
Abdallah. 2022. "Spike-Event X-ray Image
Classification for 3D-NoC-Based Neuromorphic
Pneumonia Detection" Electronics 11, no. 24:
4157.
https://doi.org/10.3390/electronics11244157
The
success of deep learning in extending the
frontiers of artificial intelligence has
accelerated the application of AI-enabled systems
in addressing various challenges in different
fields. In healthcare, deep learning is deployed
on edge computing platforms to address security
and latency challenges, even though these
platforms are often resource-constrained. Deep
learning systems are based on conventional
artificial neural networks, which are
computationally complex, require high power, and
have low energy efficiency, making them unsuitable
for edge computing platforms. Since these systems
are also used in critical applications such as
bio-medicine, it is expedient that their
reliability is considered when designing them. For
biomedical applications, the spatio-temporal
nature of information processing of spiking neural
networks could be merged with a fault-tolerant
3-dimensional network on chip (3D-NoC) hardware to
obtain an excellent multi-objective performance
accuracy while maintaining low latency and low
power consumption. In this work, we propose a
reconfigurable 3D-NoC-based neuromorphic system
for biomedical applications based on a
fault-tolerant spike routing scheme. The
performance evaluation results over X-ray images
for pneumonia (i.e., COVID-19) detection show that
the proposed system achieves 88.43% detection
accuracy over the collected test data and could be
accelerated to achieve 4.6% better inference
latency than the ANN-based system while consuming
32% less power. Furthermore, the proposed system
maintains high accuracy for up to 30% inter-neuron
communication faults with increased latency
- Patent: Abderazek Ben Abdallah,
Huankun Huang, Nam Khanh Dang, Jiangning Song, "AIプ
ロセッサ," 特願2020-194733 (2020 年11月24日)
|
Embedded Multicore SoC
Architecture and Design for Real-time ECG Processing
Recent
technological advances in wireless networking,
microelectronics and the Internet allow us to
fundamentally change the way elderly health care
services are practiced. Traditionally, embedded personal
medical monitoring systems have been used only to
collect data. Data processing and analysis are performed
off-line, making such devices impractical for continual
monitoring and early detection of medical disorders. The
goal of this project is to research about efficient
novel in-body snart embedded system to effectively
monitor elderly health status remotely. In particular,
we investigate an extreme area in the design space of
networked embedded objects: the domain of low energy,
and real-time. Issues related to the design,
implementation and deployment of such systems are also
studied.
|
Low-Power
Queue Processor Architecture and Design (1999-2008)
This project
focuses on the research about a novel low power and high
performance parallel processor processor based on Queue
computation model, where Queue programs are generated by
traversing a given data flow graph using level order
traversal. The Queue processor uses a circular
queue-register to manipulatelates operands
and results, and exploits parallelism dynamically with
"little efforts" when compared with conventional
architectures. The nonexistence of false dependencies
allows programs to expose maximum parallelism that the
queue processor can execute without complex and
power-hungry hardware such as register renaming and large
instruction windows. Parallel processing allows queue
processors to speed-up the execution of applications. We
are researching and developing a complete tool-chain for
this promising computing model consisting of: compiler,
assembler, functional and cycle accurate simulator, and
hardware design.
- A. Ben
Abdallah, A. Canedo, T. Yoshinga, and M. Sowa, The
QC-2 Parallel Queue Processor Architecture, Journal of
Parallel and Distributed Computing, Vol. 68, No. 2,
pp. 235-245, 2008.
- A. Ben
Abdallah, M. Masuda, A. Canedo, K. Kuroda,Natural
Instruction Level Parallelism-aware Compiler for
High-Performance Processor Architecture, The Journal
of supercomputing, Volume 57, Number 3, pp. 314-338,
Sept. 2011.
- A. Canedo,
A. Ben Abdallah, and M. Sowa, ''Efficient Compilation
for Queue Size Constrained Queue Processors, The
Journal of Parallel Computing, Vol.35, pp. 213-225,
2009.
- A. Ben
Abdallah, QueueCore Instruction Set
Architecture, Technical Report,
Parallel/Distributed Systems Laboratory, Graduate
School of Information Systems, 2003.
- A. Ben Abdallah, QC-1 Processing Stages
Algorithms, Technical Report,
Parallel/Distributed Systems Laboratory, Graduate
School of Information Systems, 2003.
|