Skip to main content

Showing 1–22 of 22 results for author: Masera, G

  1. arXiv:2406.14263  [pdf, other

    cs.AR

    Scalable and RISC-V Programmable Near-Memory Computing Architectures for Edge Nodes

    Authors: Michele Caon, Clément Choné, Pasquale Davide Schiavone, Alexandre Levisse, Guido Masera, Maurizio Martina, David Atienza

    Abstract: The widespread adoption of data-centric algorithms, particularly Artificial Intelligence (AI) and Machine Learning (ML), has exposed the limitations of centralized processing infrastructures, driving a shift towards edge computing. This necessitates stringent constraints on energy efficiency, which traditional von Neumann architectures struggle to meet. The Compute-In-Memory (CIM) paradigm has eme… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 14 pages, 12 figures, submitted to IEEE Transactions on Emerging Topics in Computing

  2. arXiv:2402.09780  [pdf, other

    cs.LG

    TinyCL: An Efficient Hardware Architecture for Continual Learning on Autonomous Systems

    Authors: Eugenio Ressa, Alberto Marchisio, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: The Continuous Learning (CL) paradigm consists of continuously evolving the parameters of the Deep Neural Network (DNN) model to progressively learn to perform new tasks without reducing the performance on previous tasks, i.e., avoiding the so-called catastrophic forgetting. However, the DNN parameter update in CL-based autonomous systems is extremely resource-hungry. The existing DNN accelerators… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  3. Custom Memory Design for Logic-in-Memory: Drawbacks and Improvements over Conventional Memories

    Authors: Fabrizio Ottati, Giovanna Turvani, Marco Vacca, Guido Masera

    Abstract: The speed of modern digital systems is severely limited by memory latency (the ``Memory Wall'' problem). Data exchange between Logic and Memory is also responsible for a large part of the system energy consumption. Logic--In--Memory (LiM) represents an attractive solution to this problem. By performing part of the computations directly inside the memory the system speed can be improved while reduc… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  4. arXiv:2304.03986  [pdf, other

    cs.LG

    SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers

    Authors: Alberto Marchisio, Davide Dura, Maurizio Capra, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Transformers' compute-intensive operations pose enormous challenges for their deployment in resource-constrained EdgeAI / tinyML devices. As an established neural network compression technique, quantization reduces the hardware computational and memory resources. In particular, fixed-point quantization is desirable to ease the computations using lightweight blocks, like adders and multipliers, of… ▽ More

    Submitted 25 April, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

    Comments: To appear at the 2023 International Joint Conference on Neural Networks (IJCNN), Queensland, Australia, June 2023

  5. arXiv:2208.02253  [pdf, other

    cs.NE cs.LG

    LaneSNNs: Spiking Neural Networks for Lane Detection on the Loihi Neuromorphic Processor

    Authors: Alberto Viale, Alberto Marchisio, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Autonomous Driving (AD) related features represent important elements for the next generation of mobile robots and autonomous vehicles focused on increasingly intelligent, autonomous, and interconnected systems. The applications involving the use of these features must provide, by definition, real-time decisions, and this property is key to avoid catastrophic accidents. Moreover, all the decision… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: To appear at the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  6. arXiv:2208.00331  [pdf, other

    cs.AR cs.LG

    CoNLoCNN: Exploiting Correlation and Non-Uniform Quantization for Energy-Efficient Low-precision Deep Convolutional Neural Networks

    Authors: Muhammad Abdullah Hanif, Giuseppe Maria Sarda, Alberto Marchisio, Guido Masera, Maurizio Martina, Muhammad Shafique

    Abstract: In today's era of smart cyber-physical systems, Deep Neural Networks (DNNs) have become ubiquitous due to their state-of-the-art performance in complex real-world applications. The high computational complexity of these networks, which translates to increased energy consumption, is the foremost obstacle towards deploying large DNNs in resource-constrained systems. Fixed-Point (FP) implementations… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: 8 pages, 15 figures, 2 tables

  7. arXiv:2206.10200  [pdf, other

    cs.LG

    Enabling Capsule Networks at the Edge through Approximate Softmax and Squash Operations

    Authors: Alberto Marchisio, Beatrice Bussolino, Edoardo Salvati, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Complex Deep Neural Networks such as Capsule Networks (CapsNets) exhibit high learning capabilities at the cost of compute-intensive operations. To enable their deployment on edge devices, we propose to leverage approximate computing for designing approximate variants of the complex operations like softmax and squash. In our experiments, we evaluate tradeoffs between area, power consumption, and c… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

    Comments: To appear at the ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED), August 2022, Boston, MA, USA

  8. arXiv:2109.00533  [pdf, other

    cs.LG cs.CR cs.NE

    R-SNN: An Analysis and Design Methodology for Robustifying Spiking Neural Networks against Adversarial Attacks through Noise Filters for Dynamic Vision Sensors

    Authors: Alberto Marchisio, Giacomo Pira, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Spiking Neural Networks (SNNs) aim at providing energy-efficient learning capabilities when implemented on neuromorphic chips with event-based Dynamic Vision Sensors (DVS). This paper studies the robustness of SNNs against adversarial attacks on such DVS-based systems, and proposes R-SNN, a novel methodology for robustifying SNNs through efficient DVS-noise filtering. We are the first to generate… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: To appear at the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021). arXiv admin note: text overlap with arXiv:2107.00415

  9. arXiv:2107.00415  [pdf, other

    cs.CV cs.LG

    DVS-Attacks: Adversarial Attacks on Dynamic Vision Sensors for Spiking Neural Networks

    Authors: Alberto Marchisio, Giacomo Pira, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Spiking Neural Networks (SNNs), despite being energy-efficient when implemented on neuromorphic hardware and coupled with event-based Dynamic Vision Sensors (DVS), are vulnerable to security threats, such as adversarial attacks, i.e., small perturbations added to the input for inducing a misclassification. Toward this, we propose DVS-Attacks, a set of stealthy yet efficient adversarial attack meth… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at IJCNN 2021

  10. arXiv:2107.00401  [pdf, other

    cs.NE cs.LG

    CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphic Research Processor

    Authors: Alberto Viale, Alberto Marchisio, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Autonomous Driving (AD) related features provide new forms of mobility that are also beneficial for other kind of intelligent and autonomous systems like robots, smart transportation, and smart industries. For these applications, the decisions need to be made fast and in real-time. Moreover, in the quest for electric mobility, this task must follow low power policy, without affecting much the auto… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at IJCNN 2021

  11. Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead

    Authors: Maurizio Capra, Beatrice Bussolino, Alberto Marchisio, Guido Masera, Maurizio Martina, Muhammad Shafique

    Abstract: Currently, Machine Learning (ML) is becoming ubiquitous in everyday life. Deep Learning (DL) is already present in many applications ranging from computer vision for medicine to autonomous driving of modern cars as well as other sectors in security, healthcare, and finance. However, to achieve impressive performance, these algorithms employ very deep networks, requiring a significant computational… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: Accepted for publication in IEEE Access

  12. Q-CapsNets: A Specialized Framework for Quantizing Capsule Networks

    Authors: Alberto Marchisio, Beatrice Bussolino, Alessio Colucci, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Capsule Networks (CapsNets), recently proposed by the Google Brain team, have superior learning capabilities in machine learning tasks, like image classification, compared to the traditional CNNs. However, CapsNets require extremely intense computations and are difficult to be deployed in their original form at the resource-constrained edge devices. This paper makes the first attempt to quantize C… ▽ More

    Submitted 17 April, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: Accepted for publication at Design Automation Conference 2020 (DAC 2020)

  13. FasTrCaps: An Integrated Framework for Fast yet Accurate Training of Capsule Networks

    Authors: Alberto Marchisio, Beatrice Bussolino, Alessio Colucci, Muhammad Abdullah Hanif, Maurizio Martina, Guido Masera, Muhammad Shafique

    Abstract: Recently, Capsule Networks (CapsNets) have shown improved performance compared to the traditional Convolutional Neural Networks (CNNs), by encoding and preserving spatial relationships between the detected features in a better way. This is achieved through the so-called Capsules (i.e., groups of neurons) that encode both the instantiation probability and the spatial information. However, one of th… ▽ More

    Submitted 18 May, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Accepted for publication at the 2020 International Joint Conference on Neural Networks (IJCNN)

  14. arXiv:1802.00580  [pdf, ps, other

    cs.AR

    A Multi-Kernel Multi-Code Polar Decoder Architecture

    Authors: Gabriele Coppolino, Carlo Condo, Guido Masera, Warren J. Gross

    Abstract: Polar codes have received increasing attention in the past decade, and have been selected for the next generation of wireless communication standard. Most research on polar codes has focused on codes constructed from a $2\times2$ polarization matrix, called binary kernel: codes constructed from binary kernels have code lengths that are bound to powers of $2$. A few recent works have proposed const… ▽ More

    Submitted 2 February, 2018; originally announced February 2018.

  15. arXiv:1301.1465   

    cs.AR

    A joint communication and application simulator for NoC-based SoCs

    Authors: Carlo Condo, Amer Baghdadi, Guido Masera

    Abstract: NoCs have become a widespread paradigm in the system-on-chip design world, not only for multi-purpose SoCs, but also for application-specific ICs. The common approach in the NoC design world is to separate the design of the interconnection from the design of the processing elements: this is well suited for a large number of developments, but the need for joint application and NoC design is not unc… ▽ More

    Submitted 31 May, 2013; v1 submitted 8 January, 2013; originally announced January 2013.

    Comments: Withdrawn, due to extended and revised version being published

  16. arXiv:1105.2624  [pdf, other

    cs.AR

    A Flexible LDPC code decoder with a Network on Chip as underlying interconnect architecture

    Authors: Carlo Condo, Guido Masera

    Abstract: LDPC (Low Density Parity Check) codes are among the most powerful and widely adopted modern error correcting codes. The iterative decoding algorithms required for these codes involve high computational complexity and high processing throughput is achieved by allocating a sufficient number of processing elements (PEs). Supporting multiple heterogeneous LDPC codes on a parallel decoder poses serious… ▽ More

    Submitted 13 May, 2011; originally announced May 2011.

  17. arXiv:1105.1014  [pdf, ps, other

    cs.AR

    Improving Network-on-Chip-based turbo decoder architectures

    Authors: Maurizio Martina, Guido Masera

    Abstract: In this work novel results concerning Network-on-Chip-based turbo decoder architectures are presented. Stemming from previous publications, this work concentrates first on improving the throughput by exploiting adaptive-bandwidth reduction techniques. This technique shows in the best case an improvement of more than 60 Mb/s. Moreover, it is known that double-binary turbo decoders require higher ar… ▽ More

    Submitted 5 May, 2011; originally announced May 2011.

  18. arXiv:1006.4030  [pdf

    cs.IT

    A Novel VLSI Architecture of Fixed-complexity Sphere Decoder

    Authors: Bin Wu, Guido Masera

    Abstract: Fixed-complexity Sphere Decoder (FSD) is a recently proposed technique for Multiple-Input Multiple-Output (MIMO) detection. It has several outstanding features such as constant throughput and large potential parallelism, which makes it suitable for efficient VLSI implementation. However, to our best knowledge, no VLSI implementation of FSD has been reported in the literature, although some FPGA pr… ▽ More

    Submitted 21 June, 2010; originally announced June 2010.

    Comments: 8 pages, this paper has been accepted by the conference DSD 2010

  19. arXiv:1001.4694  [pdf

    cs.AR

    VLSI Architectures for WIMAX Channel Decoders

    Authors: Maurizio Martina, Guido Masera

    Abstract: This chapter describes the main architectures proposed in the literature to implement the channel decoders required by the WiMax standard, namely convolutional codes, turbo codes (both block and convolutional) and LDPC. Then it shows a complete design of a convolutional turbo code encoder/decoder system for WiMax.

    Submitted 26 January, 2010; originally announced January 2010.

    Comments: To appear in the book "WIMAX, New Developments", M. Upena, D. Dalal, Y. Kosta (Ed.), ISBN978-953-7619-53-4

  20. Turbo NOC: a framework for the design of Network On Chip based turbo decoder architectures

    Authors: Maurizio Martina, Guido Masera

    Abstract: This work proposes a general framework for the design and simulation of network on chip based turbo decoder architectures. Several parameters in the design space are investigated, namely the network topology, the parallelism degree, the rate at which messages are sent by processing nodes over the network and the routing strategy. The main results of this analysis are: i) the most suited topologi… ▽ More

    Submitted 10 September, 2009; originally announced September 2009.

    Comments: submitted to IEEE Trans. on Circuits and Systems I (submission date 27 may 2009)

  21. arXiv:0711.2383  [pdf, ps, other

    cs.AR

    Decoding the Golden Code: a VLSI design

    Authors: Barbara Cerato, Guido Masera, Emanuele Viterbo

    Abstract: The recently proposed Golden code is an optimal space-time block code for 2 X 2 multiple-input multiple-output (MIMO) systems. The aim of this work is the design of a VLSI decoder for a MIMO system coded with the Golden code. The architecture is based on a rearrangement of the sphere decoding algorithm that achieves maximum-likelihood (ML) decoding performance. Compared to other approaces, the p… ▽ More

    Submitted 15 November, 2007; originally announced November 2007.

    Comments: 25 pages, 10 figures

    ACM Class: B.7.1

  22. arXiv:0710.4840  [pdf

    cs.AR

    Testing Logic Cores using a BIST P1500 Compliant Approach: A Case of Study

    Authors: P. Bernardi, G. Masera, F. Quaglio, M. Sonza Reorda

    Abstract: In this paper we describe how we applied a BIST-based approach to the test of a logic core to be included in System-on-a-chip (SoC) environments. The approach advantages are the ability to protect the core IP, the simple test interface (thanks also to the adoption of the P1500 standard), the possibility to run the test at-speed, the reduced test time, and the good diagnostic capabilities. The pa… ▽ More

    Submitted 25 October, 2007; originally announced October 2007.

    Comments: Submitted on behalf of EDAA (http://www.edaa.com/)

    Journal ref: Dans Design, Automation and Test in Europe | Designers'Forum - DATE'05, Munich : Allemagne (2005)