Skip to main content

Showing 1–10 of 10 results for author: Que, Z

  1. arXiv:2406.15313  [pdf, other

    cs.IR cs.CL

    STARD: A Chinese Statute Retrieval Dataset with Real Queries Issued by Non-professionals

    Authors: Weihang Su, Yiran Hu, Anzhe Xie, Qingyao Ai, Zibing Que, Ning Zheng, Yun Liu, Weixing Shen, Yiqun Liu

    Abstract: Statute retrieval aims to find relevant statutory articles for specific queries. This process is the basis of a wide range of legal applications such as legal advice, automated judicial decisions, legal document drafting, etc. Existing statute retrieval benchmarks focus on formal and professional queries from sources like bar exams and legal case documents, thereby neglecting non-professional quer… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2402.01876  [pdf, other

    hep-ex cs.LG physics.ins-det

    Ultrafast jet classification on FPGAs for the HL-LHC

    Authors: Patrick Odagiu, Zhiqiang Que, Javier Duarte, Johannes Haller, Gregor Kasieczka, Artur Lobanov, Vladimir Loncar, Wayne Luk, Jennifer Ngadiuba, Maurizio Pierini, Philipp Rincke, Arpita Seksaria, Sioni Summers, Andre Sznajder, Alexander Tapper, Thea K. Aarrestad

    Abstract: Three machine learning models are used to perform jet origin classification. These models are optimized for deployment on a field-programmable gate array device. In this context, we demonstrate how latency and resource consumption scale with the input size and choice of algorithm. Moreover, the models proposed here are designed to work on the type of data and under the foreseen conditions at the C… ▽ More

    Submitted 4 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 figures, 3 tables. Mach. Learn.: Sci. Technol (2024)

    Report number: FERMILAB-PUB-24-0030-CMS-CSAID-PPD

  3. arXiv:2308.06849  [pdf, other

    cs.LG cs.AR

    When Monte-Carlo Dropout Meets Multi-Exit: Optimizing Bayesian Neural Networks on FPGA

    Authors: Hongxiang Fan, Hao Chen, Liam Castelli, Zhiqiang Que, He Li, Kenneth Long, Wayne Luk

    Abstract: Bayesian Neural Networks (BayesNNs) have demonstrated their capability of providing calibrated prediction for safety-critical applications such as medical imaging and autonomous driving. However, the high algorithmic complexity and the poor hardware performance of BayesNNs hinder their deployment in real-life applications. To bridge this gap, this paper proposes a novel multi-exit Monte-Carlo Drop… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  4. MetaML: Automating Customizable Cross-Stage Design-Flow for Deep Learning Acceleration

    Authors: Zhiqiang Que, Shuo Liu, Markus Rognlien, Ce Guo, Jose G. F. Coutinho, Wayne Luk

    Abstract: This paper introduces a novel optimization framework for deep neural network (DNN) hardware accelerators, enabling the rapid development of customized and automated design flows. More specifically, our approach aims to automate the selection and configuration of low-level optimization techniques, encompassing DNN and FPGA low-level optimizations. We introduce novel optimization and transformation… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: 5 pages, Accepted at FPL'23

  5. arXiv:2209.14065  [pdf, other

    cs.AR cs.LG physics.ins-det

    LL-GNN: Low Latency Graph Neural Networks on FPGAs for High Energy Physics

    Authors: Zhiqiang Que, Hongxiang Fan, Marcus Loo, He Li, Michaela Blott, Maurizio Pierini, Alexander Tapper, Wayne Luk

    Abstract: This work presents a novel reconfigurable architecture for Low Latency Graph Neural Network (LL-GNN) designs for particle detectors, delivering unprecedented low latency performance. Incorporating FPGA-based GNNs into particle detectors presents a unique challenge since it requires sub-microsecond latency to deploy the networks for online event selection with a data rate of hundreds of terabytes p… ▽ More

    Submitted 9 January, 2024; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: This paper has been accepted by ACM Transactions on Embedded Computing Systems (TECS)

  6. arXiv:2111.12787  [pdf, other

    cs.LG cs.AR eess.SY

    Algorithm and Hardware Co-design for Reconfigurable CNN Accelerator

    Authors: Hongxiang Fan, Martin Ferianc, Zhiqiang Que, He Li, Shuanglong Liu, Xinyu Niu, Wayne Luk

    Abstract: Recent advances in algorithm-hardware co-design for deep neural networks (DNNs) have demonstrated their potential in automatically designing neural architectures and hardware designs. Nevertheless, it is still a challenging optimization problem due to the expensive training cost and the time-consuming hardware implementation, which makes the exploration on the vast design space of neural architect… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

  7. arXiv:2110.13041  [pdf, other

    cs.LG cs.AR physics.data-an physics.ins-det

    Applications and Techniques for Fast Machine Learning in Science

    Authors: Allison McCarn Deiana, Nhan Tran, Joshua Agar, Michaela Blott, Giuseppe Di Guglielmo, Javier Duarte, Philip Harris, Scott Hauck, Mia Liu, Mark S. Neubauer, Jennifer Ngadiuba, Seda Ogrenci-Memik, Maurizio Pierini, Thea Aarrestad, Steffen Bahr, Jurgen Becker, Anne-Sophie Berthold, Richard J. Bonventre, Tomas E. Muller Bravo, Markus Diefenthaler, Zhen Dong, Nick Fritzsche, Amir Gholami, Ekaterina Govorkova, Kyle J Hazelwood , et al. (62 additional authors not shown)

    Abstract: In this community review report, we discuss applications and techniques for fast machine learning (ML) in science -- the concept of integrating power ML methods into the real-time experimental data processing loop to accelerate scientific discovery. The material for the report builds on two workshops held by the Fast ML for Science community and covers three main areas: applications for fast ML ac… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: 66 pages, 13 figures, 5 tables

    Report number: FERMILAB-PUB-21-502-AD-E-SCD

    Journal ref: Front. Big Data 5, 787421 (2022)

  8. arXiv:2106.14089  [pdf, other

    cs.LG cs.AR physics.ins-det

    Accelerating Recurrent Neural Networks for Gravitational Wave Experiments

    Authors: Zhiqiang Que, Erwei Wang, Umar Marikar, Eric Moreno, Jennifer Ngadiuba, Hamza Javed, Bartłomiej Borzyszkowski, Thea Aarrestad, Vladimir Loncar, Sioni Summers, Maurizio Pierini, Peter Y Cheung, Wayne Luk

    Abstract: This paper presents novel reconfigurable architectures for reducing the latency of recurrent neural networks (RNNs) that are used for detecting gravitational waves. Gravitational interferometers such as the LIGO detectors capture cosmic events such as black hole mergers which happen at unknown times and of varying durations, producing time-series data. We have developed a new architecture capable… ▽ More

    Submitted 26 June, 2021; originally announced June 2021.

    Comments: Accepted at the 2021 32nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP)

  9. arXiv:2106.06048  [pdf, other

    cs.LG

    Optimizing Bayesian Recurrent Neural Networks on an FPGA-based Accelerator

    Authors: Martin Ferianc, Zhiqiang Que, Hongxiang Fan, Wayne Luk, Miguel Rodrigues

    Abstract: Neural networks have demonstrated their outstanding performance in a wide range of tasks. Specifically recurrent architectures based on long-short term memory (LSTM) cells have manifested excellent capability to model time dependencies in real-world data. However, standard recurrent architectures cannot estimate their uncertainty which is essential for safety-critical applications such as in medic… ▽ More

    Submitted 7 November, 2021; v1 submitted 4 June, 2021; originally announced June 2021.

    Comments: Accepted to FPT'21. Martin Ferianc and Zhiqiang Que share an equal contribution. Updated copyright footer

  10. arXiv:2105.02158  [pdf, other

    cs.CV eess.IV

    VoxelContext-Net: An Octree based Framework for Point Cloud Compression

    Authors: Zizheng Que, Guo Lu, Dong Xu

    Abstract: In this paper, we propose a two-stage deep learning framework called VoxelContext-Net for both static and dynamic point cloud compression. Taking advantages of both octree based methods and voxel based schemes, our approach employs the voxel context to compress the octree structured data. Specifically, we first extract the local voxel representation that encodes the spatial neighbouring context in… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: CVPR2021