Skip to main content

Showing 1–14 of 14 results for author: Lou, C

  1. arXiv:2406.16747  [pdf, other

    cs.CL cs.LG

    Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

    Authors: Chao Lou, Zixia Jia, Zilong Zheng, Kewei Tu

    Abstract: Accommodating long sequences efficiently in autoregressive Transformers, especially within an extended context window, poses significant challenges due to the quadratic computational complexity and substantial KV memory requirements inherent in self-attention mechanisms. In this work, we introduce SPARSEK Attention, a novel sparse attention mechanism designed to overcome these computational and me… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint

  2. arXiv:2310.17670  [pdf, ps, other

    cs.LG

    Unknown Health States Recognition With Collective Decision Based Deep Learning Networks In Predictive Maintenance Applications

    Authors: Chuyue Lou, M. Amine Atoui

    Abstract: At present, decision making solutions developed based on deep learning (DL) models have received extensive attention in predictive maintenance (PM) applications along with the rapid improvement of computing power. Relying on the superior properties of shared weights and spatial pooling, Convolutional Neural Network (CNN) can learn effective representations of health states from industrial data. Ma… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  3. arXiv:2310.11964  [pdf, other

    cs.CL

    AMR Parsing with Causal Hierarchical Attention and Pointers

    Authors: Chao Lou, Kewei Tu

    Abstract: Translation-based AMR parsers have recently gained popularity due to their simplicity and effectiveness. They predict linearized graphs as free texts, avoiding explicit structure modeling. However, this simplicity neglects structural locality in AMR graphs and introduces unnecessary tokens to represent coreferences. In this paper, we introduce new target forms of AMR parsing and a novel model, CHA… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  4. arXiv:2308.10529  [pdf, other

    cs.CL

    SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

    Authors: Tianyu Yu, Chengyue Jiang, Chao Lou, Shen Huang, Xiaobin Wang, Wei Liu, Jiong Cai, Yangning Li, Yinghui Li, Kewei Tu, Hai-Tao Zheng, Ningyu Zhang, Pengjun Xie, Fei Huang, Yong Jiang

    Abstract: Large language models (LLMs) have shown impressive ability for open-domain NLP tasks. However, LLMs are sometimes too footloose for natural language understanding (NLU) tasks which always have restricted output and input format. Their performances on NLU tasks are highly related to prompts or demonstrations and are shown to be poor at performing several representative NLU tasks, such as event extr… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: Initial version of SeqGPT

  5. arXiv:2306.02671  [pdf, other

    cs.CL

    Improving Grammar-based Sequence-to-Sequence Modeling with Decomposition and Constraints

    Authors: Chao Lou, Kewei Tu

    Abstract: Neural QCFG is a grammar-based sequence-tosequence (seq2seq) model with strong inductive biases on hierarchical structures. It excels in interpretability and generalization but suffers from expensive inference. In this paper, we study two low-rank variants of Neural QCFG for faster inference with different trade-offs between efficiency and expressiveness. Furthermore, utilizing the symbolic interf… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: ACL 2023

  6. arXiv:2304.03285  [pdf, other

    cs.CV

    $\text{DC}^2$: Dual-Camera Defocus Control by Learning to Refocus

    Authors: Hadi Alzayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar

    Abstract: Smartphone cameras today are increasingly approaching the versatility and quality of professional cameras through a combination of hardware and software advancements. However, fixed aperture remains a key limitation, preventing users from controlling the depth of field (DoF) of captured images. At the same time, many smartphones now have multiple cameras with different fixed apertures -- specifica… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: CVPR 2023. See the project page at https://defocus-control.github.io

  7. arXiv:2206.04685  [pdf, other

    cs.LG cs.AR cs.NE

    Predictive Exit: Prediction of Fine-Grained Early Exits for Computation- and Energy-Efficient Inference

    Authors: Xiangjie Li, Chenfei Lou, Zhengping Zhu, Yuchi Chen, Yingtao Shen, Yehan Ma, An Zou

    Abstract: By adding exiting layers to the deep learning networks, early exit can terminate the inference earlier with accurate results. The passive decision-making of whether to exit or continue the next layer has to go through every pre-placed exiting layer until it exits. In addition, it is also hard to adjust the configurations of the computing platforms alongside the inference proceeds. By incorporating… ▽ More

    Submitted 28 December, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

  8. arXiv:2203.14260  [pdf, other

    cs.CV cs.CL

    Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships

    Authors: Chao Lou, Wenjuan Han, Yuhuan Lin, Zilong Zheng

    Abstract: Understanding realistic visual scene images together with language descriptions is a fundamental task towards generic visual understanding. Previous works have shown compelling comprehensive results by building hierarchical structures for visual scenes (e.g., scene graphs) and natural languages (e.g., dependency trees), individually. However, how to construct a joint vision-language (VL) structure… ▽ More

    Submitted 1 June, 2022; v1 submitted 27 March, 2022; originally announced March 2022.

    Comments: Updated

  9. arXiv:2203.04665  [pdf, other

    cs.CL

    Nested Named Entity Recognition as Latent Lexicalized Constituency Parsing

    Authors: Chao Lou, Songlin Yang, Kewei Tu

    Abstract: Nested named entity recognition (NER) has been receiving increasing attention. Recently, (Fu et al, 2021) adapt a span-based constituency parser to tackle nested NER. They treat nested entities as partially-observed constituency trees and propose the masked inside algorithm for partial marginalization. However, their method cannot leverage entity heads, which have been shown useful in entity menti… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: ACL 2022 camera ready

  10. arXiv:2106.03562  [pdf

    cs.RO

    Robotic Electrospinning Actuated by Non-Circular Joint Continuum Manipulator for Endoluminal Therapy

    Authors: Zicong Wu, Chuqian Lou, Zhu Jin, Shaoping Huang, Ning Liu, Yun Zou, Mirko Kovac, Anzhu Gao, Guang-Zhong Yang

    Abstract: Electrospinning has exhibited excellent benefits to treat the trauma for tissue engineering due to its produced micro/nano fibrous structure. It can effectively adhere to the tissue surface for long-term continuous therapy. This paper develops a robotic electrospinning platform for endoluminal therapy. The platform consists of a continuum manipulator, the electrospinning device, and the actuation… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  11. arXiv:1905.05652  [pdf

    cs.HC cs.CV

    "Tom" pet robot applied to urban autism

    Authors: Xingqian Li, Chenwei Lou, Jian Zhao, HuaPeng Wei, Hongwei Zhao

    Abstract: With the fast development of network information technology, more and more people are immersed in the virtual community environment brought by the network, ignoring the social interaction in real life. The consequent urban autism problem has become more and more serious. Promoting offline communication between people " and "eliminating loneliness through emotional communication between pet robots… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

  12. arXiv:1506.02990  [pdf, other

    cs.IT

    Convolutional-Code-Specific CRC Code Design

    Authors: Chung-Yu Lou, Babak Daneshrad, Richard D. Wesel

    Abstract: Cyclic redundancy check (CRC) codes check if a codeword is correctly received. This paper presents an algorithm to design CRC codes that are optimized for the code-specific error behavior of a specified feedforward convolutional code. The algorithm utilizes two distinct approaches to computing undetected error probability of a CRC code used with a specific convolutional code. The first approach en… ▽ More

    Submitted 9 June, 2015; originally announced June 2015.

    Comments: 12 pages, 8 figures, journal paper

  13. arXiv:1410.2904  [pdf, other

    cs.IT

    Optimizing Pilot Length for a Go/No-Go Decision in Two-State Block Fading Channels with Feedback

    Authors: Chung-Yu Lou, Babak Daneshrad, Richard D. Wesel

    Abstract: We propose an approach where each user independently seeks to minimize the amount of time that they occupy the channel. Essentially, we seek to minimize the number of transmitted symbols required to communicate a packet assuming variable-length coding with feedback. Users send a pilot sequence to estimate the channel quality and decide whether to proceed with a transmission or wait for the next op… ▽ More

    Submitted 10 October, 2014; originally announced October 2014.

    Comments: 6 pages, 3 figures, conference

  14. Performance Indicator for MIMO MMSE Receivers in the Presence of Channel Estimation Error

    Authors: Eren Eraslan, Babak Daneshrad, Chung-Yu Lou

    Abstract: We present the derivation of post-processing SNR for Minimum-Mean-Squared-Error (MMSE) receivers with imperfect channel estimates, and show that it is an accurate indicator of the error rate performance of MIMO systems in the presence of channel estimation error. Simulation results show the tightness of the analysis.

    Submitted 13 November, 2012; v1 submitted 30 October, 2012; originally announced October 2012.

    Comments: 4 pages, 3 figures. Submitted to IEEE Wireless Communications Letters