Skip to main content

Showing 1–50 of 65 results for author: Ren, Q

  1. arXiv:2407.02891  [pdf, other

    cs.LG cs.AI cs.CL

    GPTQT: Quantize Large Language Models Twice to Push the Efficiency

    Authors: Yipin Guo, Yilin Lang, Qinyuan Ren

    Abstract: Due to their large size, generative Large Language Models (LLMs) require significant computing and storage resources. This paper introduces a new post-training quantization method, GPTQT, to reduce memory usage and enhance processing speed by expressing the weight of LLM in 3bit/2bit. Practice has shown that minimizing the quantization error of weights is ineffective, leading to overfitting. There… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by 11th IEEE International Conference on Cybernetics and Intelligent Systems

  2. arXiv:2407.02881  [pdf, other

    cs.LG cs.AI cs.CV

    ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation

    Authors: Yipin Guo, Zihao Li, Yilin Lang, Qinyuan Ren

    Abstract: Operators devoid of multiplication, such as Shift and Add, have gained prominence for their compatibility with hardware. However, neural networks (NNs) employing these operators typically exhibit lower accuracy compared to conventional NNs with identical structures. ShiftAddAug uses costly multiplication to augment efficient but less powerful multiplication-free operators, improving performance wi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 CVPR Workshop : Efficient Deep Learning for Computer Vision

  3. arXiv:2407.02878  [pdf, other

    cs.RO cs.AI

    Efficient Fusion and Task Guided Embedding for End-to-end Autonomous Driving

    Authors: Yipin Guo, Yilin Lang, Qinyuan Ren

    Abstract: To address the challenges of sensor fusion and safety risk prediction, contemporary closed-loop autonomous driving neural networks leveraging imitation learning typically require a substantial volume of parameters and computational resources to run neural networks. Given the constrained computational capacities of onboard vehicular computers, we introduce a compact yet potent solution named Effici… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Best Paper Award of the IEEE 13th Data-Driven Control and Learning Systems Conference

  4. arXiv:2406.19853  [pdf, other

    cs.CL cs.AI

    YuLan: An Open-source Large Language Model

    Authors: Yutao Zhu, Kun Zhou, Kelong Mao, Wentong Chen, Yiding Sun, Zhipeng Chen, Qian Cao, Yihan Wu, Yushuo Chen, Feng Wang, Lei Zhang, Junyi Li, Xiaolei Wang, Lei Wang, Beichen Zhang, Zican Dong, Xiaoxue Cheng, Yuhan Chen, Xinyu Tang, Yupeng Hou, Qiangqiang Ren, Xincheng Pang, Shufang Xie, Wayne Xin Zhao, Zhicheng Dou , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) have become the foundation of many applications, leveraging their extensive capabilities in processing and understanding natural language. While many open-source LLMs have been released with technical reports, the lack of training details hinders further research and development. This paper presents the development of YuLan, a series of open-source LLMs with $12$ billi… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  5. arXiv:2406.13583  [pdf, other

    cs.CV

    Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation

    Authors: Qian Chen, Lei Zhu, Hangzhou He, Xinliang Zhang, Shuang Zeng, Qiushi Ren, Yanye Lu

    Abstract: The primary goal of continual learning (CL) task in medical image segmentation field is to solve the "catastrophic forgetting" problem, where the model totally forgets previously learned features when it is extended to new categories (class-level) or tasks (task-level). Due to the privacy protection, the historical data labels are inaccessible. Prevalent continual learning methods primarily focus… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.11921  [pdf, other

    cs.LG cs.AI

    Rethinking Spatio-Temporal Transformer for Traffic Prediction:Multi-level Multi-view Augmented Learning Framework

    Authors: Jiaqi Lin, Qianqian Ren

    Abstract: Traffic prediction is a challenging spatio-temporal forecasting problem that involves highly complex spatio-temporal correlations. This paper proposes a Multi-level Multi-view Augmented Spatio-temporal Transformer (LVSTformer) for traffic prediction. The model aims to capture spatial dependencies from three different levels: local geographic, global semantic, and pivotal nodes, along with long- an… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.05914  [pdf, other

    eess.AS cs.SD eess.SP

    Soundscape Captioning using Sound Affective Quality Network and Large Language Model

    Authors: Yuanbo Hou, Qiaoqiao Ren, Andrew Mitchell, Wenwu Wang, Jian Kang, Tony Belpaeme, Dick Botteldooren

    Abstract: We live in a rich and varied acoustic world, which is experienced by individuals or communities as a soundscape. Computational auditory scene analysis, disentangling acoustic scenes by detecting and classifying events, focuses on objective attributes of sounds, such as their category and temporal characteristics, ignoring the effect of sounds on people and failing to explore the relationship betwe… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Code: https://github.com/Yuanbo2020/SoundSCaper

  8. arXiv:2405.13025  [pdf, other

    cs.CL cs.AI cs.CY

    A survey on fairness of large language models in e-commerce: progress, application, and challenge

    Authors: Qingyang Ren, Zilin Jiang, Jinghan Cao, Sijia Li, Chiqu Li, Yiyang Liu, Shuning Huo, Tiange He, Yuan Chen

    Abstract: This survey explores the fairness of large language models (LLMs) in e-commerce, examining their progress, applications, and the challenges they face. LLMs have become pivotal in the e-commerce domain, offering innovative solutions and enhancing customer experiences. This work presents a comprehensive survey on the applications and challenges of LLMs in e-commerce. The paper begins by introducing… ▽ More

    Submitted 21 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 21 pages, 9 figures

  9. arXiv:2405.09708  [pdf, ps, other

    cs.RO cs.AI stat.CO

    No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation

    Authors: Qiaoqiao Ren, Yuanbo Hou, Dick Botteldooren, Tony Belpaeme

    Abstract: Spoken language interaction is at the heart of interpersonal communication, and people flexibly adapt their speech to different individuals and environments. It is surprising that robots, and by extension other digital devices, are not equipped to adapt their speech and instead rely on fixed speech parameters, which often hinder comprehension by the user. We conducted a speech comprehension study… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: IEEE Robotics and Automation Letters (IEEE RAL)

  10. arXiv:2404.17394  [pdf, other

    cs.CL cs.HC cs.RO

    Child Speech Recognition in Human-Robot Interaction: Problem Solved?

    Authors: Ruben Janssens, Eva Verhelst, Giulio Antonio Abbo, Qiaoqiao Ren, Maria Jose Pinto Bernal, Tony Belpaeme

    Abstract: Automated Speech Recognition shows superhuman performance for adult English speech on a range of benchmarks, but disappoints when fed children's speech. This has long sat in the way of child-robot interaction. Recent evolutions in data-driven speech recognition, including the availability of Transformer architectures and unprecedented volumes of training data, might mean a breakthrough for child s… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Presented at 2024 International Symposium on Technological Advances in Human-Robot Interaction

  11. arXiv:2404.00443  [pdf, ps, other

    cs.RO

    UDE-based Dynamic Motion Force Control of Mobile Manipulators

    Authors: Songqun Gao, Wendi Ding, Qinyuan Ren, Ben M. Chen

    Abstract: Mobile manipulators are known for their superior mobility over manipulators on fixed bases, offering promising applications in smart industry and housekeeping scenarios. However, the dynamic coupling nature between the mobile base and the manipulator presents challenges for the physical interactive tasks of the mobile manipulator. Current methods suffer from complex modeling processes and poor tra… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  12. arXiv:2403.07865  [pdf, other

    cs.CL cs.AI cs.CR cs.LG cs.SE

    CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion

    Authors: Qibing Ren, Chang Gao, Jing Shao, Junchi Yan, Xin Tan, Wai Lam, Lizhuang Ma

    Abstract: The rapid advancement of Large Language Models (LLMs) has brought about remarkable generative capabilities but also raised concerns about their potential misuse. While strategies like supervised fine-tuning and reinforcement learning from human feedback have enhanced their safety, these methods primarily focus on natural languages, which may not generalize to other domains. This paper introduces C… ▽ More

    Submitted 9 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: ACL Findings 2024, Code is available at https://github.com/renqibing/CodeAttack

  13. arXiv:2402.01163  [pdf, other

    cs.CV

    Enhanced Urban Region Profiling with Adversarial Self-Supervised Learning

    Authors: Weiliang Chan, Qianqian Ren, Jinbao Li

    Abstract: Urban region profiling is pivotal for smart cities, but mining fine-grained semantics from noisy and incomplete urban data remains challenging. In response, we propose a novel self-supervised graph collaborative filtering model for urban region embedding called EUPAS. Specifically, region heterogeneous graphs containing human mobility data, point of interests (POIs) information, and geographic nei… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  14. arXiv:2401.18057  [pdf, other

    cs.LG

    Rank Supervised Contrastive Learning for Time Series Classification

    Authors: Qianying Ren, Dongsheng Luo, Dongjin Song

    Abstract: Recently, various contrastive learning techniques have been developed to categorize time series data and exhibit promising performance. A general paradigm is to utilize appropriate augmentations and construct feasible positive samples such that the encoder can yield robust and discriminative representations by mapping similar data points closer together in the feature space while pushing dissimila… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  15. Distillation Enhanced Time Series Forecasting Network with Momentum Contrastive Learning

    Authors: Haozhi Gao, Qianqian Ren, Jinbao Li

    Abstract: Contrastive representation learning is crucial in time series analysis as it alleviates the issue of data noise and incompleteness as well as sparsity of supervision signal. However, existing constrastive learning frameworks usually focus on intral-temporal features, which fails to fully exploit the intricate nature of time series data. To address this issue, we propose DE-TSMCL, an innovative dis… ▽ More

    Submitted 25 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  16. arXiv:2401.15071  [pdf, other

    cs.CV

    From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities

    Authors: Chaochao Lu, Chen Qian, Guodong Zheng, Hongxing Fan, Hongzhi Gao, Jie Zhang, Jing Shao, Jingyi Deng, Jinlan Fu, Kexin Huang, Kunchang Li, Lijun Li, Limin Wang, Lu Sheng, Meiqi Chen, Ming Zhang, Qibing Ren, Sirui Chen, Tao Gui, Wanli Ouyang, Yali Wang, Yan Teng, Yaru Wang, Yi Wang, Yinan He , et al. (11 additional authors not shown)

    Abstract: Multi-modal Large Language Models (MLLMs) have shown impressive abilities in generating reasonable responses with respect to multi-modal contents. However, there is still a wide gap between the performance of recent MLLM-based applications and the expectation of the broad public, even though the most powerful OpenAI's GPT-4 and Google's Gemini have been deployed. This paper strives to enhance unde… ▽ More

    Submitted 29 January, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  17. arXiv:2401.09067  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

    Authors: Depeng Li, Tianqi Wang, Junwei Chen, Qining Ren, Kenji Kawaguchi, Zhigang Zeng

    Abstract: Deep neural networks are susceptible to catastrophic forgetting when trained on sequential tasks. Various continual learning (CL) methods often rely on exemplar buffers or/and network expansion for balancing model stability and plasticity, which, however, compromises their practical value due to privacy and memory concerns. Instead, this paper considers a strict yet realistic setting, where the tr… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI 2024

  18. arXiv:2312.09952  [pdf, other

    eess.AS cs.SD

    Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction

    Authors: Yuanbo Hou, Qiaoqiao Ren, Siyang Song, Yuxin Song, Wenwu Wang, Dick Botteldooren

    Abstract: WHO's report on environmental noise estimates that 22 M people suffer from chronic annoyance related to noise caused by audio events (AEs) from various sources. Annoyance may lead to health issues and adverse effects on metabolic and cognitive systems. In cities, monitoring noise levels does not provide insights into noticeable AEs, let alone their relations to annoyance. To create annoyance-relat… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024

  19. arXiv:2311.09030  [pdf

    eess.AS cs.SD

    AI-based soundscape analysis: Jointly identifying sound sources and predicting annoyance

    Authors: Yuanbo Hou, Qiaoqiao Ren, Huizhong Zhang, Andrew Mitchell, Francesco Aletta, Jian Kang, Dick Botteldooren

    Abstract: Soundscape studies typically attempt to capture the perception and understanding of sonic environments by surveying users. However, for long-term monitoring or assessing interventions, sound-signal-based approaches are required. To this end, most previous research focused on psycho-acoustic quantities or automatic sound recognition. Few attempts were made to include appraisal (e.g., in circumplex… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: The Journal of the Acoustical Society of America, 154 (5), 3145

    Journal ref: The Journal of the Acoustical Society of America, 154, 3145 (2023)

  20. arXiv:2310.13347  [pdf, other

    cs.CV cs.AI

    NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding

    Authors: Ming Hu, Lin Wang, Siyuan Yan, Don Ma, Qingli Ren, Peng Xia, Wei Feng, Peibo Duan, Lie Ju, Zongyuan Ge

    Abstract: The application of deep learning to nursing procedure activity understanding has the potential to greatly enhance the quality and safety of nurse-patient interactions. By utilizing the technique, we can facilitate training and education, improve quality control, and enable operational compliance monitoring. However, the development of automatic recognition systems in this field is currently hinder… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023 Datasets and Benchmarks Track

  21. arXiv:2309.11907  [pdf, other

    cs.AI

    Learning to Recover for Safe Reinforcement Learning

    Authors: Haoyu Wang, Xin Yuan, Qinqing Ren

    Abstract: Safety controllers is widely used to achieve safe reinforcement learning. Most methods that apply a safety controller are using handcrafted safety constraints to construct the safety controller. However, when the environment dynamics are sophisticated, handcrafted safety constraints become unavailable. Therefore, it worth to research on constructing safety controllers by learning algorithms. We pr… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  22. arXiv:2309.11876  [pdf, other

    cs.CV cs.AI

    Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training

    Authors: Shuang Zeng, Lei Zhu, Xinliang Zhang, Qian Chen, Hangzhou He, Lujia Jin, Zifeng Tian, Qiushi Ren, Zhaoheng Xie, Yanye Lu

    Abstract: Medical image segmentation is a fundamental yet challenging task due to the arduous process of acquiring large volumes of high-quality labeled data from experts. Contrastive learning offers a promising but still problematic solution to this dilemma. Because existing medical contrastive learning strategies focus on extracting image-level representation, which ignores abundant multi-level representa… ▽ More

    Submitted 13 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

  23. arXiv:2309.08854  [pdf, other

    cs.RO

    Intention-Aware Planner for Robust and Safe Aerial Tracking

    Authors: Qiuyu Ren, Huan Yu, Jiajun Dai, Zhi Zheng, Jun Meng, Li Xu, Chao Xu, Fei Gao, Yanjun Cao

    Abstract: Autonomous target tracking with quadrotors has wide applications in many scenarios, such as cinematographic follow-up shooting or suspect chasing. Target motion prediction is necessary when designing the tracking planner. However, the widely used constant velocity or constant rotation assumption can not fully capture the dynamics of the target. The tracker may fail when the target happens to move… ▽ More

    Submitted 20 March, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 8 pages, 10 figures, submitted to 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  24. arXiv:2309.06912  [pdf, other

    cs.IR

    Multi-behavior Recommendation with SVD Graph Neural Networks

    Authors: Shengxi Fu, Qianqian Ren, Xingfeng Lv, Jinbao Li

    Abstract: Graph Neural Networks (GNNs) have been extensively employed in the field of recommendation systems, offering users personalized recommendations and yielding remarkable outcomes. Recently, GNNs incorporating contrastive learning have demonstrated promising performance in handling the sparse data problem of recommendation systems. However, existing contrastive learning methods still have limitations… ▽ More

    Submitted 9 May, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

  25. arXiv:2308.15150  [pdf, other

    cs.NE

    Unleashing the Potential of Spiking Neural Networks for Sequential Modeling with Contextual Embedding

    Authors: Xinyi Chen, Jibin Wu, Huajin Tang, Qinyuan Ren, Kay Chen Tan

    Abstract: The human brain exhibits remarkable abilities in integrating temporally distant sensory inputs for decision-making. However, existing brain-inspired spiking neural networks (SNNs) have struggled to match their biological counterpart in modeling long-term temporal relationships. To address this problem, this paper presents a novel Contextual Embedding Leaky Integrate-and-Fire (CE-LIF) spiking neuro… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  26. arXiv:2308.11980  [pdf, other

    eess.AS cs.SD

    Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning

    Authors: Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren

    Abstract: Sound events in daily life carry rich information about the objective world. The composition of these sounds affects the mood of people in a soundscape. Most previous approaches only focus on classifying and detecting audio events and scenes, but may ignore their perceptual quality that may impact humans' listening mood for the environment, e.g. annoyance. To this end, this paper proposes a novel… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023, Code and models: https://github.com/Yuanbo2020/HGRL

  27. arXiv:2308.04949  [pdf, other

    cs.CV

    Branches Mutual Promotion for End-to-End Weakly Supervised Semantic Segmentation

    Authors: Lei Zhu, Hangzhou He, Xinliang Zhang, Qian Chen, Shuang Zeng, Qiushi Ren, Yanye Lu

    Abstract: End-to-end weakly supervised semantic segmentation aims at optimizing a segmentation model in a single-stage training process based on only image annotations. Existing methods adopt an online-trained classification branch to provide pseudo annotations for supervising the segmentation branch. However, this strategy makes the classification branch dominate the whole concurrent training process, hind… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  28. arXiv:2307.03212  [pdf, other

    cs.CV cs.AI cs.CY cs.LG

    Attentive Graph Enhanced Region Representation Learning

    Authors: Weiliang Chen, Qianqian Ren, Jinbao Li

    Abstract: Representing urban regions accurately and comprehensively is essential for various urban planning and analysis tasks. Recently, with the expansion of the city, modeling long-range spatial dependencies with multiple data sources plays an important role in urban region representation. In this paper, we propose the Attentive Graph Enhanced Region Representation Learning (ATGRL) model, which aims to c… ▽ More

    Submitted 31 May, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

  29. arXiv:2306.09718  [pdf, ps, other

    cs.CV cs.AI

    Label-noise-tolerant medical image classification via self-attention and self-supervised learning

    Authors: Hongyang Jiang, Mengdi Gao, Yan Hu, Qiushi Ren, Zhaoheng Xie, Jiang Liu

    Abstract: Deep neural networks (DNNs) have been widely applied in medical image classification and achieve remarkable classification performance. These achievements heavily depend on large-scale accurately annotated training data. However, label noise is inevitably introduced in the medical image annotation, as the labeling process heavily relies on the expertise and experience of annotators. Meanwhile, DNN… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 11pages, 8 figures

  30. arXiv:2305.08062  [pdf, other

    stat.ML cs.AI cs.LG

    Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling

    Authors: Yuta Saito, Qingyang Ren, Thorsten Joachims

    Abstract: We study off-policy evaluation (OPE) of contextual bandit policies for large discrete action spaces where conventional importance-weighting approaches suffer from excessive variance. To circumvent this variance issue, we propose a new estimator, called OffCEM, that is based on the conjunct effect model (CEM), a novel decomposition of the causal effect into a cluster effect and a residual effect. O… ▽ More

    Submitted 2 June, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

    Comments: accepted at ICML2023. arXiv admin note: text overlap with arXiv:2202.06317

  31. Handoff-Aware Distributed Computing in High Altitude Platform Station (HAPS)-Assisted Vehicular Networks

    Authors: Qiqi Ren, Omid Abbasi, Gunes Karabulut Kurt, Halim Yanikomeroglu, Jian Chen

    Abstract: Distributed computing enables Internet of vehicle (IoV) services by collaboratively utilizing the computing resources from the network edge and the vehicles. However, the computing interruption issue caused by frequent edge network handoffs, and a severe shortage of computing resources are two problems in providing IoV services. High altitude platform station (HAPS) computing can be a promising ad… ▽ More

    Submitted 7 May, 2023; originally announced May 2023.

  32. arXiv:2305.01939  [pdf, other

    cs.LG cs.AI cs.CV

    Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

    Authors: Qihan Ren, Jiayang Gao, Wen Shen, Quanshi Zhang

    Abstract: This paper aims to prove the emergence of symbolic concepts in well-trained AI models. We prove that if (1) the high-order derivatives of the model output w.r.t. the input variables are all zero, (2) the AI model can be used on occluded samples and will yield higher confidence when the input sample is less occluded, and (3) the confidence of the AI model does not significantly degrade on occluded… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  33. arXiv:2303.15182  [pdf, other

    cs.LG

    Hybrid Augmented Automated Graph Contrastive Learning

    Authors: Yifu Chen, Qianqian Ren, Liu Yong

    Abstract: Graph augmentations are essential for graph contrastive learning. Most existing works use pre-defined random augmentations, which are usually unable to adapt to different input graphs and fail to consider the impact of different nodes and edges on graph semantics. To address this issue, we propose a framework called Hybrid Augmented Automated Graph Contrastive Learning (HAGCL). HAGCL consists of a… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  34. arXiv:2302.13095  [pdf, other

    cs.LG cs.AI cs.CV

    Bayesian Neural Networks Avoid Encoding Complex and Perturbation-Sensitive Concepts

    Authors: Qihan Ren, Huiqi Deng, Yunuo Chen, Siyu Lou, Quanshi Zhang

    Abstract: In this paper, we focus on mean-field variational Bayesian Neural Networks (BNNs) and explore the representation capacity of such BNNs by investigating which types of concepts are less likely to be encoded by the BNN. It has been observed and studied that a relatively small set of interactive concepts usually emerge in the knowledge representation of a sufficiently-trained neural network, and such… ▽ More

    Submitted 1 December, 2023; v1 submitted 25 February, 2023; originally announced February 2023.

  35. arXiv:2301.12344  [pdf, other

    cs.RO eess.SY

    TJ-FlyingFish: Design and Implementation of an Aerial-Aquatic Quadrotor with Tiltable Propulsion Units

    Authors: Xuchen Liu, Minghao Dou, Dongyue Huang, Biao Wang, Jinqiang Cui, Qinyuan Ren, Lihua Dou, Zhi Gao, Jie Chen, Ben M. Chen

    Abstract: Aerial-aquatic vehicles are capable to move in the two most dominant fluids, making them more promising for a wide range of applications. We propose a prototype with special designs for propulsion and thruster configuration to cope with the vast differences in the fluid properties of water and air. For propulsion, the operating range is switched for the different mediums by the dual-speed propulsi… ▽ More

    Submitted 6 February, 2023; v1 submitted 28 January, 2023; originally announced January 2023.

    Comments: 6 pages, 9 figures, accepted to 2023 IEEE International Conference on Robotics and Automation (ICRA)

  36. arXiv:2211.00730  [pdf, other

    cs.RO

    Tactile interaction with a robot leads to increased risk-taking

    Authors: Qiaoqiao Ren, Tony Belpaeme

    Abstract: Tactile interaction plays a crucial role in interactions between people. Touch can, for example, help people calm down and lower physiological stress responses. Consequently, it is believed that tactile and haptic interaction matter also in human-robot interaction. We study if the intensity of the tactile interaction has an impact on people, and do so by studying whether different intensities of t… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 10 pages, 5 figures, conference

    MSC Class: International conference of social robotics

  37. Slim-neck by GSConv: A lightweight-design for real-time detector architectures

    Authors: Hulin Li, Jun Li, Hanbing Wei, Zheng Liu, Zhenfei Zhan, Qiliang Ren

    Abstract: Real-time object detection is significant for industrial and research fields. On edge devices, a giant model is difficult to achieve the real-time detecting requirement and a lightweight model built from a large number of the depth-wise separable convolutional could not achieve the sufficient accuracy. We introduce a new lightweight convolutional technique, GSConv, to lighten the model but maintai… ▽ More

    Submitted 1 July, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

  38. Artificial Open World for Evaluating AGI: a Conceptual Design

    Authors: Bowen Xu, Quansheng Ren

    Abstract: How to evaluate Artificial General Intelligence (AGI) is a critical problem that is discussed and unsolved for a long period. In the research of narrow AI, this seems not a severe problem, since researchers in that field focus on some specific problems as well as one or some aspects of cognition, and the criteria for evaluation are explicitly defined. By contrast, an AGI agent should solve problem… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  39. arXiv:2202.10206  [pdf, other

    cs.CR

    DECLOAK: Enable Secure and Cheap Multi-Party Transactions on Legacy Blockchains by a Minimally Trusted TEE Network

    Authors: Qian Ren, Yue Li, Yingjun Wu, Yuchen Wu, Hong Lei, Lei Wang, Bangdao Chen

    Abstract: As the confidentiality and scalability of smart contracts have become a crucial demand of blockchains, off-chain contract execution frameworks have been promising. Some have recently expanded off-chain contracts to Multi-Party Computation (MPC), which seek to transition the on-chain states by off-chain MPC. The most general problem among these solutions is MPT, since its off-chain MPC takes on- an… ▽ More

    Submitted 22 May, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.13926

  40. arXiv:2201.10797  [pdf, other

    cs.CL cs.LG cs.NE

    An Automated Question-Answering Framework Based on Evolution Algorithm

    Authors: Sinan Tan, Hui Xue, Qiyu Ren, Huaping Liu, Jing Bai

    Abstract: Building a deep learning model for a Question-Answering (QA) task requires a lot of human effort, it may need several months to carefully tune various model architectures and find a best one. It's even harder to find different excellent models for multiple datasets. Recent works show that the best model structure is related to the dataset used, and one single model cannot adapt to all tasks. In th… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

    Comments: In Proceedings of the AAAI 2019 Workshop (WS13) on Reasoning and Complex Question-Answering (RCQA-19) https://researcher.watson.ibm.com/researcher/view_group.php?id=9632

  41. arXiv:2201.09907  [pdf, other

    cs.LG stat.AP

    Ordinal-Quadruplet: Retrieval of Missing Classes in Ordinal Time Series

    Authors: Jurijs Nazarovs, Cristian Lumezanu, Qianying Ren, Yuncong Chen, Takehiko Mizoguchi, Dongjin Song, Haifeng Chen

    Abstract: In this paper, we propose an ordered time series classification framework that is robust against missing classes in the training data, i.e., during testing we can prescribe classes that are missing during training. This framework relies on two main components: (1) our newly proposed ordinal-quadruplet loss, which forces the model to learn latent representation while preserving the ordinal relation… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

  42. arXiv:2201.00402  [pdf, other

    math.OC cs.AI cs.CR cs.LG

    A General Framework for Evaluating Robustness of Combinatorial Optimization Solvers on Graphs

    Authors: Han Lu, Zenan Li, Runzhong Wang, Qibing Ren, Junchi Yan, Xiaokang Yang

    Abstract: Solving combinatorial optimization (CO) on graphs is among the fundamental tasks for upper-stream applications in data mining, machine learning and operations research. Despite the inherent NP-hard challenge for CO, heuristics, branch-and-bound, learning-based solvers are developed to tackle CO problems as accurately as possible given limited time budgets. However, a practical metric for the sensi… ▽ More

    Submitted 4 June, 2022; v1 submitted 28 December, 2021; originally announced January 2022.

  43. arXiv:2112.14379  [pdf, other

    cs.CV

    Background-aware Classification Activation Map for Weakly Supervised Object Localization

    Authors: Lei Zhu, Qi She, Qian Chen, Xiangxi Meng, Mufeng Geng, Lujia Jin, Zhe Jiang, Bin Qiu, Yunfei You, Yibao Zhang, Qiushi Ren, Yanye Lu

    Abstract: Weakly supervised object localization (WSOL) relaxes the requirement of dense annotations for object localization by using image-level classification masks to supervise its learning process. However, current WSOL methods suffer from excessive activation of background locations and need post-processing to obtain the localization mask. This paper attributes these issues to the unawareness of backgro… ▽ More

    Submitted 28 December, 2021; originally announced December 2021.

  44. arXiv:2111.06236  [pdf, other

    cs.LG cs.AI cs.CV

    Discovering and Explaining the Representation Bottleneck of DNNs

    Authors: Huiqi Deng, Qihan Ren, Hao Zhang, Quanshi Zhang

    Abstract: This paper explores the bottleneck of feature representations of deep neural networks (DNNs), from the perspective of the complexity of interactions between input variables encoded in DNNs. To this end, we focus on the multi-order interaction between input variables, where the order represents the complexity of interactions. We discover that a DNN is more likely to encode both too simple interacti… ▽ More

    Submitted 7 November, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

  45. arXiv:2111.03549  [pdf, other

    cs.CV cs.AI cs.LG

    Interpreting Representation Quality of DNNs for 3D Point Cloud Processing

    Authors: Wen Shen, Qihan Ren, Dongrui Liu, Quanshi Zhang

    Abstract: In this paper, we evaluate the quality of knowledge representations encoded in deep neural networks (DNNs) for 3D point cloud processing. We propose a method to disentangle the overall model vulnerability into the sensitivity to the rotation, the translation, the scale, and local 3D structures. Besides, we also propose metrics to evaluate the spatial smoothness of encoding 3D structures, and the r… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

  46. arXiv:2109.10750  [pdf, other

    cs.RO

    Control of Pneumatic Artificial Muscles with SNN-based Cerebellar-like Model

    Authors: Hongbo Zhang, Yunshuang Li, Yipin Guo, Xinyi Chen, Qinyuan Ren

    Abstract: Soft robotics technologies have gained growing interest in recent years, which allows various applications from manufacturing to human-robot interaction. Pneumatic artificial muscle (PAM), a typical soft actuator, has been widely applied to soft robots. The compliance and resilience of soft actuators allow soft robots to behave compliant when interacting with unstructured environments, while the u… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

  47. arXiv:2109.02396  [pdf, other

    cs.LG cs.DC

    Byzantine-Robust Federated Learning via Credibility Assessment on Non-IID Data

    Authors: Kun Zhai, Qiang Ren, Junli Wang, Chungang Yan

    Abstract: Federated learning is a novel framework that enables resource-constrained edge devices to jointly learn a model, which solves the problem of data protection and data islands. However, standard federated learning is vulnerable to Byzantine attacks, which will cause the global model to be manipulated by the attacker or fail to converge. On non-iid data, the current methods are not effective in defen… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  48. arXiv:2107.02763  [pdf, other

    cs.CE

    Predicting Surface Heat Flux on Complex Systems via Conv-LSTM

    Authors: Yinpeng Wang, Nianru Wang, Qiang Ren

    Abstract: Existing algorithms with iterations as the principle for 3D inverse heat conduction problems (IHCPs) are usually time-consuming. With the recent advancements in deep learning techniques, it is possible to apply the neural network to compute IHCPs. In this paper, a new framework based on Convolutional-LSTM is introduced to predict the transient heat flux via measured temperature. The inverse heat c… ▽ More

    Submitted 29 June, 2021; originally announced July 2021.

    Comments: 11 pages, 9 figures

  49. arXiv:2106.14928  [pdf, ps, other

    eess.SY cs.NI

    Caching and Computation Offloading in High Altitude Platform Station (HAPS) Assisted Intelligent Transportation Systems

    Authors: Qiqi Ren, Omid Abbasi, Gunes Karabulut Kurt, Halim Yanikomeroglu, Jian Chen

    Abstract: Edge intelligence, a new paradigm to accelerate artificial intelligence (AI) applications by leveraging computing resources on the network edge, can be used to improve intelligent transportation systems (ITS). However, due to physical limitations and energy-supply constraints, the computing powers of edge equipment are usually limited. High altitude platform station (HAPS) computing can be conside… ▽ More

    Submitted 13 January, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

  50. arXiv:2106.13926  [pdf, other

    cs.CR

    Cloak: Transitioning States on Legacy Blockchains Using Secure and Publicly Verifiable Off-Chain Multi-Party Computation

    Authors: Qian Ren, Yingjun Wu, Han Liu, Yue Li, Anne Victor, Hong Lei, Lei Wang, Bangdao Chen

    Abstract: In recent years, the confidentiality of smart contracts has become a fundamental requirement for practical applications. While many efforts have been made to develop architectural capabilities for enforcing confidential smart contracts, a few works arise to extend confidential smart contracts to Multi-Party Computation (MPC), i.e., multiple parties jointly evaluate a transaction off-chain and comm… ▽ More

    Submitted 10 February, 2023; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: accepted by ACSAC'22