Skip to main content

Showing 1–35 of 35 results for author: Ding, P

  1. arXiv:2406.07025  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    Entropy-Reinforced Planning with Large Language Models for Drug Discovery

    Authors: Xuefeng Liu, Chih-chan Tien, Peng Ding, Songhao Jiang, Rick L. Stevens

    Abstract: The objective of drug discovery is to identify chemical compounds that possess specific pharmaceutical properties toward a binding target. Existing large language models (LLMS) can achieve high token matching scores in terms of likelihood for molecule generation. However, relying solely on LLM decoding often results in the generation of molecules that are either invalid due to a single misused tok… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Published in ICML2024

  2. arXiv:2404.14025  [pdf, other

    cs.CV

    DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation

    Authors: Yonghao Dang, Jianqin Yin, Liyuan Liu, Pengxiang Ding, Yuan Sun, Yanzhu Hu

    Abstract: Multi-person pose estimation (MPPE) presents a formidable yet crucial challenge in computer vision. Most existing methods predominantly concentrate on isolated interaction either between instances or joints, which is inadequate for scenarios demanding concurrent localization of both instances and joints. This paper introduces a novel CNN-based single-stage method, named Dual-path Hierarchical Rela… ▽ More

    Submitted 26 April, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  3. Towards more realistic human motion prediction with attention to motion coordination

    Authors: Pengxiang Ding, Jianqin Yin

    Abstract: Joint relation modeling is a curial component in human motion prediction. Most existing methods rely on skeletal-based graphs to build the joint relations, where local interactive relations between joint pairs are well learned. However, the motion coordination, a global joint relation reflecting the simultaneous cooperation of all joints, is usually weakened because it is learned from part to whol… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted by TCSVT

  4. arXiv:2403.19913  [pdf, other

    cs.CL cs.AI cs.LG cs.RO

    MANGO: A Benchmark for Evaluating Mapping and Navigation Abilities of Large Language Models

    Authors: Peng Ding, Jiading Fang, Peng Li, Kangrui Wang, Xiaochen Zhou, Mo Yu, Jing Li, Matthew R. Walter, Hongyuan Mei

    Abstract: Large language models such as ChatGPT and GPT-4 have recently achieved astonishing performance on a variety of natural language processing tasks. In this paper, we propose MANGO, a benchmark to evaluate their capabilities to perform text-based mapping and navigation. Our benchmark includes 53 mazes taken from a suite of textgames: each maze is paired with a walkthrough that visits every location b… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  5. arXiv:2403.14520  [pdf, other

    cs.CV

    Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

    Authors: Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang

    Abstract: In recent years, the application of multimodal large language models (MLLM) in various fields has achieved remarkable success. However, as the foundation model for many downstream tasks, current MLLMs are composed of the well-known Transformer network, which has a less efficient quadratic computation complexity. To improve the efficiency of such basic models, we propose Cobra, a linear computation… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Update ablation results

  6. arXiv:2403.13834  [pdf, other

    cs.LG

    Few-shot Learning on Heterogeneous Graphs: Challenges, Progress, and Prospects

    Authors: Pengfei Ding, Yan Wang, Guanfeng Liu

    Abstract: Few-shot learning on heterogeneous graphs (FLHG) is attracting more attention from both academia and industry because prevailing studies on heterogeneous graphs often suffer from label sparsity. FLHG aims to tackle the performance degradation in the face of limited annotated data and there have been numerous recent studies proposing various methods and applications. In this paper, we provide a com… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  7. arXiv:2403.13358  [pdf, other

    cs.RO cs.CV cs.LG

    GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot

    Authors: Wenxuan Song, Han Zhao, Pengxiang Ding, Can Cui, Shangke Lyu, Yaning Fan, Donglin Wang

    Abstract: Multi-task robot learning holds significant importance in tackling diverse and complex scenarios. However, current approaches are hindered by performance issues and difficulties in collecting training datasets. In this paper, we propose GeRM (Generalist Robotic Model). We utilize offline reinforcement learning to optimize data utilization strategies to learn from both demonstrations and sub-optima… ▽ More

    Submitted 9 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  8. arXiv:2402.01271  [pdf, other

    eess.AS cs.SD

    An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec

    Authors: Linping Xu, Jiawei Jiang, Dejun Zhang, Xianjun Xia, Li Chen, Yijian Xiao, Piao Ding, Shenyi Song, Sixing Yin, Ferdous Sohel

    Abstract: Recently, neural networks have proven to be effective in performing speech coding task at low bitrates. However, under-utilization of intra-frame correlations and the error of quantizer specifically degrade the reconstructed audio quality. To improve the coding quality, we present an end-to-end neural speech codec, namely CBRC (Convolutional and Bidirectional Recurrent neural Codec). An interleave… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: INTERSPEECH 2023

  9. arXiv:2401.03597  [pdf, other

    cs.LG cs.AI

    Few-Shot Causal Representation Learning for Out-of-Distribution Generalization on Heterogeneous Graphs

    Authors: Pengfei Ding, Yan Wang, Guanfeng Liu, Nan Wang, Xiaofang Zhou

    Abstract: Heterogeneous graph few-shot learning (HGFL) has been developed to address the label sparsity issue in heterogeneous graphs (HGs), which consist of various types of nodes and edges. The core concept of HGFL is to extract knowledge from rich-labeled classes in a source HG, transfer this knowledge to a target HG to facilitate learning new classes with few-labeled training data, and finally make pred… ▽ More

    Submitted 16 April, 2024; v1 submitted 7 January, 2024; originally announced January 2024.

  10. arXiv:2312.14457  [pdf, other

    cs.RO cs.CV

    QUAR-VLA: Vision-Language-Action Model for Quadruped Robots

    Authors: Pengxiang Ding, Han Zhao, Wenxuan Song, Wenjie Zhang, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang

    Abstract: The important manifestation of robot intelligence is the ability to naturally interact and autonomously make decisions. Traditional approaches to robot control often compartmentalize perception, planning, and decision-making, simplifying system design but limiting the synergy between different information streams. This compartmentalization poses challenges in achieving seamless autonomous reasonin… ▽ More

    Submitted 6 July, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  11. arXiv:2312.11972  [pdf, other

    cs.CV

    Expressive Forecasting of 3D Whole-body Human Motions

    Authors: Pengxiang Ding, Qiongjie Cui, Min Zhang, Mengyuan Liu, Haofan Wang, Donglin Wang

    Abstract: Human motion forecasting, with the goal of estimating future human behavior over a period of time, is a fundamental task in many real-world applications. However, existing works typically concentrate on predicting the major joints of the human body without considering the delicate movements of the human hands. In practical applications, hand gesture plays an important role in human communication w… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: Accepted by AAAI24

  12. arXiv:2311.08268  [pdf, other

    cs.CL

    A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily

    Authors: Peng Ding, Jun Kuang, Dan Ma, Xuezhi Cao, Yunsen Xian, Jiajun Chen, Shujian Huang

    Abstract: Large Language Models (LLMs), such as ChatGPT and GPT-4, are designed to provide useful and safe responses. However, adversarial prompts known as 'jailbreaks' can circumvent safeguards, leading LLMs to generate potentially harmful content. Exploring jailbreak prompts can help to better reveal the weaknesses of LLMs and further steer us to secure them. Unfortunately, existing jailbreak methods eith… ▽ More

    Submitted 6 April, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: Acccepted by NAACL 2024, 18 pages, 7 figures, 13 tables

  13. arXiv:2309.15769  [pdf, other

    math.ST cs.LG stat.ME

    Algebraic and Statistical Properties of the Ordinary Least Squares Interpolator

    Authors: Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon

    Abstract: Deep learning research has uncovered the phenomenon of benign overfitting for overparameterized statistical models, which has drawn significant theoretical interest in recent years. Given its simplicity and practicality, the ordinary least squares (OLS) interpolator has become essential to gain foundational insights into this phenomenon. While properties of OLS are well established in classical, u… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  14. arXiv:2308.05275  [pdf, other

    cs.LG cs.AI

    Cross-heterogeneity Graph Few-shot Learning

    Authors: Pengfei Ding, Yan Wang, Guanfeng Liu

    Abstract: In recent years, heterogeneous graph few-shot learning has been proposed to address the label sparsity issue in heterogeneous graphs (HGs), which contain various types of nodes and edges. The existing methods have achieved good performance by transferring generalized knowledge extracted from rich-labeled classes in source HG(s) to few-labeled classes in a target HG. However, these methods only con… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  15. arXiv:2308.03271  [pdf, other

    cs.LG cs.AI

    Local Structure-aware Graph Contrastive Representation Learning

    Authors: Kai Yang, Yuan Liu, Zijuan Zhao, Peijin Ding, Wenqian Zhao

    Abstract: Traditional Graph Neural Network (GNN), as a graph representation learning method, is constrained by label information. However, Graph Contrastive Learning (GCL) methods, which tackle the label problem effectively, mainly focus on the feature information of the global graph or small subgraph structure (e.g., the first-order neighborhood). In the paper, we propose a Local Structure-aware Graph Cont… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  16. arXiv:2307.06566  [pdf, other

    cs.CV cs.AI

    Regression-Oriented Knowledge Distillation for Lightweight Ship Orientation Angle Prediction with Optical Remote Sensing Images

    Authors: Zhan Shi, Xin Ding, Peng Ding, Chun Yang, Ru Huang, Xiaoxuan Song

    Abstract: Ship orientation angle prediction (SOAP) with optical remote sensing images is an important image processing task, which often relies on deep convolutional neural networks (CNNs) to make accurate predictions. This paper proposes a novel framework to reduce the model sizes and computational costs of SOAP models without harming prediction accuracy. First, a new SOAP model called Mobile-SOAP is desig… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  17. arXiv:2306.13699  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction

    Authors: Cong Shen, Pingjian Ding, Junjie Wee, Jialin Bi, Jiawei Luo, Kelin Xia

    Abstract: Geometric deep learning has demonstrated a great potential in non-Euclidean data analysis. The incorporation of geometric insights into learning architecture is vital to its success. Here we propose a curvature-enhanced graph convolutional network (CGCN) for biomolecular interaction prediction, for the first time. Our CGCN employs Ollivier-Ricci curvature (ORC) to characterize network local struct… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  18. arXiv:2306.07470  [pdf, other

    cs.CV cs.AI

    Reviving Shift Equivariance in Vision Transformers

    Authors: Peijian Ding, Davit Soselia, Thomas Armstrong, Jiahao Su, Furong Huang

    Abstract: Shift equivariance is a fundamental principle that governs how we perceive the world - our recognition of an object remains invariant with respect to shifts. Transformers have gained immense popularity due to their effectiveness in both language and vision tasks. While the self-attention operator in vision transformers (ViT) is permutation-equivariant and thus shift-equivariant, patch embedding, p… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 9 pages, 3 figures

  19. arXiv:2306.00812  [pdf, other

    eess.AS cs.SD

    Harmonic enhancement using learnable comb filter for light-weight full-band speech enhancement model

    Authors: Xiaohuai Le, Tong Lei, Li Chen, Yiqing Guo, Chao He, Cheng Chen, Xianjun Xia, Hua Gao, Yijian Xiao, Piao Ding, Shenyi Song, Jing Lu

    Abstract: With fewer feature dimensions, filter banks are often used in light-weight full-band speech enhancement models. In order to further enhance the coarse speech in the sub-band domain, it is necessary to apply a post-filtering for harmonic retrieval. The signal processing-based comb filters used in RNNoise and PercepNet have limited performance and may cause speech quality degradation due to inaccura… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: accepted by Interspeech 2023

  20. arXiv:2304.04540  [pdf, other

    cs.CV

    FreConv: Frequency Branch-and-Integration Convolutional Networks

    Authors: Zhaowen Li, Xu Zhao, Peigeng Ding, Zongxin Gao, Yuting Yang, Ming Tang, Jinqiao Wang

    Abstract: Recent researches indicate that utilizing the frequency information of input data can enhance the performance of networks. However, the existing popular convolutional structure is not designed specifically for utilizing the frequency information contained in datasets. In this paper, we propose a novel and effective module, named FreConv (frequency branch-and-integration convolution), to replace th… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by ICME2023

  21. Instance-incremental Scene Graph Generation from Real-world Point Clouds via Normalizing Flows

    Authors: Chao Qi, Jianqin Yin, Jinghang Xu, Pengxiang Ding

    Abstract: This work introduces a new task of instance-incremental scene graph generation: Given a scene of the point cloud, representing it as a graph and automatically increasing novel instances. A graph denoting the object layout of the scene is finally generated. It is an important task since it helps to guide the insertion of novel 3D objects into a real-world scene in vision-based applications like aug… ▽ More

    Submitted 28 August, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted by IEEE TCSVT. The supplementary material is available in the media column of the journal version of the article

  22. arXiv:2207.05068  [pdf, other

    cs.LG cs.AI

    Few-Shot Semantic Relation Prediction across Heterogeneous Graphs

    Authors: Pengfei Ding, Yan Wang, Guanfeng Liu, Xiaofang Zhou

    Abstract: Semantic relation prediction aims to mine the implicit relationships between objects in heterogeneous graphs, which consist of different types of objects and different types of links. In real-world scenarios, new semantic relations constantly emerge and they typically appear with only a few labeled data. Since a variety of semantic relations exist in multiple heterogeneous graphs, the transferable… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

  23. arXiv:2107.03575  [pdf, other

    cs.CV

    Uncertainty-aware Human Motion Prediction

    Authors: Pengxiang Ding, Jianqin Yin

    Abstract: Human motion prediction is essential for tasks such as human motion analysis and human-robot interactions. Most existing approaches have been proposed to realize motion prediction. However, they ignore an important task, the evaluation of the quality of the predicted result. It is far more enough for current approaches in actual scenarios because people can't know how to interact with the machine… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

  24. arXiv:2106.02094  [pdf, other

    cs.CY cs.LG

    Adaptive Epidemic Forecasting and Community Risk Evaluation of COVID-19

    Authors: Vishrawas Gopalakrishnan, Sayali Navalekar, Pan Ding, Ryan Hooley, Jacob Miller, Raman Srinivasan, Ajay Deshpande, Xuan Liu, Simone Bianco, James H. Kaufman

    Abstract: Pandemic control measures like lock-down, restrictions on restaurants and gatherings, social-distancing have shown to be effective in curtailing the spread of COVID-19. However, their sustained enforcement has negative economic effects. To craft strategies and policies that reduce the hardship on the people and the economy while being effective against the pandemic, authorities need to understand… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 9 pages, 11 figures

    MSC Class: 34B60; 35Q99 ACM Class: I.2.1; I.6.0

  25. arXiv:2105.09711  [pdf, other

    cs.CV

    An Attractor-Guided Neural Networks for Skeleton-Based Human Motion Prediction

    Authors: Pengxiang Ding, Junying Wang, Jianqin Yin

    Abstract: Joint relation modeling is a curial component in human motion prediction. Most existing methods tend to design skeletal-based graphs to build the relations among joints, where local interactions between joint pairs are well learned. However, the global coordination of all joints, which reflects human motion's balance property, is usually weakened because it is learned from part to whole progressiv… ▽ More

    Submitted 6 May, 2022; v1 submitted 20 May, 2021; originally announced May 2021.

  26. arXiv:2103.16689  [pdf, ps, other

    cs.LG stat.ME stat.ML

    Multi-Source Causal Inference Using Control Variates

    Authors: Wenshuo Guo, Serena Wang, Peng Ding, Yixin Wang, Michael I. Jordan

    Abstract: While many areas of machine learning have benefited from the increasing availability of large and varied datasets, the benefit to causal inference has been limited given the strong assumptions needed to ensure identifiability of causal effects; these are often not satisfied in real-world datasets. For example, many large observational datasets (e.g., case-control studies in epidemiology, click-thr… ▽ More

    Submitted 5 June, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

  27. arXiv:2001.04488  [pdf, other

    eess.IV cs.LG stat.ML

    Deep Residual Dense U-Net for Resolution Enhancement in Accelerated MRI Acquisition

    Authors: Pak Lun Kevin Ding, Zhiqiang Li, Yuxiang Zhou, Baoxin Li

    Abstract: Typical Magnetic Resonance Imaging (MRI) scan may take 20 to 60 minutes. Reducing MRI scan time is beneficial for both patient experience and cost considerations. Accelerated MRI scan may be achieved by acquiring less amount of k-space data (down-sampling in the k-space). However, this leads to lower resolution and aliasing artifacts for the reconstructed images. There are many existing approaches… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: SPIE Medical Imaging 2019

  28. arXiv:1911.00962  [pdf, other

    cs.CV cs.LG eess.IV

    Conservative Wasserstein Training for Pose Estimation

    Authors: Xiaofeng Liu, Yang Zou, Tong Che, Peng Ding, Ping Jia, Jane You, Kumar B. V. K

    Abstract: This paper targets the task with discrete and periodic class labels ($e.g.,$ pose/orientation estimation) in the context of deep learning. The commonly used cross-entropy or regression loss is not well matched to this problem as they ignore the periodic nature of the labels and the class similarity, or assume labels are continuous value. We propose to incorporate inter-class correlations in a Wass… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

    Comments: ICCV 2019

  29. arXiv:1911.00055  [pdf, other

    cs.LG cs.LO stat.ML

    DRUM: End-To-End Differentiable Rule Mining On Knowledge Graphs

    Authors: Ali Sadeghian, Mohammadreza Armandpour, Patrick Ding, Daisy Zhe Wang

    Abstract: In this paper, we study the problem of learning probabilistic logical rules for inductive and interpretable link prediction. Despite the importance of inductive link prediction, most previous works focused on transductive link prediction and cannot manage previously unseen entities. Moreover, they are black-box models that are not easily explainable for humans. We propose DRUM, a scalable and diff… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

  30. arXiv:1910.06583  [pdf, other

    cs.CV

    TrajectoryNet: a new spatio-temporal feature learning network for human motion prediction

    Authors: Xiaoli Liu, Jianqin Yin, Jin Liu, Pengxiang Ding, Jun Liu, Huaping Liu

    Abstract: Human motion prediction is an increasingly interesting topic in computer vision and robotics. In this paper, we propose a new 2D CNN based network, TrajectoryNet, to predict future poses in the trajectory space. Compared with most existing methods, our model focuses on modeling the motion dynamics with coupled spatio-temporal features, local-global spatial features and global temporal co-occurrenc… ▽ More

    Submitted 20 March, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

  31. arXiv:1811.09763  [pdf, other

    cs.CV cs.LG

    Mean Local Group Average Precision (mLGAP): A New Performance Metric for Hashing-based Retrieval

    Authors: Pak Lun Kevin Ding, Yikang Li, Baoxin Li

    Abstract: The research on hashing techniques for visual data is gaining increased attention in recent years due to the need for compact representations supporting efficient search/retrieval in large-scale databases such as online images. Among many possibilities, Mean Average Precision(mAP) has emerged as the dominant performance metric for hashing-based retrieval. One glaring shortcoming of mAP is its inab… ▽ More

    Submitted 23 November, 2018; originally announced November 2018.

  32. arXiv:1810.04040  [pdf, other

    cs.IR cs.AI cs.LG

    Person-Job Fit: Adapting the Right Talent for the Right Job with Joint Representation Learning

    Authors: Chen Zhu, Hengshu Zhu, Hui Xiong, Chao Ma, Fang Xie, Pengliang Ding, Pan Li

    Abstract: Person-Job Fit is the process of matching the right talent for the right job by identifying talent competencies that are required for the job. While many qualitative efforts have been made in related fields, it still lacks of quantitative ways of measuring talent competencies as well as the job's talent requirements. To this end, in this paper, we propose a novel end-to-end data-driven model based… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 16 pages, 5 figures

    Journal ref: ACM Transactions on Management Information Systems (2018)

  33. arXiv:1802.00212  [pdf, other

    cs.LG

    Training Neural Networks by Using Power Linear Units (PoLUs)

    Authors: Yikang Li, Pak Lun Kevin Ding, Baoxin Li

    Abstract: In this paper, we introduce "Power Linear Unit" (PoLU) which increases the nonlinearity capacity of a neural network and thus helps improving its performance. PoLU adopts several advantages of previously proposed activation functions. First, the output of PoLU for positive inputs is designed to be identity to avoid the gradient vanishing problem. Second, PoLU has a non-zero output for negative inp… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

  34. Recruitment Market Trend Analysis with Sequential Latent Variable Models

    Authors: Chen Zhu, Hengshu Zhu, Hui Xiong, Pengliang Ding, Fang Xie

    Abstract: Recruitment market analysis provides valuable understanding of industry-specific economic growth and plays an important role for both employers and job seekers. With the rapid development of online recruitment services, massive recruitment data have been accumulated and enable a new paradigm for recruitment market analysis. However, traditional methods for recruitment market analysis largely rely… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.

    Comments: 11 pages, 30 figure, SIGKDD 2016

    MSC Class: 62H25

  35. arXiv:1703.02859  [pdf, other

    cs.CL

    A World of Difference: Divergent Word Interpretations among People

    Authors: Tianran Hu, Ruihua Song, Maya Abtahian, Philip Ding, Xing Xie, Jiebo Luo

    Abstract: Divergent word usages reflect differences among people. In this paper, we present a novel angle for studying word usage divergence -- word interpretations. We propose an approach that quantifies semantic differences in interpretations among different groups of people. The effectiveness of our approach is validated by quantitative evaluations. Experiment results indicate that divergences in word in… ▽ More

    Submitted 28 March, 2017; v1 submitted 8 March, 2017; originally announced March 2017.

    Comments: 4 pages, 1 figure, published at ICWSM'17