Skip to main content

Showing 1–50 of 199 results for author: Zheng, D

  1. arXiv:2407.08366  [pdf, other

    cs.RO cs.CV

    An Economic Framework for 6-DoF Grasp Detection

    Authors: Xiao-Ming Wu, Jia-Feng Cai, Jian-Jian Jiang, Dian Zheng, Yi-Lin Wei, Wei-Shi Zheng

    Abstract: Robotic grasping in clutters is a fundamental task in robotic manipulation. In this work, we propose an economic framework for 6-DoF grasp detection, aiming to economize the resource cost in training and meanwhile maintain effective grasp performance. To begin with, we discover that the dense supervision is the bottleneck of current SOTA methods that severely encumbers the entire training overload… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 19 pages, 7 figures. Accepted in ECCV 2024!

  2. arXiv:2407.01904  [pdf, other

    cs.DS

    From Directed Steiner Tree to Directed Polymatroid Steiner Tree in Planar Graphs

    Authors: Chandra Chekuri, Rhea Jain, Shubhang Kulkarni, Da Wei Zheng, Weihao Zhu

    Abstract: In the Directed Steiner Tree (DST) problem the input is a directed edge-weighted graph $G=(V,E)$, a root vertex $r$ and a set $S \subseteq V$ of $k$ terminals. The goal is to find a min-cost subgraph that connects $r$ to each of the terminals. DST admits an $O(\log^2 k/\log \log k)$-approximation in quasi-polynomial time, and an $O(k^ε)$-approximation for any fixed $ε> 0$ in polynomial-time. Resol… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2406.11884  [pdf, other

    cs.SI cs.AI

    Hierarchical Compression of Text-Rich Graphs via Large Language Models

    Authors: Shichang Zhang, Da Zheng, Jiani Zhang, Qi Zhu, Xiang song, Soji Adeshina, Christos Faloutsos, George Karypis, Yizhou Sun

    Abstract: Text-rich graphs, prevalent in data mining contexts like e-commerce and academic graphs, consist of nodes with textual features linked by various relations. Traditional graph machine learning models, such as Graph Neural Networks (GNNs), excel in encoding the graph structural information, but have limited capability in handling rich text on graph nodes. Large Language Models (LLMs), noted for thei… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.06918  [pdf, other

    cs.SE

    Towards more realistic evaluation of LLM-based code generation: an experimental study and beyond

    Authors: Dewu Zheng, Yanlin Wang, Ensheng Shi, Ruikai Zhang, Yuchi Ma, Hongyu Zhang, Zibin Zheng

    Abstract: To evaluate the code generation capabilities of Large Language Models (LLMs) in complex real-world software development scenarios, many evaluation approaches have been developed. They typically leverage contextual code from the latest version of a project to facilitate LLMs in accurately generating the desired function. However, such evaluation approaches fail to consider the dynamic evolution of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  5. arXiv:2406.06022  [pdf, other

    cs.LG cs.DC

    GraphStorm: all-in-one graph machine learning framework for industry applications

    Authors: Da Zheng, Xiang Song, Qi Zhu, Jian Zhang, Theodore Vasiloudis, Runjie Ma, Houyu Zhang, Zichen Wang, Soji Adeshina, Israt Nisa, Alejandro Mottini, Qingjun Cui, Huzefa Rangwala, Belinda Zeng, Christos Faloutsos, George Karypis

    Abstract: Graph machine learning (GML) is effective in many business applications. However, making GML easy to use and applicable to industry applications with massive datasets remain challenging. We developed GraphStorm, which provides an end-to-end solution for scalable graph construction, graph model training and inference. GraphStorm has the following desirable properties: (a) Easy to use: it can perfor… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Journal ref: KDD 2024

  6. arXiv:2405.19596  [pdf, ps, other

    cs.IT

    The weight hierarchies of three classes of linear codes

    Authors: Wei Lu, Qingyao Wang, Xiaoqiang Wang, Dabin Zheng

    Abstract: Studying the generalized Hamming weights of linear codes is a significant research area within coding theory, as it provides valuable structural information about the codes and plays a crucial role in determining their performance in various applications. However, determining the generalized Hamming weights of linear codes, particularly their weight hierarchy, is generally a challenging task. In t… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  7. AI-Assisted Assessment of Coding Practices in Modern Code Review

    Authors: Manushree Vijayvergiya, Małgorzata Salawa, Ivan Budiselić, Dan Zheng, Pascal Lamblin, Marko Ivanković, Juanjo Carin, Mateusz Lewko, Jovan Andonov, Goran Petrović, Daniel Tarlow, Petros Maniatis, René Just

    Abstract: Modern code review is a process in which an incremental code contribution made by a code author is reviewed by one or more peers before it is committed to the version control system. An important element of modern code review is verifying that code contributions adhere to best practices. While some of these best practices can be automatically verified, verifying others is commonly left to human re… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: To appear at the ACM International Conference on AI-Powered Software (AIware '24)

  8. arXiv:2404.18271  [pdf, other

    cs.CL cs.LG

    Parameter-Efficient Tuning Large Language Models for Graph Representation Learning

    Authors: Qi Zhu, Da Zheng, Xiang Song, Shichang Zhang, Bowen Jin, Yizhou Sun, George Karypis

    Abstract: Text-rich graphs, which exhibit rich textual information on nodes and edges, are prevalent across a wide range of real-world business applications. Large Language Models (LLMs) have demonstrated remarkable abilities in understanding text, which also introduced the potential for more expressive modeling in text-rich graphs. Despite these capabilities, efficiently applying LLMs to representation lea… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  9. arXiv:2404.18135  [pdf, other

    cs.RO

    Dexterous Grasp Transformer

    Authors: Guo-Hao Xu, Yi-Lin Wei, Dian Zheng, Xiao-Ming Wu, Wei-Shi Zheng

    Abstract: In this work, we propose a novel discriminative framework for dexterous grasp generation, named Dexterous Grasp TRansformer (DGTR), capable of predicting a diverse set of feasible grasp poses by processing the object point cloud with only one forward pass. We formulate dexterous grasp generation as a set prediction task and design a transformer-based grasping model for it. However, we identify tha… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  10. arXiv:2403.17502  [pdf, other

    cs.CV

    SeNM-VAE: Semi-Supervised Noise Modeling with Hierarchical Variational Autoencoder

    Authors: Dihan Zheng, Yihang Zou, Xiaowen Zhang, Chenglong Bao

    Abstract: The data bottleneck has emerged as a fundamental challenge in learning based image restoration methods. Researchers have attempted to generate synthesized training data using paired or unpaired samples to address this challenge. This study proposes SeNM-VAE, a semi-supervised noise modeling method that leverages both paired and unpaired datasets to generate realistic degraded data. Our approach is… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  11. arXiv:2403.12303  [pdf, other

    cs.CG

    Semialgebraic Range Stabbing, Ray Shooting, and Intersection Counting in the Plane

    Authors: Timothy M. Chan, Pingan Cheng, Da Wei Zheng

    Abstract: Polynomial partitioning techniques have recently led to improved geometric data structures for a variety of fundamental problems related to semialgebraic range searching and intersection searching in 3D and higher dimensions (e.g., see [Agarwal, Aronov, Ezra, and Zahl, SoCG 2019; Ezra and Sharir, SoCG 2021; Agarwal, Aronov, Ezra, Katz, and Sharir, SoCG 2022]). They have also led to improved algori… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: SOCG 2024

  12. arXiv:2403.11157  [pdf, other

    cs.CV

    Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model

    Authors: Dian Zheng, Xiao-Ming Wu, Shuzhou Yang, Jian Zhang, Jian-Fang Hu, Wei-Shi Zheng

    Abstract: Universal image restoration is a practical and potential computer vision task for real-world applications. The main challenge of this task is handling the different degradation distributions at once. Existing methods mainly utilize task-specific conditions (e.g., prompt) to guide the model to learn different distributions separately, named multi-partite mapping. However, it is not suitable for uni… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR2024

  13. arXiv:2403.09475  [pdf, other

    cs.CR

    Covert Communication for Untrusted UAV-Assisted Wireless Systems

    Authors: Chan Gao, Linying Tian, Dong Zheng

    Abstract: Wireless systems are of paramount importance for providing ubiquitous data transmission for smart cities. However, due to the broadcasting and openness of wireless channels, such systems face potential security challenges. UAV-assisted covert communication is a supporting technology for improving covert performances and has become a hot issue in the research of wireless communication security. Thi… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  14. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  15. arXiv:2403.02630  [pdf, other

    cs.LG cs.IR cs.SI

    FedHCDR: Federated Cross-Domain Recommendation with Hypergraph Signal Decoupling

    Authors: Hongyu Zhang, Dongyi Zheng, Lin Zhong, Xu Yang, Jiyuan Feng, Yunqing Feng, Qing Liao

    Abstract: In recent years, Cross-Domain Recommendation (CDR) has drawn significant attention, which utilizes user data from multiple domains to enhance the recommendation performance. However, current CDR methods require sharing user data across domains, thereby violating the General Data Protection Regulation (GDPR). Consequently, numerous approaches have been proposed for Federated Cross-Domain Recommenda… ▽ More

    Submitted 10 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 16 pages, 5 figures

  16. arXiv:2403.00095  [pdf

    cs.CY physics.soc-ph

    Solving Jigsaw Puzzles using Iterative Random Sampling: Parallels with Development of Skill Mastery

    Authors: Neil Zhao, Diana Zheng

    Abstract: Skill mastery is a priority for success in all fields. We present a parallel between the development of skill mastery and the process of solving jigsaw puzzles. We show that iterative random sampling solves jigsaw puzzles in two phases: a lag phase that is characterized by little change and occupies the majority of the time, and a growth phase that marks rapid and imminent puzzle completion. Chang… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    Comments: 26 pages, 15 figures, 1 table

  17. arXiv:2402.12554  [pdf, other

    cs.CL

    Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning

    Authors: Danna Zheng, Mirella Lapata, Jeff Z. Pan

    Abstract: We present Archer, a challenging bilingual text-to-SQL dataset specific to complex reasoning, including arithmetic, commonsense and hypothetical reasoning. It contains 1,042 English questions and 1,042 Chinese questions, along with 521 unique SQL queries, covering 20 English databases across 20 domains. Notably, this dataset demonstrates a significantly higher level of complexity compared to exist… ▽ More

    Submitted 24 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  18. arXiv:2402.12545  [pdf, other

    cs.CL

    TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

    Authors: Danna Zheng, Danyang Liu, Mirella Lapata, Jeff Z. Pan

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities across various domains, prompting a surge in their practical applications. However, concerns have arisen regarding the trustworthiness of LLMs outputs, particularly in closed-book question-answering tasks, where non-experts may struggle to identify inaccuracies due to the absence of contextual or ground truth information. This… ▽ More

    Submitted 6 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  19. arXiv:2402.07999  [pdf, other

    cs.LG cs.SI

    NetInfoF Framework: Measuring and Exploiting Network Usable Information

    Authors: Meng-Chieh Lee, Haiyang Yu, Jian Zhang, Vassilis N. Ioannidis, Xiang Song, Soji Adeshina, Da Zheng, Christos Faloutsos

    Abstract: Given a node-attributed graph, and a graph task (link prediction or node classification), can we tell if a graph neural network (GNN) will perform well? More specifically, do the graph structure and the node features carry enough usable information for the task? Our goals are (1) to develop a fast tool to measure how much information is in the graph structure and in the node features, and (2) to e… ▽ More

    Submitted 20 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to ICLR 2024 (Spotlight)

  20. arXiv:2401.16444  [pdf, other

    cs.HC cs.AI

    Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

    Authors: Yiming Gao, Feiyu Liu, Liang Wang, Zhenjie Lian, Dehua Zheng, Weixuan Wang, Wenjin Yang, Siqin Li, Xianliang Wang, Wenhui Chen, Jing Dai, Qiang Fu, Wei Yang, Lanxiao Huang, Wei Liu

    Abstract: Existing game AI research mainly focuses on enhancing agents' abilities to win games, but this does not inherently make humans have a better experience when collaborating with these agents. For example, agents may dominate the collaboration and exhibit unintended or detrimental behaviors, leading to poor experiences for their human partners. In other words, most game AI agents are modeled in a "se… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted at ICLR 2024. arXiv admin note: text overlap with arXiv:2304.11632

  21. arXiv:2401.15560  [pdf

    cs.IT cs.CL

    An Analysis of Letter Dynamics in the English Alphabet

    Authors: Neil Zhao, Diana Zheng

    Abstract: The frequency with which the letters of the English alphabet appear in writings has been applied to the field of cryptography, the development of keyboard mechanics, and the study of linguistics. We expanded on the statistical analysis of the English alphabet by examining the average frequency which each letter appears in different categories of writings. We evaluated news articles, novels, plays,… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: 22 pages, 6 figures, 5 tables

    MSC Class: 94A15

  22. arXiv:2401.00283  [pdf, other

    cs.IT eess.SP

    Near-Space Communications: the Last Piece of 6G Space-Air-Ground-Sea Integrated Network Puzzle

    Authors: Hongshan Liu, Tong Qin, Zhen Gao, Tianqi Mao, Keke Ying, Ziwei Wan, Li Qiao, Rui Na, Zhongxiang Li, Chun Hu, Yikun Mei, Tuan Li, Guanghui Wen, Lei Chen, Zhonghuai Wu, Ruiqi Liu, Gaojie Chen, Shuo Wang, Dezhi Zheng

    Abstract: This article presents a comprehensive study on the emerging near-space communications (NS-COM) within the context of space-air-ground-sea integrated network (SAGSIN). Specifically, we firstly explore the recent technical developments of NS-COM, followed by the discussions about motivations behind integrating NS-COM into SAGSIN. To further demonstrate the necessity of NS-COM, a comparative analysis… ▽ More

    Submitted 4 March, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

    Comments: 28 pages, 8 figures, 2 tables

  23. arXiv:2312.08887  [pdf, other

    cs.CV cs.LG

    SpeedUpNet: A Plug-and-Play Hyper-Network for Accelerating Text-to-Image Diffusion Models

    Authors: Weilong Chai, DanDan Zheng, Jiajiong Cao, Zhiquan Chen, Changbao Wang, Chenguang Ma

    Abstract: Text-to-image diffusion models (SD) exhibit significant advancements while requiring extensive computational resources. Though many acceleration methods have been proposed, they suffer from generation quality degradation or extra training cost generalizing to new fine-tuned models. To address these limitations, we propose a novel and universal Stable-Diffusion (SD) acceleration module called Speed… ▽ More

    Submitted 20 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Table 1. shows the comparison with existing methods, but the lack of experimental data of the LCM method under 12-step makes the table incomplete. We need to temporarily withdraw the manuscript and conduct corresponding experiments before resubmitting it

  24. arXiv:2312.06682  [pdf, other

    cs.AI cs.LG

    Learning to Denoise Unreliable Interactions for Link Prediction on Biomedical Knowledge Graph

    Authors: Tengfei Ma, Yujie Chen, Wen Tao, Dashun Zheng, Xuan Lin, Patrick Cheong-lao Pang, Yiping Liu, Yijun Wang, Bosheng Song, Xiangxiang Zeng

    Abstract: Link prediction in biomedical knowledge graphs (KGs) aims at predicting unknown interactions between entities, including drug-target interaction (DTI) and drug-drug interaction (DDI), which is critical for drug discovery and therapeutics. Previous methods prefer to utilize the rich semantic relations and topological structure of the KG to predict missing links, yielding promising outcomes. However… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  25. arXiv:2312.05474  [pdf, ps, other

    cs.IT

    The duals of narrow-sense BCH codes with length $\frac{q^m-1}λ$

    Authors: Xiaoqiang Wang, Chengliang Xiao, Dabin Zheng

    Abstract: BCH codes are an interesting class of cyclic codes due to their efficient encoding and decoding algorithms. In the past sixty years, a lot of progress on the study of BCH codes has been made, but little is known about the properties of their duals. Recently, in order to study the duals of BCH codes and the lower bounds on their minimum distances, a new concept called dually-BCH code was proposed b… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  26. arXiv:2312.02010  [pdf, other

    cs.CV cs.AI

    Towards Learning a Generalist Model for Embodied Navigation

    Authors: Duo Zheng, Shijia Huang, Lin Zhao, Yiwu Zhong, Liwei Wang

    Abstract: Building a generalist agent that can interact with the world is the intriguing target of AI systems, thus spurring the research for embodied navigation, where an agent is required to navigate according to instructions or respond to queries. Despite the major progress attained, previous works primarily focus on task-specific agents and lack generalizability to unseen scenarios. Recently, LLMs have… ▽ More

    Submitted 1 April, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by CVPR 2024 (14 pages, 3 figures)

  27. arXiv:2311.18432  [pdf, ps, other

    cs.IT

    Three classes of new optimal cyclic $(r,δ)$ locally recoverable codes

    Authors: Yaozong Zhang, Dabin Zheng, Xiaoqiang Wang

    Abstract: An $(r, δ)$-locally repairable code ($(r, δ)$-LRC for short) was introduced by Prakash et al. for tolerating multiple failed nodes in distributed storage systems, and has garnered significant interest among researchers. An $(r,δ)$-LRC is called an optimal code if its parameters achieve the Singleton-like bound. In this paper, we construct three classes of $q$-ary optimal cyclic $(r,δ)$-LRCs with n… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  28. arXiv:2311.10372  [pdf, other

    cs.SE

    A Survey of Large Language Models for Code: Evolution, Benchmarking, and Future Trends

    Authors: Zibin Zheng, Kaiwen Ning, Yanlin Wang, Jingwen Zhang, Dewu Zheng, Mingxi Ye, Jiachi Chen

    Abstract: General large language models (LLMs), represented by ChatGPT, have demonstrated significant potential in tasks such as code generation in software engineering. This has led to the development of specialized LLMs for software engineering, known as Code LLMs. A considerable portion of Code LLMs is derived from general LLMs through model fine-tuning. As a result, Code LLMs are often updated frequentl… ▽ More

    Submitted 8 January, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

  29. arXiv:2311.07993  [pdf, other

    cs.CV

    Explicit Change Relation Learning for Change Detection in VHR Remote Sensing Images

    Authors: Dalong Zheng, Zebin Wu, Jia Liu, Chih-Cheng Hung, Zhihui Wei

    Abstract: Change detection has always been a concerned task in the interpretation of remote sensing images. It is essentially a unique binary classification task with two inputs, and there is a change relationship between these two inputs. At present, the mining of change relationship features is usually implicit in the network architectures that contain single-branch or two-branch encoders. However, due to… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  30. 3DGAUnet: 3D generative adversarial networks with a 3D U-Net based generator to achieve the accurate and effective synthesis of clinical tumor image data for pancreatic cancer

    Authors: Yu Shi, Hannah Tang, Michael Baine, Michael A. Hollingsworth, Huijing Du, Dandan Zheng, Chi Zhang, Hongfeng Yu

    Abstract: Pancreatic ductal adenocarcinoma (PDAC) presents a critical global health challenge, and early detection is crucial for improving the 5-year survival rate. Recent medical imaging and computational algorithm advances offer potential solutions for early diagnosis. Deep learning, particularly in the form of convolutional neural networks (CNNs), has demonstrated success in medical image analysis tasks… ▽ More

    Submitted 27 November, 2023; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Published on Cancers: Shi, Yu, Hannah Tang, Michael J. Baine, Michael A. Hollingsworth, Huijing Du, Dandan Zheng, Chi Zhang, and Hongfeng Yu. 2023. "3DGAUnet: 3D Generative Adversarial Networks with a 3D U-Net Based Generator to Achieve the Accurate and Effective Synthesis of Clinical Tumor Image Data for Pancreatic Cancer" Cancers 15, no. 23: 5496

  31. arXiv:2311.05141  [pdf, other

    cs.RO

    Differentiable Cloth Parameter Identification and State Estimation in Manipulation

    Authors: Dongzhe Zheng, Siqiong Yao, Wenqiang Xu, Cewu Lu

    Abstract: In the realm of robotic cloth manipulation, accurately estimating the cloth state during or post-execution is imperative. However, the inherent complexities in a cloth's dynamic behavior and its near-infinite degrees of freedom (DoF) pose significant challenges. Traditional methods have been restricted to using keypoints or boundaries as cues for cloth state, which do not holistically capture the… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  32. arXiv:2311.05137  [pdf, other

    cs.RO

    Differentiable Fluid Physics Parameter Identification Via Stirring

    Authors: Wenqiang Xu, Dongzhe Zheng, Yutong Li, Jieji Ren, Cewu Lu

    Abstract: Fluid interactions permeate daily human activities, with properties like density and viscosity playing pivotal roles in household tasks. While density estimation is straightforward through Archimedes' principle, viscosity poses a more intricate challenge, especially given the varied behaviors of Newtonian and non-Newtonian fluids. These fluids, which differ in their stress-strain relationships, ar… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

  33. arXiv:2311.01267  [pdf, other

    cs.RO cs.AI cs.CV

    UniFolding: Towards Sample-efficient, Scalable, and Generalizable Robotic Garment Folding

    Authors: Han Xue, Yutong Li, Wenqiang Xu, Huanyu Li, Dongzhe Zheng, Cewu Lu

    Abstract: This paper explores the development of UniFolding, a sample-efficient, scalable, and generalizable robotic system for unfolding and folding various garments. UniFolding employs the proposed UFONet neural network to integrate unfolding and folding decisions into a single policy model that is adaptable to different garment types and states. The design of UniFolding is based on a garment's partial po… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

    Comments: CoRL 2023

  34. arXiv:2310.17331  [pdf

    cs.CE

    A novel solution for seepage problems using physics-informed neural networks

    Authors: Tianfu Luo, Yelin Feng, Qingfu Huang, Zongliang Zhang, Mingjiao Yan, Zaihong Yang, Dawei Zheng, Yang Yang

    Abstract: A Physics-Informed Neural Network (PINN) provides a distinct advantage by synergizing neural networks' capabilities with the problem's governing physical laws. In this study, we introduce an innovative approach for solving seepage problems by utilizing the PINN, harnessing the capabilities of Deep Neural Networks (DNNs) to approximate hydraulic head distributions in seepage analysis. To effectivel… ▽ More

    Submitted 25 November, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

  35. arXiv:2310.15363  [pdf, other

    cs.CG

    An Optimal Algorithm for Higher-Order Voronoi Diagrams in the Plane: The Usefulness of Nondeterminism

    Authors: Timothy M. Chan, Pingan Cheng, Da Wei Zheng

    Abstract: We present the first optimal randomized algorithm for constructing the order-$k$ Voronoi diagram of $n$ points in two dimensions. The expected running time is $O(n\log n + nk)$, which improves the previous, two-decades-old result of Ramos (SoCG'99) by a $2^{O(\log^*k)}$ factor. To obtain our result, we (i) use a recent decision-tree technique of Chan and Zheng (SODA'22) in combination with Ramos's… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: To appear in SODA 2024. 16 pages, 1 figure

  36. arXiv:2310.11873  [pdf, ps, other

    cs.IT

    The Weight Hierarchies of Linear Codes from Simplicial Complexes

    Authors: Chao Liu, Dabin Zheng, Wei Lu, Xiaoqiang Wang

    Abstract: The study of the generalized Hamming weight of linear codes is a significant research topic in coding theory as it conveys the structural information of the codes and determines their performance in various applications. However, determining the generalized Hamming weights of linear codes, especially the weight hierarchy, is generally challenging. In this paper, we investigate the generalized Hamm… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  37. arXiv:2310.06328  [pdf, other

    cs.LG eess.SP

    Antenna Response Consistency Driven Self-supervised Learning for WIFI-based Human Activity Recognition

    Authors: Ke Xu, Jiangtao Wang, Hongyuan Zhu, Dingchang Zheng

    Abstract: Self-supervised learning (SSL) for WiFi-based human activity recognition (HAR) holds great promise due to its ability to address the challenge of insufficient labeled data. However, directly transplanting SSL algorithms, especially contrastive learning, originally designed for other domains to CSI data, often fails to achieve the expected performance. We attribute this issue to the inappropriate a… ▽ More

    Submitted 28 November, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

  38. arXiv:2309.15668  [pdf, other

    cs.IT cs.NI

    A New Centralized Multi-Node Repair Scheme of MSR codes with Error-Correcting Capability

    Authors: Shenghua Li, Maximilien Gadouleau, Jiaojiao Wang, Dabin Zheng

    Abstract: Minimum storage regenerating (MSR) codes, with the MDS property and the optimal repair bandwidth, are widely used in distributed storage systems (DSS) for data recovery. In this paper, we consider the construction of $(n,k,l)$ MSR codes in the centralized model that can repair $h$ failed nodes simultaneously with $e$ out $d$ helper nodes providing erroneous information. We first propose the new re… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  39. arXiv:2309.14954  [pdf, other

    q-bio.BM cs.AI

    Addressing preferred orientation in single-particle cryo-EM through AI-generated auxiliary particles

    Authors: Hui Zhang, Dihan Zheng, Qiurong Wu, Nieng Yan, Zuoqiang Shi, Mingxu Hu, Chenglong Bao

    Abstract: The single-particle cryo-EM field faces the persistent challenge of preferred orientation, lacking general computational solutions. We introduce cryoPROS, an AI-based approach designed to address the above issue. By generating the auxiliary particles with a conditional deep generative model, cryoPROS addresses the intrinsic bias in orientation estimation for the observed particles. We effectively… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  40. arXiv:2309.12645  [pdf, other

    cs.IR

    KuaiSim: A Comprehensive Simulator for Recommender Systems

    Authors: Kesen Zhao, Shuchang Liu, Qingpeng Cai, Xiangyu Zhao, Ziru Liu, Dong Zheng, Peng Jiang, Kun Gai

    Abstract: Reinforcement Learning (RL)-based recommender systems (RSs) have garnered considerable attention due to their ability to learn optimal recommendation policies and maximize long-term user rewards. However, deploying RL models directly in online environments and generating authentic data through A/B tests can pose challenges and require substantial resources. Simulators offer an alternative approach… ▽ More

    Submitted 19 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

  41. arXiv:2309.10722  [pdf, other

    cs.RO cs.AI

    LEA*: An A* Variant Algorithm with Improved Edge Efficiency for Robot Motion Planning

    Authors: Dongliang Zheng, Panagiotis Tsiotras

    Abstract: In this work, we introduce a new graph search algorithm, lazy edged based A* (LEA*), for robot motion planning. By using an edge queue and exploiting the idea of lazy search, LEA* is optimally vertex efficient similar to A*, and has improved edge efficiency compared to A*. LEA* is simple and easy to implement with minimum modification to A*, resulting in a very small overhead compared to previous… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  42. FedDCSR: Federated Cross-domain Sequential Recommendation via Disentangled Representation Learning

    Authors: Hongyu Zhang, Dongyi Zheng, Xu Yang, Jiyuan Feng, Qing Liao

    Abstract: Cross-domain Sequential Recommendation (CSR) which leverages user sequence data from multiple domains has received extensive attention in recent years. However, the existing CSR methods require sharing origin user data across domains, which violates the General Data Protection Regulation (GDPR). Thus, it is necessary to combine federated learning (FL) and CSR to fully utilize knowledge from differ… ▽ More

    Submitted 16 January, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

  43. arXiv:2309.04068  [pdf, ps, other

    cs.IT

    Two classes of reducible cyclic codes with large minimum symbol-pair distances

    Authors: Xiaoqiang Wang, Yue Su, Dabin Zheng, Wei Lu

    Abstract: The high-density data storage technology aims to design high-capacity storage at a relatively low cost. In order to achieve this goal, symbol-pair codes were proposed by Cassuto and Blaum \cite{CB10,CB11} to handle channels that output pairs of overlapping symbols. Such a channel is called symbol-pair read channel, which introduce new concept called symbol-pair weight and minimum symbol-pair dista… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  44. arXiv:2308.15989  [pdf, other

    cs.CV

    DiffuVolume: Diffusion Model for Volume based Stereo Matching

    Authors: Dian Zheng, Xiao-Ming Wu, Zuhao Liu, Jingke Meng, Wei-shi Zheng

    Abstract: Stereo matching is a significant part in many computer vision tasks and driving-based applications. Recently cost volume-based methods have achieved great success benefiting from the rich geometry information in paired images. However, the redundancy of cost volume also interferes with the model training and limits the performance. To construct a more precise cost volume, we pioneeringly apply the… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: 17 pages, 11 figures

  45. arXiv:2308.11159  [pdf, other

    cs.CV

    SwinV2DNet: Pyramid and Self-Supervision Compounded Feature Learning for Remote Sensing Images Change Detection

    Authors: Dalong Zheng, Zebin Wu, Jia Liu, Zhihui Wei

    Abstract: Among the current mainstream change detection networks, transformer is deficient in the ability to capture accurate low-level details, while convolutional neural network (CNN) is wanting in the capacity to understand global information and establish remote spatial relationships. Meanwhile, both of the widely used early fusion and late fusion frameworks are not able to well learn complete change fe… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  46. arXiv:2308.06689  [pdf, other

    cs.CV

    Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training

    Authors: Xiao-Ming Wu, Dian Zheng, Zuhao Liu, Wei-Shi Zheng

    Abstract: Binarization of neural networks is a dominant paradigm in neural networks compression. The pioneering work BinaryConnect uses Straight Through Estimator (STE) to mimic the gradients of the sign function, but it also causes the crucial inconsistency problem. Most of the previous methods design different estimators instead of STE to mitigate it. However, they ignore the fact that when reducing the e… ▽ More

    Submitted 25 August, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: 10 pages, 6 figures. Accepted in ICCV 2023

  47. arXiv:2308.04813  [pdf, other

    cs.CL

    CLEVA: Chinese Language Models EVAluation Platform

    Authors: Yanyang Li, Jianqiao Zhao, Duo Zheng, Zi-Yuan Hu, Zhi Chen, Xiaohui Su, Yongfeng Huang, Shijia Huang, Dahua Lin, Michael R. Lyu, Liwei Wang

    Abstract: With the continuous emergence of Chinese Large Language Models (LLMs), how to evaluate a model's capabilities has become an increasingly significant issue. The absence of a comprehensive Chinese benchmark that thoroughly assesses a model's performance, the unstandardized and incomparable prompting procedure, and the prevalent risk of contamination pose major challenges in the current evaluation of… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: EMNLP 2023 System Demonstrations camera-ready

  48. arXiv:2308.02412  [pdf, other

    eess.SP cs.AI cs.HC cs.LG

    Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study

    Authors: Ke Xu, Jiangtao Wang, Hongyuan Zhu, Dingchang Zheng

    Abstract: Recently, with the advancement of the Internet of Things (IoT), WiFi CSI-based HAR has gained increasing attention from academic and industry communities. By integrating the deep learning technology with CSI-based HAR, researchers achieve state-of-the-art performance without the need of expert knowledge. However, the scarcity of labeled CSI data remains the most prominent challenge when applying d… ▽ More

    Submitted 19 July, 2023; originally announced August 2023.

  49. arXiv:2308.00890  [pdf, other

    cs.LG

    Tango: rethinking quantization for graph neural network training on GPUs

    Authors: Shiyang Chen, Da Zheng, Caiwen Ding, Chengying Huan, Yuede Ji, Hang Liu

    Abstract: Graph Neural Networks (GNNs) are becoming increasingly popular due to their superior performance in critical graph-related tasks. While quantization is widely used to accelerate GNN computation, quantized training faces unprecedented challenges. Current quantized GNN training systems often have longer training times than their full-precision counterparts for two reasons: (i) addressing the accurac… ▽ More

    Submitted 31 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  50. arXiv:2307.12612  [pdf, other

    cs.CV cs.AI

    Less is More: Focus Attention for Efficient DETR

    Authors: Dehua Zheng, Wenhui Dong, Hailin Hu, Xinghao Chen, Yunhe Wang

    Abstract: DETR-like models have significantly boosted the performance of detectors and even outperformed classical convolutional models. However, all tokens are treated equally without discrimination brings a redundant computational burden in the traditional encoder structure. The recent sparsification strategies exploit a subset of informative tokens to reduce attention complexity maintaining performance t… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 8 pages, 6 figures, accepted to ICCV2023