Skip to main content

Showing 1–50 of 194 results for author: lv, J

  1. arXiv:2407.11550  [pdf, other

    cs.CL cs.AI

    Optimizing KV Cache Eviction in LLMs: Adaptive Allocation for Enhanced Budget Utilization

    Authors: Yuan Feng, Junlin Lv, Yukun Cao, Xike Xie, S. Kevin Zhou

    Abstract: Large Language Models have excelled in various fields but encounter efficiency limitations due to the extensive KV cache required for long sequences inference. Many efforts try to evict non-critical cache elements during runtime, thereby reducing cache size within a given memory budget while preserving generation quality. Our reexamination of their underlying principles discerns that prevailing st… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.10204  [pdf, other

    cs.LG

    Improving Graph Out-of-distribution Generalization on Real-world Data

    Authors: Can Xu, Yao Cheng, Jianxiang Yu, Haosen Wang, Jingsong Lv, Xiang Li

    Abstract: Existing methods for graph out-of-distribution (OOD) generalization primarily rely on empirical studies on synthetic datasets. Such approaches tend to overemphasize the causal relationships between invariant sub-graphs and labels, thereby neglecting the non-negligible role of environment in real-world scenarios. In contrast to previous studies that impose rigid independence assumptions on environm… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 21 pages, 5 figures

  3. arXiv:2407.04193  [pdf, ps, other

    cs.IT

    Combinatorial Constructions of Optimal Quaternary Additive Codes

    Authors: Chaofeng Guan, Jingjie Lv, Gaojun Luo, Zhi Ma

    Abstract: This paper aims to construct optimal quaternary additive codes with non-integer dimensions. Firstly, we propose combinatorial constructions of quaternary additive constant-weight codes, alongside additive anticode construction. Subsequently, we propose generalized Construction X, which facilitates the construction of non-integer dimensional optimal additive codes from linear codes. Then, we constr… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: This work was mainly completed in the summer of 2023, and here we add some new developments. Everyone is welcome to discuss issues related to additional code with the first author

  4. arXiv:2407.03596  [pdf, other

    cs.CV

    Self Adaptive Threshold Pseudo-labeling and Unreliable Sample Contrastive Loss for Semi-supervised Image Classification

    Authors: Xuerong Zhang, Li Huang, Jing Lv, Ming Yang

    Abstract: Semi-supervised learning is attracting blooming attention, due to its success in combining unlabeled data. However, pseudo-labeling-based semi-supervised approaches suffer from two problems in image classification: (1) Existing methods might fail to adopt suitable thresholds since they either use a pre-defined/fixed threshold or an ad-hoc threshold adjusting scheme, resulting in inferior performan… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: ICANN24 accepted

  5. arXiv:2407.03245  [pdf, other

    cs.RO cs.AI eess.SY

    TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

    Authors: Weikun Peng, Jun Lv, Yuwei Zeng, Haonan Chen, Siheng Zhao, Jichen Sun, Cewu Lu, Lin Shao

    Abstract: The tie-knotting task is highly challenging due to the tie's high deformation and long-horizon manipulation actions. This work presents TieBot, a Real-to-Sim-to-Real learning from visual demonstration system for the robots to learn to knot a tie. We introduce the Hierarchical Feature Matching approach to estimate a sequence of tie's meshes from the demonstration video. With these estimated meshes… ▽ More

    Submitted 3 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: fix few typos

  6. arXiv:2407.00299  [pdf, other

    cs.RO cs.AI cs.CV cs.HC cs.LG

    Human-Agent Joint Learning for Efficient Robot Manipulation Skill Acquisition

    Authors: Shengcheng Luo, Quanquan Peng, Jun Lv, Kaiwen Hong, Katherine Rose Driggs-Campbell, Cewu Lu, Yong-Lu Li

    Abstract: Employing a teleoperation system for gathering demonstrations offers the potential for more efficient learning of robot manipulation. However, teleoperating a robot arm equipped with a dexterous hand or gripper, via a teleoperation system poses significant challenges due to its high dimensionality, complex motions, and differences in physiological structure. In this study, we introduce a novel s… ▽ More

    Submitted 2 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures

  7. arXiv:2406.10580  [pdf, other

    cs.CV

    IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization

    Authors: Xiaochen Ma, Xuekang Zhu, Lei Su, Bo Du, Zhuohang Jiang, Bingkui Tong, Zeyu Lei, Xinyu Yang, Chi-Man Pun, Jiancheng Lv, Jizhe Zhou

    Abstract: A comprehensive benchmark is yet to be established in the Image Manipulation Detection \& Localization (IMDL) field. The absence of such a benchmark leads to insufficient and misleading model evaluations, severely undermining the development of this field. However, the scarcity of open-sourced baseline models and inconsistent training and evaluation protocols make conducting rigorous experiments a… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: Technical report

  8. arXiv:2406.01931  [pdf, other

    cs.CL

    Dishonesty in Helpful and Harmless Alignment

    Authors: Youcheng Huang, Jingkun Tang, Duanyu Feng, Zheng Zhang, Wenqiang Lei, Jiancheng Lv, Anthony G. Cohn

    Abstract: People tell lies when seeking rewards. Large language models (LLMs) are aligned to human values with reinforcement learning where they get rewards if they satisfy human preference. We find that this also induces dishonesty in helpful and harmless alignment where LLMs tell lies in generating harmless responses. Using the latest interpreting tools, we detect dishonesty, show how LLMs can be harmful… ▽ More

    Submitted 5 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  9. arXiv:2405.12081  [pdf, other

    cs.CL

    Selective Annotation via Data Allocation: These Data Should Be Triaged to Experts for Annotation Rather Than the Model

    Authors: Chen Huang, Yang Deng, Wenqiang Lei, Jiancheng Lv, Ido Dagan

    Abstract: To obtain high-quality annotations under limited budget, semi-automatic annotation methods are commonly used, where a portion of the data is annotated by experts and a model is then trained to complete the annotations for the remaining data. However, these methods mainly focus on selecting informative data for expert annotations to improve the model predictive ability (i.e., triage-to-human data),… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 18 pages, 4 figures

  10. arXiv:2405.11912  [pdf, other

    cs.CL cs.HC

    ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

    Authors: Chen Huang, Yiping Jin, Ilija Ilievski, Wenqiang Lei, Jiancheng Lv

    Abstract: Human annotation is a time-consuming task that requires a significant amount of effort. To address this issue, interactive data annotation utilizes an annotation model to provide suggestions for humans to approve or correct. However, annotation models trained with limited labeled data are prone to generating incorrect suggestions, leading to extra human correction effort. To tackle this challenge,… ▽ More

    Submitted 1 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024. Camera Ready

  11. arXiv:2405.10890  [pdf, other

    astro-ph.IM astro-ph.GA cs.AI

    A Versatile Framework for Analyzing Galaxy Image Data by Implanting Human-in-the-loop on a Large Vision Model

    Authors: Mingxiang Fu, Yu Song, Jiameng Lv, Liang Cao, Peng Jia, Nan Li, Xiangru Li, Jifeng Liu, A-Li Luo, Bo Qiu, Shiyin Shen, Liangping Tu, Lili Wang, Shoulin Wei, Haifeng Yang, Zhenping Yi, Zhiqiang Zou

    Abstract: The exponential growth of astronomical datasets provides an unprecedented opportunity for humans to gain insight into the Universe. However, effectively analyzing this vast amount of data poses a significant challenge. Astronomers are turning to deep learning techniques to address this, but the methods are limited by their specific training sets, leading to considerable duplicate workloads too. He… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 26 pages, 10 figures, to be published on Chinese Physics C

  12. arXiv:2405.10248  [pdf, other

    cs.HC cs.IR

    Co-Matching: Towards Human-Machine Collaborative Legal Case Matching

    Authors: Chen Huang, Xinwei Yang, Yang Deng, Wenqiang Lei, JianCheng Lv, Tat-Seng Chua

    Abstract: Recent efforts have aimed to improve AI machines in legal case matching by integrating legal domain knowledge. However, successful legal case matching requires the tacit knowledge of legal practitioners, which is difficult to verbalize and encode into machines. This emphasizes the crucial role of involving legal practitioners in high-stakes legal case matching. To address this, we propose a collab… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: Draft V1: 23 pages, 7 figures

  13. arXiv:2405.07309  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model

    Authors: Yang Jin, Jun Lv, Shuqiang Jiang, Cewu Lu

    Abstract: Generating robot demonstrations through simulation is widely recognized as an effective way to scale up robot data. Previous work often trained reinforcement learning agents to generate expert policies, but this approach lacks sample efficiency. Recently, a line of work has attempted to generate robot demonstrations via differentiable simulation, which is promising but heavily relies on reward des… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  14. arXiv:2405.06690  [pdf, other

    q-bio.BM cs.CL cs.LG

    DrugLLM: Open Large Language Model for Few-shot Molecule Generation

    Authors: Xianggen Liu, Yan Guo, Haoran Li, Jin Liu, Shudong Huang, Bowen Ke, Jiancheng Lv

    Abstract: Large Language Models (LLMs) have made great strides in areas such as language processing and computer vision. Despite the emergence of diverse techniques to improve few-shot learning capacity, current LLMs fall short in handling the languages in biology and chemistry. For example, they are struggling to capture the relationship between molecule structure and pharmacochemical properties. Consequen… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 17 pages, 3 figures

  15. arXiv:2405.03408  [pdf, other

    astro-ph.IM astro-ph.SR cs.CV

    An Image Quality Evaluation and Masking Algorithm Based On Pre-trained Deep Neural Networks

    Authors: Peng Jia, Yu Song, Jiameng Lv, Runyu Ning

    Abstract: With the growing amount of astronomical data, there is an increasing need for automated data processing pipelines, which can extract scientific information from observation data without human interventions. A critical aspect of these pipelines is the image quality evaluation and masking algorithm, which evaluates image qualities based on various factors such as cloud coverage, sky brightness, scat… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by the AJ. The code could be downloaded from: https://nadc.china-vo.org/res/r101415/ with DOI of: 10.12149/101415

  16. arXiv:2405.03197  [pdf, other

    cs.CV

    StyleSeg V2: Towards Robust One-shot Segmentation of Brain Tissue via Optimization-free Registration Error Perception

    Authors: Zhiwei Wang, Xiaoyu Zeng, Chongwei Wu, Jinxin lv, Xu Zhang, Wei Fang, Qiang Li

    Abstract: One-shot segmentation of brain tissue requires training registration-segmentation (reg-seg) dual-model iteratively, where reg-model aims to provide pseudo masks of unlabeled images for seg-model by warping a carefully-labeled atlas. However, the imperfect reg-model induces image-mask misalignment, poisoning the seg-model subsequently. Recent StyleSeg bypasses this bottleneck by replacing the unlab… ▽ More

    Submitted 18 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 10 pages, 11 figures, 2 tables

  17. arXiv:2404.16484  [pdf, other

    cs.CV eess.IV

    Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey

    Authors: Marcos V. Conde, Zhijun Lei, Wen Li, Cosmin Stejerean, Ioannis Katsavounidis, Radu Timofte, Kihwan Yoon, Ganzorig Gankhuyag, Jiangtao Lv, Long Sun, Jinshan Pan, Jiangxin Dong, Jinhui Tang, Zhiyuan Li, Hao Wei, Chenyang Ge, Dongyang Zhang, Tianle Liu, Huaian Chen, Yi Jin, Menghan Zhou, Yiqiang Yan, Si Gao, Biao Wu, Shaoli Liu , et al. (50 additional authors not shown)

    Abstract: This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, AI for Streaming (AIS) Workshop

  18. arXiv:2404.06926  [pdf, other

    cs.RO

    Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting

    Authors: Xiaolei Lang, Laijian Li, Hang Zhang, Feng Xiong, Mu Xu, Yong Liu, Xingxing Zuo, Jiajun Lv

    Abstract: We present a real-time LiDAR-Inertial-Camera SLAM system with 3D Gaussian Splatting as the mapping backend. Leveraging robust pose estimates from our LiDAR-Inertial-Camera odometry, Coco-LIC, an incremental photo-realistic mapping system is proposed in this paper. We initialize 3D Gaussians from colorized LiDAR points and optimize them using differentiable rendering powered by 3D Gaussian Splattin… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Submitted to IROS 2024

  19. arXiv:2404.04317  [pdf, other

    stat.ML cs.LG q-bio.QM

    DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM

    Authors: Wenxuan Zuo, Zifan Zhu, Yuxuan Du, Yi-Chun Yeh, Jed A. Fuhrman, Jinchi Lv, Yingying Fan, Fengzhu Sun

    Abstract: High-dimensional longitudinal time series data is prevalent across various real-world applications. Many such applications can be modeled as regression problems with high-dimensional time series covariates. Deep learning has been a popular and powerful tool for fitting these regression models. Yet, the development of interpretable and reproducible deep-learning models is challenging and remains un… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  20. arXiv:2404.03304  [pdf, other

    cs.CL cs.AI

    Concept -- An Evaluation Protocol on Conversational Recommender Systems with System-centric and User-centric Factors

    Authors: Chen Huang, Peixin Qin, Yang Deng, Wenqiang Lei, Jiancheng Lv, Tat-Seng Chua

    Abstract: The conversational recommendation system (CRS) has been criticized regarding its user experience in real-world scenarios, despite recent significant progress achieved in academia. Existing evaluation protocols for CRS may prioritize system-centric factors such as effectiveness and fluency in conversation while neglecting user-centric aspects. Thus, we propose a new and inclusive evaluation protoco… ▽ More

    Submitted 6 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 33 pages, 18 tables, and 10 figures. Our code is available at https://github.com/huangzichun/Concept4CRS

  21. arXiv:2404.01780  [pdf, other

    astro-ph.IM astro-ph.GA cs.CV

    CSST Strong Lensing Preparation: a Framework for Detecting Strong Lenses in the Multi-color Imaging Survey by the China Survey Space Telescope (CSST)

    Authors: Xu Li, Ruiqi Sun, Jiameng Lv, Peng Jia, Nan Li, Chengliang Wei, Zou Hu, Xinzhong Er, Yun Chen, Zhang Ban, Yuedong Fang, Qi Guo, Dezi Liu, Guoliang Li, Lin Lin, Ming Li, Ran Li, Xiaobo Li, Yu Luo, Xianmin Meng, Jundan Nie, Zhaoxiang Qi, Yisheng Qiu, Li Shao, Hao Tian , et al. (7 additional authors not shown)

    Abstract: Strong gravitational lensing is a powerful tool for investigating dark matter and dark energy properties. With the advent of large-scale sky surveys, we can discover strong lensing systems on an unprecedented scale, which requires efficient tools to extract them from billions of astronomical objects. The existing mainstream lens-finding tools are based on machine learning algorithms and applied to… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: The paper is accepted by the AJ. The complete code could be downloaded with DOI of: 10.12149/101393. Comments are welcome

  22. arXiv:2403.15734  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Space Group Informed Transformer for Crystalline Materials Generation

    Authors: Zhendong Cao, Xiaoshan Luo, Jian Lv, Lei Wang

    Abstract: We introduce CrystalFormer, a transformer-based autoregressive model specifically designed for space group-controlled generation of crystalline materials. The space group symmetry significantly simplifies the crystal space, which is crucial for data and compute efficient generative modeling of crystalline materials. Leveraging the prominent discrete and sequential nature of the Wyckoff positions,… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: 17 pages, 8 figures

  23. arXiv:2403.13588  [pdf, other

    cs.SE cs.CL

    Genetic Auto-prompt Learning for Pre-trained Code Intelligence Language Models

    Authors: Chengzhe Feng, Yanan Sun, Ke Li, Pan Zhou, Jiancheng Lv, Aojun Lu

    Abstract: As Pre-trained Language Models (PLMs), a popular approach for code intelligence, continue to grow in size, the computational cost of their usage has become prohibitively expensive. Prompt learning, a recent development in the field of natural language processing, emerges as a potential solution to address this challenge. In this paper, we investigate the effectiveness of prompt learning in code in… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  24. arXiv:2403.08433  [pdf, other

    cs.CV

    An Empirical Study of Parameter Efficient Fine-tuning on Vision-Language Pre-train Model

    Authors: Yuxin Tian, Mouxing Yang, Yunfan Li, Dayiheng Liu, Xingzhang Ren, Xi Peng, Jiancheng Lv

    Abstract: Recent studies applied Parameter Efficient Fine-Tuning techniques (PEFTs) to efficiently narrow the performance gap between pre-training and downstream. There are two important factors for various PEFTs, namely, the accessible data size and fine-tunable parameter size. A natural expectation for PEFTs is that the performance of various PEFTs is positively related to the data size and fine-tunable p… ▽ More

    Submitted 18 May, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted by ICME2024

  25. arXiv:2402.18233  [pdf, other

    cs.CV

    Zero-Shot Aerial Object Detection with Visual Description Regularization

    Authors: Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv

    Abstract: Existing object detection models are mainly trained on large-scale labeled datasets. However, annotating data for novel aerial object classes is expensive since it is time-consuming and may require expert knowledge. Thus, it is desirable to study label-efficient object detection methods on aerial images. In this work, we propose a zero-shot method for aerial object detection named visual Descripti… ▽ More

    Submitted 1 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 figures

  26. arXiv:2402.17525  [pdf, other

    cs.CV

    Diffusion Model-Based Image Editing: A Survey

    Authors: Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan, Jiaxi Lv, Jianzhuang Liu, Wei Xiong, He Zhang, Shifeng Chen, Liangliang Cao

    Abstract: Denoising diffusion models have emerged as a powerful tool for various image generation and editing tasks, facilitating the synthesis of visual content in an unconditional or input-conditional manner. The core idea behind them is learning to reverse the process of gradually adding noise to images, allowing them to generate high-quality samples from a complex distribution. In this survey, we provid… ▽ More

    Submitted 16 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  27. arXiv:2402.16907  [pdf, other

    eess.IV cs.CV cs.LG

    Diffusion Posterior Proximal Sampling for Image Restoration

    Authors: Hongjie Wu, Linchao He, Mingqin Zhang, Dongdong Chen, Kunming Luo, Mengting Luo, Ji-Zhe Zhou, Hu Chen, Jiancheng Lv

    Abstract: Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements inherited from the unconditional generation paradigm. These strategies initiate the denoising process with pure white noise and incorporate random noise at each… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  28. arXiv:2402.14598  [pdf, other

    cs.NE cs.LG

    Brain-inspired Distributed Memorization Learning for Efficient Feature-free Unsupervised Domain Adaptation

    Authors: Jianming Lv, Depin Liang, Zequan Liang, Yaobin Zhang, Sijun Xia

    Abstract: Compared with gradient based artificial neural networks, biological neural networks usually show a more powerful generalization ability to quickly adapt to unknown environments without using any gradient back-propagation procedure. Inspired by the distributed memory mechanism of human brains, we propose a novel gradient-free Distributed Memorization Learning mechanism, namely DML, to support quick… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 15 pages,15 figures

  29. arXiv:2402.13234  [pdf, other

    cs.IR cs.CL

    Unlocking Insights: Semantic Search in Jupyter Notebooks

    Authors: Lan Li, Jinpeng Lv

    Abstract: Semantic search, a process aimed at delivering highly relevant search results by comprehending the searcher's intent and the contextual meaning of terms within a searchable dataspace, plays a pivotal role in information retrieval. In this paper, we investigate the application of large language models to enhance semantic search capabilities, specifically tailored for the domain of Jupyter Notebooks… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  30. arXiv:2402.05954  [pdf, other

    cs.LG

    EasyFS: an Efficient Model-free Feature Selection Framework via Elastic Transformation of Features

    Authors: Jianming Lv, Sijun Xia, Depin Liang, Wei Chen

    Abstract: Traditional model-free feature selection methods treat each feature independently while disregarding the interrelationships among features, which leads to relatively poor performance compared with the model-aware methods. To address this challenge, we propose an efficient model-free feature selection framework via elastic expansion and compression of the features, namely EasyFS, to achieve better… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  31. arXiv:2401.14876  [pdf, other

    cs.LG cs.AI

    Cross-Space Adaptive Filter: Integrating Graph Topology and Node Attributes for Alleviating the Over-smoothing Problem

    Authors: Chen Huang, Haoyang Li, Yifan Zhang, Wenqiang Lei, Jiancheng Lv

    Abstract: The vanilla Graph Convolutional Network (GCN) uses a low-pass filter to extract low-frequency signals from graph topology, which may lead to the over-smoothing problem when GCN goes deep. To this end, various methods have been proposed to create an adaptive filter by incorporating an extra filter (e.g., a high-pass filter) extracted from the graph topology. However, these methods heavily rely on t… ▽ More

    Submitted 10 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted to WWW 2024. V2: update the results on GCN-BC based on our rebuttal on OpenReview. Our code is available at https://github.com/huangzichun/Cross-Space-Adaptive-Filter

  32. arXiv:2401.12540  [pdf, other

    cs.IR cs.CL

    DREditor: An Time-efficient Approach for Building a Domain-specific Dense Retrieval Model

    Authors: Chen Huang, Duanyu Feng, Wenqiang Lei, Jiancheng Lv

    Abstract: Deploying dense retrieval models efficiently is becoming increasingly important across various industries. This is especially true for enterprise search services, where customizing search engines to meet the time demands of different enterprises in different domains is crucial. Motivated by this, we develop a time-efficient approach called DREditor to edit the matching rule of an off-the-shelf den… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 15 pages, 6 figures, Codes are available at https://github.com/huangzichun/DREditor

  33. arXiv:2401.10153  [pdf, other

    cs.NI cs.CV

    Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

    Authors: Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo, Changchuan Yin

    Abstract: This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, pedestrians and obstacles) is more crucial than that of other objects to guarantee driving safety. Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image seg… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 10 pages, 8 figures

  34. arXiv:2401.07544  [pdf, other

    cs.CL

    See the Unseen: Better Context-Consistent Knowledge-Editing by Noises

    Authors: Youcheng Huang, Wenqiang Lei, Zheng Zhang, Jiancheng Lv, Shuicheng Yan

    Abstract: Knowledge-editing updates knowledge of large language models (LLMs) and contributes to the interpretability and application of LLMs. However, knowledge applying is context-consistent: LLMs can recall the same knowledge in different contexts. Existing works ignore this property and the editing lacks generalization. In this paper, we empirically find that the effects of different contexts upon LLMs… ▽ More

    Submitted 17 January, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

  35. arXiv:2312.13309  [pdf, other

    cs.CV cs.AI

    Generate E-commerce Product Background by Integrating Category Commonality and Personalized Style

    Authors: Haohan Wang, Wei Feng, Yang Lu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Lixing Bo, Jingping Shao

    Abstract: The state-of-the-art methods for e-commerce product background generation suffer from the inefficiency of designing product-wise prompts when scaling up the production, as well as the ineffectiveness of describing fine-grained styles when customizing personalized backgrounds for some specific brands. To address these obstacles, we integrate the category commonality and personalized style into diff… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 12 pages, 11 figures

  36. Near-Optimal Resilient Aggregation Rules for Distributed Learning Using 1-Center and 1-Mean Clustering with Outliers

    Authors: Yuhao Yi, Ronghui You, Hong Liu, Changxin Liu, Yuan Wang, Jiancheng Lv

    Abstract: Byzantine machine learning has garnered considerable attention in light of the unpredictable faults that can occur in large-scale distributed learning systems. The key to secure resilience against Byzantine machines in distributed learning is resilient aggregation mechanisms. Although abundant resilient aggregation rules have been proposed, they are designed in ad-hoc manners, imposing extra barri… ▽ More

    Submitted 31 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Comments: 17 pages, 4 figures. Accepted by the 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24)

    Journal ref: AAAI 2024, 38, 16469-16477

  37. arXiv:2312.08822  [pdf, other

    cs.CV

    Planning and Rendering: Towards End-to-End Product Poster Generation

    Authors: Zhaochen Li, Fengheng Li, Wei Feng, Honghe Zhu, An Liu, Yaoyu Li, Zheng Zhang, Jingjing Lv, Xin Zhu, Junjie Shen, Zhangang Lin, Jingping Shao, Zhenglu Yang

    Abstract: End-to-end product poster generation significantly optimizes design efficiency and reduces production costs. Prevailing methods predominantly rely on image-inpainting methods to generate clean background images for given products. Subsequently, poster layout generation methods are employed to produce corresponding layout results. However, the background images may not be suitable for accommodating… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  38. arXiv:2312.07280  [pdf, other

    cs.CL

    Towards Equipping Transformer with the Ability of Systematic Compositionality

    Authors: Chen Huang, Peixin Qin, Wenqiang Lei, Jiancheng Lv

    Abstract: One of the key factors in language productivity and human cognition is the ability of systematic compositionality, which refers to understanding composed unseen examples of seen primitives. However, recent evidence reveals that the Transformers have difficulty generalizing the composed context based on the seen primitives. To this end, we take the first step to propose a compositionality-aware Tra… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024. Paper with appendix

  39. arXiv:2312.04055  [pdf

    cs.LG

    Jointly spatial-temporal representation learning for individual trajectories

    Authors: Fei Huang, Jianrong Lv, Yang Yue

    Abstract: Individual trajectories, rich in human-environment interaction information across space and time, serve as vital inputs for geospatial foundation models (GeoFMs). However, existing attempts at learning trajectory representations have overlooked the implicit spatial-temporal dependency within trajectories, failing to encode such dependency in a deep learning-friendly format. That poses a challenge… ▽ More

    Submitted 11 December, 2023; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: 27 pages, 3 tables, 7 figures

  40. arXiv:2312.00347  [pdf, other

    cs.CV cs.CL cs.MM

    RTQ: Rethinking Video-language Understanding Based on Image-text Model

    Authors: Xiao Wang, Yaoyu Li, Tian Gan, Zheng Zhang, Jingjing Lv, Liqiang Nie

    Abstract: Recent advancements in video-language understanding have been established on the foundation of image-text models, resulting in promising outcomes due to the shared knowledge between images and videos. However, video-language understanding presents unique challenges due to the inclusion of highly complex semantic details, which result in information redundancy, temporal dependency, and scene comple… ▽ More

    Submitted 17 December, 2023; v1 submitted 30 November, 2023; originally announced December 2023.

    Comments: Accepted by ACM MM 2023 as Oral representation

    Journal ref: In International Conference on Multimedia. ACM, 557--566 (2023)

  41. arXiv:2311.18214  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR cs.CV physics.optics

    Perception of Misalignment States for Sky Survey Telescopes with the Digital Twin and the Deep Neural Networks

    Authors: Miao Zhang, Peng Jia, Zhengyang Li, Wennan Xiang, Jiameng Lv, Rui Sun

    Abstract: Sky survey telescopes play a critical role in modern astronomy, but misalignment of their optical elements can introduce significant variations in point spread functions, leading to reduced data quality. To address this, we need a method to obtain misalignment states, aiding in the reconstruction of accurate point spread functions for data processing methods or facilitating adjustments of optical… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: The aforementioned submission has been accepted by Optics Express. We kindly request any feedback or comments to be directed to the corresponding author, Peng Jia (robinmartin20@gmail.com), or the second corresponding author, Zhengyang Li (lizy@niaot.ac.cn). Please note that Zhengyang is currently stationed in the South Antarctica and will not be available until after February 1st, 2024

  42. arXiv:2311.12631  [pdf, other

    cs.CV

    GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning

    Authors: Jiaxi Lv, Yi Huang, Mingfu Yan, Jiancheng Huang, Jianzhuang Liu, Yifan Liu, Yafei Wen, Xiaoxin Chen, Shifeng Chen

    Abstract: Recent advances in text-to-video generation have harnessed the power of diffusion models to create visually compelling content conditioned on text prompts. However, they usually encounter high computational costs and often struggle to produce videos with coherent physical motions. To tackle these issues, we propose GPT4Motion, a training-free framework that leverages the planning capability of lar… ▽ More

    Submitted 23 April, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

  43. arXiv:2311.04247  [pdf, other

    cs.LG cs.AI

    Analysis and Applications of Deep Learning with Finite Samples in Full Life-Cycle Intelligence of Nuclear Power Generation

    Authors: Chenwei Tang, Wenqiang Zhou, Dong Wang, Caiyang Yu, Zhenan He, Jizhe Zhou, Shudong Huang, Yi Gao, Jianming Chen, Wentao Feng, Jiancheng Lv

    Abstract: The advent of Industry 4.0 has precipitated the incorporation of Artificial Intelligence (AI) methods within industrial contexts, aiming to realize intelligent manufacturing, operation as well as maintenance, also known as industrial intelligence. However, intricate industrial milieus, particularly those relating to energy exploration and production, frequently encompass data characterized by long… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  44. arXiv:2311.03798  [pdf, other

    cs.CL

    Noisy Pair Corrector for Dense Retrieval

    Authors: Hang Zhang, Yeyun Gong, Xingwei He, Dayiheng Liu, Daya Guo, Jiancheng Lv, Jian Guo

    Abstract: Most dense retrieval models contain an implicit assumption: the training query-document pairs are exactly matched. Since it is expensive to annotate the corpus manually, training pairs in real-world applications are usually collected automatically, which inevitably introduces mismatched-pair noise. In this paper, we explore an interesting and challenging problem in dense retrieval, how to train an… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Findings of EMNLP 2023

  45. arXiv:2311.00186  [pdf, other

    astro-ph.IM astro-ph.GA astro-ph.SR cs.CV

    Image Restoration with Point Spread Function Regularization and Active Learning

    Authors: Peng Jia, Jiameng Lv, Runyu Ning, Yu Song, Nan Li, Kaifan Ji, Chenzhou Cui, Shanshan Li

    Abstract: Large-scale astronomical surveys can capture numerous images of celestial objects, including galaxies and nebulae. Analysing and processing these images can reveal intricate internal structures of these objects, allowing researchers to conduct comprehensive studies on their morphology, evolution, and physical properties. However, varying noise levels and point spread functions can hamper the accur… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: To be published in the MNRAS

  46. arXiv:2310.16659  [pdf, other

    cs.RO cs.LG

    UAV Pathfinding in Dynamic Obstacle Avoidance with Multi-agent Reinforcement Learning

    Authors: Qizhen Wu, Lei Chen, Kexin Liu, Jinhu Lv

    Abstract: Multi-agent reinforcement learning based methods are significant for online planning of feasible and safe paths for agents in dynamic and uncertain scenarios. Although some methods like fully centralized and fully decentralized methods achieve a certain measure of success, they also encounter problems such as dimension explosion and poor convergence, respectively. In this paper, we propose a novel… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  47. arXiv:2310.14170  [pdf, other

    cs.LG

    Learning Invariant Molecular Representation in Latent Discrete Space

    Authors: Xiang Zhuang, Qiang Zhang, Keyan Ding, Yatao Bian, Xiao Wang, Jingsong Lv, Hongyang Chen, Huajun Chen

    Abstract: Molecular representation learning lays the foundation for drug discovery. However, existing methods suffer from poor out-of-distribution (OOD) generalization, particularly when data for training and testing originate from different environments. To address this issue, we propose a new framework for learning molecular representations that exhibit invariance and robustness against distribution shift… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  48. arXiv:2310.11989  [pdf, other

    cs.LG

    Image Clustering with External Guidance

    Authors: Yunfan Li, Peng Hu, Dezhong Peng, Jiancheng Lv, Jianping Fan, Xi Peng

    Abstract: The core of clustering is incorporating prior knowledge to construct supervision signals. From classic k-means based on data compactness to recent contrastive clustering guided by self-supervision, the evolution of clustering methods intrinsically corresponds to the progression of supervision signals. At present, substantial efforts have been devoted to mining internal supervision signals from dat… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Journal ref: ICML 2024 (Oral)

  49. arXiv:2310.09183  [pdf, other

    cs.LG cs.AI cs.DC

    PRIOR: Personalized Prior for Reactivating the Information Overlooked in Federated Learning

    Authors: Mingjia Shi, Yuhao Zhou, Kai Wang, Huaizheng Zhang, Shudong Huang, Qing Ye, Jiangcheng Lv

    Abstract: Classical federated learning (FL) enables training machine learning models without sharing data for privacy preservation, but heterogeneous data characteristic degrades the performance of the localized model. Personalized FL (PFL) addresses this by synthesizing personalized models from a global model via training on local data. Such a global model may overlook the specific information that the cli… ▽ More

    Submitted 10 November, 2023; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

    MSC Class: 68T07 ACM Class: I.2.11

  50. arXiv:2310.08986  [pdf, other

    cs.CV

    VCL Challenges 2023 at ICCV 2023 Technical Report: Bi-level Adaptation Method for Test-time Adaptive Object Detection

    Authors: Chenyu Lin, Yusheng He, Zhengqing Zang, Chenwei Tang, Tao Wang, Jiancheng Lv

    Abstract: This report outlines our team's participation in VCL Challenges B Continual Test_time Adaptation, focusing on the technical details of our approach. Our primary focus is Testtime Adaptation using bi_level adaptations, encompassing image_level and detector_level adaptations. At the image level, we employ adjustable parameterbased image filters, while at the detector level, we leverage adjustable pa… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.