Skip to main content

Showing 1–50 of 65 results for author: Fan, G

  1. arXiv:2407.11548  [pdf, other

    cs.IR

    A PLMs based protein retrieval framework

    Authors: Yuxuan Wu, Xiao Yi, Yang Tan, Huiqun Yu, Guisheng Fan

    Abstract: Protein retrieval, which targets the deconstruction of the relationship between sequences, structures and functions, empowers the advancing of biology. Basic Local Alignment Search Tool (BLAST), a sequence-similarity-based algorithm, has proved the efficiency of this field. Despite the existing tools for protein retrieval, they prioritize sequence similarity and probably overlook proteins that are… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 16 pages, 12 figures

    ACM Class: H.3.3

  2. arXiv:2407.11033  [pdf, other

    cs.LG cs.CL

    Hadamard Adapter: An Extreme Parameter-Efficient Adapter Tuning Method for Pre-trained Language Models

    Authors: Yuyan Chen, Qiang Fu, Ge Fan, Lun Du, Jian-Guang Lou, Shi Han, Dongmei Zhang, Zhixu Li, Yanghua Xiao

    Abstract: Recent years, Pre-trained Language models (PLMs) have swept into various fields of artificial intelligence and achieved great success. However, most PLMs, such as T5 and GPT3, have a huge amount of parameters, fine-tuning them is often expensive and time consuming, and storing them takes up a lot of space. Therefore, it is necessary to adopt a parameter-efficient approach to reduce parameters of P… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to CIKM 2023 (Long Paper)

  3. arXiv:2407.04121  [pdf, other

    cs.CL cs.AI

    Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models

    Authors: Yuyan Chen, Qiang Fu, Yichen Yuan, Zhihao Wen, Ge Fan, Dayiheng Liu, Dongmei Zhang, Zhixu Li, Yanghua Xiao

    Abstract: Large Language Models (LLMs) have gained widespread adoption in various natural language processing tasks, including question answering and dialogue systems. However, a major drawback of LLMs is the issue of hallucination, where they generate unfaithful or inconsistent content that deviates from the input source, leading to severe consequences. In this paper, we propose a robust discriminator name… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to CIKM 2023 (Long Paper)

  4. arXiv:2407.04118  [pdf, other

    cs.CL cs.AI

    MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization

    Authors: Yuyan Chen, Zhihao Wen, Ge Fan, Zhengyu Chen, Wei Wu, Dayiheng Liu, Zhixu Li, Bang Liu, Yanghua Xiao

    Abstract: Prompt engineering, as an efficient and effective way to leverage Large Language Models (LLM), has drawn a lot of attention from the research community. The existing research primarily emphasizes the importance of adapting prompts to specific tasks, rather than specific LLMs. However, a good prompt is not solely defined by its wording, but also binds to the nature of the LLM in question. In this w… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to EMNLP 2023 (Findings)

  5. arXiv:2406.19720  [pdf

    cs.HC cs.AI

    CUPID: Improving Battle Fairness and Position Satisfaction in Online MOBA Games with a Re-matchmaking System

    Authors: Ge Fan, Chaoyun Zhang, Kai Wang, Yingjie Li, Junyang Chen, Zenglin Xu

    Abstract: The multiplayer online battle arena (MOBA) genre has gained significant popularity and economic success, attracting considerable research interest within the Human-Computer Interaction community. Enhancing the gaming experience requires a deep understanding of player behavior, and a crucial aspect of MOBA games is matchmaking, which aims to assemble teams of comparable skill levels. However, exist… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 38 pages, accepted by CSCW 24

  6. arXiv:2405.16854  [pdf, other

    cs.MA

    Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning

    Authors: Zhihao Liu, Xianliang Yang, Zichuan Liu, Yifan Xia, Wei Jiang, Yuanyu Zhang, Lijuan Li, Guoliang Fan, Lei Song, Bian Jiang

    Abstract: Multi-agent reinforcement learning (MARL) is employed to develop autonomous agents that can learn to adopt cooperative or competitive strategies within complex environments. However, the linear increase in the number of agents leads to a combinatorial explosion of the action space, which may result in algorithmic instability, difficulty in convergence, or entrapment in local optima. While research… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  7. arXiv:2404.17936  [pdf, other

    cs.CV

    FDCE-Net: Underwater Image Enhancement with Embedding Frequency and Dual Color Encoder

    Authors: Zheng Cheng, Guodong Fan, Jingchun Zhou, Min Gan, C. L. Philip Chen

    Abstract: Underwater images often suffer from various issues such as low brightness, color shift, blurred details, and noise due to light absorption and scattering caused by water and suspended particles. Previous underwater image enhancement (UIE) methods have primarily focused on spatial domain enhancement, neglecting the frequency domain information inherent in the images. However, the degradation factor… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 16pages,13 figures

  8. arXiv:2404.17780  [pdf, other

    cs.MA cs.AI

    Verco: Learning Coordinated Verbal Communication for Multi-agent Reinforcement Learning

    Authors: Dapeng Li, Hang Dong, Lu Wang, Bo Qiao, Si Qin, Qingwei Lin, Dongmei Zhang, Qi Zhang, Zhiwei Xu, Bin Zhang, Guoliang Fan

    Abstract: In recent years, multi-agent reinforcement learning algorithms have made significant advancements in diverse gaming environments, leading to increased interest in the broader application of such techniques. To address the prevalent challenge of partial observability, communication-based algorithms have improved cooperative performance through the sharing of numerical embedding between agents. Howe… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  9. arXiv:2404.17400  [pdf, other

    cs.CV cs.AI eess.IV

    Spatial-frequency Dual-Domain Feature Fusion Network for Low-Light Remote Sensing Image Enhancement

    Authors: Zishu Yao, Guodong Fan, Jinfu Fan, Min Gan, C. L. Philip Chen

    Abstract: Low-light remote sensing images generally feature high resolution and high spatial complexity, with continuously distributed surface features in space. This continuity in scenes leads to extensive long-range correlations in spatial domains within remote sensing images. Convolutional Neural Networks, which rely on local correlations for long-distance modeling, struggle to establish long-range corre… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 14 page

  10. arXiv:2404.14850  [pdf, other

    cs.CL cs.LG q-bio.BM

    Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models

    Authors: Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

    Abstract: Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures, 8 tables

  11. arXiv:2404.14661  [pdf, other

    cs.CV astro-ph.EP cs.LG

    First Mapping the Canopy Height of Primeval Forests in the Tallest Tree Area of Asia

    Authors: Guangpeng Fan, Fei Yan, Xiangquan Zeng, Qingtao Xu, Ruoyoulan Wang, Binghong Zhang, Jialing Zhou, Liangliang Nan, Jinhu Wang, Zhiwei Zhang, Jia Wang

    Abstract: We have developed the world's first canopy height map of the distribution area of world-level giant trees. This mapping is crucial for discovering more individual and community world-level giant trees, and for analyzing and quantifying the effectiveness of biodiversity conservation measures in the Yarlung Tsangpo Grand Canyon (YTGC) National Nature Reserve. We proposed a method to map the canopy h… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  12. arXiv:2404.06939  [pdf, other

    cs.ET cs.AI

    Fast System Technology Co-Optimization Framework for Emerging Technology Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Xuguang Sun, Zhihui Deng, Kainlu Low, Leilai Shao

    Abstract: This paper proposes a fast system technology co-optimization (STCO) framework that optimizes power, performance, and area (PPA) for next-generation IC design, addressing the challenges and opportunities presented by novel materials and device architectures. We focus on accelerating the technology level of STCO using AI techniques, by employing graph neural network (GNN)-based approaches for both T… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted by the 61th Design Automation Conference (DAC)

  13. arXiv:2403.14128  [pdf, other

    cs.DB

    Gen-T: Table Reclamation in Data Lakes

    Authors: Grace Fan, Roee Shraga, Renée J. Miller

    Abstract: We introduce the problem of Table Reclamation. Given a Source Table and a large table repository, reclamation finds a set of tables that, when integrated, reproduce the source table as closely as possible. Unlike query discovery problems like Query-by-Example or by-Target, Table Reclamation focuses on reclaiming the data in the Source Table as fully as possible using real tables that may be incomp… ▽ More

    Submitted 22 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: to appear at ICDE 2024

  14. arXiv:2402.14323  [pdf, other

    cs.SE cs.AI

    REPOFUSE: Repository-Level Code Completion with Fused Dual Context

    Authors: Ming Liang, Xiaoheng Xie, Gehao Zhang, Xunjin Zheng, Peng Di, wei jiang, Hongwei Chen, Chengpeng Wang, Gang Fan

    Abstract: The success of language models in code assistance has spurred the proposal of repository-level code completion as a means to enhance prediction accuracy, utilizing the context from the entire codebase. However, this amplified context can inadvertently increase inference latency, potentially undermining the developer experience and deterring tool adoption - a challenge we termed the Context-Latency… ▽ More

    Submitted 22 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  15. arXiv:2401.01571  [pdf, other

    cs.SE cs.PL

    CodeFuse-Query: A Data-Centric Static Code Analysis System for Large-Scale Organizations

    Authors: Xiaoheng Xie, Gang Fan, Xiaojun Lin, Ang Zhou, Shijie Li, Xunjin Zheng, Yinan Liang, Yu Zhang, Na Yu, Haokun Li, Xinyu Chen, Yingzhuang Chen, Yi Zhen, Dejun Dong, Xianjin Fu, Jinzhou Su, Fuxiong Pan, Pengshuai Luo, Youzheng Feng, Ruoxiang Hu, Jing Fan, Jinguo Zhou, Xiao Xiao, Peng Di

    Abstract: In the domain of large-scale software development, the demands for dynamic and multifaceted static code analysis exceed the capabilities of traditional tools. To bridge this gap, we present CodeFuse-Query, a system that redefines static code analysis through the fusion of Domain Optimized System Design and Logic Oriented Computation Design. CodeFuse-Query reimagines code analysis as a data compu… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

  16. arXiv:2312.12784  [pdf, other

    cs.LG

    Fast Cell Library Characterization for Design Technology Co-Optimization Based on Graph Neural Networks

    Authors: Tianliang Ma, Guangxi Fan, Zhihui Deng, Xuguang Sun, Kainlu Low, Leilai Shao

    Abstract: Design technology co-optimization (DTCO) plays a critical role in achieving optimal power, performance, and area (PPA) for advanced semiconductor process development. Cell library characterization is essential in DTCO flow, but traditional methods are time-consuming and costly. To overcome these challenges, we propose a graph neural network (GNN)-based machine learning model for rapid and accurate… ▽ More

    Submitted 19 March, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  17. arXiv:2312.09009  [pdf, other

    cs.AI

    Adaptive parameter sharing for multi-agent reinforcement learning

    Authors: Dapeng Li, Na Lou, Bin Zhang, Zhiwei Xu, Guoliang Fan

    Abstract: Parameter sharing, as an important technique in multi-agent systems, can effectively solve the scalability issue in large-scale agent problems. However, the effectiveness of parameter sharing largely depends on the environment setting. When agents have different identities or tasks, naive parameter sharing makes it difficult to generate sufficiently differentiated strategies for agents. Inspired b… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 pages, accepted for ICASSP 2024

  18. arXiv:2312.04245  [pdf, other

    cs.MA cs.AI

    Mastering Complex Coordination through Attention-based Dynamic Graph

    Authors: Guangchong Zhou, Zhiwei Xu, Zeren Zhang, Guoliang Fan

    Abstract: The coordination between agents in multi-agent systems has become a popular topic in many fields. To catch the inner relationship between agents, the graph structure is combined with existing methods and improves the results. But in large-scale tasks with numerous agents, an overly complex graph would lead to a boost in computational cost and a decline in performance. Here we present DAGMIX, a nov… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  19. arXiv:2311.13884  [pdf, other

    cs.AI

    Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

    Authors: Bin Zhang, Hangyu Mao, Jingqing Ruan, Ying Wen, Yang Li, Shao Zhang, Zhiwei Xu, Dapeng Li, Ziyue Li, Rui Zhao, Lijuan Li, Guoliang Fan

    Abstract: The remarkable progress in Large Language Models (LLMs) opens up new avenues for addressing planning and decision-making problems in Multi-Agent Systems (MAS). However, as the number of agents increases, the issues of hallucination in LLMs and coordination in MAS have become increasingly prominent. Additionally, the efficient utilization of tokens emerges as a critical consideration when employing… ▽ More

    Submitted 23 January, 2024; v1 submitted 23 November, 2023; originally announced November 2023.

    Comments: 13 pages, 11 figures

  20. arXiv:2310.17415  [pdf, other

    cs.CL cs.AI q-bio.BM

    PETA: Evaluating the Impact of Protein Transfer Learning with Sub-word Tokenization on Downstream Applications

    Authors: Yang Tan, Mingchen Li, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

    Abstract: Large protein language models are adept at capturing the underlying evolutionary information in primary structures, offering significant practical value for protein engineering. Compared to natural language models, protein amino acid sequences have a smaller data volume and a limited combinatorial space. Choosing an appropriate vocabulary size to optimize the pre-trained model is a pivotal issue.… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: 46 pages, 4figures, 9 tables

  21. arXiv:2310.08837  [pdf, other

    cs.SE

    Static Code Analysis in the AI Era: An In-depth Exploration of the Concept, Function, and Potential of Intelligent Code Analysis Agents

    Authors: Gang Fan, Xiaoheng Xie, Xunjin Zheng, Yinan Liang, Peng Di

    Abstract: The escalating complexity of software systems and accelerating development cycles pose a significant challenge in managing code errors and implementing business logic. Traditional techniques, while cornerstone for software quality assurance, exhibit limitations in handling intricate business logic and extensive codebases. To address these challenges, we introduce the Intelligent Code Analysis Agen… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2310.06266  [pdf, other

    cs.SE cs.AI cs.LG

    CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model

    Authors: Peng Di, Jianguo Li, Hang Yu, Wei Jiang, Wenting Cai, Yang Cao, Chaoyu Chen, Dajun Chen, Hongwei Chen, Liang Chen, Gang Fan, Jie Gong, Zi Gong, Wen Hu, Tingting Guo, Zhichao Lei, Ting Li, Zheng Li, Ming Liang, Cong Liao, Bingchang Liu, Jiachen Liu, Zhiwei Liu, Shaojun Lu, Min Shen , et al. (13 additional authors not shown)

    Abstract: Code Large Language Models (Code LLMs) have gained significant attention in the industry due to their wide applications in the full lifecycle of software engineering. However, the effectiveness of existing models in understanding non-English inputs for multi-lingual code-related tasks is still far from well studied. This paper introduces CodeFuse-13B, an open-sourced pre-trained code LLM. It is sp… ▽ More

    Submitted 10 January, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

    Comments: Accepted by ICSE-SEIP 2024

  23. arXiv:2309.10002  [pdf, other

    cs.LG math.NA

    Energy stable neural network for gradient flow equations

    Authors: Ganghua Fan, Tianyu Jin, Yuan Lan, Yang Xiang, Luchan Zhang

    Abstract: In this paper, we propose an energy stable network (EStable-Net) for solving gradient flow equations. The solution update scheme in our neural network EStable-Net is inspired by a proposed auxiliary variable based equivalent form of the gradient flow equation. EStable-Net enables decreasing of a discrete energy along the neural network, which is consistent with the property in the evolution proces… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

  24. arXiv:2309.05929  [pdf

    eess.IV cs.CV

    Introducing Shape Prior Module in Diffusion Model for Medical Image Segmentation

    Authors: Zhiqing Zhang, Guojia Fan, Tianyong Liu, Nan Li, Yuyang Liu, Ziyu Liu, Canwei Dong, Shoujun Zhou

    Abstract: Medical image segmentation is critical for diagnosing and treating spinal disorders. However, the presence of high noise, ambiguity, and uncertainty makes this task highly challenging. Factors such as unclear anatomical boundaries, inter-class similarities, and irrational annotations contribute to this challenge. Achieving both accurate and diverse segmentation templates is essential to support ra… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  25. arXiv:2309.01114  [pdf, other

    cs.CL cs.AI

    MedChatZH: a Better Medical Adviser Learns from Better Instructions

    Authors: Yang Tan, Mingchen Li, Zijie Huang, Huiqun Yu, Guisheng Fan

    Abstract: Generative large language models (LLMs) have shown great success in various applications, including question-answering (QA) and dialogue systems. However, in specialized domains like traditional Chinese medical QA, these models may perform unsatisfactorily without fine-tuning on domain-specific datasets. To address this, we introduce MedChatZH, a dialogue model designed specifically for traditiona… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: 7 pages, 3 figures

  26. arXiv:2308.11624  [pdf

    cs.LG cs.AI cs.AR

    Revolutionizing TCAD Simulations with Universal Device Encoding and Graph Attention Networks

    Authors: Guangxi Fan, Leilai Shao, Kain Lu Low

    Abstract: An innovative methodology that leverages artificial intelligence (AI) and graph representation for semiconductor device encoding in TCAD device simulation is proposed. A graph-based universal encoding scheme is presented that not only considers material-level and device-level embeddings, but also introduces a novel spatial relationship embedding inspired by interpolation operations typically used… ▽ More

    Submitted 23 January, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: 32 pages, 13 figures and 4 tables

  27. arXiv:2305.19623  [pdf, other

    cs.CV cs.LG

    Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast

    Authors: Guofan Fan, Zekun Qi, Wenkai Shi, Kaisheng Ma

    Abstract: Geometry and color information provided by the point clouds are both crucial for 3D scene understanding. Two pieces of information characterize the different aspects of point clouds, but existing methods lack an elaborate design for the discrimination and relevance. Hence we explore a 3D self-supervised paradigm that can better utilize the relations of point cloud information. Specifically, we pro… ▽ More

    Submitted 1 June, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

  28. arXiv:2305.07856  [pdf, other

    cs.MA cs.AI

    Stackelberg Decision Transformer for Asynchronous Action Coordination in Multi-Agent Systems

    Authors: Bin Zhang, Hangyu Mao, Lijuan Li, Zhiwei Xu, Dapeng Li, Rui Zhao, Guoliang Fan

    Abstract: Asynchronous action coordination presents a pervasive challenge in Multi-Agent Systems (MAS), which can be represented as a Stackelberg game (SG). However, the scalability of existing Multi-Agent Reinforcement Learning (MARL) methods based on SG is severely constrained by network structures or environmental limitations. To address this issue, we propose the Stackelberg Decision Transformer (STEER)… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: 11pages, 7papers

  29. arXiv:2305.04316  [pdf, other

    cs.PL cs.SE

    Synthesizing Conjunctive Queries for Code Search

    Authors: Chengpeng Wang, Peisen Yao, Wensheng Tang, Gang Fan, Charles Zhang

    Abstract: This paper presents Squid, a new conjunctive query synthesis algorithm for searching code with target patterns. Given positive and negative examples along with a natural language description, Squid analyzes the relations derived from the examples by a Datalog-based program analyzer and synthesizes a conjunctive query expressing the search intent. The synthesized query can be further used to search… ▽ More

    Submitted 11 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 32 pages, 7 figures, and 1 table. Accepted by ECOOP 2023

  30. arXiv:2304.14656  [pdf, other

    cs.MA cs.AI cs.LG

    From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL

    Authors: Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan

    Abstract: Centralized training with decentralized execution (CTDE) is a widely-used learning paradigm that has achieved significant success in complex tasks. However, partial observability issues and the absence of effectively shared signals between agents often limit its effectiveness in fostering cooperation. While communication can address this challenge, it simultaneously reduces the algorithm's practic… ▽ More

    Submitted 28 April, 2023; originally announced April 2023.

    Comments: 16 pages, 10figures

  31. arXiv:2304.12532  [pdf, other

    cs.MA cs.AI eess.SY

    SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning

    Authors: Dapeng Li, Zhiwei Xu, Bin Zhang, Guoliang Fan

    Abstract: Spatial information is essential in various fields. How to explicitly model according to the spatial location of agents is also very important for the multi-agent problem, especially when the number of agents is changing and the scale is enormous. Inspired by the point cloud task in computer vision, we propose a spatial information extraction structure for multi-agent reinforcement learning in thi… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

    Comments: 8 pages,6 figures, Accepted by IJCNN2023

  32. arXiv:2304.10351  [pdf, other

    cs.MA cs.AI

    Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning

    Authors: Bin Zhang, Lijuan Li, Zhiwei Xu, Dapeng Li, Guoliang Fan

    Abstract: In multi-agent reinforcement learning (MARL), self-interested agents attempt to establish equilibrium and achieve coordination depending on game structure. However, existing MARL approaches are mostly bound by the simultaneous actions of all agents in the Markov game (MG) framework, and few works consider the formation of equilibrium strategies via asynchronous action coordination. In view of the… ▽ More

    Submitted 10 December, 2023; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted as a conference paper to the 32nd International Joint Conference on Artificial Intelligence (IJCAI-23)

  33. arXiv:2303.11716  [pdf, other

    cs.LG cs.AI q-fin.RM

    Style Miner: Find Significant and Stable Explanatory Factors in Time Series with Constrained Reinforcement Learning

    Authors: Dapeng Li, Feiyang Pan, Jia He, Zhiwei Xu, Dandan Tu, Guoliang Fan

    Abstract: In high-dimensional time-series analysis, it is essential to have a set of key factors (namely, the style factors) that explain the change of the observed variable. For example, volatility modeling in finance relies on a set of risk factors, and climate change studies in climatology rely on a set of causal factors. The ideal low-dimensional style factors should balance significance (with high expl… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 9 pages, 6 figures

  34. arXiv:2302.02318  [pdf, other

    cs.CV

    Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining

    Authors: Zekun Qi, Runpei Dong, Guofan Fan, Zheng Ge, Xiangyu Zhang, Kaisheng Ma, Li Yi

    Abstract: Mainstream 3D representation learning approaches are built upon contrastive or generative modeling pretext tasks, where great improvements in performance on various downstream tasks have been achieved. However, we find these two paradigms have different characteristics: (i) contrastive models are data-hungry that suffer from a representation over-fitting issue; (ii) generative models have a data f… ▽ More

    Submitted 22 May, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023

  35. arXiv:2302.02180  [pdf, other

    cs.MA cs.AI cs.LG

    Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning

    Authors: Zhiwei Xu, Bin Zhang, Dapeng Li, Guangchong Zhou, Zeren Zhang, Guoliang Fan

    Abstract: Value decomposition methods have gained popularity in the field of cooperative multi-agent reinforcement learning. However, almost all existing methods follow the principle of Individual Global Max (IGM) or its variants, which limits their problem-solving capabilities. To address this, we propose a dual self-awareness value decomposition framework, inspired by the notion of dual self-awareness in… ▽ More

    Submitted 16 May, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: 20 pages, 17 figures and 4 tables

  36. arXiv:2301.11011  [pdf, other

    cs.PL cs.SE

    Verifying Data Constraint Equivalence in FinTech Systems

    Authors: Chengpeng Wang, Gang Fan, Peisen Yao, Fuxiong Pan, Charles Zhang

    Abstract: Data constraints are widely used in FinTech systems for monitoring data consistency and diagnosing anomalous data manipulations. However, many equivalent data constraints are created redundantly during the development cycle, slowing down the FinTech systems and causing unnecessary alerts. We present EqDAC, an efficient decision procedure to determine the data constraint equivalence. We first propo… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 14 pages, 11 figures, accepted by ICSE 2023

  37. SESNet: sequence-structure feature-integrated deep learning method for data-efficient protein engineering

    Authors: Mingchen Li, Liqi Kang, Yi Xiong, Yu Guang Wang, Guisheng Fan, Pan Tan, Liang Hong

    Abstract: Deep learning has been widely used for protein engineering. However, it is limited by the lack of sufficient experimental data to train an accurate model for predicting the functional fitness of high-order mutants. Here, we develop SESNet, a supervised deep-learning model to predict the fitness for protein mutants by leveraging both sequence and structure information, and exploiting attention mech… ▽ More

    Submitted 28 December, 2022; originally announced January 2023.

    Journal ref: Journal of Cheminformatics (2023) 15:12

  38. arXiv:2211.14091  [pdf, other

    cs.CV

    Language-Assisted 3D Feature Learning for Semantic Scene Understanding

    Authors: Junbo Zhang, Guofan Fan, Guanghan Wang, Zhengyuan Su, Kaisheng Ma, Li Yi

    Abstract: Learning descriptive 3D features is crucial for understanding 3D scenes with diverse objects and complex structures. However, it is usually unknown whether important geometric attributes and scene context obtain enough emphasis in an end-to-end trained 3D scene understanding network. To guide 3D feature learning toward important geometric attributes and scene context, we explore the help of textua… ▽ More

    Submitted 10 December, 2022; v1 submitted 25 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  39. MV-HAN: A Hybrid Attentive Networks based Multi-View Learning Model for Large-scale Contents Recommendation

    Authors: Ge Fan, Chaoyun Zhang, Kai Wang, Junyang Chen

    Abstract: Industrial recommender systems usually employ multi-source data to improve the recommendation quality, while effectively sharing information between different data sources remain a challenge. In this paper, we introduce a novel Multi-View Approach with Hybrid Attentive Networks (MV-HAN) for contents retrieval at the matching stage of recommender systems. The proposed model enables high-order featu… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: accepted by ASE 2022

  40. arXiv:2210.01922  [pdf, other

    cs.DB

    Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning

    Authors: Grace Fan, Jin Wang, Yuliang Li, Dan Zhang, Renée Miller

    Abstract: Dataset discovery from data lakes is essential in many real application scenarios. In this paper, we propose Starmie, an end-to-end framework for dataset discovery from data lakes (with table union search as the main use case). Our proposed framework features a contrastive learning method to train column encoders from pre-trained language models in a fully unsupervised manner. The column encoder o… ▽ More

    Submitted 15 January, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

  41. arXiv:2209.13589  [pdf, other

    cs.DB

    SANTOS: Relationship-based Semantic Table Union Search

    Authors: Aamod Khatiwada, Grace Fan, Roee Shraga, Zixuan Chen, Wolfgang Gatterbauer, Renée J. Miller, Mirek Riedewald

    Abstract: Existing techniques for unionable table search define unionability using metadata (tables must have the same or similar schemas) or column-based metrics (for example, the values in a table should be drawn from the same domain). In this work, we introduce the use of semantic relationships between pairs of columns in a table to improve the accuracy of union search. Consequently, we introduce a new n… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 15 pages, 10 figures, to appear at SIGMOD 2023

  42. QuickSkill: Novice Skill Estimation in Online Multiplayer Games

    Authors: Chaoyun Zhang, Kai Wang, Hao Chen, Ge Fan, Yingjie Li, Lifang Wu, Bingchao Zheng

    Abstract: Matchmaking systems are vital for creating fair matches in online multiplayer games, which directly affects players' satisfactions and game experience. Most of the matchmaking systems largely rely on precise estimation of players' game skills to construct equitable games. However, the skill rating of a novice is usually inaccurate, as current matchmaking rating algorithms require considerable amou… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022 Applied Research Track

  43. arXiv:2208.05168  [pdf, other

    cs.CR

    TokenPatronus: A Decentralized NFT Anti-theft Mechanism

    Authors: Zheng Cao, Yi Zhen, Gang Fan, Sheng Gao

    Abstract: The emergence of metaverse brings tremendous evolution to Non-Fungible Tokens (NFTs), which could certify the ownership the unique digital asset in the cyber world. The NFT market has garnered unprecedented attention from investors and created billions of dollars in transaction volume. Meanwhile, securing NFT is still a challenging issue. Recently, numerous incidents of NFT theft have been reporte… ▽ More

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: submitted to CESC 2022 as a work-in-progress paper

  44. arXiv:2206.02583  [pdf, other

    cs.MA cs.AI

    Consensus Learning for Cooperative Multi-Agent Reinforcement Learning

    Authors: Zhiwei Xu, Bin Zhang, Dapeng Li, Zeren Zhang, Guangchong Zhou, Hao Chen, Guoliang Fan

    Abstract: Almost all multi-agent reinforcement learning algorithms without communication follow the principle of centralized training with decentralized execution. During centralized training, agents can be guided by the same signals, such as the global state. During decentralized execution, however, agents lack the shared signal. Inspired by viewpoint invariance and contrastive learning, we propose consens… ▽ More

    Submitted 6 December, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: 14 pages, 13 figures, 2 tables

  45. arXiv:2204.09418  [pdf, other

    cs.MA cs.AI cs.LG

    Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning

    Authors: Zhiwei Xu, Dapeng Li, Bin Zhang, Yuan Zhan, Yunpeng Bai, Guoliang Fan

    Abstract: Recently, model-based agents have achieved better performance than model-free ones using the same computational budget and training time in single-agent environments. However, due to the complexity of multi-agent systems, it is tough to learn the model of the environment. The significant compounding error may hinder the learning process when model-based methods are applied to multi-agent tasks. Th… ▽ More

    Submitted 6 December, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: 16 pages, 9 figures, 2 tables

  46. arXiv:2203.03265  [pdf, other

    cs.AI cs.MA

    Efficient Policy Generation in Multi-Agent Systems via Hypergraph Neural Network

    Authors: Bin Zhang, Yunpeng Bai, Zhiwei Xu, Dapeng Li, Guoliang Fan

    Abstract: The application of deep reinforcement learning in multi-agent systems introduces extra challenges. In a scenario with numerous agents, one of the most important concerns currently being addressed is how to develop sufficient collaboration between diverse agents. To address this problem, we consider the form of agent interaction based on neighborhood and propose a multi-agent reinforcement learning… ▽ More

    Submitted 11 October, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: 12 pages, 6 figures

  47. arXiv:2112.13513  [pdf

    eess.IV cs.CV cs.LG

    MSHT: Multi-stage Hybrid Transformer for the ROSE Image Analysis of Pancreatic Cancer

    Authors: Tianyi Zhang, Yunlu Feng, Yu Zhao, Guangda Fan, Aiming Yang, Shangqin Lyu, Peng Zhang, Fan Song, Chenbin Ma, Yangyang Sun, Youdan Feng, Guanglei Zhang

    Abstract: Pancreatic cancer is one of the most malignant cancers in the world, which deteriorates rapidly with very high mortality. The rapid on-site evaluation (ROSE) technique innovates the workflow by immediately analyzing the fast stained cytopathological images with on-site pathologists, which enables faster diagnosis in this time-pressured process. However, the wider expansion of ROSE diagnosis has be… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 12 pages, 10 figures

  48. arXiv:2112.06771  [pdf, other

    cs.AI cs.LG cs.MA

    Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution

    Authors: Yunpeng Bai, Chen Gong, Bin Zhang, Guoliang Fan, Xinwen Hou, Yu Liu

    Abstract: Recent years have witnessed the great success of multi-agent systems (MAS). Value decomposition, which decomposes joint action values into individual action values, has been an important work in MAS. However, many value decomposition methods ignore the coordination among different agents, leading to the notorious "lazy agents" problem. To enhance the coordination in MAS, this paper proposes HyperG… ▽ More

    Submitted 28 April, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 8 pages, 6 figures, accepted by IJCNN 2022

  49. arXiv:2111.05670  [pdf, other

    cs.LG cs.MA

    DeCOM: Decomposed Policy for Constrained Cooperative Multi-Agent Reinforcement Learning

    Authors: Zhaoxing Yang, Rong Ding, Haiming Jin, Yifei Wei, Haoyi You, Guiyun Fan, Xiaoying Gan, Xinbing Wang

    Abstract: In recent years, multi-agent reinforcement learning (MARL) has presented impressive performance in various applications. However, physical limitations, budget restrictions, and many other factors usually impose \textit{constraints} on a multi-agent system (MAS), which cannot be handled by traditional MARL frameworks. Specifically, this paper focuses on constrained MASes where agents work \textit{c… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 25 pages

  50. arXiv:2110.07246  [pdf, other

    cs.MA cs.AI cs.LG

    HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism

    Authors: Zhiwei Xu, Yunpeng Bai, Bin Zhang, Dapeng Li, Guoliang Fan

    Abstract: Recently, some challenging tasks in multi-agent systems have been solved by some hierarchical reinforcement learning methods. Inspired by the intra-level and inter-level coordination in the human nervous system, we propose a novel value decomposition framework HAVEN based on hierarchical reinforcement learning for fully cooperative multi-agent problems. To address the instability arising from the… ▽ More

    Submitted 6 December, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: 13 pages, 11 figures, 2 tables