Skip to main content

Showing 1–50 of 65 results for author: Fang, R

  1. arXiv:2407.08273   

    cs.CL

    RB-SQL: A Retrieval-based LLM Framework for Text-to-SQL

    Authors: Zhenhe Wu, Zhongqiu Li, Jie Zhang, Mengxiang Li, Yu Zhao, Ruiyu Fang, Zhongjiang He, Xuelong Li, Zhoujun Li, Shuangyong Song

    Abstract: Large language models (LLMs) with in-context learning have significantly improved the performance of text-to-SQL task. Previous works generally focus on using exclusive SQL generation prompt to improve the LLMs' reasoning ability. However, they are mostly hard to handle large databases with numerous tables and columns, and usually ignore the significance of pre-processing database and extracting v… ▽ More

    Submitted 12 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Further improvement and modification are needed.

  2. arXiv:2407.06045  [pdf, other

    cs.CV

    OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning

    Authors: Wenjun Miao, Guansong Pang, Trong-Tung Nguyen, Ruohang Fang, Jin Zheng, Xiao Bai

    Abstract: Class incremental learning (CIL) aims to learn a model that can not only incrementally accommodate new classes, but also maintain the learned knowledge of old classes. Out-of-distribution (OOD) detection in CIL is to retain this incremental learning ability, while being able to reject unknown samples that are drawn from different distributions of the learned classes. This capability is crucial to… ▽ More

    Submitted 9 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2406.10729  [pdf, other

    cs.LG cs.AI cs.CV

    A Comprehensive Survey of Foundation Models in Medicine

    Authors: Wasif Khan, Seowung Leem, Kyle B. See, Joshua K. Wong, Shaoting Zhang, Ruogu Fang

    Abstract: Foundation models (FMs) are large-scale deep-learning models trained on extensive datasets using self-supervised techniques. These models serve as a base for various downstream tasks, including healthcare. FMs have been adopted with great success across various domains within healthcare, including natural language processing (NLP), computer vision, graph learning, biology, and omics. Existing heal… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 44 pages, and a more compact version is under review

  4. arXiv:2406.10395  [pdf, other

    eess.IV cs.CV q-bio.NC

    BrainFounder: Towards Brain Foundation Models for Neuroimage Analysis

    Authors: Joseph Cox, Peng Liu, Skylar E. Stolte, Yunchao Yang, Kang Liu, Kyle B. See, Huiwen Ju, Ruogu Fang

    Abstract: The burgeoning field of brain health research increasingly leverages artificial intelligence (AI) to interpret and analyze neurological data. This study introduces a novel approach towards the creation of medical foundation models by integrating a large-scale multi-modal magnetic resonance imaging (MRI) dataset derived from 41,400 participants in its own. Our method involves a novel two-stage pret… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 17 pages, 5 figures, to be published in Medical Image Analysis

  5. arXiv:2406.01637  [pdf, other

    cs.MA cs.AI

    Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

    Authors: Richard Fang, Rohan Bindu, Akul Gupta, Qiusi Zhan, Daniel Kang

    Abstract: LLM agents have become increasingly sophisticated, especially in the realm of cybersecurity. Researchers have shown that LLM agents can exploit real-world vulnerabilities when given a description of the vulnerability and toy capture-the-flag problems. However, these agents still perform poorly on real-world vulnerabilities that are unknown to the agent ahead of time (zero-day vulnerabilities). I… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  6. arXiv:2405.14205  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MA

    Agent Planning with World Knowledge Model

    Authors: Shuofei Qiao, Runnan Fang, Ningyu Zhang, Yuqi Zhu, Xiang Chen, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

    Abstract: Recent endeavors towards directly using large language models (LLMs) as agent models to execute interactive planning tasks have shown commendable results. Despite their achievements, however, they still struggle with brainless trial-and-error in global planning and generating hallucinatory actions in local planning due to their poor understanding of the ''real'' physical world. Imitating humans' m… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Work in progress

  7. arXiv:2404.08144  [pdf, other

    cs.CR cs.AI

    LLM Agents can Autonomously Exploit One-day Vulnerabilities

    Authors: Richard Fang, Rohan Bindu, Akul Gupta, Daniel Kang

    Abstract: LLMs have becoming increasingly powerful, both in their benign and malicious uses. With the increase in capabilities, researchers have been increasingly interested in their ability to exploit cybersecurity vulnerabilities. In particular, recent work has conducted preliminary studies on the ability of LLM agents to autonomously hack websites. However, these studies are limited to simple vulnerabili… ▽ More

    Submitted 17 April, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  8. arXiv:2403.12963  [pdf, other

    cs.CV

    FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis

    Authors: Linjiang Huang, Rongyao Fang, Aiping Zhang, Guanglu Song, Si Liu, Yu Liu, Hongsheng Li

    Abstract: In this study, we delve into the generation of high-resolution images from pre-trained diffusion models, addressing persistent challenges, such as repetitive patterns and structural distortions, that emerge when models are applied beyond their trained resolutions. To address this issue, we introduce an innovative, training-free approach FouriScale from the perspective of frequency domain analysis.… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  9. arXiv:2403.03676  [pdf, other

    cs.LG

    Simplified PCNet with Robustness

    Authors: Bingheng Li, Xuanting Xie, Haoxiang Lei, Ruiyi Fang, Zhao Kang

    Abstract: Graph Neural Networks (GNNs) have garnered significant attention for their success in learning the representation of homophilic or heterophilic graphs. However, they cannot generalize well to real-world graphs with different levels of homophily. In response, the Possion-Charlier Network (PCNet) \cite{li2024pc}, the previous work, allows graph representation to be learned from heterophily to homoph… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 10 pages, 3 figures

  10. arXiv:2402.06664  [pdf, other

    cs.CR cs.AI

    LLM Agents can Autonomously Hack Websites

    Authors: Richard Fang, Rohan Bindu, Akul Gupta, Qiusi Zhan, Daniel Kang

    Abstract: In recent years, large language models (LLMs) have become increasingly capable and can now interact with tools (i.e., call functions), read documents, and recursively call themselves. As a result, these LLMs can now function autonomously as agents. With the rise in capabilities of these agents, recent work has speculated on how LLM agents would affect cybersecurity. However, not much is known abou… ▽ More

    Submitted 15 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  11. arXiv:2402.03049  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

    Authors: Yixin Ou, Ningyu Zhang, Honghao Gui, Ziwen Xu, Shuofei Qiao, Yida Xue, Runnan Fang, Kangwei Liu, Lei Li, Zhen Bi, Guozhou Zheng, Huajun Chen

    Abstract: In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing approaches have been proposed, aiming to achieve a delicate balance between data quantity and data quality. Nevertheless, due to inconsistencies that persist am… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ACL 2024 System Demonstrations; Project website: https://zjunlp.github.io/project/EasyInstruct Code: https://github.com/zjunlp/EasyInstruct Video: https://youtu.be/rfQOWYfziFo Demo: https://huggingface.co/spaces/zjunlp/EasyInstruct

  12. arXiv:2401.17268  [pdf, other

    cs.CL cs.AI cs.LG

    Weaver: Foundation Models for Creative Writing

    Authors: Tiannan Wang, Jiamin Chen, Qingrui Jia, Shuai Wang, Ruoyu Fang, Huilin Wang, Zhaowei Gao, Chunzhao Xie, Chuou Xu, Jihong Dai, Yibin Liu, Jialong Wu, Shengwei Ding, Long Li, Zhiwei Huang, Xinle Deng, Teng Yu, Gangan Ma, Han Xiao, Zixin Chen, Danjun Xiang, Yunxia Wang, Yuanyuan Zhu, Yi Xiao, Jing Wang , et al. (21 additional authors not shown)

    Abstract: This work introduces Weaver, our first family of large language models (LLMs) dedicated to content creation. Weaver is pre-trained on a carefully selected corpus that focuses on improving the writing capabilities of large language models. We then fine-tune Weaver for creative and professional writing purposes and align it to the preference of professional writers using a suit of novel methods for… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  13. arXiv:2401.05268  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.MA

    AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning

    Authors: Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, Chengfei Lv, Huajun Chen

    Abstract: Language agents have achieved considerable performance on various complex question-answering tasks by planning with external tools. Despite the incessant exploration in this field, existing language agent systems still struggle with costly, non-reproducible data reliance and face the challenge of compelling a single model for multiple functions. To this end, we introduce AutoAct, an automatic agen… ▽ More

    Submitted 26 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: ACL 2024

  14. arXiv:2401.03804  [pdf, other

    cs.CL cs.AI

    TeleChat Technical Report

    Authors: Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang , et al. (11 additional authors not shown)

    Abstract: In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, i… ▽ More

    Submitted 1 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 28 pages, 2 figures

    ACM Class: I.2.7

  15. arXiv:2312.10439  [pdf, other

    cs.CV

    Simple Image-level Classification Improves Open-vocabulary Object Detection

    Authors: Ruohuan Fang, Guansong Pang, Xiao Bai

    Abstract: Open-Vocabulary Object Detection (OVOD) aims to detect novel objects beyond a given set of base categories on which the detection model is trained. Recent OVOD methods focus on adapting the image-level pre-trained vision-language models (VLMs), such as CLIP, to a region-level object detection task via, eg., region-level knowledge distillation, regional prompt learning, or region-text pre-training,… ▽ More

    Submitted 19 December, 2023; v1 submitted 16 December, 2023; originally announced December 2023.

    Comments: Accepted at AAAI 2024

  16. arXiv:2312.07899  [pdf

    q-bio.QM cs.AI cs.CV cs.LG

    Morphological Profiling for Drug Discovery in the Era of Deep Learning

    Authors: Qiaosi Tang, Ranjala Ratnayake, Gustavo Seabra, Zhe Jiang, Ruogu Fang, Lina Cui, Yousong Ding, Tamer Kahveci, Jiang Bian, Chenglong Li, Hendrik Luesch, Yanjun Li

    Abstract: Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial… ▽ More

    Submitted 15 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: 44 pages, 5 figure, 5 tables

  17. arXiv:2311.18835  [pdf, other

    cs.CV

    InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation

    Authors: Rongyao Fang, Shilin Yan, Zhaoyang Huang, Jingqiu Zhou, Hao Tian, Jifeng Dai, Hongsheng Li

    Abstract: Empowering models to dynamically accomplish tasks specified through natural language instructions represents a promising path toward more capable and general artificial intelligence. In this work, we introduce InstructSeq, an instruction-conditioned multi-modal modeling framework that unifies diverse vision tasks through flexible natural language control and handling of both visual and textual dat… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 10 pages

  18. arXiv:2311.05553  [pdf, other

    cs.CL cs.AI

    Removing RLHF Protections in GPT-4 via Fine-Tuning

    Authors: Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

    Abstract: As large language models (LLMs) have increased in their capabilities, so does their potential for dual use. To reduce harmful outputs, produces and vendors of LLMs have used reinforcement learning with human feedback (RLHF). In tandem, LLM vendors have been increasingly enabling fine-tuning of their most powerful models. However, concurrent work has shown that fine-tuning can remove RLHF protectio… ▽ More

    Submitted 5 April, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024. (7 pages)

  19. arXiv:2311.00300  [pdf

    cs.CL

    Entity Alignment Method of Science and Technology Patent based on Graph Convolution Network and Information Fusion

    Authors: Runze Fang, Yawen Li, Yingxia Shao, Zeli Guan, Zhe Xue

    Abstract: The entity alignment of science and technology patents aims to link the equivalent entities in the knowledge graph of different science and technology patent data sources. Most entity alignment methods only use graph neural network to obtain the embedding of graph structure or use attribute text description to obtain semantic representation, ignoring the process of multi-information fusion in scie… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  20. arXiv:2310.15436  [pdf, other

    cs.SE

    VGX: Large-Scale Sample Generation for Boosting Learning-Based Software Vulnerability Analyses

    Authors: Yu Nong, Richard Fang, Guangbei Yi, Kunsong Zhao, Xiapu Luo, Feng Chen, Haipeng Cai

    Abstract: Accompanying the successes of learning-based defensive software vulnerability analyses is the lack of large and quality sets of labeled vulnerable program samples, which impedes further advancement of those defenses. Existing automated sample generation approaches have shown potentials yet still fall short of practical expectations due to the high noise in the generated samples. This paper propose… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: 13 pages, 6 figures, To appear at ICSE 2024

  21. arXiv:2310.12357  [pdf, other

    cs.SE cs.CR

    Large Language Models for Code Analysis: Do LLMs Really Do Their Job?

    Authors: Chongzhou Fang, Ning Miao, Shaurya Srivastav, Jialin Liu, Ruoyu Zhang, Ruijie Fang, Asmita, Ryan Tsang, Najmeh Nazari, Han Wang, Houman Homayoun

    Abstract: Large language models (LLMs) have demonstrated significant potential in the realm of natural language understanding and programming code processing tasks. Their capacity to comprehend and generate human-like code has spurred research into harnessing LLMs for code analysis purposes. However, the existing body of literature falls short in delivering a systematic evaluation and assessment of LLMs' ef… ▽ More

    Submitted 5 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted by Usenix Security 2024

  22. arXiv:2309.02088  [pdf, other

    cs.CV cs.AI

    Dual Adversarial Alignment for Realistic Support-Query Shift Few-shot Learning

    Authors: Siyang Jiang, Rui Fang, Hsi-Wen Chen, Wei Ding, Ming-Syan Chen

    Abstract: Support-query shift few-shot learning aims to classify unseen examples (query set) to labeled data (support set) based on the learned embedding in a low-dimensional space under a distribution shift between the support set and the query set. However, in real-world scenarios the shifts are usually unknown and varied, making it difficult to estimate in advance. Therefore, in this paper, we propose a… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Best student paper in PAKDD 2022

  23. arXiv:2308.16527  [pdf, other

    cs.CV

    Unsupervised Recognition of Unknown Objects for Open-World Object Detection

    Authors: Ruohuan Fang, Guansong Pang, Lei Zhou, Xiao Bai, Jin Zheng

    Abstract: Open-World Object Detection (OWOD) extends object detection problem to a realistic and dynamic scenario, where a detection model is required to be capable of detecting both known and unknown objects and incrementally learning newly introduced knowledge. Current OWOD models, such as ORE and OW-DETR, focus on pseudo-labeling regions with high objectness scores as unknowns, whose performance relies h… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

  24. arXiv:2308.10453  [pdf

    cs.CV cs.LG eess.IV

    DOMINO++: Domain-aware Loss Regularization for Deep Learning Generalizability

    Authors: Skylar E. Stolte, Kyle Volle, Aprinda Indahlastari, Alejandro Albizu, Adam J. Woods, Kevin Brink, Matthew Hale, Ruogu Fang

    Abstract: Out-of-distribution (OOD) generalization poses a serious challenge for modern deep learning (DL). OOD data consists of test data that is significantly different from the model's training data. DL models that perform well on in-domain test data could struggle on OOD data. Overcoming this discrepancy is essential to the reliable deployment of DL. Proper model calibration decreases the number of spur… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 12 pages, 5 figures, 5 tables, Accepted by the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2023

  25. arXiv:2304.13397  [pdf, other

    cs.CV cs.AI

    Filter Pruning via Filters Similarity in Consecutive Layers

    Authors: Xiaorui Wang, Jun Wang, Xin Tang, Peng Gao, Rui Fang, Guotong Xie

    Abstract: Filter pruning is widely adopted to compress and accelerate the Convolutional Neural Networks (CNNs), but most previous works ignore the relationship between filters and channels in different layers. Processing each layer independently fails to utilize the collaborative relationship across layers. In this paper, we intuitively propose a novel pruning method by explicitly leveraging the Filters Sim… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted by ICASSP 2023 (oral)

  26. arXiv:2303.05475  [pdf, other

    cs.CV

    Mimic before Reconstruct: Enhancing Masked Autoencoders with Feature Mimicking

    Authors: Peng Gao, Renrui Zhang, Rongyao Fang, Ziyi Lin, Hongyang Li, Hongsheng Li, Qiao Yu

    Abstract: Masked Autoencoders (MAE) have been popular paradigms for large-scale vision representation pre-training. However, MAE solely reconstructs the low-level RGB signals after the decoder and lacks supervision upon high-level semantics for the encoder, thus suffering from sub-optimal learned representations and long pre-training epochs. To alleviate this, previous methods simply replace the pixel recon… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 12 pages, 3 figures

  27. arXiv:2303.01503  [pdf, other

    cs.CV

    FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation

    Authors: Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li

    Abstract: One-to-one matching is a crucial design in DETR-like object detection frameworks. It enables the DETR to perform end-to-end detection. However, it also faces challenges of lacking positive sample supervision and slow convergence speed. Several recent works proposed the one-to-many matching mechanism to accelerate training and boost detection performance. We revisit these methods and model them in… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 12 pages, 6 figures

  28. arXiv:2302.06727  [pdf, other

    cs.LG cs.CV eess.IV

    Deep Learning Predicts Prevalent and Incident Parkinson's Disease From UK Biobank Fundus Imaging

    Authors: Charlie Tran, Kai Shen, Kang Liu, Akshay Ashok, Adolfo Ramirez-Zamora, Jinghua Chen, Yulin Li, Ruogu Fang

    Abstract: Parkinson's disease is the world's fastest-growing neurological disorder. Research to elucidate the mechanisms of Parkinson's disease and automate diagnostics would greatly improve the treatment of patients with Parkinson's disease. Current diagnostic methods are expensive and have limited availability. Considering the insidious and preclinical onset and progression of the disease, a desirable scr… ▽ More

    Submitted 18 February, 2024; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: 17 pages, 4 figures, 2 tables, 4 supplementary tables

  29. arXiv:2302.05142  [pdf

    cs.CV cs.AI cs.LG

    DOMINO: Domain-aware Loss for Deep Learning Calibration

    Authors: Skylar E. Stolte, Kyle Volle, Aprinda Indahlastari, Alejandro Albizu, Adam J. Woods, Kevin Brink, Matthew Hale, Ruogu Fang

    Abstract: Deep learning has achieved the state-of-the-art performance across medical imaging tasks; however, model calibration is often not considered. Uncalibrated models are potentially dangerous in high-risk applications since the user does not know when they will fail. Therefore, this paper proposes a novel domain-aware loss function to calibrate deep learning models. The proposed loss function applies… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: 7 pages, 1 figure, 1 table, accepted by the Software Impacts journal

  30. arXiv:2302.03008  [pdf, other

    cs.LG cs.AI eess.IV

    LAVA: Granular Neuron-Level Explainable AI for Alzheimer's Disease Assessment from Fundus Images

    Authors: Nooshin Yousefzadeh, Charlie Tran, Adolfo Ramirez-Zamora, Jinghua Chen, Ruogu Fang, My T. Thai

    Abstract: Alzheimer's Disease (AD) is a progressive neurodegenerative disease and the leading cause of dementia. Early diagnosis is critical for patients to benefit from potential intervention and treatment. The retina has been hypothesized as a diagnostic site for AD detection owing to its anatomical connection with the brain. Developed AI models for this purpose have yet to provide a rational explanation… ▽ More

    Submitted 16 March, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: 27 pages, 11 figures

  31. arXiv:2212.01977  [pdf, other

    cs.LG cs.AI

    Distributed Pruning Towards Tiny Neural Networks in Federated Learning

    Authors: Hong Huang, Lan Zhang, Chaoyue Sun, Ruogu Fang, Xiaoyong Yuan, Dapeng Wu

    Abstract: Neural network pruning is an essential technique for reducing the size and complexity of deep neural networks, enabling large-scale models on devices with limited resources. However, existing pruning approaches heavily rely on training data for guiding the pruning strategies, making them ineffective for federated learning over distributed and confidential datasets. Additionally, the memory- and co… ▽ More

    Submitted 11 July, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: This paper has been accepted to ICDCS 2023

  32. arXiv:2211.15003  [pdf, other

    cs.CL

    STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction

    Authors: Shuo Liang, Wei Wei, Xian-Ling Mao, Yuanyuan Fu, Rui Fang, Dangyang Chen

    Abstract: Aspect Sentiment Triplet Extraction (ASTE) has become an emerging task in sentiment analysis research, aiming to extract triplets of the aspect term, its corresponding opinion term, and its associated sentiment polarity from a given sentence. Recently, many neural networks based models with different tagging schemes have been proposed, but almost all of them have their limitations: heavily relying… ▽ More

    Submitted 9 April, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted by AAAI 2023

  33. arXiv:2211.06475  [pdf, other

    cs.NI

    High-Level Synthesis for Packet-Processing Pipelines

    Authors: Xiangyu Gao, Divya Raghunathan, Ruijie Fang, Tao Wang, Xiaotong Zhu, Anirudh Sivaraman, Srinivas Narayana, Aarti Gupta

    Abstract: Compiling high-level programs to target high-speed packet-processing pipelines is a challenging combinatorial optimization problem. The compiler must configure the pipeline's resources to match the high-level semantics of the program, while packing all of the program's computation into the pipeline's limited resources. State of the art approaches tackle individual aspects of this problem. Yet, the… ▽ More

    Submitted 18 November, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

  34. arXiv:2210.12020  [pdf, other

    cs.LG cs.AI

    HCL: Improving Graph Representation with Hierarchical Contrastive Learning

    Authors: Jun Wang, Weixun Li, Changyu Hou, Xin Tang, Yixuan Qiao, Rui Fang, Pengyong Li, Peng Gao, Guotong Xie

    Abstract: Contrastive learning has emerged as a powerful tool for graph representation learning. However, most contrastive learning methods learn features of graphs with fixed coarse-grained scale, which might underestimate either local or global information. To capture more hierarchical and richer representation, we propose a novel Hierarchical Contrastive Learning (HCL) framework that explicitly learns gr… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: published at The 21st International Semantic Web Conference ( ISWC 2022 )

  35. arXiv:2210.08806  [pdf, other

    cs.CL

    HCL-TAT: A Hybrid Contrastive Learning Method for Few-shot Event Detection with Task-Adaptive Threshold

    Authors: Ruihan Zhang, Wei Wei, Xian-Ling Mao, Rui Fang, Dangyang Chen

    Abstract: Conventional event detection models under supervised learning settings suffer from the inability of transfer to newly-emerged event types owing to lack of sufficient annotations. A commonly-adapted solution is to follow a identify-then-classify manner, which first identifies the triggers and then converts the classification task via a few-shot learning paradigm. However, these methods still fall f… ▽ More

    Submitted 20 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: This paper has been accepted by Findings of EMNLP 2022

  36. arXiv:2210.03291  [pdf

    cs.IR

    A Relational Triple Extraction Method Based on Feature Reasoning for Technological Patents

    Authors: Runze Fang, Junping Du, Yingxia Shao, Zeli Guan

    Abstract: The relation triples extraction method based on table filling can address the issues of relation overlap and bias propagation. However, most of them only establish separate table features for each relationship, which ignores the implicit relationship between different entity pairs and different relationship features. Therefore, a feature reasoning relational triple extraction method based on table… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  37. arXiv:2209.06077  [pdf, other

    eess.IV cs.CV cs.LG

    DOMINO: Domain-aware Model Calibration in Medical Image Segmentation

    Authors: Skylar E. Stolte, Kyle Volle, Aprinda Indahlastari, Alejandro Albizu, Adam J. Woods, Kevin Brink, Matthew Hale, Ruogu Fang

    Abstract: Model calibration measures the agreement between the predicted probability estimates and the true correctness likelihood. Proper model calibration is vital for high-risk applications. Unfortunately, modern deep neural networks are poorly calibrated, compromising trustworthiness and reliability. Medical image segmentation particularly suffers from this due to the natural uncertainty of tissue bound… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 10 pages, 6 figures, 3 tables. Accepted by International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2022 Oral Talk

  38. arXiv:2209.00793  [pdf, other

    cs.LG cs.AI cs.CV cs.SI

    Structure-Preserving Graph Representation Learning

    Authors: Ruiyi Fang, Liangjian Wen, Zhao Kang, Jianzhuang Liu

    Abstract: Though graph representation learning (GRL) has made significant progress, it is still a challenge to extract and embed the rich topological structure and feature information in an adequate way. Most existing methods focus on local structure and fail to fully incorporate the global topological structure. To this end, we propose a novel Structure-Preserving Graph Representation Learning (SPGRL) meth… ▽ More

    Submitted 7 December, 2022; v1 submitted 1 September, 2022; originally announced September 2022.

    Comments: Accepted by the IEEE International Conference on Data Mining (ICDM) 2022. arXiv admin note: text overlap with arXiv:2108.04822

  39. Multi-level Contrastive Learning Framework for Sequential Recommendation

    Authors: Ziyang Wang, Huoyu Liu, Wei Wei, Yue Hu, Xian-Ling Mao, Shaojian He, Rui Fang, Dangyang chen

    Abstract: Sequential recommendation (SR) aims to predict the subsequent behaviors of users by understanding their successive historical behaviors. Recently, some methods for SR are devoted to alleviating the data sparsity problem (i.e., limited supervised signals for training), which take account of contrastive learning to incorporate self-supervised signals into SR. Despite their achievements, it is far fr… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Journal ref: CIKM 2022

  40. Improving Personality Consistency in Conversation by Persona Extending

    Authors: Yifan Liu, Wei Wei, Jiayi Liu, Xianling Mao, Rui Fang, Dangyang Chen

    Abstract: Endowing chatbots with a consistent personality plays a vital role for agents to deliver human-like interactions. However, existing personalized approaches commonly generate responses in light of static predefined personas depicted with textual description, which may severely restrict the interactivity of human and the chatbot, especially when the agent needs to answer the query excluded in the pr… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted by CIKM 2022

  41. Improving Knowledge-aware Recommendation with Multi-level Interactive Contrastive Learning

    Authors: Ding Zou, Wei Wei, Ziyang Wang, Xian-Ling Mao, Feida Zhu, Rui Fang, Dangyang Chen

    Abstract: Incorporating Knowledge Graphs (KG) into recommeder system has attracted considerable attention. Recently, the technical trend of Knowledge-aware Recommendation (KGR) is to develop end-to-end models based on graph neural networks (GNNs). However, the extremely sparse user-item interactions significantly degrade the performance of the GNN-based models, as: 1) the sparse interaction, means inadequat… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM 2022

  42. arXiv:2207.09519  [pdf, other

    cs.CV cs.AI cs.CL

    Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification

    Authors: Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li

    Abstract: Contrastive Vision-Language Pre-training, known as CLIP, has provided a new paradigm for learning visual representations using large-scale image-text pairs. It shows impressive performance on downstream tasks by zero-shot knowledge transfer. To further enhance CLIP's adaption capability, existing methods proposed to fine-tune additional learnable modules, which significantly improves the few-shot… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022. arXiv admin note: substantial text overlap with arXiv:2111.03930

  43. arXiv:2205.14401  [pdf, other

    cs.CV cs.AI

    Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training

    Authors: Renrui Zhang, Ziyu Guo, Rongyao Fang, Bin Zhao, Dong Wang, Yu Qiao, Hongsheng Li, Peng Gao

    Abstract: Masked Autoencoders (MAE) have shown great potentials in self-supervised pre-training for language and 2D image transformers. However, it still remains an open question on how to exploit masked autoencoding for learning 3D representations of irregular point clouds. In this paper, we propose Point-M2AE, a strong Multi-scale MAE pre-training framework for hierarchical self-supervised learning of 3D… ▽ More

    Submitted 13 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022

  44. arXiv:2205.11245  [pdf, ps, other

    cs.IR cs.CL

    PASH at TREC 2021 Deep Learning Track: Generative Enhanced Model for Multi-stage Ranking

    Authors: Yixuan Qiao, Hao Chen, Jun Wang, Yongquan Lai, Tuozhen Liu, Xianbin Ye, Xin Tang, Rui Fang, Peng Gao, Wenfeng Xie, Guotong Xie

    Abstract: This paper describes the PASH participation in TREC 2021 Deep Learning Track. In the recall stage, we adopt a scheme combining sparse and dense retrieval method. In the multi-stage ranking phase, point-wise and pair-wise ranking strategies are used one after another based on model continual pre-trained on general knowledge and document-level data. Compared to TREC 2020 Deep Learning Track, we have… ▽ More

    Submitted 2 December, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: TREC 2021

  45. arXiv:2204.02251  [pdf, other

    cs.CV

    RBGNet: Ray-based Grouping for 3D Object Detection

    Authors: Haiyang Wang, Shaoshuai Shi, Ze Yang, Rongyao Fang, Qi Qian, Hongsheng Li, Bernt Schiele, Liwei Wang

    Abstract: As a fundamental problem in computer vision, 3D object detection is experiencing rapid growth. To extract the point-wise features from the irregularly and sparsely distributed points, previous methods usually take a feature grouping module to aggregate the point features to an object candidate. However, these methods have not yet leveraged the surface geometry of foreground objects to enhance grou… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

  46. arXiv:2201.07105  [pdf, other

    cs.CL cs.LG

    Beyond modeling: NLP Pipeline for efficient environmental policy analysis

    Authors: Jordi Planas, Daniel Firebanks-Quevedo, Galina Naydenova, Ramansh Sharma, Cristina Taylor, Kathleen Buckingham, Rong Fang

    Abstract: As we enter the UN Decade on Ecosystem Restoration, creating effective incentive structures for forest and landscape restoration has never been more critical. Policy analysis is necessary for policymakers to understand the actors and rules involved in restoration in order to shift economic and financial incentives to the right places. Classical policy analysis is resource-intensive and complex, la… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

    Comments: Accepted at Fragile Earth workshop proceedings at KDD 2021

  47. arXiv:2112.07463  [pdf, other

    cs.SD eess.AS

    End-to-end speaker diarization with transformer

    Authors: Yongquan Lai, Xin Tang, Yuanyuan Fu, Rui Fang

    Abstract: Speaker diarization is connected to semantic segmentation in computer vision. Inspired from MaskFormer \cite{cheng2021per} which treats semantic segmentation as a set-prediction problem, we propose an end-to-end approach to predict a set of targets consisting of binary masks, vocal activities and speaker vectors. Our model, which we coin \textit{DiFormer}, is mainly based on a speaker encoder and… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

    Comments: submitted to icassp2022

  48. arXiv:2112.00948  [pdf, other

    cs.CV

    Visual-Semantic Transformer for Scene Text Recognition

    Authors: Xin Tang, Yongquan Lai, Ying Liu, Yuanyuan Fu, Rui Fang

    Abstract: Modeling semantic information is helpful for scene text recognition. In this work, we propose to model semantic and visual information jointly with a Visual-Semantic Transformer (VST). The VST first explicitly extracts primary semantic information from visual feature maps with a transformer module and a primary visual-semantic alignment module. The semantic information is then joined with the visu… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  49. arXiv:2111.03930  [pdf, other

    cs.CV cs.CL

    Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling

    Authors: Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li

    Abstract: Contrastive Vision-Language Pre-training, known as CLIP, has provided a new paradigm for learning visual representations by using large-scale contrastive image-text pairs. It shows impressive performance on zero-shot knowledge transfer to downstream tasks. To further enhance CLIP's few-shot capability, CLIP-Adapter proposed to fine-tune a lightweight residual feature adapter and significantly impr… ▽ More

    Submitted 14 November, 2021; v1 submitted 6 November, 2021; originally announced November 2021.

    Comments: preprints

  50. arXiv:2110.04544  [pdf, other

    cs.CV cs.CL

    CLIP-Adapter: Better Vision-Language Models with Feature Adapters

    Authors: Peng Gao, Shijie Geng, Renrui Zhang, Teli Ma, Rongyao Fang, Yongfeng Zhang, Hongsheng Li, Yu Qiao

    Abstract: Large-scale contrastive vision-language pre-training has shown significant progress in visual representation learning. Unlike traditional visual systems trained by a fixed set of discrete labels, a new paradigm was introduced in \cite{radford2021learning} to directly learn to align images with raw texts in an open-vocabulary setting. On downstream tasks, a carefully chosen text prompt is employed… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Technical Report