Skip to main content

Showing 1–50 of 186 results for author: Ren, P

  1. arXiv:2407.07844  [pdf, other

    cs.CV

    OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

    Authors: Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chengjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang

    Abstract: Open-vocabulary detection is a challenging task due to the requirement of detecting objects based on class names, including those not encountered during training. Existing methods have shown strong zero-shot detection capabilities through pre-training on diverse large-scale datasets. However, these approaches still face two primary challenges: (i) how to universally integrate diverse data sources… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Technical Report

  2. arXiv:2406.14891  [pdf, other

    cs.CL cs.IR

    Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering

    Authors: Zhengliang Shi, Shuo Zhang, Weiwei Sun, Shen Gao, Pengjie Ren, Zhumin Chen, Zhaochun Ren

    Abstract: Multi-Hop Question Answering (MHQA) tasks present a significant challenge for large language models (LLMs) due to the intensive knowledge required. Current solutions, like Retrieval-Augmented Generation, typically retrieve potential documents from an external corpus to read an answer. However, the performance of this retrieve-then-read paradigm is constrained by the retriever and the inevitable no… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: ACL 2024 (main conference)

  3. arXiv:2406.04984  [pdf, other

    cs.CL

    MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter

    Authors: Jitai Hao, WeiWei Sun, Xin Xin, Qi Meng, Zhumin Chen, Pengjie Ren, Zhaochun Ren

    Abstract: Parameter-Efficient Fine-tuning (PEFT) facilitates the fine-tuning of Large Language Models (LLMs) under limited resources. However, the fine-tuning performance with PEFT on complex, knowledge-intensive tasks is limited due to the constrained model capacity, which originates from the limited number of additional trainable parameters. To overcome this limitation, we introduce a novel mechanism that… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL 24

  4. arXiv:2405.20516  [pdf, other

    cs.LG physics.geo-ph

    WaveCastNet: An AI-enabled Wavefield Forecasting Framework for Earthquake Early Warning

    Authors: Dongwei Lyu, Rie Nakata, Pu Ren, Michael W. Mahoney, Arben Pitarka, Nori Nakata, N. Benjamin Erichson

    Abstract: Large earthquakes can be destructive and quickly wreak havoc on a landscape. To mitigate immediate threats, early warning systems have been developed to alert residents, emergency responders, and critical infrastructure operators seconds to a minute before seismic waves arrive. These warnings provide time to take precautions and prevent damage. The success of these systems relies on fast, accurate… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  5. arXiv:2405.13034  [pdf, other

    cs.CL cs.AI cs.HC

    Autonomous Workflow for Multimodal Fine-Grained Training Assistants Towards Mixed Reality

    Authors: Jiahuan Pei, Irene Viola, Haochen Huang, Junxiao Wang, Moonisa Ahsan, Fanghua Ye, Jiang Yiming, Yao Sai, Di Wang, Zhumin Chen, Pengjie Ren, Pablo Cesar

    Abstract: Autonomous artificial intelligence (AI) agents have emerged as promising protocols for automatically understanding the language-based environment, particularly with the exponential development of large language models (LLMs). However, a fine-grained, comprehensive understanding of multimodal environments remains under-explored. This work designs an autonomous workflow tailored for integrating AI a… ▽ More

    Submitted 5 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024

  6. arXiv:2405.01326  [pdf, other

    cs.CV

    Multi-modal Learnable Queries for Image Aesthetics Assessment

    Authors: Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu

    Abstract: Image aesthetics assessment (IAA) is attracting wide interest with the prevalence of social media. The problem is challenging due to its subjective and ambiguous nature. Instead of directly extracting aesthetic features solely from the image, user comments associated with an image could potentially provide complementary knowledge that is useful for IAA. With existing large-scale pre-trained models… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME2024

  7. arXiv:2404.17288  [pdf, other

    cs.IR

    ExcluIR: Exclusionary Neural Information Retrieval

    Authors: Wenhao Zhang, Mengqi Zhang, Shiguang Wu, Jiahuan Pei, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Pengjie Ren

    Abstract: Exclusion is an important and universal linguistic skill that humans use to express what they do not want. However, in information retrieval community, there is little research on exclusionary retrieval, where users express what they do not want in their queries. In this work, we investigate the scenario of exclusionary retrieval in document retrieval for the first time. We present ExcluIR, a set… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  8. arXiv:2404.11034  [pdf

    cs.CY

    Exploring the Path of Transformation and Development for Study Abroad Consultancy Firms in China

    Authors: Ping Ren, Zhiqiang Zhao, Qian Yang

    Abstract: In recent years, with the changing landscape of international education and the growing demand from Chinese students, study abroad consultancy firms in China need to adopt transformational development strategies to address challenges and maintain competitiveness. This study investigated the relationships between key performance indicators and several factors through a questionnaire survey of 158 c… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  9. arXiv:2404.11029  [pdf

    cs.CY

    Student self-management, academic achievement: Exploring the mediating role of self-efficacy and the moderating influence of gender insights from a survey conducted in 3 universities in America

    Authors: Zhiqiang Zhao, Ping Ren, Qian Yang

    Abstract: Excellent students are not only those who master more effective and efficient learning techniques to acquire and apply information. Even in the absence of correct learning, they are able to self-motivate, evaluate, and adjust their behavior. This study aims to explore the relationship between student self-management and academic achievement, with a focus on investigating the mediating role of self… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Journal ref: Journal of Integrated Social Sciences and Humanities (2023): 1-12

  10. arXiv:2404.10393  [pdf, other

    cs.LG cs.AI

    Offline Trajectory Generalization for Offline Reinforcement Learning

    Authors: Ziqi Zhao, Zhaochun Ren, Liu Yang, Fajie Yuan, Pengjie Ren, Zhumin Chen, jun Ma, Xin Xin

    Abstract: Offline reinforcement learning (RL) aims to learn policies from static datasets of previously collected trajectories. Existing methods for offline RL either constrain the learned policy to the support of offline data or utilize model-based virtual environments to generate simulated rollouts. However, these methods suffer from (i) poor generalization to unseen states; and (ii) trivial improvement f… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  11. arXiv:2404.06451  [pdf, other

    cs.CV

    SmartControl: Enhancing ControlNet for Handling Rough Visual Conditions

    Authors: Xiaoyu Liu, Yuxiang Wei, Ming Liu, Xianhui Lin, Peiran Ren, Xuansong Xie, Wangmeng Zuo

    Abstract: Human visual imagination usually begins with analogies or rough sketches. For example, given an image with a girl playing guitar before a building, one may analogously imagine how it seems like if Iron Man playing guitar before Pyramid in Egypt. Nonetheless, visual condition may not be precisely aligned with the imaginary result indicated by text prompt, and existing layout-controllable text-to-im… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  12. arXiv:2404.00684  [pdf, other

    cs.IR cs.AI

    Generative Retrieval as Multi-Vector Dense Retrieval

    Authors: Shiguang Wu, Wenda Wei, Mengqi Zhang, Zhumin Chen, Jun Ma, Zhaochun Ren, Maarten de Rijke, Pengjie Ren

    Abstract: Generative retrieval generates identifiers of relevant documents in an end-to-end manner using a sequence-to-sequence architecture for a given query. The relation between generative retrieval and other retrieval methods, especially those based on matching within dense retrieval models, is not yet fully comprehended. Prior work has demonstrated that generative retrieval with atomic identifiers is e… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 12 pages, 5 figures, 8 tables, accepted at SIGIR 2024

  13. arXiv:2403.18480  [pdf, other

    cs.IR

    Enhanced Generative Recommendation via Content and Collaboration Integration

    Authors: Yidan Wang, Zhaochun Ren, Weiwei Sun, Jiyuan Yang, Zhixiang Liang, Xin Chen, Ruobing Xie, Su Yan, Xu Zhang, Pengjie Ren, Zhumin Chen, Xin Xin

    Abstract: Generative recommendation has emerged as a promising paradigm aimed at augmenting recommender systems with recent advancements in generative artificial intelligence. This task has been formulated as a sequence-to-sequence generation process, wherein the input sequence encompasses data pertaining to the user's previously interacted items, and the output sequence denotes the generative identifier fo… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  14. arXiv:2403.17706  [pdf, other

    cs.CL cs.AI

    Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

    Authors: Shuyu Chang, Rui Wang, Peng Ren, Haiping Huang

    Abstract: Crafting effective topic models for brief texts, like tweets and news headlines, is essential for capturing the swift shifts in social dynamics. Traditional topic models, however, often fall short in accurately representing the semantic intricacies of short texts due to their brevity and lack of contextual data. In our study, we harness the advanced capabilities of Large Language Models (LLMs) to… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures

  15. arXiv:2403.16371  [pdf, other

    cs.IR

    Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation

    Authors: Jiyuan Yang, Yuanzi Li, Jingyu Zhao, Hanbing Wang, Muyang Ma, Jun Ma, Zhaochun Ren, Mengqi Zhang, Xin Xin, Zhumin Chen, Pengjie Ren

    Abstract: Sequential Recommenders have been widely applied in various online services, aiming to model users' dynamic interests from their sequential interactions. With users increasingly engaging with online platforms, vast amounts of lifelong user behavioral sequences have been generated. However, existing sequential recommender models often struggle to handle such lifelong sequences. The primary challeng… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  16. arXiv:2403.15245  [pdf, other

    cs.CV cs.AI cs.LG

    Reasoning-Enhanced Object-Centric Learning for Videos

    Authors: Jian Li, Pu Ren, Yang Liu, Hao Sun

    Abstract: Object-centric learning aims to break down complex visual scenes into more manageable object representations, enhancing the understanding and reasoning abilities of machine learning systems toward the physical world. Recently, slot-based video models have demonstrated remarkable proficiency in segmenting and tracking objects, but they overlook the importance of the effective reasoning module. In t… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  17. arXiv:2403.05438  [pdf, other

    cs.CV

    VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

    Authors: Yabo Zhang, Yuxiang Wei, Xianhui Lin, Zheng Hui, Peiran Ren, Xuansong Xie, Xiangyang Ji, Wangmeng Zuo

    Abstract: Text-to-image diffusion models (T2I) have demonstrated unprecedented capabilities in creating realistic and aesthetic images. On the contrary, text-to-video diffusion models (T2V) still lag far behind in frame quality and text alignment, owing to insufficient quality and quantity of training videos. In this paper, we introduce VideoElevator, a training-free and plug-and-play method, which elevates… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Project page: https://videoelevator.github.io Code: https://github.com/YBYBZhang/VideoElevator

  18. arXiv:2403.03424  [pdf, other

    cs.IR

    Generative News Recommendation

    Authors: Shen Gao, Jiabao Fang, Quan Tu, Zhitao Yao, Zhumin Chen, Pengjie Ren, Zhaochun Ren

    Abstract: Most existing news recommendation methods tackle this task by conducting semantic matching between candidate news and user representation produced by historical clicked news. However, they overlook the high-level connections among different news articles and also ignore the profound relationship between these news articles and users. And the definition of these methods dictates that they can only… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024

  19. arXiv:2403.03031  [pdf, other

    cs.CL

    Learning to Use Tools via Cooperative and Interactive Agents

    Authors: Zhengliang Shi, Shen Gao, Xiuyi Chen, Yue Feng, Lingyong Yan, Haibo Shi, Dawei Yin, Pengjie Ren, Suzan Verberne, Zhaochun Ren

    Abstract: Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility. Existing methods employ one single LLM-based agent to iteratively select and execute tools, thereafter incorporating execution results into the next action prediction. Despite their progress, these methods suffer from performance degradation when addressing practical tasks due to: (1) the… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: working in process

  20. arXiv:2402.17992  [pdf

    physics.app-ph cs.LG

    Physics-Informed Machine Learning for Seismic Response Prediction OF Nonlinear Steel Moment Resisting Frame Structures

    Authors: R. Bailey Bond, Pu Ren, Jerome F. Hajjar, Hao Sun

    Abstract: There is growing interest in using machine learning (ML) methods for structural metamodeling due to the substantial computational cost of traditional simulations. Purely data-driven strategies often face limitations in model robustness, interpretability, and dependency on extensive data. To address these challenges, this paper introduces a novel physics-informed machine learning (PiML) method that… ▽ More

    Submitted 29 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: 34 pages, 12 figures

  21. arXiv:2402.17263  [pdf, other

    cs.CL

    MELoRA: Mini-Ensemble Low-Rank Adapters for Parameter-Efficient Fine-Tuning

    Authors: Pengjie Ren, Chengshun Shi, Shiguang Wu, Mengqi Zhang, Zhaochun Ren, Maarten de Rijke, Zhumin Chen, Jiahuan Pei

    Abstract: Parameter-efficient fine-tuning (PEFT) is a popular method for tailoring pre-trained large language models (LLMs), especially as the models' scale and the diversity of tasks increase. Low-rank adaptation (LoRA) is based on the idea that the adaptation process is intrinsically low-dimensional, i.e., significant model changes can be represented with relatively few parameters. However, decreasing the… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: ACL2024

    MSC Class: 68T50 ACM Class: I.2.7

  22. arXiv:2402.15734  [pdf, other

    cs.LG stat.ML

    Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning

    Authors: Wuyang Chen, Jialin Song, Pu Ren, Shashank Subramanian, Dmitriy Morozov, Michael W. Mahoney

    Abstract: Recent years have witnessed the promise of coupling machine learning methods and physical domainspecific insights for solving scientific problems based on partial differential equations (PDEs). However, being data-intensive, these methods still require a large amount of PDE data. This reintroduces the need for expensive numerical PDE solutions, partially undermining the original goal of avoiding t… ▽ More

    Submitted 13 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  23. arXiv:2402.13593  [pdf, other

    cs.CL

    Knowledge Graph Enhanced Large Language Model Editing

    Authors: Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen

    Abstract: Large language models (LLMs) are pivotal in advancing natural language processing (NLP) tasks, yet their efficacy is hampered by inaccuracies and outdated knowledge. Model editing emerges as a promising solution to address these challenges. However, existing editing methods struggle to track and incorporate changes in knowledge associated with edits, which limits the generalization ability of post… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  24. arXiv:2402.11176  [pdf, other

    cs.CL cs.AI

    KnowTuning: Knowledge-aware Fine-tuning for Large Language Models

    Authors: Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren

    Abstract: Despite their success at many natural language processing (NLP) tasks, large language models still struggle to effectively leverage knowledge for knowledge-intensive tasks, manifesting limitations such as generating incomplete, non-factual, or illogical answers. These limitations stem from inadequate knowledge awareness of LLMs during vanilla fine-tuning. To address these problems, we propose a kn… ▽ More

    Submitted 17 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  25. arXiv:2402.01135  [pdf, other

    cs.IR cs.CL

    A Multi-Agent Conversational Recommender System

    Authors: Jiabao Fang, Shen Gao, Pengjie Ren, Xiuying Chen, Suzan Verberne, Zhaochun Ren

    Abstract: Due to strong capabilities in conducting fluent, multi-turn conversations with users, Large Language Models (LLMs) have the potential to further improve the performance of Conversational Recommender System (CRS). Unlike the aimless chit-chat that LLM excels at, CRS has a clear target. So it is imperative to control the dialogue flow in the LLM to successfully recommend appropriate items to the use… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  26. arXiv:2401.08021  [pdf, other

    cs.HC

    All the Way There and Back: Inertial-Based, Phone-in-Pocket Indoor Wayfinding and Backtracking Apps for Blind Travelers

    Authors: Chia Hsuan Tsai, Fatemeh Elyasi, Peng Ren, Roberto Manduchi

    Abstract: We introduce two iOS apps that have been designed to support wayfinding and backtracking for blind travelers navigating in indoor building environments. Wayfinding involves determining and following a route through the building's corridors to reach a destination, and assumes that the app has access to the floor plan of the building. Backtracking one's route, on the other hand, requires no map know… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: Chia Hsuan Tsai, Fatemeh Elyasi and Peng Ren contributed equally to this research

  27. arXiv:2401.04423  [pdf, other

    cs.IR

    Privacy-Preserving Sequential Recommendation with Collaborative Confusion

    Authors: Wei Wang, Yujie Lin, Pengjie Ren, Zhumin Chen, Tsunenori Mine, Jianli Zhao, Qiang Zhao, Moyan Zhang, Xianye Ben, Yujun Li

    Abstract: Sequential recommendation has attracted a lot of attention from both academia and industry, however the privacy risks associated to gathering and transferring users' personal interaction data are often underestimated or ignored. Existing privacy-preserving studies are mainly applied to traditional collaborative filtering or matrix factorization rather than sequential recommendation. Moreover, thes… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  28. arXiv:2401.01218  [pdf, other

    cs.CL cs.AI cs.LG

    Self-Supervised Position Debiasing for Large Language Models

    Authors: Zhongkun Liu, Zheng Chen, Mengqi Zhang, Zhaochun Ren, Pengjie Ren, Zhumin Chen

    Abstract: Fine-tuning has been demonstrated to be an effective method to improve the domain performance of large language models (LLMs). However, LLMs might fit the dataset bias and shortcuts for prediction, leading to poor generation performance. Previous works have proven that LLMs are prone to exhibit position bias, i.e., leveraging information positioned at the beginning or end, or specific positional c… ▽ More

    Submitted 29 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL 2024 findings, this is the camera-ready version; 21 pages, 22 figures

    ACM Class: I.2.7

  29. On the Effectiveness of Unlearning in Session-Based Recommendation

    Authors: Xin Xin, Liu Yang, Ziqi Zhao, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

    Abstract: Session-based recommendation predicts users' future interests from previous interactions in a session. Despite the memorizing of historical samples, the request of unlearning, i.e., to remove the effect of certain training samples, also occurs for reasons such as user privacy or model fidelity. However, existing studies on unlearning are not tailored for the session-based recommendation. On the on… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures

  30. Debiasing Sequential Recommenders through Distributionally Robust Optimization over System Exposure

    Authors: Jiyuan Yang, Yue Ding, Yidan Wang, Pengjie Ren, Zhumin Chen, Fei Cai, Jun Ma, Rui Zhang, Zhaochun Ren, Xin Xin

    Abstract: Sequential recommendation (SR) models are typically trained on user-item interactions which are affected by the system exposure bias, leading to the user preference learned from the biased SR model not being fully consistent with the true user preference. Exposure bias refers to the fact that user interactions are dependent upon the partial items exposed to the user. Existing debiasing methods do… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Accept by WSDM 2024

  31. arXiv:2312.05762  [pdf, other

    cs.CL

    Multi-Defendant Legal Judgment Prediction via Hierarchical Reasoning

    Authors: Yougang Lyu, Jitai Hao, Zihan Wang, Kai Zhao, Shen Gao, Pengjie Ren, Zhumin Chen, Fang Wang, Zhaochun Ren

    Abstract: Multiple defendants in a criminal fact description generally exhibit complex interactions, and cannot be well handled by existing Legal Judgment Prediction (LJP) methods which focus on predicting judgment results (e.g., law articles, charges, and terms of penalty) for single-defendant cases. To address this problem, we propose the task of multi-defendant LJP, which aims to automatically predict th… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Comments: EMNLP2023 Findings

  32. arXiv:2312.05107  [pdf, other

    cs.CV

    DreaMoving: A Human Video Generation Framework based on Diffusion Models

    Authors: Mengyang Feng, Jinlin Liu, Kai Yu, Yuan Yao, Zheng Hui, Xiefan Guo, Xianhui Lin, Haolan Xue, Chen Shi, Xiaowen Li, Aojie Li, Xiaoyang Kang, Biwen Lei, Miaomiao Cui, Peiran Ren, Xuansong Xie

    Abstract: In this paper, we present DreaMoving, a diffusion-based controllable video generation framework to produce high-quality customized human videos. Specifically, given target identity and posture sequences, DreaMoving can generate a video of the target identity moving or dancing anywhere driven by the posture sequences. To this end, we propose a Video ControlNet for motion-controlling and a Content G… ▽ More

    Submitted 11 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 5 pages, 5 figures, Tech. Report

  33. arXiv:2311.02446  [pdf, other

    cs.IR

    Learning Robust Sequential Recommenders through Confident Soft Labels

    Authors: Shiguang Wu, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Maarten de Rijke, Zhaochun Ren

    Abstract: Sequential recommenders that are trained on implicit feedback are usually learned as a multi-class classification task through softmax-based loss functions on one-hot class labels. However, one-hot training labels are sparse and may lead to biased training and sub-optimal performance. Dense, soft labels have been shown to help improve recommendation performance. But how to generate high-quality an… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  34. arXiv:2311.01555  [pdf, other

    cs.IR cs.CL

    Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers

    Authors: Weiwei Sun, Zheng Chen, Xinyu Ma, Lingyong Yan, Shuaiqiang Wang, Pengjie Ren, Zhumin Chen, Dawei Yin, Zhaochun Ren

    Abstract: Recent studies have demonstrated the great potential of Large Language Models (LLMs) serving as zero-shot relevance rankers. The typical approach involves making comparisons between pairs or lists of documents. Although effective, these listwise and pairwise methods are not efficient and also heavily rely on intricate prompt engineering. To tackle this problem, we introduce a novel instruction dis… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  35. arXiv:2310.09846  [pdf, other

    cs.IR

    Generalizing Few-Shot Named Entity Recognizers to Unseen Domains with Type-Related Features

    Authors: Zihan Wang, Ziqi Zhao, Zhumin Chen, Pengjie Ren, Maarten de Rijke, Zhaochun Ren

    Abstract: Few-shot named entity recognition (NER) has shown remarkable progress in identifying entities in low-resource domains. However, few-shot NER methods still struggle with out-of-domain (OOD) examples due to their reliance on manual labeling for the target domain. To address this limitation, recent studies enable generalization to an unseen target domain with only a few labeled examples using data au… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP findings

  36. arXiv:2310.02619  [pdf, other

    cs.LG

    Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs

    Authors: Ilan Naiman, N. Benjamin Erichson, Pu Ren, Michael W. Mahoney, Omri Azencot

    Abstract: Generating realistic time series data is important for many engineering and scientific applications. Existing work tackles this problem using generative adversarial networks (GANs). However, GANs are unstable during training, and they can suffer from mode collapse. While variational autoencoders (VAEs) are known to be more robust to the these issues, they are (surprisingly) less considered for tim… ▽ More

    Submitted 13 May, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted to The Twelfth International Conference on Learning Representations, ICLR 2024

  37. arXiv:2309.08156  [pdf, other

    cs.CL

    RADE: Reference-Assisted Dialogue Evaluation for Open-Domain Dialogue

    Authors: Zhengliang Shi, Weiwei Sun, Shuo Zhang, Zhen Zhang, Pengjie Ren, Zhaochun Ren

    Abstract: Evaluating open-domain dialogue systems is challenging for reasons such as the one-to-many problem, i.e., many appropriate responses other than just the golden response. As of now, automatic evaluation methods need better consistency with humans, while reliable human evaluation can be time- and cost-intensive. To this end, we propose the Reference-Assisted Dialogue Evaluation (RADE) approach under… ▽ More

    Submitted 17 September, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 19 pages, Accepted by ACL2023 main conference

  38. arXiv:2309.02861  [pdf, other

    cs.CV

    Image Aesthetics Assessment via Learnable Queries

    Authors: Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu

    Abstract: Image aesthetics assessment (IAA) aims to estimate the aesthetics of images. Depending on the content of an image, diverse criteria need to be selected to assess its aesthetics. Existing works utilize pre-trained vision backbones based on content knowledge to learn image aesthetics. However, training those backbones is time-consuming and suffers from attention dispersion. Inspired by learnable que… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  39. arXiv:2308.16432  [pdf, other

    cs.CR

    Efficient Additions and Montgomery Reductions of Large Integers for SIMD

    Authors: Pengchang Ren, Reiji Suda, Vorapong Suppakitpaisarn

    Abstract: This paper presents efficient algorithms, designed to leverage SIMD for performing Montgomery reductions and additions on integers larger than 512 bits. The existing algorithms encounter inefficiencies when parallelized using SIMD due to extensive dependencies in both operations, particularly noticeable in costly operations like ARM's SVE. To mitigate this problem, a novel addition algorithm is in… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  40. arXiv:2308.14469  [pdf, other

    cs.CV

    Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

    Authors: Tao Yang, Rongyuan Wu, Peiran Ren, Xuansong Xie, Lei Zhang

    Abstract: Diffusion models have demonstrated impressive performance in various image generation, editing, enhancement and translation tasks. In particular, the pre-trained text-to-image stable diffusion models provide a potential solution to the challenging realistic image super-resolution (Real-ISR) and image stylization problems with their strong generative priors. However, the existing methods along this… ▽ More

    Submitted 9 July, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Journal ref: The European Conference on Computer Vision (ECCV) 2024

  41. arXiv:2308.14034  [pdf, other

    cs.AI cs.CL

    Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum

    Authors: Shen Gao, Zhengliang Shi, Minghang Zhu, Bowen Fang, Xin Xin, Pengjie Ren, Zhumin Chen, Jun Ma, Zhaochun Ren

    Abstract: Augmenting large language models (LLMs) with external tools has emerged as a promising approach to extending the capability of LLMs. Although some works employ open-source LLMs for the tool learning task, most of them are trained in a controlled environment in which LLMs only learn to execute the human-provided tools. However, selecting proper tools from the large toolset is also a crucial ability… ▽ More

    Submitted 21 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted by AAAI 2024

  42. FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Lookup Table

    Authors: Wenhao Li, Guangyang Wu, Wenyi Wang, Peiran Ren, Xiaohong Liu

    Abstract: Low-Light Video Enhancement (LLVE) has received considerable attention in recent years. One of the critical requirements of LLVE is inter-frame brightness consistency, which is essential for maintaining the temporal coherence of the enhanced video. However, most existing single-image-based methods fail to address this issue, resulting in flickering effect that degrades the overall quality after en… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 11pages, 9 Figures, and 6 Tables. Accepted by ACMMM 2023

    ACM Class: I.4.3

  43. arXiv:2308.04829  [pdf, other

    cs.CV

    MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation

    Authors: Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang

    Abstract: Recently, semantic segmentation models trained with image-level text supervision have shown promising results in challenging open-world scenarios. However, these models still face difficulties in learning fine-grained semantic alignment at the pixel level and predicting accurate object masks. To address this issue, we propose MixReorg, a novel and straightforward pre-training paradigm for semantic… ▽ More

    Submitted 12 March, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

  44. arXiv:2307.09751  [pdf, other

    cs.IR cs.AI

    Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community

    Authors: Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shen Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Mingwen Wang, Ji-Rong Wen, Le Wu , et al. (8 additional authors not shown)

    Abstract: The research field of Information Retrieval (IR) has evolved significantly, expanding beyond traditional search to meet diverse user information needs. Recently, Large Language Models (LLMs) have demonstrated exceptional capabilities in text understanding, generation, and knowledge inference, opening up exciting avenues for IR research. LLMs not only facilitate generative retrieval but also offer… ▽ More

    Submitted 26 July, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 17 pages

  45. arXiv:2307.06703  [pdf, other

    cs.CL

    Intent-calibrated Self-training for Answer Selection in Open-domain Dialogues

    Authors: Wentao Deng, Jiahuan Pei, Zhaochun Ren, Zhumin Chen, Pengjie Ren

    Abstract: Answer selection in open-domain dialogues aims to select an accurate answer from candidates. Recent success of answer selection models hinges on training with large amounts of labeled data. However, collecting large-scale labeled data is labor-intensive and time-consuming. In this paper, we introduce the predicted intent labels to calibrate answer labels in a self-training paradigm. Specifically,… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: This arXiv version is a pre-MIT Press publication version, this paper has been accepted by TACL. 16 pages, 3 figures, 4 tables

  46. arXiv:2307.03897  [pdf, other

    cs.CL

    Answering Ambiguous Questions via Iterative Prompting

    Authors: Weiwei Sun, Hengyi Cai, Hongshen Chen, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren

    Abstract: In open-domain question answering, due to the ambiguity of questions, multiple plausible answers may exist. To provide feasible answers to an ambiguous question, one approach is to directly predict all valid answers, but this can struggle with balancing relevance and diversity. An alternative is to gather candidate answers and aggregate them, but this method can be computationally costly and may n… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: To be published in ACL 2023

  47. arXiv:2306.16275  [pdf

    cs.CL cs.AI cs.LG

    Leveraging GPT-4 for Food Effect Summarization to Enhance Product-Specific Guidance Development via Iterative Prompting

    Authors: Yiwen Shi, Ping Ren, Jing Wang, Biao Han, Taha ValizadehAslani, Felix Agbavor, Yi Zhang, Meng Hu, Liang Zhao, Hualou Liang

    Abstract: Food effect summarization from New Drug Application (NDA) is an essential component of product-specific guidance (PSG) development and assessment. However, manual summarization of food effect from extensive drug application review documents is time-consuming, which arouses a need to develop automated methods. Recent advances in large language models (LLMs) such as ChatGPT and GPT-4, have demonstra… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 22 pages, 6 figures

  48. arXiv:2306.14070  [pdf, other

    cs.CV eess.IV physics.comp-ph

    SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning

    Authors: Pu Ren, N. Benjamin Erichson, Shashank Subramanian, Omer San, Zarija Lukic, Michael W. Mahoney

    Abstract: Super-Resolution (SR) techniques aim to enhance data resolution, enabling the retrieval of finer details, and improving the overall quality and fidelity of the data representation. There is growing interest in applying SR methods to complex spatiotemporal systems within the Scientific Machine Learning (SciML) community, with the hope of accelerating numerical simulations and/or improving forecasts… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  49. arXiv:2306.11335  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Surfer: Progressive Reasoning with World Models for Robotic Manipulation

    Authors: Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang

    Abstract: Considering how to make the model accurately understand and follow natural language instructions and perform actions consistent with world knowledge is a key challenge in robot manipulation. This mainly includes human fuzzy instruction reasoning and the following of physical knowledge. Therefore, the embodied intelligence agent must have the ability to model world knowledge from training data. How… ▽ More

    Submitted 20 March, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  50. Towards Explainable Conversational Recommender Systems

    Authors: Shuyu Guo, Shuo Zhang, Weiwei Sun, Pengjie Ren, Zhumin Chen, Zhaochun Ren

    Abstract: Explanations in conventional recommender systems have demonstrated benefits in helping the user understand the rationality of the recommendations and improving the system's efficiency, transparency, and trustworthiness. In the conversational environment, multiple contextualized explanations need to be generated, which poses further challenges for explanations. To better measure explainability in c… ▽ More

    Submitted 27 May, 2023; originally announced May 2023.