Skip to main content

Showing 1–16 of 16 results for author: Tu, Q

  1. arXiv:2405.10235  [pdf

    cs.DB physics.data-an

    Novel Data Models for Inter-operable LCA Frameworks

    Authors: Kourosh Malek, Max Dreger, Zirui Tang, Qingshi Tu

    Abstract: Life cycle assessment (LCA) plays a critical role in assessing the environmental impacts of a product, technology, or service throughout its entire life cycle. Nonetheless, many existing LCA tools and methods lack adequate metadata management, which can hinder their further development and wide adoption. In the example of LCA for clean energy technologies, metadata helps monitor data and the envir… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2404.05569  [pdf, other

    cs.AI cs.CL cs.MA

    360$^\circ$REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System

    Authors: Shen Gao, Hao Li, Chengrui Huang, Quan Tu, Zhiliang Tian, Minlie Huang, Shuo Shang

    Abstract: Large language model agents have demonstrated remarkable advancements across various complex tasks. Recent works focus on optimizing the agent team or employing self-reflection to iteratively solve complex tasks. Since these agents are all based on the same LLM, only conducting self-evaluation or removing underperforming agents does not substantively enhance the capability of the agents. We argue… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  3. arXiv:2403.11439  [pdf, other

    cs.CL

    StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation

    Authors: Jinpeng Li, Zekai Zhang, Quan Tu, Xin Cheng, Dongyan Zhao, Rui Yan

    Abstract: Large Language Models (LLMs) demonstrate superior performance in generative scenarios and have attracted widespread attention. Among them, stylized dialogue generation is essential in the context of LLMs for building intelligent and engaging dialogue agent. However the ability of LLMs is data-driven and limited by data bias, leading to poor performance on specific tasks. In particular, stylized di… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  4. arXiv:2403.08312  [pdf, other

    cs.CL cs.AI

    StreamingDialogue: Prolonged Dialogue Learning via Long Context Compression with Minimal Losses

    Authors: Jia-Nan Li, Quan Tu, Cunli Mao, Zhengtao Yu, Ji-Rong Wen, Rui Yan

    Abstract: Standard Large Language Models (LLMs) struggle with handling dialogues with long contexts due to efficiency and consistency issues. According to our observation, dialogue contexts are highly structured, and the special token of \textit{End-of-Utterance} (EoU) in dialogues has the potential to aggregate information. We refer to the EoU tokens as ``conversational attention sinks'' (conv-attn sinks).… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2403.03424  [pdf, other

    cs.IR

    Generative News Recommendation

    Authors: Shen Gao, Jiabao Fang, Quan Tu, Zhitao Yao, Zhumin Chen, Pengjie Ren, Zhaochun Ren

    Abstract: Most existing news recommendation methods tackle this task by conducting semantic matching between candidate news and user representation produced by historical clicked news. However, they overlook the high-level connections among different news articles and also ignore the profound relationship between these news articles and users. And the definition of these methods dictates that they can only… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW 2024

  6. arXiv:2403.03102  [pdf, other

    cs.CL cs.AI

    "In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning

    Authors: Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan

    Abstract: Personalized dialogue systems have gained significant attention in recent years for their ability to generate responses in alignment with different personas. However, most existing approaches rely on pre-defined personal profiles, which are not only time-consuming and labor-intensive to create but also lack flexibility. We propose In-Dialogue Learning (IDL), a fine-tuning framework that enhances t… ▽ More

    Submitted 12 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2401.01275  [pdf, other

    cs.CL

    CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation

    Authors: Quan Tu, Shilong Fan, Zihang Tian, Rui Yan

    Abstract: Recently, the advent of large language models (LLMs) has revolutionized generative agents. Among them, Role-Playing Conversational Agents (RPCAs) attract considerable attention due to their ability to emotionally engage users. However, the absence of a comprehensive benchmark impedes progress in this field. To bridge this gap, we introduce CharacterEval, a Chinese benchmark for comprehensive RPCA… ▽ More

    Submitted 9 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  8. arXiv:2312.16132  [pdf, other

    cs.CL

    RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models

    Authors: Tianhao Shen, Sun Li, Quan Tu, Deyi Xiong

    Abstract: The rapid evolution of large language models necessitates effective benchmarks for evaluating their role knowledge, which is essential for establishing connections with the real world and providing more immersive interactions. This paper introduces RoleEval, a bilingual benchmark designed to assess the memorization, utilization, and reasoning capabilities of role knowledge. RoleEval comprises Role… ▽ More

    Submitted 16 February, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Our dataset is available at https://github.com/Magnetic2014/RoleEval

  9. arXiv:2311.07468  [pdf, other

    cs.CL cs.AI cs.LG

    Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse

    Authors: Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan

    Abstract: Recent studies have highlighted a phenomenon in large language models (LLMs) known as "the reversal curse," in which the order of knowledge entities in the training data biases the models' comprehension. For example, if a model is trained on sentences where entity A consistently appears before entity B, it can respond to queries about A by providing B as the answer. However, it may encounter confu… ▽ More

    Submitted 16 November, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

  10. arXiv:2310.17976  [pdf, other

    cs.CL

    InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

    Authors: Xintao Wang, Yunze Xiao, Jen-tse Huang, Siyu Yuan, Rui Xu, Haoran Guo, Quan Tu, Yaying Fei, Ziang Leng, Wei Wang, Jiangjie Chen, Cheng Li, Yanghua Xiao

    Abstract: Role-playing agents (RPAs), powered by large language models, have emerged as a flourishing field of applications. However, a key challenge lies in assessing whether RPAs accurately reproduce the personas of target characters, namely their character fidelity. Existing methods mainly focus on the knowledge and linguistic patterns of characters. This paper, instead, introduces a novel perspective to… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: ACL 2024

  11. arXiv:2310.16271  [pdf, other

    cs.CL cs.AI

    CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment

    Authors: Jixiang Hong, Quan Tu, Changyu Chen, Xing Gao, Ji Zhang, Rui Yan

    Abstract: Language models trained on large-scale corpus often generate content that is harmful, toxic, or contrary to human preferences, making their alignment with human values a critical concern. Reinforcement learning from human feedback (RLHF) with algorithms like PPO is a prevalent approach for alignment but is often complex, unstable, and resource-intensive. Recently, ranking-based alignment methods h… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  12. arXiv:2308.10278  [pdf, other

    cs.CL

    CharacterChat: Learning towards Conversational AI with Personalized Social Support

    Authors: Quan Tu, Chuanqi Chen, Jinpeng Li, Yanran Li, Shuo Shang, Dongyan Zhao, Ran Wang, Rui Yan

    Abstract: In our modern, fast-paced, and interconnected world, the importance of mental well-being has grown into a matter of great urgency. However, traditional methods such as Emotional Support Conversations (ESC) face challenges in effectively addressing a diverse range of individual personalities. In response, we introduce the Social Support Conversation (S2Conv) framework. It comprises a series of supp… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 10 pages, 6 figures, 5 tables

  13. arXiv:2307.00569  [pdf, other

    cs.CL

    SSP: Self-Supervised Post-training for Conversational Search

    Authors: Quan Tu, Shen Gao, Xiaolong Wu, Zhao Cao, Ji-Rong Wen, Rui Yan

    Abstract: Conversational search has been regarded as the next-generation search paradigm. Constrained by data scarcity, most existing methods distill the well-trained ad-hoc retriever to the conversational retriever. However, these methods, which usually initialize parameters by query reformulation to discover contextualized dependency, have trouble in understanding the dialogue structure information and st… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Accepted by ACL 2023 Findings, Long Paper

  14. arXiv:2205.12633  [pdf, other

    cs.CV eess.IV

    NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

    Authors: Eduardo Pérez-Pellitero, Sibi Catley-Chandar, Richard Shaw, Aleš Leonardis, Radu Timofte, Zexin Zhang, Cen Liu, Yunbo Peng, Yue Lin, Gaocheng Yu, Jin Zhang, Zhe Ma, Hongbin Wang, Xiangyu Chen, Xintao Wang, Haiwei Wu, Lin Liu, Chao Dong, Jiantao Zhou, Qingsen Yan, Song Zhang, Weiye Chen, Yuhang Liu, Zhen Zhang, Yanning Zhang , et al. (68 additional authors not shown)

    Abstract: This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR)… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

    Comments: CVPR Workshops 2022. 15 pages, 21 figures, 2 tables

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2022

  15. arXiv:2203.13560  [pdf, other

    cs.CL

    MISC: A MIxed Strategy-Aware Model Integrating COMET for Emotional Support Conversation

    Authors: Quan Tu, Yanran Li, Jianwei Cui, Bin Wang, Ji-Rong Wen, Rui Yan

    Abstract: Applying existing methods to emotional support conversation -- which provides valuable assistance to people who are in need -- has two major limitations: (a) they generally employ a conversation-level emotion label, which is too coarse-grained to capture user's instant mental state; (b) most of them focus on expressing empathy in the response(s) rather than gradually reducing user's distress. To a… ▽ More

    Submitted 31 March, 2022; v1 submitted 25 March, 2022; originally announced March 2022.

    Comments: 12 pages, 5 figures, accepted by ACL 2022 main conference

  16. arXiv:2103.16664  [pdf

    cond-mat.mtrl-sci cs.LG

    A probabilistic deep learning approach to automate the interpretation of multi-phase diffraction spectra

    Authors: Nathan J. Szymanski, Christopher J. Bartel, Yan Zeng, Qingsong Tu, Gerbrand Ceder

    Abstract: Autonomous synthesis and characterization of inorganic materials requires the automatic and accurate analysis of X-ray diffraction spectra. For this task, we designed a probabilistic deep learning algorithm to identify complex multi-phase mixtures. At the core of this algorithm lies an ensemble convolutional neural network trained on simulated diffraction spectra, which are systematically augmente… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.