Skip to main content

Showing 1–50 of 92 results for author: Zou, L

  1. arXiv:2407.10182  [pdf, other

    cs.SD eess.AS

    Few-Shot Bioacoustic Event Detection with Frame-Level Embedding Learning System

    Authors: PengYuan Zhao, ChengWei Lu, Liang Zou

    Abstract: This technical report presents our frame-level embedding learning system for the DCASE2024 challenge for few-shot bioacoustic event detection (Task 5).In this work, we used log-mel and PCEN for feature extraction of the input audio, Netmamba Encoder as the information interaction network, and adopted data augmentation strategies to improve the generalizability of the trained model as well as multi… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  2. arXiv:2407.02328  [pdf, other

    cs.CL

    Efficient Sparse Attention needs Adaptive Token Release

    Authors: Chaoran Zhang, Lixin Zou, Dan Luo, Min Tang, Xiangyang Luo, Zihao Li, Chenliang Li

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide array of text-centric tasks. However, their `large' scale introduces significant computational and storage challenges, particularly in managing the key-value states of the transformer, which limits their wider applicability. Therefore, we propose to adaptively release resources from caches and reb… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024(Findings)

  3. arXiv:2406.13216  [pdf, other

    cs.LG cs.AI

    Combining Optimal Transport and Embedding-Based Approaches for More Expressiveness in Unsupervised Graph Alignment

    Authors: Songyang Chen, Yu Liu, Lei Zou, Zexuan Wang, Youfang Lin, Yuxing Chen, Anqun Pan

    Abstract: Unsupervised graph alignment finds the one-to-one node correspondence between a pair of attributed graphs by only exploiting graph structure and node features. One category of existing works first computes the node representation and then matches nodes with close embeddings, which is intuitive but lacks a clear objective tailored for graph alignment in the unsupervised setting. The other category… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 12 pages,9 figures

  4. arXiv:2404.13066  [pdf, other

    cs.CL cs.AI

    Leveraging Large Language Model as Simulated Patients for Clinical Education

    Authors: Yanzeng Li, Cheng Zeng, Jialun Zhong, Ruoyu Zhang, Minhao Zhang, Lei Zou

    Abstract: Simulated Patients (SPs) play a crucial role in clinical medical education by providing realistic scenarios for student practice. However, the high cost of training and hiring qualified SPs, along with the heavy workload and potential risks they face in consistently portraying actual patients, limit students' access to this type of clinical training. Consequently, the integration of computer progr… ▽ More

    Submitted 24 April, 2024; v1 submitted 13 April, 2024; originally announced April 2024.

  5. arXiv:2404.09463  [pdf

    cs.LG

    PRIME: A CyberGIS Platform for Resilience Inference Measurement and Enhancement

    Authors: Debayan Mandal, Dr. Lei Zou, Rohan Singh Wilkho, Joynal Abedin, Bing Zhou, Dr. Heng Cai, Dr. Furqan Baig, Dr. Nasir Gharaibeh, Dr. Nina Lam

    Abstract: In an era of increased climatic disasters, there is an urgent need to develop reliable frameworks and tools for evaluating and improving community resilience to climatic hazards at multiple geographical and temporal scales. Defining and quantifying resilience in the social domain is relatively subjective due to the intricate interplay of socioeconomic factors with disaster resilience. Meanwhile, t… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 28 pages, 6 figures

  6. arXiv:2404.07999  [pdf, other

    cs.LG cs.CL

    A Multi-Level Framework for Accelerating Training Transformer Models

    Authors: Longwei Zou, Han Zhang, Yangdong Deng

    Abstract: The fast growing capabilities of large-scale deep learning models, such as Bert, GPT and ViT, are revolutionizing the landscape of NLP, CV and many other domains. Training such models, however, poses an unprecedented demand for computing power, which incurs exponentially increasing energy cost and carbon dioxide emissions. It is thus critical to develop efficient training solutions to reduce the t… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: ICLR 2024

  7. arXiv:2404.06709  [pdf, other

    cs.CL

    CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers

    Authors: Longwei Zou, Qingyang Wang, Han Zhao, Jiangang Kong, Yi Yang, Yangdong Deng

    Abstract: The fast-growing large scale language models are delivering unprecedented performance on almost all natural language processing tasks. However, the effectiveness of large language models are reliant on an exponentially increasing number of parameters. The overwhelming computation complexity incurs a high inference latency that negatively affects user experience. Existing methods to improve inferen… ▽ More

    Submitted 4 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: ACL 2024

  8. arXiv:2403.13300  [pdf, other

    stat.ML cs.LG

    Kernel Multigrid: Accelerate Back-fitting via Sparse Gaussian Process Regression

    Authors: Lu Zou, Liang Ding

    Abstract: Additive Gaussian Processes (GPs) are popular approaches for nonparametric feature selection. The common training method for these models is Bayesian Back-fitting. However, the convergence rate of Back-fitting in training additive GPs is still an open problem. By utilizing a technique called Kernel Packets (KP), we prove that the convergence rate of Back-fitting is no faster than… ▽ More

    Submitted 30 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  9. arXiv:2403.11091  [pdf, other

    cs.SD cs.CV eess.AS

    Multitask frame-level learning for few-shot sound event detection

    Authors: Liang Zou, Genwei Yan, Ruoyu Wang, Jun Du, Meng Lei, Tian Gao, Xin Fang

    Abstract: This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 6 pages, 4 figures, conference

  10. arXiv:2402.15627  [pdf, other

    cs.LG cs.DC

    MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

    Authors: Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao , et al. (7 additional authors not shown)

    Abstract: We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the scale of more than 10,000 GPUs. Training LLMs at this scale brings unprecedented challenges to training efficiency and stability. We take a full-stack approach that co-designs the algorithmic and system components across model bl… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  11. arXiv:2402.13296  [pdf, other

    cs.NE

    Evolutionary Reinforcement Learning: A Systematic Review and Future Directions

    Authors: Yuanguo Lin, Fan Lin, Guorong Cai, Hong Chen, Lixin Zou, Pengcheng Wu

    Abstract: In response to the limitations of reinforcement learning and evolutionary algorithms (EAs) in complex problem-solving, Evolutionary Reinforcement Learning (EvoRL) has emerged as a synergistic solution. EvoRL integrates EAs and reinforcement learning, presenting a promising avenue for training intelligent agents. This systematic review firstly navigates through the technological background of EvoRL… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 18 pages, 2 figures

  12. arXiv:2312.09911  [pdf, other

    cs.SD eess.AS

    Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

    Authors: Xueyao Zhang, Liumeng Xue, Yicheng Gu, Yuancheng Wang, Haorui He, Chaoren Wang, Xi Chen, Zihao Fang, Haopeng Chen, Junan Zhang, Tze Ying Tang, Lexiao Zou, Mingxuan Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu

    Abstract: Amphion is an open-source toolkit for Audio, Music, and Speech Generation, targeting to ease the way for junior researchers and engineers into these fields. It presents a unified framework that is inclusive of diverse generation tasks and models, with the added bonus of being easily extendable for new incorporation. The toolkit is designed with beginner-friendly workflows and pre-trained models, a… ▽ More

    Submitted 22 February, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: Amphion Website: https://github.com/open-mmlab/Amphion

  13. arXiv:2312.07141  [pdf, other

    cs.CL

    Multilingual large language models leak human stereotypes across language boundaries

    Authors: Yang Trista Cao, Anna Sotnikova, Jieyu Zhao, Linda X. Zou, Rachel Rudinger, Hal Daume III

    Abstract: Multilingual large language models have been increasingly popular for their proficiency in processing and generating text across various languages. Previous research has shown that the presence of stereotypes and biases in monolingual large language models can be attributed to the nature of their training data, which is collected from humans and reflects societal biases. Multilingual language mode… ▽ More

    Submitted 8 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  14. arXiv:2312.01386  [pdf, ps, other

    cs.LG stat.ML

    Regret Optimality of GP-UCB

    Authors: Wenjia Wang, Xiaowei Zhang, Lu Zou

    Abstract: Gaussian Process Upper Confidence Bound (GP-UCB) is one of the most popular methods for optimizing black-box functions with noisy observations, due to its simple structure and superior performance. Its empirical successes lead to a natural, yet unresolved question: Is GP-UCB regret optimal? In this paper, we offer the first generally affirmative answer to this important open question in the Bayesi… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 23 pages

  15. arXiv:2310.19596  [pdf, other

    cs.CL cs.AI

    LLMaAA: Making Large Language Models as Active Annotators

    Authors: Ruoyu Zhang, Yanzeng Li, Yongliang Ma, Ming Zhou, Lei Zou

    Abstract: Prevalent supervised learning methods in natural language processing (NLP) are notoriously data-hungry, which demand large amounts of high-quality annotated data. In practice, acquiring such data is a costly endeavor. Recently, the superior few-shot performance of large language models (LLMs) has propelled the development of dataset generation, where the training data are solely synthesized from L… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023 camera ready

  16. arXiv:2310.16837   

    cs.LG cs.AI cs.DB cs.SI

    RDBench: ML Benchmark for Relational Databases

    Authors: Zizhao Zhang, Yi Yang, Lutong Zou, He Wen, Tao Feng, Jiaxuan You

    Abstract: Benefiting from high-quality datasets and standardized evaluation metrics, machine learning (ML) has achieved sustained progress and widespread applications. However, while applying machine learning to relational databases (RDBs), the absence of a well-established benchmark remains a significant obstacle to the development of ML. To address this issue, we introduce ML Benchmark For Relational Data… ▽ More

    Submitted 30 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Withdrawn by the authors to avoid conflict of interests

  17. arXiv:2310.11160  [pdf, other

    cs.SD eess.AS

    Leveraging Diverse Semantic-based Audio Pretrained Models for Singing Voice Conversion

    Authors: Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Junan Zhang, Liumeng Xue, Jinchao Zhang, Jie Zhou, Zhizheng Wu

    Abstract: Singing Voice Conversion (SVC) is a technique that enables any singer to perform any song. To achieve this, it is essential to obtain speaker-agnostic representations from the source audio, which poses a significant challenge. A common solution involves utilizing a semantic-based audio pretrained model as a feature extractor. However, the degree to which the extracted features can meet the SVC req… ▽ More

    Submitted 27 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

  18. arXiv:2309.16730  [pdf

    cs.LG cs.CY

    Explainable machine learning-based prediction model for diabetic nephropathy

    Authors: Jing-Mei Yin, Yang Li, Jun-Tang Xue, Guo-Wei Zong, Zhong-Ze Fang, Lang Zou

    Abstract: The aim of this study is to analyze the effect of serum metabolites on diabetic nephropathy (DN) and predict the prevalence of DN through a machine learning approach. The dataset consists of 548 patients from April 2018 to April 2019 in Second Affiliated Hospital of Dalian Medical University (SAHDMU). We select the optimal 38 features through a Least absolute shrinkage and selection operator (LASS… ▽ More

    Submitted 24 October, 2023; v1 submitted 27 September, 2023; originally announced September 2023.

  19. arXiv:2309.05201  [pdf, other

    cs.CL

    Two is Better Than One: Answering Complex Questions by Multiple Knowledge Sources with Generalized Links

    Authors: Minhao Zhang, Yongliang Ma, Yanzeng Li, Ruoyu Zhang, Lei Zou, Ming Zhou

    Abstract: Incorporating multiple knowledge sources is proven to be beneficial for answering complex factoid questions. To utilize multiple knowledge bases (KB), previous works merge all KBs into a single graph via entity alignment and reduce the problem to question-answering (QA) over the fused KB. In reality, various link relations between KBs might be adopted in QA over multi-KBs. In addition to the ident… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  20. arXiv:2308.04800  [pdf, other

    cs.CL

    ADMUS: A Progressive Question Answering Framework Adaptable to Multiple Knowledge Sources

    Authors: Yirui Zhan, Yanzeng Li, Minhao Zhang, Lei Zou

    Abstract: With the introduction of deep learning models, semantic parsingbased knowledge base question answering (KBQA) systems have achieved high performance in handling complex questions. However, most existing approaches primarily focus on enhancing the model's effectiveness on individual benchmark datasets, disregarding the high costs of adapting the system to disparate datasets in real-world scenarios… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  21. arXiv:2307.14603  [pdf, other

    eess.IV cs.CV

    A Weakly Supervised Segmentation Network Embedding Cross-scale Attention Guidance and Noise-sensitive Constraint for Detecting Tertiary Lymphoid Structures of Pancreatic Tumors

    Authors: Bingxue Wang, Liwen Zou, Jun Chen, Yingying Cao, Zhenghua Cai, Yudong Qiu, Liang Mao, Zhongqiu Wang, Jingya Chen, Luying Gui, Xiaoping Yang

    Abstract: The presence of tertiary lymphoid structures (TLSs) on pancreatic pathological images is an important prognostic indicator of pancreatic tumors. Therefore, TLSs detection on pancreatic pathological images plays a crucial role in diagnosis and treatment for patients with pancreatic tumors. However, fully supervised detection algorithms based on deep learning usually require a large number of manual… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

  22. arXiv:2305.09918  [pdf, ps, other

    cs.IR

    Unconfounded Propensity Estimation for Unbiased Ranking

    Authors: Dan Luo, Lixin Zou, Qingyao Ai, Zhiyu Chen, Chenliang Li, Dawei Yin, Brian D. Davison

    Abstract: The goal of unbiased learning to rank (ULTR) is to leverage implicit user feedback for optimizing learning-to-rank systems. Among existing solutions, automatic ULTR algorithms that jointly learn user bias models (i.e., propensity models) with unbiased rankers have received a lot of attention due to their superior performance and low deployment cost in practice. Despite their theoretical soundness,… ▽ More

    Submitted 8 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures

  23. arXiv:2305.00324  [pdf, other

    stat.ML cs.LG

    Representing Additive Gaussian Processes by Sparse Matrices

    Authors: Lu Zou, Haoyuan Chen, Liang Ding

    Abstract: Among generalized additive models, additive Matérn Gaussian Processes (GPs) are one of the most popular for scalable high-dimensional problems. Thanks to their additive structure and stochastic differential equation representation, back-fitting-based algorithms can reduce the time complexity of computing the posterior mean from $O(n^3)$ to $O(n\log n)$ time where $n$ is the data size. However, gen… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

  24. arXiv:2303.13844  [pdf, other

    cs.DB

    Efficient Execution of SPARQL Queries with OPTIONAL and UNION Expressions

    Authors: Lei Zou, Yue Pang, M. Tamer Özsu, Jiaqi Chen

    Abstract: The proliferation of RDF datasets has resulted in studies focusing on optimizing SPARQL query processing. Most existing work focuses on basic graph patterns (BGPs) and ignores other vital operators in SPARQL, such as UNION and OPTIONAL. SPARQL queries with these operators, which we abbreviate as SPARQL-UO, pose serious query plan generation challenges. In this paper, we propose techniques for exec… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  25. User Retention-oriented Recommendation with Decision Transformer

    Authors: Kesen Zhao, Lixin Zou, Xiangyu Zhao, Maolin Wang, Dawei yin

    Abstract: Improving user retention with reinforcement learning~(RL) has attracted increasing attention due to its significant importance in boosting user engagement. However, training the RL policy from scratch without hurting users' experience is unavoidable due to the requirement of trial-and-error searches. Furthermore, the offline methods, which aim to optimize the policy without online interactions, su… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

    Comments: 9 pages, 5 figures

  26. arXiv:2303.02967  [pdf, other

    eess.IV cs.CV

    Automated Peripancreatic Vessel Segmentation and Labeling Based on Iterative Trunk Growth and Weakly Supervised Mechanism

    Authors: Liwen Zou, Zhenghua Cai, Liang Mao, Ziwei Nie, Yudong Qiu, Xiaoping Yang

    Abstract: Peripancreatic vessel segmentation and anatomical labeling play extremely important roles to assist the early diagnosis, surgery planning and prognosis for patients with pancreatic tumors. However, most current techniques cannot achieve satisfactory segmentation performance for peripancreatic veins and usually make predictions with poor integrity and connectivity. Besides, unsupervised labeling al… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  27. arXiv:2303.02944  [pdf, other

    cs.CV

    CTG-Net: An Efficient Cascaded Framework Driven by Terminal Guidance Mechanism for Dilated Pancreatic Duct Segmentation

    Authors: Liwen Zou, Zhenghua Cai, Yudong Qiu, Luying Gui, Liang Mao, Xiaoping Yang

    Abstract: Pancreatic duct dilation indicates a high risk of various pancreatic diseases. Segmentation of dilated pancreatic ducts on computed tomography (CT) images shows the potential to assist the early diagnosis, surgical planning and prognosis. Because of the ducts' tiny sizes, slender tubular structures and the surrounding distractions, most current researches on pancreatic duct segmentation achieve lo… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  28. arXiv:2301.13631  [pdf

    cs.CL cs.AI

    TopoBERT: Plug and Play Toponym Recognition Module Harnessing Fine-tuned BERT

    Authors: Bing Zhou, Lei Zou, Yingjie Hu, Yi Qiang, Daniel Goldberg

    Abstract: Extracting precise geographical information from textual contents is crucial in a plethora of applications. For example, during hazardous events, a robust and unbiased toponym extraction framework can provide an avenue to tie the location concerned to the topic discussed by news media posts and pinpoint humanitarian help requests or damage reports from social media. Early studies have leveraged ru… ▽ More

    Submitted 3 February, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: 8 Pages, 6 figures

  29. arXiv:2301.13337   

    cs.CV

    DAFD: Domain Adaptation via Feature Disentanglement for Image Classification

    Authors: Zhize Wu, Changjiang Du, Le Zou, Ming Tan, Tong Xu, Fan Cheng, Fudong Nian, Thomas Weise

    Abstract: A good feature representation is the key to image classification. In practice, image classifiers may be applied in scenarios different from what they have been trained on. This so-called domain shift leads to a significant performance drop in image classification. Unsupervised domain adaptation (UDA) reduces the domain shift by transferring the knowledge learned from a labeled source domain to an… ▽ More

    Submitted 9 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Update the experimental results

  30. arXiv:2210.10718  [pdf, other

    cs.IR cs.AI

    Whole Page Unbiased Learning to Rank

    Authors: Haitao Mao, Lixin Zou, Yujia Zheng, Jiliang Tang, Xiaokai Chu, Jiashu Zhao, Qian Wang, Dawei Yin

    Abstract: The page presentation biases in the information retrieval system, especially on the click behavior, is a well-known challenge that hinders improving ranking models' performance with implicit user feedback. Unbiased Learning to Rank~(ULTR) algorithms are then proposed to learn an unbiased ranking model with biased click data. However, most existing algorithms are specifically designed to mitigate p… ▽ More

    Submitted 13 June, 2024; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 12 pages, 5 figures

  31. Lexical semantics enhanced neural word embeddings

    Authors: Dongqiang Yang, Ning Li, Li Zou, Hongwei Ma

    Abstract: Current breakthroughs in natural language processing have benefited dramatically from neural language models, through which distributional semantics can leverage neural data representations to facilitate downstream applications. Since neural embeddings use context prediction on word co-occurrences to yield dense vectors, they are inevitably prone to capture more semantic association than semantic… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Journal ref: Knowledge-Based Systems, Volume 252,2022

  32. arXiv:2209.07663  [pdf, other

    cs.IR

    Monolith: Real Time Recommendation System With Collisionless Embedding Table

    Authors: Zhuoran Liu, Leqi Zou, Xuan Zou, Caihua Wang, Biao Zhang, Da Tang, Bolin Zhu, Yijie Zhu, Peng Wu, Ke Wang, Youlong Cheng

    Abstract: Building a scalable and real-time recommendation system is vital for many businesses driven by time-sensitive customer feedback, such as short-videos ranking or online ads. Despite the ubiquitous adoption of production-scale deep learning frameworks like TensorFlow or PyTorch, these general-purpose frameworks fall short of business demands in recommendation scenarios for various reasons: on one ha… ▽ More

    Submitted 27 September, 2022; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: ORSUM@ACM RecSys 2022

  33. arXiv:2209.02981  [pdf, other

    cs.DB cs.CL

    VGStore: A Multimodal Extension to SPARQL for Querying RDF Scene Graph

    Authors: Yanzeng Li, Zilong Zheng, Wenjuan Han, Lei Zou

    Abstract: Semantic Web technology has successfully facilitated many RDF models with rich data representation methods. It also has the potential ability to represent and store multimodal knowledge bases such as multimodal scene graphs. However, most existing query languages, especially SPARQL, barely explore the implicit multimodal relationships like semantic similarity, spatial relations, etc. We first expl… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: ISWC 2022 Posters, Demos, and Industry Tracks

  34. arXiv:2208.09705  [pdf, other

    cs.CL

    gBuilder: A Scalable Knowledge Graph Construction System for Unstructured Corpus

    Authors: Yanzeng Li, Lei Zou

    Abstract: We design a user-friendly and scalable knowledge graph construction (KGC) system for extracting structured knowledge from the unstructured corpus. Different from existing KGC systems, gBuilder provides a flexible and user-defined pipeline to embrace the rapid development of IE models. More built-in template-based or heuristic operators and programmable operators are available for adapting to data… ▽ More

    Submitted 11 December, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

  35. Approximated Doubly Robust Search Relevance Estimation

    Authors: Lixin Zou, Changying Hao, Hengyi Cai, Suqi Cheng, Shuaiqiang Wang, Wenwen Ye, Zhicong Cheng, Simiu Gu, Dawei Yin

    Abstract: Extracting query-document relevance from the sparse, biased clickthrough log is among the most fundamental tasks in the web search system. Prior art mainly learns a relevance judgment model with semantic features of the query and document and ignores directly counterfactual relevance evaluation from the clicking log. Though the learned semantic matching models can provide relevance signals for tai… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

    Comments: 10 pages

    Journal ref: CIKM 2022

  36. arXiv:2207.11785  [pdf, ps, other

    cs.IR

    Model-based Unbiased Learning to Rank

    Authors: Dan Luo, Lixin Zou, Qingyao Ai, Zhiyu Chen, Dawei Yin, Brian D. Davison

    Abstract: Unbiased Learning to Rank (ULTR) that learns to rank documents with biased user feedback data is a well-known challenge in information retrieval. Existing methods in unbiased learning to rank typically rely on click modeling or inverse propensity weighting (IPW). Unfortunately, the search engines are faced with severe long-tail query distribution, where neither click modeling nor IPW can handle we… ▽ More

    Submitted 7 February, 2023; v1 submitted 24 July, 2022; originally announced July 2022.

    Comments: accepted in WSDM '23; extended version

  37. arXiv:2207.03680  [pdf, other

    cs.CL

    Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base

    Authors: Minhao Zhang, Ruoyu Zhang, Yanzeng Li, Lei Zou

    Abstract: Semantic parsing solves knowledge base (KB) question answering (KBQA) by composing a KB query, which generally involves node extraction (NE) and graph composition (GC) to detect and connect related nodes in a query. Despite the strong causal effects between NE and GC, previous works fail to directly model such causalities in their pipeline, hindering the learning of subtask correlations. Also, the… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: NAACL 2022 Findings

  38. arXiv:2207.03051  [pdf, ps, other

    cs.AI

    A Large Scale Search Dataset for Unbiased Learning to Rank

    Authors: Lixin Zou, Haitao Mao, Xiaokai Chu, Jiliang Tang, Wenwen Ye, Shuaiqiang Wang, Dawei Yin

    Abstract: The unbiased learning to rank (ULTR) problem has been greatly advanced by recent deep learning techniques and well-designed debias algorithms. However, promising results on the existing benchmark datasets may not be extended to the practical scenario due to the following disadvantages observed from those popular benchmark datasets: (1) outdated semantic feature extraction where state-of-the-art la… ▽ More

    Submitted 19 September, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: 15 pages, 9 figures

  39. arXiv:2207.01762  [pdf, other

    cs.CL cs.AI cs.IR

    PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN

    Authors: Pan Du, Jian-Yun Nie, Yutao Zhu, Hao Jiang, Lixin Zou, Xiaohui Yan

    Abstract: Beyond topical relevance, passage ranking for open-domain factoid question answering also requires a passage to contain an answer (answerability). While a few recent studies have incorporated some reading capability into a ranker to account for answerability, the ranker is still hindered by the noisy nature of the training data typically available in this area, which considers any passage containi… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  40. arXiv:2206.11684  [pdf, other

    cs.CL

    Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

    Authors: Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou

    Abstract: NLP models trained on text have been shown to reproduce human stereotypes, which can magnify harms to marginalized groups when systems are deployed at scale. We adapt the Agency-Belief-Communion (ABC) stereotype model of Koch et al. (2016) from social psychology as a framework for the systematic study and discovery of stereotypic group-trait associations in language models (LMs). We introduce the… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

  41. arXiv:2205.11764  [pdf, other

    cs.CL

    D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat

    Authors: Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu

    Abstract: In a depression-diagnosis-directed clinical session, doctors initiate a conversation with ample emotional support that guides the patients to expose their symptoms based on clinical diagnosis criteria. Such a dialogue system is distinguished from existing single-purpose human-machine dialog systems, as it combines task-oriented and chit-chats with uniqueness in dialogue topics and procedures. Howe… ▽ More

    Submitted 24 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  42. arXiv:2204.06240  [pdf, other

    cs.LG cs.IR

    CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU

    Authors: Zangwei Zheng, Pengtai Xu, Xuan Zou, Da Tang, Zhen Li, Chenguang Xi, Peng Wu, Leqi Zou, Yijie Zhu, Ming Chen, Xiangzhuo Ding, Fuzhao Xue, Ziheng Qin, Youlong Cheng, Yang You

    Abstract: The click-through rate (CTR) prediction task is to predict whether a user will click on the recommended item. As mind-boggling amounts of data are produced online daily, accelerating CTR prediction model training is critical to ensuring an up-to-date model and reducing the training cost. One approach to increase the training speed is to apply large batch training. However, as shown in computer vis… ▽ More

    Submitted 30 November, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: AAAI 2023

  43. arXiv:2203.03833  [pdf, other

    cs.CV cs.AI

    Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap

    Authors: Yongwei Chen, Zihao Wang, Longkun Zou, Ke Chen, Kui Jia

    Abstract: Semantic analyses of object point clouds are largely driven by releasing of benchmarking datasets, including synthetic ones whose instances are sampled from object CAD models. However, learning from synthetic data may not generalize to practical scenarios, where point clouds are typically incomplete, non-uniformly distributed, and noisy. Such a challenge of Simulation-to-Reality (Sim2Real) domain… ▽ More

    Submitted 19 July, 2022; v1 submitted 7 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022

  44. arXiv:2202.11878  [pdf, other

    cs.CV

    New Benchmark for Household Garbage Image Recognition

    Authors: Zhize Wu, Huanyi Li, Xiaofeng Wang, Zijun Wu, Le Zou, Lixiang Xu, Ming Tan

    Abstract: Household garbage images are usually faced with complex backgrounds, variable illuminations, diverse angles, and changeable shapes, which bring a great difficulty in garbage image classification. Due to the ability to discover problem-specific features, deep learning and especially convolutional neural networks (CNNs) have been successfully and widely used for image representation learning. Howeve… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

  45. arXiv:2202.05941   

    cs.CV

    Domain-Invariant Proposals based on a Balanced Domain Classifier for Object Detection

    Authors: Zhize Wu, Xiaofeng Wang, Tong Xu, Xuebin Yang, Le Zou, Lixiang Xu, Thomas Weise

    Abstract: Object recognition from images means to automatically find object(s) of interest and to return their category and location information. Benefiting from research on deep learning, like convolutional neural networks~(CNNs) and generative adversarial networks, the performance in this field has been improved significantly, especially when training and test data are drawn from similar distributions. Ho… ▽ More

    Submitted 5 January, 2024; v1 submitted 11 February, 2022; originally announced February 2022.

    Comments: fixed some issues

  46. arXiv:2111.07187  [pdf

    cs.SI

    Social Media for Emergency Rescue: An Analysis of Rescue Requests on Twitter during Hurricane Harvey

    Authors: Lei Zou, Danqing Liao, Nina S. N. Lam, Michelle Meyer, Nasir G. Gharaibeh, Heng Cai, Bing Zhou, Dongying Li

    Abstract: Social media plays increasingly significant roles in disaster response, but effectively leveraging social media for rescue is challenging. This study analyzed rescue requests on Twitter during the 2017 Hurricane Harvey, in which many residents resorted to social media to call for help. The objectives include (1) understanding the characteristics of rescue-request messages; (2) revealing the spatia… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

    Comments: 24 pages, 9 figures, 6 tables

  47. arXiv:2111.03446  [pdf

    physics.soc-ph cs.SI

    Revealing the Global Linguistic and Geographical Disparities of Public Awareness to Covid-19 Outbreak through Social Media

    Authors: Binbin Lin, Lei Zou, Nick Duffield, Ali Mostafavi, Heng Cai, Bing Zhou, Jian Tao, Mingzheng Yang, Debayan Mandal, Joynal Abedin

    Abstract: The Covid-19 has presented an unprecedented challenge to public health worldwide. However, residents in different countries showed diverse levels of Covid-19 awareness during the outbreak and suffered from uneven health impacts. This study analyzed the global Twitter data from January 1st to June 30th, 2020, seeking to answer two research questions. What are the linguistic and geographical dispari… ▽ More

    Submitted 8 November, 2021; v1 submitted 29 October, 2021; originally announced November 2021.

  48. 6D-ViT: Category-Level 6D Object Pose Estimation via Transformer-based Instance Representation Learning

    Authors: Lu Zou, Zhangjin Huang, Naijie Gu, Guoping Wang

    Abstract: This paper presents 6D-ViT, a transformer-based instance representation learning network, which is suitable for highly accurate category-level object pose estimation on RGB-D images. Specifically, a novel two-stream encoder-decoder framework is dedicated to exploring complex and powerful instance representations from RGB images, point clouds and categorical shape priors. For this purpose, the whol… ▽ More

    Submitted 30 October, 2021; v1 submitted 10 October, 2021; originally announced October 2021.

    Comments: 13 pages, 12 figures

    Journal ref: IEEE Transactions on Image Processing 2022

  49. A Survey on Reinforcement Learning for Recommender Systems

    Authors: Yuanguo Lin, Yong Liu, Fan Lin, Lixin Zou, Pengcheng Wu, Wenhua Zeng, Huanhuan Chen, Chunyan Miao

    Abstract: Recommender systems have been widely applied in different real-life scenarios to help us find useful information. In particular, Reinforcement Learning (RL) based recommender systems have become an emerging research topic in recent years, owing to the interactive nature and autonomous learning ability. Empirical results show that RL-based recommendation methods often surpass most of supervised lea… ▽ More

    Submitted 10 June, 2023; v1 submitted 22 September, 2021; originally announced September 2021.

    Comments: 21 pages, 4 figures, Accepted by TNNLS 2023

  50. arXiv:2109.08909  [pdf, other

    cs.CV eess.IV math.NA

    Measuring the rogue wave pattern triggered from Gaussian perturbations by deep learning

    Authors: Liwen Zou, XinHang Luo, Delu Zeng, Liming Ling, Li-Chen Zhao

    Abstract: Weak Gaussian perturbations on a plane wave background could trigger lots of rogue waves, due to modulational instability. Numerical simulations showed that these rogue waves seemed to have similar unit structure. However, to the best of our knowledge, there is no relative result to prove that these rogue waves have the similar patterns for different perturbations, partly due to that it is hard to… ▽ More

    Submitted 9 October, 2021; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: 8 pages, 6 figures