Skip to main content

Showing 1–50 of 173 results for author: Yin, M

  1. arXiv:2407.06645  [pdf, other

    cs.LG cs.CL

    Entropy Law: The Story Behind Data Compression and LLM Performance

    Authors: Mingjia Yin, Chuhan Wu, Yufei Wang, Hao Wang, Wei Guo, Yasheng Wang, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen

    Abstract: Data is the cornerstone of large language models (LLMs), but not all data is useful for model learning. Carefully selected data can better elicit the capabilities of LLMs with much less computational overhead. Most methods concentrate on evaluating the quality of individual samples in data selection, while the combinatorial effects among samples are neglected. Even if each sample is of perfect qua… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2407.06309  [pdf, other

    cs.CY cs.AI

    Multimodal Chain-of-Thought Reasoning via ChatGPT to Protect Children from Age-Inappropriate Apps

    Authors: Chuanbo Hu, Bin Liu, Minglei Yin, Yilu Zhou, Xin Li

    Abstract: Mobile applications (Apps) could expose children to inappropriate themes such as sexual content, violence, and drug use. Maturity rating offers a quick and effective method for potential users, particularly guardians, to assess the maturity levels of apps. Determining accurate maturity ratings for mobile apps is essential to protect children's health in today's saturated digital marketplace. Exist… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.03307  [pdf, other

    eess.IV cs.CV

    HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

    Authors: Yucheng Tang, Yufan He, Vishwesh Nath, Pengfeig Guo, Ruining Deng, Tianyuan Yao, Quan Liu, Can Cui, Mengmeng Yin, Ziyue Xu, Holger Roth, Daguang Xu, Haichun Yang, Yuankai Huo

    Abstract: In digital pathology, the traditional method for deep learning-based image segmentation typically involves a two-stage process: initially segmenting high-resolution whole slide images (WSI) into smaller patches (e.g., 256x256, 512x512, 1024x1024) and subsequently reconstructing them to their original scale. This method often struggles to capture the complex details and vast scope of WSIs. In this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  4. arXiv:2407.00596  [pdf, other

    eess.IV cs.CV

    HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Juming Xiong, Shunxing Bao, Hao Li, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy. For instance, the intricate organization in kidney pathology spans multiple layers, from regions like the cortex and medulla to functional units such as glomeruli, tubules, and vessels, down to various cell types. In this paper, we propose a novel… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.19286

  5. arXiv:2407.00030  [pdf, other

    cs.DC cs.PF

    On Orchestrating Parallel Broadcasts for Distributed Ledgers

    Authors: Peiyao Sheng, Chenyuan Wu, Dahlia Malkhi, Michael K. Reiter, Chrysoula Stathakopoulou, Michael Wei, Maofan Yin

    Abstract: This paper introduces and develops the concept of ``ticketing'', through which atomic broadcasts are orchestrated by nodes in a distributed system. The paper studies different ticketing regimes that allow parallelism, yet prevent slow nodes from hampering overall progress. It introduces a hybrid scheme which combines managed and unmanaged ticketing regimes, striking a balance between adaptivity an… ▽ More

    Submitted 17 May, 2024; originally announced July 2024.

  6. arXiv:2406.12404  [pdf

    cs.CV

    Scan-to-BIM for As-built Roads: Automatic Road Digital Twinning from Semantically Labeled Point Cloud Data

    Authors: Yuexiong Ding, Mengtian Yin, Ran Wei, Ioannis Brilakis, Muyang Liu, Xiaowei Luo

    Abstract: Creating geometric digital twins (gDT) for as-built roads still faces many challenges, such as low automation level and accuracy, limited asset types and shapes, and reliance on engineering experience. A novel scan-to-building information modeling (scan-to-BIM) framework is proposed for automatic road gDT creation based on semantically labeled point cloud data (PCD), which considers six asset type… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.05590  [pdf, other

    cs.CR cs.AI cs.CY cs.LG

    NYU CTF Dataset: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive Security

    Authors: Minghao Shao, Sofija Jancheska, Meet Udeshi, Brendan Dolan-Gavitt, Haoran Xi, Kimberly Milner, Boyuan Chen, Max Yin, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Ramesh Karri, Muhammad Shafique

    Abstract: Large Language Models (LLMs) are being deployed across various domains today. However, their capacity to solve Capture the Flag (CTF) challenges in cybersecurity has not been thoroughly evaluated. To address this, we develop a novel method to assess LLMs in solving CTF challenges by creating a scalable, open-source benchmark database specifically designed for these applications. This database incl… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  8. arXiv:2406.01838  [pdf, other

    cs.LG cs.AI

    Learning the Target Network in Function Space

    Authors: Kavosh Asadi, Yao Liu, Shoham Sabach, Ming Yin, Rasool Fakoor

    Abstract: We focus on the task of learning the value function in the reinforcement learning (RL) setting. This task is often solved by updating a pair of online and target networks while ensuring that the parameters of these two networks are equivalent. We propose Lookahead-Replicate (LR), a new value-function approximation algorithm that is agnostic to this parameter-space equivalence. Instead, the LR algo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to International Conference on Machine Learning (ICML24)

  9. arXiv:2405.20495  [pdf, other

    cs.CL cs.LG

    Transfer Q Star: Principled Decoding for LLM Alignment

    Authors: Souradip Chakraborty, Soumya Suvra Ghosal, Ming Yin, Dinesh Manocha, Mengdi Wang, Amrit Singh Bedi, Furong Huang

    Abstract: Aligning foundation models is essential for their safe and trustworthy deployment. However, traditional fine-tuning methods are computationally intensive and require updating billions of model parameters. A promising alternative, alignment via decoding, adjusts the response distribution directly without model updates to maximize a target reward $r$, thus providing a lightweight and adaptable frame… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.17795  [pdf, other

    cs.IR

    Dataset Regeneration for Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Suojuan Zhang, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: The sequential recommender (SR) system is a crucial component of modern recommender systems, as it aims to capture the evolving preferences of users. Significant efforts have been made to enhance the capabilities of SR systems. These methods typically follow the model-centric paradigm, which involves developing effective models based on fixed datasets. However, this approach often overlooks potent… ▽ More

    Submitted 3 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  11. arXiv:2405.16345  [pdf

    cs.DB cs.ET cs.IR

    Cypher4BIM: Releasing the Power of Graph for Building Knowledge Discovery

    Authors: Junxiang Zhu, Nicholas Nisbet, Mengtian Yin, Ran Wei, Ioannis Brilakis

    Abstract: Graph is considered a promising way for managing building information. A new graphic form of IFC (Industry Foundation Classes) data has just been developed, referred to as IFC-Graph. However, understanding of IFC-Graph is insufficient, especially for information query. This study aims to explore graphic building information query and develop a graph query language tailored for IFC-Graph. A series… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  12. arXiv:2405.12473  [pdf, other

    cs.IR cs.AI

    Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Wei Guo, Yong Liu, Zhi Li, Sirui Zhao, Defu Lian, Enhong Chen

    Abstract: Cross-domain sequential recommendation (CDSR) aims to uncover and transfer users' sequential preferences across multiple recommendation domains. While significant endeavors have been made, they primarily concentrated on developing advanced transfer modules and aligning user representations using self-supervised learning techniques. However, the problem of aligning item representations has received… ▽ More

    Submitted 3 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  13. arXiv:2404.17069  [pdf, other

    cs.IT cs.LG eess.SP

    Channel Modeling for FR3 Upper Mid-band via Generative Adversarial Networks

    Authors: Yaqi Hu, Mingsheng Yin, Marco Mezzavilla, Hao Guo, Sundeep Rangan

    Abstract: The upper mid-band (FR3) has been recently attracting interest for new generation of mobile networks, as it provides a promising balance between spectrum availability and coverage, which are inherent limitations of the sub 6GHz and millimeter wave bands, respectively. In order to efficiently design and optimize the network, channel modeling plays a key role since FR3 systems are expected to operat… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  14. arXiv:2404.13528  [pdf, other

    cs.LG cs.AI cs.DC

    SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile

    Authors: Wei Niu, Md Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren

    Abstract: This work is motivated by recent developments in Deep Neural Networks, particularly the Transformer architectures underlying applications such as ChatGPT, and the need for performing inference on mobile devices. Focusing on emerging transformers (specifically the ones with computationally efficient Swin-like architectures) and large models (e.g., Stable Diffusion and LLMs) based on transformers, w… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  15. arXiv:2404.13470  [pdf, other

    cs.DC cs.AI

    GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data

    Authors: Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin

    Abstract: The rapid expansion of computational capabilities and the ever-growing scale of modern HPC systems present formidable challenges in managing exascale scientific data. Faced with such vast datasets, traditional lossless compression techniques prove insufficient in reducing data size to a manageable level while preserving all information intact. In response, researchers have turned to error-bounded… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  16. arXiv:2404.11871  [pdf, other

    cs.CV

    Group-On: Boosting One-Shot Segmentation with Supportive Query

    Authors: Hanjing Zhou, Mingze Yin, JinTai Chen, Danny Chen, Jian Wu

    Abstract: One-shot semantic segmentation aims to segment query images given only ONE annotated support image of the same class. This task is challenging because target objects in the support and query images can be largely different in appearance and pose (i.e., intra-class variation). Prior works suggested that incorporating more annotated support images in few-shot settings boosts performances but increas… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  17. arXiv:2404.04057  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

    Authors: Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

    Abstract: We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator. SiD not only facilitates an exponentially fast reduction in Fréchet inception distance (FID) during distillation but also approaches or even exceeds the FID performance of the original teacher diffusion models. By refo… ▽ More

    Submitted 24 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: ICML 2024, PyTorch implementation: https://github.com/mingyuanzhou/SiD

  18. arXiv:2404.00268  [pdf, other

    cs.IR

    A Unified Framework for Adaptive Representation Enhancement and Inversed Learning in Cross-Domain Recommendation

    Authors: Luankang Zhang, Hao Wang, Suojuan Zhang, Mingjia Yin, Yongqiang Han, Jiaqing Zhang, Defu Lian, Enhong Chen

    Abstract: Cross-domain recommendation (CDR), aiming to extract and transfer knowledge across domains, has attracted wide attention for its efficacy in addressing data sparsity and cold-start problems. Despite significant advances in representation disentanglement to capture diverse user preferences, existing methods usually neglect representation enhancement and lack rigorous decoupling constraints, thereby… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted by DASFAA 2024

  19. arXiv:2403.16812  [pdf, other

    cs.HC cs.AI

    Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making

    Authors: Shuai Ma, Qiaoyi Chen, Xinru Wang, Chengbo Zheng, Zhenhui Peng, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, humans often passively review AI's suggestion and decide whether to accept or reject it as a whole. In such a paradigm, humans are found to rarely trigger analytical thinking and face difficulties in communicating the nuances of conflicting opinions to the AI when disagreements occur. To tackle this challenge, we propose Human-AI Deliberation, a novel framework to p… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  20. arXiv:2403.12004  [pdf, other

    cs.HC

    The Value, Benefits, and Concerns of Generative AI-Powered Assistance in Writing

    Authors: Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin

    Abstract: Recent advances in generative AI technologies like large language models raise both excitement and concerns about the future of human-AI co-creation in writing. To unpack people's attitude towards and experience with generative AI-powered writing assistants, in this paper, we conduct an experiment to understand whether and how much value people attach to AI assistance, and how the incorporation of… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: CHI 2024

  21. arXiv:2403.11574  [pdf, ps, other

    cs.LG

    Offline Multitask Representation Learning for Reinforcement Learning

    Authors: Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

    Abstract: We study offline multitask representation learning in reinforcement learning (RL), where a learner is provided with an offline dataset from different tasks that share a common representation and is asked to learn the shared representation. We theoretically investigate offline multitask low-rank RL, and propose a new algorithm called MORL for offline multitask representation learning. Furthermore,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  22. arXiv:2403.10850  [pdf, other

    cs.RO cs.AI

    GAgent: An Adaptive Rigid-Soft Gripping Agent with Vision Language Models for Complex Lighting Environments

    Authors: Zhuowei Li, Miao Zhang, Xiaotian Lin, Meng Yin, Shuai Lu, Xueqian Wang

    Abstract: This paper introduces GAgent: an Gripping Agent designed for open-world environments that provides advanced cognitive abilities via VLM agents and flexible grasping abilities with variable stiffness soft grippers. GAgent comprises three primary components - Prompt Engineer module, Visual-Language Model (VLM) core and Workflow module. These three modules enhance gripper success rates by recognizing… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  23. arXiv:2403.09552  [pdf, other

    cs.HC

    "Are You Really Sure?" Understanding the Effects of Human Self-Confidence Calibration in AI-Assisted Decision Making

    Authors: Shuai Ma, Xinru Wang, Ying Lei, Chuhan Shi, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, it is crucial but challenging for humans to achieve appropriate reliance on AI. This paper approaches this problem from a human-centered perspective, "human self-confidence calibration". We begin by proposing an analytical framework to highlight the importance of calibrated human self-confidence. In our first study, we explore the relationship between human self-con… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  24. arXiv:2403.04530  [pdf, other

    cs.GT econ.TH

    Multi-District School Choice: Playing on Several Fields

    Authors: Yannai A. Gonczarowski, Michael Yin, Shirley Zhang

    Abstract: We extend the seminal model of Pathak and Sönmez (2008) to a setting with multiple school districts, each running its own separate centralized match, and focus on the case of two districts. In our setting, in addition to each student being either sincere or sophisticated, she is also either constrained - able to apply only to schools within her own district of residence - or unconstrained - able t… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  25. arXiv:2403.03149  [pdf, other

    cs.CR cs.DC cs.LG

    Robust Federated Learning Mitigates Client-side Training Data Distribution Inference Attacks

    Authors: Yichang Xu, Ming Yin, Minghong Fang, Neil Zhenqiang Gong

    Abstract: Recent studies have revealed that federated learning (FL), once considered secure due to clients not sharing their private data with the server, is vulnerable to attacks such as client-side training data distribution inference, where a malicious client can recreate the victim's data. While various countermeasures exist, they are not practical, often assuming server access to some training data or… ▽ More

    Submitted 4 April, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: To appear in The Web Conference 2024 (WWW '24)

  26. arXiv:2403.01791  [pdf, other

    cs.HC cs.AI

    Beyond Recommender: An Exploratory Study of the Effects of Different AI Roles in AI-Assisted Decision Making

    Authors: Shuai Ma, Chenyi Zhang, Xinru Wang, Xiaojuan Ma, Ming Yin

    Abstract: Artificial Intelligence (AI) is increasingly employed in various decision-making tasks, typically as a Recommender, providing recommendations that the AI deems correct. However, recent studies suggest this may diminish human analytical thinking and lead to humans' inappropriate reliance on AI, impairing the synergy in human-AI teams. In contrast, human advisors in group decision-making perform var… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  27. arXiv:2402.19286  [pdf, other

    eess.IV cs.CV

    PrPSeg: Universal Proposition Learning for Panoramic Renal Pathology Segmentation

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jialin Yue, Juming Xiong, Lining Yu, Yifei Wu, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Understanding the anatomy of renal pathology is crucial for advancing disease diagnostics, treatment evaluation, and clinical research. The complex kidney system comprises various components across multiple levels, including regions (cortex, medulla), functional units (glomeruli, tubules), and cells (podocytes, mesangial cells in glomerulus). Prior studies have predominantly overlooked the intrica… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference 2024

  28. arXiv:2402.11637  [pdf, other

    cs.CR cs.IR cs.LG

    Poisoning Federated Recommender Systems with Fake Users

    Authors: Ming Yin, Yichang Xu, Minghong Fang, Neil Zhenqiang Gong

    Abstract: Federated recommendation is a prominent use case within federated learning, yet it remains susceptible to various attacks, from user to server-side vulnerabilities. Poisoning attacks are particularly notable among user-side attacks, as participants upload malicious model updates to deceive the global model, often intending to promote or demote specific targeted items. This study investigates strat… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: To appear in The Web Conference 2024 (WWW '24)

  29. arXiv:2402.10516  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative AI for Controllable Protein Sequence Design: A Survey

    Authors: Yiheng Zhu, Zitai Kong, Jialu Wu, Weize Liu, Yuqiang Han, Mingze Yin, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou

    Abstract: The design of novel protein sequences with targeted functionalities underpins a central theme in protein engineering, impacting diverse fields such as drug discovery and enzymatic engineering. However, navigating this vast combinatorial search space remains a severe challenge due to time and financial constraints. This scenario is rapidly evolving as the transformative advancements in AI, particul… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 9 pages

  30. arXiv:2402.07250  [pdf, other

    cs.LG cs.AI cs.CE

    DIMON: Learning Solution Operators of Partial Differential Equations on a Diffeomorphic Family of Domains

    Authors: Minglang Yin, Nicolas Charon, Ryan Brody, Lu Lu, Natalia Trayanova, Mauro Maggioni

    Abstract: The solution of a PDE over varying initial/boundary conditions on multiple domains is needed in a wide variety of applications, but it is computationally expensive if the solution is computed de novo whenever the initial/boundary conditions of the domain change. We introduce a general operator learning framework, called DIffeomorphic Mapping Operator learNing (DIMON) to learn approximate PDE solut… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  31. arXiv:2401.15603  [pdf, other

    cs.LG cs.SI

    Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

    Authors: Kangkang Lu, Yanhua Yu, Hao Fei, Xuan Li, Zixuan Yang, Zirui Guo, Meiyu Liang, Mengran Yin, Tat-Seng Chua

    Abstract: In recent years, spectral graph neural networks, characterized by polynomial filters, have garnered increasing attention and have achieved remarkable performance in tasks such as node classification. These models typically assume that eigenvalues for the normalized Laplacian matrix are distinct from each other, thus expecting a polynomial filter to have a high fitting ability. However, this paper… ▽ More

    Submitted 18 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI-24

  32. arXiv:2401.10341  [pdf, other

    cs.CV cs.AI

    ELRT: Efficient Low-Rank Training for Compact Convolutional Neural Networks

    Authors: Yang Sui, Miao Yin, Yu Gong, Jinqi Xiao, Huy Phan, Bo Yuan

    Abstract: Low-rank compression, a popular model compression technique that produces compact convolutional neural networks (CNNs) with low rankness, has been well-studied in the literature. On the other hand, low-rank training, as an alternative way to train low-rank CNNs from scratch, has been exploited little yet. Unlike low-rank compression, low-rank training does not need pre-trained full-rank models, an… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  33. arXiv:2401.07058  [pdf, other

    cs.HC cs.AI

    Does More Advice Help? The Effects of Second Opinions in AI-Assisted Decision Making

    Authors: Zhuoran Lu, Dakuo Wang, Ming Yin

    Abstract: AI assistance in decision-making has become popular, yet people's inappropriate reliance on AI often leads to unsatisfactory human-AI collaboration performance. In this paper, through three pre-registered, randomized human subject experiments, we explore whether and how the provision of {second opinions} may affect decision-makers' behavior and performance in AI-assisted decision-making. We find t… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

  34. arXiv:2401.05840  [pdf, other

    cs.HC cs.AI

    Decoding AI's Nudge: A Unified Framework to Predict Human Behavior in AI-assisted Decision Making

    Authors: Zhuoyan Li, Zhuoran Lu, Ming Yin

    Abstract: With the rapid development of AI-based decision aids, different forms of AI assistance have been increasingly integrated into the human decision making processes. To best support humans in decision making, it is essential to quantitatively understand how diverse forms of AI assistance influence humans' decision making behavior. To this end, much of the current research focuses on the end-to-end pr… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: AAAI 2024

  35. arXiv:2312.15877  [pdf, other

    cs.AI cs.LO

    PBCounter: Weighted Model Counting on Pseudo-Boolean Formulas

    Authors: Yong Lai, Zhenghang Xu, Minghao Yin

    Abstract: In Weighted Model Counting (WMC), we assign weights to literals and compute the sum of the weights of the models of a given propositional formula where the weight of an assignment is the product of the weights of its literals. The current WMC solvers work on Conjunctive Normal Form (CNF) formulas. However, CNF is not a natural representation for human-being in many applications. Motivated by the s… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  36. arXiv:2312.15717  [pdf, other

    cs.AI cs.CY cs.LG

    Spatial-Temporal Interplay in Human Mobility: A Hierarchical Reinforcement Learning Approach with Hypergraph Representation

    Authors: Zhaofan Zhang, Yanan Xiao, Lu Jiang, Dingqi Yang, Minghao Yin, Pengyang Wang

    Abstract: In the realm of human mobility, the decision-making process for selecting the next-visit location is intricately influenced by a trade-off between spatial and temporal constraints, which are reflective of individual needs and preferences. This trade-off, however, varies across individuals, making the modeling of these spatial-temporal dynamics a formidable challenge. To address the problem, in thi… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  37. arXiv:2311.16502  [pdf, other

    cs.CL cs.AI cs.CV

    MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

    Authors: Xiang Yue, Yuansheng Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang, Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

    Abstract: We introduce MMMU: a new benchmark designed to evaluate multimodal models on massive multi-discipline tasks demanding college-level subject knowledge and deliberate reasoning. MMMU includes 11.5K meticulously collected multimodal questions from college exams, quizzes, and textbooks, covering six core disciplines: Art & Design, Business, Science, Health & Medicine, Humanities & Social Science, and… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: CVPR 2024 Oral

  38. arXiv:2311.10357  [pdf, ps, other

    quant-ph cs.DS

    Fast algorithms for classical specifications of stabiliser states and Clifford gates

    Authors: Nadish de Silva, Wilfred Salmon, Ming Yin

    Abstract: The stabiliser formalism plays a central role in quantum computing, error correction, and fault-tolerance. Stabiliser states are used to encode computational basis states. Clifford gates are those which can be easily performed fault-tolerantly in the most common error correction schemes. Their mathematical properties are the subject of significant research interest. Conversions between and verif… ▽ More

    Submitted 26 May, 2024; v1 submitted 17 November, 2023; originally announced November 2023.

    Comments: Python implementations available at https://github.com/ndesilva/stabiliser-tools. New in v2: new algorithm for extracting the stabiliser tableau of a Clifford gate matrix that is exponentially faster compared to v1, more thorough complexity analyses. New in v3: new and faster algorithms, comparisons with existing implementations

  39. APGL4SR: A Generic Framework with Adaptive and Personalized Global Collaborative Information in Sequential Recommendation

    Authors: Mingjia Yin, Hao Wang, Xiang Xu, Likang Wu, Sirui Zhao, Wei Guo, Yong Liu, Ruiming Tang, Defu Lian, Enhong Chen

    Abstract: The sequential recommendation system has been widely studied for its promising effectiveness in capturing dynamic preferences buried in users' sequential behaviors. Despite the considerable achievements, existing methods usually focus on intra-sequence modeling while overlooking exploiting global collaborative information by inter-sequence modeling, resulting in inferior recommendation performance… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

  40. arXiv:2310.18919  [pdf, other

    cs.LG cs.AI stat.ML

    Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

    Authors: Nikki Lijing Kuang, Ming Yin, Mengdi Wang, Yu-Xiang Wang, Yi-An Ma

    Abstract: Recent studies in reinforcement learning (RL) have made significant progress by leveraging function approximation to alleviate the sample complexity hurdle for better performance. Despite the success, existing provably efficient algorithms typically rely on the accessibility of immediate feedback upon taking actions. The failure to account for the impact of delay in observations can significantly… ▽ More

    Submitted 3 November, 2023; v1 submitted 29 October, 2023; originally announced October 2023.

  41. arXiv:2310.12026  [pdf, other

    stat.ML cs.LG stat.AP

    Nonparametric Discrete Choice Experiments with Machine Learning Guided Adaptive Design

    Authors: Mingzhang Yin, Ruijiang Gao, Weiran Lin, Steven M. Shugan

    Abstract: Designing products to meet consumers' preferences is essential for a business's success. We propose the Gradient-based Survey (GBS), a discrete choice experiment for multiattribute product design. The experiment elicits consumer preferences through a sequence of paired comparisons for partial profiles. GBS adaptively constructs paired comparison questions based on the respondents' previous choices… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  42. arXiv:2310.08824  [pdf, other

    cs.HC stat.ML

    Confounding-Robust Policy Improvement with Human-AI Teams

    Authors: Ruijiang Gao, Mingzhang Yin

    Abstract: Human-AI collaboration has the potential to transform various domains by leveraging the complementary strengths of human experts and Artificial Intelligence (AI) systems. However, unobserved confounding can undermine the effectiveness of this collaboration, leading to biased and unreliable outcomes. In this paper, we propose a novel solution to address unobserved confounding in human-AI collaborat… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 24 pages

  43. arXiv:2310.07849  [pdf, other

    cs.CL cs.AI

    Synthetic Data Generation with Large Language Models for Text Classification: Potential and Limitations

    Authors: Zhuoyan Li, Hangxiao Zhu, Zhuoran Lu, Ming Yin

    Abstract: The collection and curation of high-quality training data is crucial for developing text classification models with superior performance, but it is often associated with significant costs and time investment. Researchers have recently explored using large language models (LLMs) to generate synthetic datasets as an alternative approach. However, the effectiveness of the LLM-generated synthetic data… ▽ More

    Submitted 12 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  44. arXiv:2310.06335  [pdf, other

    cs.DC

    BBCA-CHAIN: Low Latency, High Throughput BFT Consensus on a DAG

    Authors: Dahlia Malkhi, Chrysoula Stathakopoulou, Maofan Yin

    Abstract: This paper presents a partially synchronous BFT consensus protocol powered by BBCA, a lightly modified Byzantine Consistent Broadcast (BCB) primitive. BBCA provides a Complete-Adopt semantic through an added probing interface to allow either aborting the broadcast by correct nodes or exclusively, adopting the message consistently in case of a potential delivery. It does not introduce any extra typ… ▽ More

    Submitted 24 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  45. arXiv:2309.16251  [pdf, other

    cs.HC

    The effect of 3D stereopsis and hand-tool alignment on learning effectiveness and skill transfer of a VR-based simulator for dental training

    Authors: Maximilian Kaluschke, Myat Su Yin, Peter Haddawy, Siriwan Suebnukarn, Gabriel Zachmann

    Abstract: Dental simulators gained prevalence in recent years. Important aspects distinguishing VR hardware configurations are 3D stereoscopic rendering and visual alignment of the user's hands with the virtual tools. New dental simulators are often evaluated without analysing the impact of these simulation aspects. In this paper, we seek to determine the impact of 3D stereoscopic rendering and of hand-tool… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: 26 pages, 15 figures, Accepted at online journal PLoS ONE

    MSC Class: 62A86 (Primary) 62H30 (Secondary) ACM Class: J.3; G.3

  46. arXiv:2308.08858  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Improving Sample Efficiency of Model-Free Algorithms for Zero-Sum Markov Games

    Authors: Songtao Feng, Ming Yin, Yu-Xiang Wang, Jing Yang, Yingbin Liang

    Abstract: The problem of two-player zero-sum Markov games has recently attracted increasing interests in theoretical studies of multi-agent reinforcement learning (RL). In particular, for finite-horizon episodic Markov decision processes (MDPs), it has been shown that model-based algorithms can find an $ε$-optimal Nash Equilibrium (NE) with the sample complexity of $O(H^3SAB/ε^2)$, which is optimal in the d… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

  47. arXiv:2308.02665  [pdf, other

    cs.AI

    Let's Give a Voice to Conversational Agents in Virtual Reality

    Authors: Michele Yin, Gabriel Roccabruna, Abhinav Azad, Giuseppe Riccardi

    Abstract: The dialogue experience with conversational agents can be greatly enhanced with multimodal and immersive interactions in virtual reality. In this work, we present an open-source architecture with the goal of simplifying the development of conversational agents operating in virtual environments. The architecture offers the possibility of plugging in conversational agents of different domains and ad… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

  48. arXiv:2306.14757  [pdf, other

    cs.DC

    BBCA-LEDGER: High Throughput Consensus meets Low Latency

    Authors: Chrysoula Stathakopoulou, Michael Wei, Maofan Yin, Hongbo Zhang, Dahlia Malkhi

    Abstract: This paper presents BBCA-LEDGER, a Byzantine log replication technology for partially synchronous networks enabling blocks to be broadcast in parallel, such that each broadcast is finalized independently and instantaneously into an individual slot in the log. Every finalized broadcast is eventually committed to the total ordering, so that all network bandwidth has utility in disseminating blocks.… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  49. arXiv:2306.14063  [pdf, other

    cs.LG cs.AI

    Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data

    Authors: Sunil Madhow, Dan Qiao, Ming Yin, Yu-Xiang Wang

    Abstract: Developing theoretical guarantees on the sample complexity of offline RL methods is an important step towards making data-hungry RL algorithms practically viable. Currently, most results hinge on unrealistic assumptions about the data distribution -- namely that it comprises a set of i.i.d. trajectories collected by a single logging policy. We consider a more general setting where the dataset may… ▽ More

    Submitted 30 April, 2024; v1 submitted 24 June, 2023; originally announced June 2023.

  50. arXiv:2306.07992  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Securing Visually-Aware Recommender Systems: An Adversarial Image Reconstruction and Detection Framework

    Authors: Minglei Yin, Bin Liu, Neil Zhenqiang Gong, Xin Li

    Abstract: With rich visual data, such as images, becoming readily associated with items, visually-aware recommendation systems (VARS) have been widely used in different applications. Recent studies have shown that VARS are vulnerable to item-image adversarial attacks, which add human-imperceptible perturbations to the clean images associated with those items. Attacks on VARS pose new security challenges to… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.