Skip to main content

Showing 1–50 of 1,162 results for author: Tan, C

  1. arXiv:2407.11845  [pdf, other

    astro-ph.GA astro-ph.SR

    Asymmetric Kinematics in Young Clusters: The λ Ori Cluster

    Authors: Joseph J. Armstrong, Jonathan C. Tan

    Abstract: Context. Most stars form in clusters or associations but only a small number of these groups are expected to remain bound for longer than a few Myr. Once star formation has ended and the molecular gas around young stellar objects has been expelled via feedback processes, most initially bound young clusters lose the majority of their binding mass and begin to disperse into the Galactic field. Aims.… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 20 pages, 17 figures, submitted to A&A

  2. arXiv:2407.10058  [pdf, other

    cs.CL cs.AI

    Learning to Refuse: Towards Mitigating Privacy Risks in LLMs

    Authors: Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Wenliang Chen

    Abstract: Large language models (LLMs) exhibit remarkable capabilities in understanding and generating natural language. However, these models can inadvertently memorize private information, posing significant privacy risks. This study addresses the challenge of enabling LLMs to protect specific individuals' private data without the need for complete retraining. We propose \return, a Real-world pErsonal daT… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  3. arXiv:2407.09949  [pdf, other

    astro-ph.GA

    The formation of supermassive black holes from Population III.1 seeds. III. Galaxy evolution and black hole growth from semi-analytic modelling

    Authors: Vieri Cammelli, Pierluigi Monaco, Jonathan C. Tan, Jasbir Singh, Fabio Fontanot, Gabriella De Lucia, Michaela Hirschmann, Lizhi Xie

    Abstract: We present an implementation of Pop III.1 seeding of supermassive black holes (SMBHs) in a theoretical model of galaxy formation and evolution to assess the growth the SMBH population and the properties of the host galaxies. The model of Pop III.1 seeding involves SMBH formation at redshifts $z\gtrsim 20$ in dark matter minihalos that are isolated from external radiative feedback, parameterized by… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Submitted to MNRAS, comments welcome

  4. arXiv:2407.09045  [pdf, other

    cs.IR cs.AI

    Time-Frequency Analysis of Variable-Length WiFi CSI Signals for Person Re-Identification

    Authors: Chen Mao, Chong Tan, Jingqi Hu, Min Zheng

    Abstract: Person re-identification (ReID), as a crucial technology in the field of security, plays an important role in security detection and people counting. Current security and monitoring systems largely rely on visual information, which may infringe on personal privacy and be susceptible to interference from pedestrian appearances and clothing in certain scenarios. Meanwhile, the widespread use of rout… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  5. arXiv:2407.07480  [pdf, other

    astro-ph.HE

    The discovery of a nearby 421~s transient with CHIME/FRB/Pulsar

    Authors: Fengqiu Adam Dong, Tracy Clarke, Alice P. Curtin, Ajay Kumar, Ingrid Stairs, Shami Chatterjee, Amanda M. Cook, Emmanuel Fonseca, B. M. Gaensler, Jason W. T. Hessels, Victoria M. Kaspi, Mattias Lazda, Kiyoshi W. Masui, James W. McKee, Bradley W. Meyers, Aaron B. Pearlman, Scott M. Ransom, Paul Scholz, Kaitlyn Shin, Kendrick M. Smith, Chia Min Tan

    Abstract: Neutron stars and white dwarfs are both dense remnants of post-main-sequence stars. Pulsars, magnetars and strongly magnetised white dwarfs have all been seen to been observed to exhibit coherent, pulsed radio emission in relation to their rotational period. Recently, a new type of radio long period transient (LPT) has been discovered. The bright radio emission of LPTs resembles that of radio puls… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Submitted

  6. arXiv:2407.05410  [pdf, other

    cs.SE cs.DB cs.LG cs.LO

    Synthetic Test Data Generation Using Recurrent Neural Networks: A Position Paper

    Authors: Razieh Behjati, Erik Arisholm, Chao Tan, Margrethe M. Bedregal

    Abstract: Testing in production-like test environments is an essential part of quality assurance processes in many industries. Provisioning of such test environments, for information-intensive services, involves setting up databases that are rich-enough to enable simulating a wide variety of user scenarios. While production data is perhaps the gold-standard here, many organizations, particularly within the… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: This paper was published in the proceedings of RAISE@ICSE in 2019

    Journal ref: Proceedings of the 7th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, RAISE@ICSE 2019, (2019), 22-27

  7. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  8. arXiv:2407.01418  [pdf, other

    cs.RO cs.AI cs.LG

    RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing

    Authors: Bo Ai, Stephen Tian, Haochen Shi, Yixuan Wang, Cheston Tan, Yunzhu Li, Jiajun Wu

    Abstract: Tactile feedback is critical for understanding the dynamics of both rigid and deformable objects in many manipulation tasks, such as non-prehensile manipulation and dense packing. We introduce an approach that combines visual and tactile sensing for robotic manipulation by learning a neural, tactile-informed dynamics model. Our proposed framework, RoboPack, employs a recurrent graph neural network… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Robotics: Science and Systems (RSS), 2024. Project page: https://robo-pack.github.io/

    ACM Class: I.2.9; I.2.6; I.2.10

  9. arXiv:2407.00050  [pdf, other

    q-bio.BM cs.AI cs.LG

    FoldToken2: Learning compact, invariant and generative protein structure language

    Authors: Zhangyang Gao, Cheng Tan, Stan Z. Li

    Abstract: The equivalent nature of 3D coordinates has posed long term challenges in protein structure representation learning, alignment, and generation. Can we create a compact and invariant language that equivalently represents protein structures? Towards this goal, we propose FoldToken2 to transfer equivariant structures into discrete tokens, while maintaining the recoverability of the original structure… ▽ More

    Submitted 11 June, 2024; originally announced July 2024.

  10. arXiv:2406.16603  [pdf, other

    cond-mat.mtrl-sci

    Bipolarized Weyl semimetals and quantum crystal valley Hall effect in two-dimensional altermagnetic materials

    Authors: Chao-Yang Tan, Ze-Feng Gao, Huan-Cheng Yang, Kai Liu, Peng-Jie Guo, Zhong-Yi Lu

    Abstract: Magnetism and topology are two major areas of condensed matter physics. The combination of magnetism and topology gives rise to more novel physical effects, which have attracted strongly theoretical and experimental attention. Recently, the concept of altermagnetism has been introduced, characterized by a dual nature: real-space antiferromagnetism and reciprocal-space anisotropic spin polarization… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 7 pages, 5 figures

  11. arXiv:2406.15238  [pdf, other

    physics.acc-ph

    Fermilab Booster Beam Emittances from Quadrupole Modes Measured by BPMs

    Authors: C. Y. Tan, M. Balcewicz

    Abstract: The measurement of beam emittances by extracting the quadrupole mode signal from a 4 plate beam position monitor (BPM) was published at least 40 years ago. Unfortunately, in practice, this method suffers from poor signal to noise ratio and requires a lot of tuning to extract out the emittances. In this paper, an improved method where multiple BPMs are used together with better mathematical analysi… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: 15th International Particle Accelerator Conference (IPAC'24)

    Report number: FERMILAB-CONF-24-0179-AD

  12. arXiv:2406.14359  [pdf, other

    cs.NE

    Learning to Transfer for Evolutionary Multitasking

    Authors: Sheng-Hao Wu, Yuxiao Huang, Xingyu Wu, Liang Feng, Zhi-Hui Zhan, Kay Chen Tan

    Abstract: Evolutionary multitasking (EMT) is an emerging approach for solving multitask optimization problems (MTOPs) and has garnered considerable research interest. The implicit EMT is a significant research branch that utilizes evolution operators to enable knowledge transfer (KT) between tasks. However, current approaches in implicit EMT face challenges in adaptability, due to the use of a limited numbe… ▽ More

    Submitted 22 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Under review

  13. arXiv:2406.14108  [pdf, other

    math.OC

    Connected Vehicle Data-driven Robust Optimization for Traffic Signal Timing: Modeling Traffic Flow Variability and Errors

    Authors: Chaopeng Tan, Yue Ding, Kaidi Yang, Hong Zhu, Keshuang Tang

    Abstract: Recent advancements in Connected Vehicle (CV) technology have prompted research on leveraging CV data for more effective traffic management. Despite the low penetration rate, such detailed CV data has demonstrated great potential in improving traffic signal performance. However, existing studies share a common shortcoming in that they all ignore traffic flow estimation errors in their modeling pro… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted for podium session of the Conference in Emerging Technologies in Transportation Systems (TRC-30)

  14. arXiv:2406.13434  [pdf, other

    cs.RO

    Tactile Aware Dynamic Obstacle Avoidance in Crowded Environment with Deep Reinforcement Learning

    Authors: Yung Chuen Ng, Qi Wen, Lim, Chun Ye Tan, Zhen Hao Gan, Meng Yee, Chuah

    Abstract: Mobile robots operating in crowded environments require the ability to navigate among humans and surrounding obstacles efficiently while adhering to safety standards and socially compliant mannerisms. This scale of the robot navigation problem may be classified as both a local path planning and trajectory optimization problem. This work presents an array of force sensors that act as a tactile laye… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  15. arXiv:2406.12266  [pdf, other

    cs.CL

    Towards a Client-Centered Assessment of LLM Therapists by Client Simulation

    Authors: Jiashuo Wang, Yang Xiao, Yanran Li, Changhe Song, Chunpu Xu, Chenhao Tan, Wenjie Li

    Abstract: Although there is a growing belief that LLMs can be used as therapists, exploring LLMs' capabilities and inefficacy, particularly from the client's perspective, is limited. This work focuses on a client-centered assessment of LLM therapists with the involvement of simulated clients, a standard approach in clinical medical education. However, there are two challenges when applying the approach to a… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  16. arXiv:2406.10840  [pdf, other

    cs.LG cs.AI q-bio.BM

    CBGBench: Fill in the Blank of Protein-Molecule Complex Binding Graph

    Authors: Haitao Lin, Guojiang Zhao, Odin Zhang, Yufei Huang, Lirong Wu, Zicheng Liu, Siyuan Li, Cheng Tan, Zhifeng Gao, Stan Z. Li

    Abstract: Structure-based drug design (SBDD) aims to generate potential drugs that can bind to a target protein and is greatly expedited by the aid of AI techniques in generative models. However, a lack of systematic understanding persists due to the diverse settings, complex implementation, difficult reproducibility, and task singularity. Firstly, the absence of standardization can lead to unfair compariso… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 9 pages main context

  17. Massive Dirac Fermions and Strong Shubnikov-de Haas Oscillations in Topological Insulator Sm,Fe:Bi2Se3 Single Crystals

    Authors: Weiyao Zhao, Chi Xuan Trang, Qile Li, Lei Chen, Zengji Yue, Abdulhakim Bake, Cheng Tan, Lan Wang, Mitchell Nancarrow, Mark Edmonds, David Cortie, Xiaolin Wang

    Abstract: Topological insulators (TIs) are emergent materials with unique band structure, which allow the study of quantum effect in solids, as well as contribute to high performance quantum devices. To achieve the better performance of TI, here we present a co-doping strategy using synergistic rare-earth Sm and transition-metal Fe dopants in Bi2Se3 single crystals, which combine the advantages of both tran… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 5 figures

    Journal ref: Physical Review B 104, 085153 (2021)

  18. arXiv:2406.08987  [pdf, other

    cs.NE

    Towards Next Era of Multi-objective Optimization: Large Language Models as Architects of Evolutionary Operators

    Authors: Yuxiao Huang, Shenghao Wu, Wenjie Zhang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Multi-objective optimization problems (MOPs) are prevalent in various real-world applications, necessitating sophisticated solutions that balance conflicting objectives. Traditional evolutionary algorithms (EAs), while effective, often rely on domain-specific expert knowledge and iterative tuning, which can impede innovation when encountering novel MOPs. Very recently, the emergence of Large Langu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures, 5 tables

  19. arXiv:2406.05688  [pdf, other

    cs.CL cs.AI cs.LG

    Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

    Authors: Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

    Abstract: Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process. However, existing applications are primarily limited to static review generation based on submitted papers, which fail to capture the dynamic and iterative nature of real-world peer reviews. In this paper, we reformulate the peer-r… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Under review

  20. arXiv:2406.03198  [pdf, other

    cs.CL cs.HC cs.LG stat.AP stat.ML

    The Impossibility of Fair LLMs

    Authors: Jacy Anthis, Kristian Lum, Michael Ekstrand, Avi Feller, Alexander D'Amour, Chenhao Tan

    Abstract: The need for fair AI is increasingly clear in the era of general-purpose systems such as ChatGPT, Gemini, and other large language models (LLMs). However, the increasing complexity of human-AI interaction and its social impacts have raised questions of how fairness standards could be applied. Here, we review the technical frameworks that machine learning researchers have used to evaluate fairness,… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

    Comments: Presented at the 1st Human-Centered Evaluation and Auditing of Language Models (HEAL) workshop at CHI 2024

  21. arXiv:2406.02234  [pdf, other

    cs.LG cs.AI math.DS stat.ML

    On the Limitations of Fractal Dimension as a Measure of Generalization

    Authors: Charlie Tan, Inés García-Redondo, Qiquan Wang, Michael M. Bronstein, Anthea Monod

    Abstract: Bounding and predicting the generalization gap of overparameterized neural networks remains a central open problem in theoretical machine learning. Neural network optimization trajectories have been proposed to possess fractal structure, leading to bounds and generalization measures based on notions of fractal dimension on these trajectories. Prominently, both the Hausdorff dimension and the persi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 17 pages, 6 figures

  22. arXiv:2406.01627  [pdf, other

    q-bio.GN cs.LG

    GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models

    Authors: Zicheng Liu, Jiahui Li, Siyuan Li, Zelin Zang, Cheng Tan, Yufei Huang, Yajing Bai, Stan Z. Li

    Abstract: The Genomic Foundation Model (GFM) paradigm is expected to facilitate the extraction of generalizable representations from massive genomic data, thereby enabling their application across a spectrum of downstream applications. Despite advancements, a lack of evaluation framework makes it difficult to ensure equitable assessment due to experimental settings, model intricacy, benchmark datasets, and… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  23. arXiv:2406.01333  [pdf, other

    cs.CL cs.AI

    Probing Language Models for Pre-training Data Detection

    Authors: Zhenhua Liu, Tong Zhu, Chuanyuan Tan, Haonan Lu, Bing Liu, Wenliang Chen

    Abstract: Large Language Models (LLMs) have shown their impressive capabilities, while also raising concerns about the data contamination problems due to privacy issues and leakage of benchmark datasets in the pre-training phase. Therefore, it is vital to detect the contamination by checking whether an LLM has been pre-trained on the target texts. Recent studies focus on the generated texts and compute perp… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL-2024 main conference

  24. arXiv:2405.20834  [pdf, other

    cs.CV

    Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning

    Authors: Cheng Tan, Jingxuan Wei, Linzhuang Sun, Zhangyang Gao, Siyuan Li, Bihui Yu, Ruifeng Guo, Stan Z. Li

    Abstract: Large language models equipped with retrieval-augmented generation (RAG) represent a burgeoning field aimed at enhancing answering capabilities by leveraging external knowledge bases. Although the application of RAG with language-only models has been extensively explored, its adaptation into multimodal vision-language models remains nascent. Going beyond mere answer generation, the primary goal of… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Under review

  25. arXiv:2405.18968  [pdf, other

    cs.AI cs.LG q-bio.QM

    UniIF: Unified Molecule Inverse Folding

    Authors: Zhangyang Gao, Jue Wang, Cheng Tan, Lirong Wu, Yufei Huang, Siyuan Li, Zhirui Ye, Stan Z. Li

    Abstract: Molecule inverse folding has been a long-standing challenge in chemistry and biology, with the potential to revolutionize drug discovery and material science. Despite specified models have been proposed for different small- or macro-molecules, few have attempted to unify the learning process, resulting in redundant efforts. Complementary to recent advancements in molecular structure prediction, su… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  26. Discovery and follow-up of a quasiperiodically nulling and sub-pulse drifting pulsar with the Murchison Widefield Array

    Authors: G. Grover, N. D. R. Bhat, S. McSweeney, C. P. Lee, B. W. Meyers, C. M. Tan, S. S. Kudale

    Abstract: The phenomenon of pulsar nulling, where pulsars temporarily and stochastically cease their radio emission, is thought to be indicative of a `dying' pulsar, where radio emission ceases entirely. Here we report the discovery of a long-period pulsar, PSR J0452-3418, from the ongoing Southern-sky MWA Rapid Two-meter (SMART) pulsar survey. The pulsar has a rotation period of ${\sim}$1.67\,s and a dispe… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 16 pages, 9 Figures, 4 Tables, Accepted for ApJ***

  27. arXiv:2405.16041  [pdf, other

    cs.LG cs.AI

    Explainable Molecular Property Prediction: Aligning Chemical Concepts with Predictions via Language Models

    Authors: Zhenzhong Wang, Zehui Lin, Wanyu Lin, Ming Yang, Minggang Zeng, Kay Chen Tan

    Abstract: Providing explainable molecule property predictions is critical for many scientific domains, such as drug discovery and material science. Though transformer-based language models have shown great potential in accurate molecular property prediction, they neither provide chemically meaningful explanations nor faithfully reveal the molecular structure-property relationships. In this work, we develop… ▽ More

    Submitted 31 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  28. arXiv:2405.15252  [pdf, other

    cs.LG

    Fast 3D Molecule Generation via Unified Geometric Optimal Transport

    Authors: Haokai Hong, Wanyu Lin, Kay Chen Tan

    Abstract: This paper proposes a new 3D molecule generation framework, called GOAT, for fast and effective 3D molecule generation based on the flow-matching optimal transport objective. Specifically, we formulate a geometric transport formula for measuring the cost of mapping multi-modal features (e.g., continuous atom coordinates and categorical atom types) between a base distribution and a target data dist… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  29. arXiv:2405.15032  [pdf, other

    cs.CL

    Aya 23: Open Weight Releases to Further Multilingual Progress

    Authors: Viraat Aryabumi, John Dang, Dwarak Talupuru, Saurabh Dash, David Cairuz, Hangyu Lin, Bharat Venkitesh, Madeline Smith, Jon Ander Campos, Yi Chern Tan, Kelly Marchisio, Max Bartolo, Sebastian Ruder, Acyr Locatelli, Julia Kreutzer, Nick Frosst, Aidan Gomez, Phil Blunsom, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

    Abstract: This technical report introduces Aya 23, a family of multilingual language models. Aya 23 builds on the recent release of the Aya model (Üstün et al., 2024), focusing on pairing a highly performant pre-trained model with the recently released Aya collection (Singh et al., 2024). The result is a powerful multilingual large language model serving 23 languages, expanding state-of-art language modelin… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  30. arXiv:2405.14159  [pdf, other

    cs.CL cs.AI

    Super Tiny Language Models

    Authors: Dylan Hillier, Leon Guertler, Cheston Tan, Palaash Agrawal, Chen Ruirui, Bobby Cheng

    Abstract: The rapid advancement of large language models (LLMs) has led to significant improvements in natural language processing but also poses challenges due to their high computational and energy demands. This paper introduces a series of research efforts focused on Super Tiny Language Models (STLMs), which aim to deliver high performance with significantly reduced parameter counts. We explore innovativ… ▽ More

    Submitted 26 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 11 pages, 4 figures

    ACM Class: I.2.7

  31. arXiv:2405.11349  [pdf, other

    cs.LG

    Unlock the Power of Algorithm Features: A Generalization Analysis for Algorithm Selection

    Authors: Xingyu Wu, Yan Zhong, Jibin Wu, Yuxiao Huang, Sheng-hao Wu, Kay Chen Tan

    Abstract: In the algorithm selection research, the discussion surrounding algorithm features has been significantly overshadowed by the emphasis on problem features. Although a few empirical studies have yielded evidence regarding the effectiveness of algorithm features, the potential benefits of incorporating algorithm features into algorithm selection models and their suitability for different scenarios r… ▽ More

    Submitted 3 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  32. arXiv:2405.10812  [pdf, other

    q-bio.GN cs.AI

    VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

    Authors: Siyuan Li, Zedong Wang, Zicheng Liu, Di Wu, Cheng Tan, Jiangbin Zheng, Yufei Huang, Stan Z. Li

    Abstract: Similar to natural language models, pre-trained genome language models are proposed to capture the underlying intricacies within genomes with unsupervised sequence modeling. They have become essential tools for researchers and practitioners in biology. However, the hand-crafted tokenization policies used in these models may not encode the most discriminative patterns from the limited vocabulary of… ▽ More

    Submitted 2 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Preprint V2 with 17 pages and 5 figures

  33. arXiv:2405.09046  [pdf, other

    cond-mat.str-el quant-ph

    Entanglement parity effects in the Kane-Fisher problem

    Authors: Chunyu Tan, Yuxiao Hang, Stephan Haas, Hubert Saleur

    Abstract: We study the entanglement of a segment of length $\ell$ in an XXZ chain with one free extremity and the other connected to the rest of the system with a weak bond. We find that the von-Neumann entropy exhibits terms of order $O(1)$ with strong parity effects, that probe the physics associated with the weakened bond and its behavior under the RG (Kane Fisher problem). In contrast with the XX case s… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  34. arXiv:2405.08355  [pdf, other

    cs.CL

    Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark

    Authors: Mengsong Wu, Tong Zhu, Han Han, Chuanyuan Tan, Xiang Zhang, Wenliang Chen

    Abstract: This paper presents a new tool learning dataset Seal-Tools, which contains self-instruct API-like tools. Seal-Tools not only offers a large number of tools, but also includes instances which demonstrate the practical application of tools. Seeking to generate data on a large scale while ensuring reliability, we propose a self-instruct method to generate tools and instances, allowing precise control… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 14 pages, 10 figures

  35. arXiv:2405.07160  [pdf, ps, other

    math.CA

    Singular Integrals associated with Reflection Groups on Euclidean Space

    Authors: Yongsheng Han, Ji Li, Chaoqiang Tan, Zipeng Wang, Xinfeng Wu

    Abstract: In the field of harmonic analysis, geometric considerations are frequently crucial. Specially, group actions such as translations, dilations and rotations on Euclidean space are instrumental. The objective of this paper is to extend the study of singular integrals to include the effects of group reflections on Euclidean space, and to establish the T1 theorem for these singular integrals.

    Submitted 12 May, 2024; originally announced May 2024.

  36. arXiv:2405.06940  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Van der Waals Magnetic Electrode Transfer for Two-Dimensional Spintronic Devices

    Authors: Zhongzhong Luo, Zhihao Yu, Xiangqian Lu, Wei Niu, Yao Yu, Yu Yao, Fuguo Tian, Chee Leong Tan, Huabin Sun, Li Gao, Wei Qin, Yong Xu, Qiang Zhao, Xiang-Xiang Song

    Abstract: Two-dimensional (2D) materials are promising candidates for spintronic applications. Maintaining their atomically smooth interfaces during integration of ferromagnetic (FM) electrodes is crucial since conventional metal deposition tends to induce defects at the interfaces. Meanwhile, the difficulties in picking up FM metals with strong adhesion and in achieving conductance match between FM electro… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Journal ref: Nano Lett. (2024)

  37. arXiv:2405.05767  [pdf

    cs.NE

    Large Language Model-Aided Evolutionary Search for Constrained Multiobjective Optimization

    Authors: Zeyi Wang, Songbai Liu, Jianyong Chen, Kay Chen Tan

    Abstract: Evolutionary algorithms excel in solving complex optimization problems, especially those with multiple objectives. However, their stochastic nature can sometimes hinder rapid convergence to the global optima, particularly in scenarios involving constraints. In this study, we employ a large language model (LLM) to enhance evolutionary search for solving constrained multi-objective optimization prob… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures, 2024 International Conference on Intelligent Computing

  38. "Community Guidelines Make this the Best Party on the Internet": An In-Depth Study of Online Platforms' Content Moderation Policies

    Authors: Brennan Schaffner, Arjun Nitin Bhagoji, Siyuan Cheng, Jacqueline Mei, Jay L. Shen, Grace Wang, Marshini Chetty, Nick Feamster, Genevieve Lakier, Chenhao Tan

    Abstract: Moderating user-generated content on online platforms is crucial for balancing user safety and freedom of speech. Particularly in the United States, platforms are not subject to legal constraints prescribing permissible content. Each platform has thus developed bespoke content moderation policies, but there is little work towards a comparative understanding of these policies across platforms and t… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  39. arXiv:2405.00518  [pdf, ps, other

    math.CO cs.CE nlin.CD

    Graph-Based Multivariate Multiscale Dispersion Entropy: Efficient Implementation and Applications to Real-World Network Data

    Authors: John Stewart Fabila-Carrasco, Chao Tan, Javier Escudero

    Abstract: We introduce Multivariate Multiscale Graph-based Dispersion Entropy (mvDEG), a novel, computationally efficient method for analyzing multivariate time series data in graph and complex network frameworks, and demonstrate its application in real-world data. mvDEG effectively combines temporal dynamics with topological relationships, offering enhanced analysis compared to traditional nonlinear entrop… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 9 pages, 10 figures

    MSC Class: 60J20; 05C82; 05C85; 94C15

  40. arXiv:2404.19664  [pdf, other

    cs.RO cs.LG

    Towards Generalist Robot Learning from Internet Video: A Survey

    Authors: Robert McCarthy, Daniel C. H. Tan, Dominik Schmidt, Fernando Acero, Nathan Herr, Yilun Du, Thomas G. Thuruthel, Zhibin Li

    Abstract: This survey presents an overview of methods for learning from video (LfV) in the context of reinforcement learning (RL) and robotics. We focus on methods capable of scaling to large internet video datasets and, in the process, extracting foundational knowledge about the world's dynamics and physical human behaviour. Such methods hold great promise for developing general-purpose robots. We open w… ▽ More

    Submitted 7 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: Updated formatting. Reduced paper length and made other minor improvements

  41. arXiv:2404.19326  [pdf, other

    cs.CV

    LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation

    Authors: Lingyi Hong, Zhongying Liu, Wenchao Chen, Chenzhi Tan, Yuang Feng, Xinyu Zhou, Pinxue Guo, Jinglun Li, Zhaoyu Chen, Shuyong Gao, Wei Zhang, Wenqiang Zhang

    Abstract: Video object segmentation (VOS) aims to distinguish and track target objects in a video. Despite the excellent performance achieved by off-the-shell VOS models, existing VOS benchmarks mainly focus on short-term videos lasting about 5 seconds, where objects remain visible most of the time. However, these benchmarks poorly represent practical applications, and the absence of long-term datasets rest… ▽ More

    Submitted 30 April, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: LVOS V2

  42. arXiv:2404.14713  [pdf, other

    eess.SY

    Enhancing High-Speed Cruising Performance of Autonomous Vehicles through Integrated Deep Reinforcement Learning Framework

    Authors: Jinhao Liang, Kaidi Yang, Chaopeng Tan, Jinxiang Wang, Guodong Yin

    Abstract: High-speed cruising scenarios with mixed traffic greatly challenge the road safety of autonomous vehicles (AVs). Unlike existing works that only look at fundamental modules in isolation, this work enhances AV safety in mixed-traffic high-speed cruising scenarios by proposing an integrated framework that synthesizes three fundamental modules, i.e., behavioral decision-making, path-planning, and mot… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  43. In-situ process monitoring and adaptive quality enhancement in laser additive manufacturing: a critical review

    Authors: Lequn Chen, Guijun Bi, Xiling Yao, Jinlong Su, Chaolin Tan, Wenhe Feng, Michalis Benakis, Youxiang Chew, Seung Ki Moon

    Abstract: Laser Additive Manufacturing (LAM) presents unparalleled opportunities for fabricating complex, high-performance structures and components with unique material properties. Despite these advancements, achieving consistent part quality and process repeatability remains challenging. This paper provides a comprehensive review of various state-of-the-art in-situ process monitoring techniques, including… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 107 Pages, 29 Figures. Paper Accepted At Journal of Manufacturing Systems

  44. arXiv:2404.12569  [pdf, other

    cs.LG cs.AI

    Multi-View Subgraph Neural Networks: Self-Supervised Learning with Scarce Labeled Data

    Authors: Zhenzhong Wang, Qingyuan Zeng, Wanyu Lin, Min Jiang, Kay Chen Tan

    Abstract: While graph neural networks (GNNs) have become the de-facto standard for graph-based node classification, they impose a strong assumption on the availability of sufficient labeled samples. This assumption restricts the classification performance of prevailing GNNs on many real-world applications suffering from low-data regimes. Specifically, features extracted from scarce labeled nodes could not p… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  45. arXiv:2404.06349  [pdf, other

    cs.LG

    CausalBench: A Comprehensive Benchmark for Causal Learning Capability of Large Language Models

    Authors: Yu Zhou, Xingyu Wu, Beicheng Huang, Jibin Wu, Liang Feng, Kay Chen Tan

    Abstract: Causality reveals fundamental principles behind data distributions in real-world scenarios, and the capability of large language models (LLMs) to understand causality directly impacts their efficacy across explaining outputs, adapting to new evidence, and generating counterfactuals. With the proliferation of LLMs, the evaluation of this capacity is increasingly garnering attention. However, the ab… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  46. arXiv:2404.06290  [pdf, other

    cs.NE

    Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models

    Authors: Beichen Huang, Xingyu Wu, Yu Zhou, Jibin Wu, Liang Feng, Ran Cheng, Kay Chen Tan

    Abstract: Large language models (LLMs) have demonstrated exceptional performance not only in natural language processing tasks but also in a great variety of non-linguistic domains. In diverse optimization scenarios, there is also a rising trend of applying LLMs. However, whether the application of LLMs in the black-box optimization problems is genuinely beneficial remains unexplored. This paper endeavors t… ▽ More

    Submitted 6 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  47. arXiv:2404.06162  [pdf, other

    cs.CL cs.AI cs.LG

    Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

    Authors: Tianyu Cao, Natraj Raman, Danial Dervovic, Chenhao Tan

    Abstract: As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study… ▽ More

    Submitted 8 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  48. Wi-Fi-based Personnel Identity Recognition: Addressing Dataset Imbalance with C-DDPMs

    Authors: Jichen Bian, Chong Tan, Peiyao Tang, Min Zheng

    Abstract: Wireless sensing technologies become increasingly prevalent due to the ubiquitous nature of wireless signals and their inherent privacy-friendly characteristics. Device-free personnel identity recognition, a prevalent application in wireless sensing, is susceptibly challenged by imbalanced channel state information (CSI) datasets. This letter proposes a novel method for CSI dataset augmentation th… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

    Journal ref: IEEE Signal Processing Letters, 2024

  49. arXiv:2404.04326  [pdf, other

    cs.AI cs.CL cs.CY cs.LG

    Hypothesis Generation with Large Language Models

    Authors: Yangqiaoyu Zhou, Haokun Liu, Tejes Srivastava, Hongyuan Mei, Chenhao Tan

    Abstract: Effective generation of novel hypotheses is instrumental to scientific progress. So far, researchers have been the main powerhouse behind hypothesis generation by painstaking data analysis and thinking (also known as the Eureka moment). In this paper, we examine the potential of large language models (LLMs) to generate hypotheses. We focus on hypothesis generation based on data (i.e., labeled exam… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: 26 pages, 6 figures, code link: https://github.com/ChicagoHAI/hypothesis_generation

  50. arXiv:2404.00962  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    Diffusion-Driven Domain Adaptation for Generating 3D Molecules

    Authors: Haokai Hong, Wanyu Lin, Kay Chen Tan

    Abstract: Can we train a molecule generator that can generate 3D molecules from a new domain, circumventing the need to collect data? This problem can be cast as the problem of domain adaptive molecule generation. This work presents a novel and principled diffusion-based approach, called GADM, that allows shifting a generative model to desired new domains without the need to collect even a single molecule.… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 11 pages, 3 figures, and 3 tables