Skip to main content

Showing 1–50 of 330 results for author: Cheng, R

  1. arXiv:2407.07731  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall

    Large spin-orbit torque in a-plane $α$-Fe$_{2}$O$_{3}$/Pt bilayers

    Authors: Igor Lyalin, Hantao Zhang, Justin Michel, Daniel Russell, Fengyuan Yang, Ran Cheng, Roland K. Kawakami

    Abstract: Realization of efficient spin-orbit torque switching of the Néel vector in insulating antiferromagnets is a challenge, often complicated by spurious effects. Quantifying the spin-orbit torques in antiferromagnet/heavy metal heterostructures is an important first step towards this goal. Here, we employ magneto-optic techniques to study damping-like spin-orbit torque (DL-SOT) in a-plane $α$-Fe$_2$O… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures

  2. arXiv:2407.00269  [pdf, other

    physics.optics physics.app-ph

    High-power and narrow-linewidth laser on thin-film lithium niobate enabled by photonic wire bonding

    Authors: Cornelis A. A. Franken, Rebecca Cheng, Keith Powell, Georgios Kyriazidis, Victoria Rosborough, Juergen Musolf, Maximilian Shah, David R. Barton III, Gage Hills, Leif Johansson, Klaus-J. Boller, Marko Lončar

    Abstract: Thin-film lithium niobate (TFLN) has emerged as a promising platform for the realization of high performance chip-scale optical systems, spanning a range of applications from optical communications to microwave photonics. Such applications rely on the integration of multiple components onto a single platform. However, while many of these components have already been demonstrated on the TFLN platfo… ▽ More

    Submitted 5 July, 2024; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures; updated long-term stability measurements with new and improved data

  3. arXiv:2406.17245  [pdf, other

    cs.LG cs.AI cs.CL

    Unlocking Continual Learning Abilities in Language Models

    Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

    Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint, 19 pages

  4. arXiv:2406.10802  [pdf, other

    cs.CL cs.AI

    KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs

    Authors: Aihua Pei, Zehua Yang, Shunan Zhu, Ruoxi Cheng, Ju Jia, Lina Wang

    Abstract: Existing frameworks for assessing robustness of large language models (LLMs) overly depend on specific benchmarks, increasing costs and failing to evaluate performance of LLMs in professional domains due to dataset limitations. This paper proposes a framework that systematically evaluates the robustness of LLMs under adversarial attack scenarios by leveraging knowledge graphs (KGs). Our framework… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  5. arXiv:2406.09274  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Doubled Shapiro steps in a dynamic axion insulator Josephson junction

    Authors: Yu-Hang Li, Ziqian Zhou, Ran Cheng, Hua Jiang, X. C. Xie

    Abstract: Dynamic axion insulators feature a time-dependent axion field that can be induced by antiferromagnetic resonance. Here, we show that a Josephson junction incorporating this dynamic axion insulator between two superconductors exhibits a striking doubled Shapiro steps wherein all odd steps are completely suppressed in the jointly presence of a DC bias and a static magnetic field. The resistively shu… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  6. arXiv:2406.07365  [pdf, other

    cs.CL cs.AI

    BvSP: Broad-view Soft Prompting for Few-Shot Aspect Sentiment Quad Prediction

    Authors: Yinhao Bai, Yalan Xie, Xiaoyi Liu, Yuhua Zhao, Zhixin Han, Mengting Hu, Hang Gao, Renhong Cheng

    Abstract: Aspect sentiment quad prediction (ASQP) aims to predict four aspect-based elements, including aspect term, opinion term, aspect category, and sentiment polarity. In practice, unseen aspects, due to distinct data distribution, impose many challenges for a trained neural model. Motivated by this, this work formulates ASQP into the few-shot scenario, which aims for fast adaptation in real application… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 Main Conference

  7. arXiv:2406.06626  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Benchmarking Neural Decoding Backbones towards Enhanced On-edge iBCI Applications

    Authors: Zhou Zhou, Guohang He, Zheng Zhang, Luziwei Leng, Qinghai Guo, Jianxing Liao, Xuan Song, Ran Cheng

    Abstract: Traditional invasive Brain-Computer Interfaces (iBCIs) typically depend on neural decoding processes conducted on workstations within laboratory settings, which prevents their everyday usage. Implementing these decoding processes on edge devices, such as the wearables, introduces considerable challenges related to computational demands, processing speed, and maintaining accuracy. This study seeks… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  8. arXiv:2405.20396  [pdf, other

    astro-ph.HE astro-ph.SR

    Using the COSMIC Population Synthesis Code to Investigate How Metallicity Affects the Rates of Interacting Binaries

    Authors: Ayanah L. Cason, Nicole M. Lloyd-Ronning, Roseanne M. Cheng

    Abstract: We use COSMIC, a galaxy population synthesis code, to investigate how metallicity affects the rate of formation of massive stars with a closely orbiting compact object companion, the suggested progenitors of radio loud long gamma-ray bursts. We present the evolution time of these systems at different metallicities, and how the formation rates of these systems are anti-correlated with metallicity.… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: submitted to RNAAS

  9. arXiv:2405.15319  [pdf, other

    cs.CL cs.AI

    Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

    Authors: Wenyu Du, Tongxu Luo, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu

    Abstract: LLMs are computationally expensive to pre-train due to their large scale. Model growth emerges as a promising approach by leveraging smaller models to accelerate the training of larger ones. However, the viability of these model growth methods in efficient LLM pre-training remains underexplored. This work identifies three critical $\underline{\textit{O}}$bstacles: ($\textit{O}$1) lack of comprehen… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Preprint; The project link: $\href{https://llm-stacking.github.io/}{https://llm-stacking.github.io/}$

  10. arXiv:2405.15307  [pdf, other

    cs.CL

    Before Generation, Align it! A Novel and Effective Strategy for Mitigating Hallucinations in Text-to-SQL Generation

    Authors: Ge Qu, Jinyang Li, Bowen Li, Bowen Qin, Nan Huo, Chenhao Ma, Reynold Cheng

    Abstract: Large Language Models (LLMs) driven by In-Context Learning (ICL) have significantly improved the performance of text-to-SQL. Previous methods generally employ a two-stage reasoning framework, namely 1) schema linking and 2) logical synthesis, making the framework not only effective but also interpretable. Despite these advancements, the inherent bad nature of the generalization of LLMs often resul… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL Findings 2024

  11. arXiv:2405.14517  [pdf, other

    cs.LG cs.CR

    Identity Inference from CLIP Models using Only Textual Data

    Authors: Songze Li, Ruoxi Cheng, Xiaojun Jia

    Abstract: The widespread usage of large-scale multimodal models like CLIP has heightened concerns about the leakage of personally identifiable information (PII). Existing methods for identity inference in CLIP models, i.e., to detect the presence of a person's PII used for training a CLIP model, require querying the model with full PII, including textual descriptions of the person and corresponding images (… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  12. arXiv:2405.12183  [pdf, other

    cs.LG cs.AI

    Multi-order Graph Clustering with Adaptive Node-level Weight Learning

    Authors: Ye Liu, Xuelei Lin, Yejia Chen, Reynold Cheng

    Abstract: Current graph clustering methods emphasize individual node and edge con nections, while ignoring higher-order organization at the level of motif. Re cently, higher-order graph clustering approaches have been designed by motif based hypergraphs. However, these approaches often suffer from hypergraph fragmentation issue seriously, which degrades the clustering performance greatly. Moreover, real-wor… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  13. arXiv:2405.11028  [pdf, other

    astro-ph.HE astro-ph.SR

    Simulations of Interacting Binary Systems -- Pathways to Radio Bright GRB Progenitors

    Authors: Angel Hernandez, Roseanne M. Cheng, Nicole M. Lloyd-Ronning, Carl E. Fields

    Abstract: Although the association of gamma-ray bursts (GRBs) with massive stellar death is on firm footing, the nature of the progenitor system and the key ingredients required for a massive star to produce a gamma-ray burst remain open questions. Here, we investigate the evolution of a massive star with a closely orbiting compact object companion using the stellar evolution code MESA. In particular, we ex… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Submitted to ApJ - comments welcome

    Report number: LA-UR-24-22983

  14. arXiv:2405.10889  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Unconventional Unidirectional Magnetoresistance in vdW Heterostructures

    Authors: I-Hsuan Kao, Junyu Tang, Gabriel Calderon Ortiz, Menglin Zhu, Sean Yuan, Rahul Rao, Jiahan Li, James H. Edgar, Jiaqiang Yan, David G. Mandrus, Kenji Watanabe, Takashi Taniguchi, Jinwoo Hwang, Ran Cheng, Jyoti Katoch, Simranjeet Singh

    Abstract: Electrical readout of magnetic states is a key to realize novel spintronics devices for efficient computing and data storage. Unidirectional magnetoresistance (UMR) in bilayer systems, consisting of a spin source material and a magnetic layer, refers to a change in the longitudinal resistance upon the reversal of magnetization, which typically originates from the interaction of spin-current and ma… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  15. arXiv:2405.10422  [pdf, other

    cs.NI

    A First Look at Immersive Telepresence on Apple Vision Pro

    Authors: Ruizhi Cheng, Nan Wu, Matteo Varvello, Eugene Chai, Songqing Chen, Bo Han

    Abstract: Due to the widespread adoption of "work-from-home" policies, videoconferencing applications (e.g., Zoom) have become indispensable for remote communication. However, these systems lack immersiveness, leading to the so-called "Zoom fatigue" and degrading communication efficiency. The recent debut of Apple Vision Pro, a mixed reality headset that supports "spatial persona", aims to offer an immersiv… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  16. arXiv:2405.03267  [pdf, other

    cs.DC cs.DB cs.IR

    Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory

    Authors: Rongxin Cheng, Yifan Peng, Xingda Wei, Hongrui Xie, Rong Chen, Sijie Shen, Haibo Chen

    Abstract: Vector searches on large-scale datasets are critical to modern online services like web search and RAG, which necessity storing the datasets and their index on the secondary storage like SSD. In this paper, we are the first to characterize the trade-off of performance and index size in existing SSD-based graph and cluster indexes: to improve throughput by 5.7$\times$ and 1.7$\times$, these indexes… ▽ More

    Submitted 7 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  17. arXiv:2404.17341  [pdf, other

    math.AG

    Free curves in Fano hypersurfaces must have high degree

    Authors: Raymond Cheng

    Abstract: The purpose of this note is to show that the minimal $e$ for which every smooth Fano hypersurface of dimension $n$ contains a free rational curve of degree at most $e$ cannot be bounded by a linear function in $n$ when the base field has positive characteristic. This is done by providing a super-linear bound on the minimal possible degree of a free curve in certain Fermat hypersurfaces.

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 4 pages, comments welcome!

    MSC Class: 14M22; 14J70 (primary); 14G17; 14J45 (secondary)

  18. A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation

    Authors: Yifan Zhao, Zhenyu Liang, Zhichao Lu, Ran Cheng

    Abstract: As one of the emerging challenges in Automated Machine Learning, the Hardware-aware Neural Architecture Search (HW-NAS) tasks can be treated as black-box multi-objective optimization problems (MOPs). An important application of HW-NAS is real-time semantic segmentation, which plays a pivotal role in autonomous driving scenarios. The HW-NAS for real-time semantic segmentation inherently needs to ba… ▽ More

    Submitted 28 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: GECCO 2024

  19. arXiv:2404.15622  [pdf, other

    cs.LG

    FR-NAS: Forward-and-Reverse Graph Predictor for Efficient Neural Architecture Search

    Authors: Haoming Zhang, Ran Cheng

    Abstract: Neural Architecture Search (NAS) has emerged as a key tool in identifying optimal configurations of deep neural networks tailored to specific tasks. However, training and assessing numerous architectures introduces considerable computational overhead. One method to mitigating this is through performance predictors, which offer a means to estimate the potential of an architecture without exhaustive… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: IJCNN'24

  20. arXiv:2404.10160  [pdf, other

    cs.AI

    Reinforcement Learning from Multi-role Debates as Feedback for Bias Mitigation in LLMs

    Authors: Ruoxi Cheng, Haoxuan Ma, Shuirong Cao, Jiaqi Li, Aihua Pei, Zhiqiang Wang, Pengliang Ji, Haoyu Wang, Jiaqi Huo

    Abstract: Bias in LLMs can harm user experience and societal outcomes. However, current bias mitigation methods often require intensive human feedback, lack transferability to other topics or yield overconfident and random outputs. We find that involving LLMs in role-playing scenario boosts their ability to recognize and mitigate biases. Based on this, we propose Reinforcement Learning from Multi-role Debat… ▽ More

    Submitted 18 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: The first three authors contributed equally to this work

  21. arXiv:2404.08233  [pdf, other

    cs.LG cs.AI cs.NE

    Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning

    Authors: Hui Bai, Ran Cheng

    Abstract: Hyperparameter optimization plays a key role in the machine learning domain. Its significance is especially pronounced in reinforcement learning (RL), where agents continuously interact with and adapt to their environments, requiring dynamic adjustments in their learning trajectories. To cater to this dynamicity, the Population-Based Training (PBT) was introduced, leveraging the collective intelli… ▽ More

    Submitted 22 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: IEEE Transactions on Emerging Topics in Computational Intelligence

  22. arXiv:2404.07387  [pdf, other

    cs.HC cs.AI

    BISCUIT: Scaffolding LLM-Generated Code with Ephemeral UIs in Computational Notebooks

    Authors: Ruijia Cheng, Titus Barik, Alan Leung, Fred Hohman, Jeffrey Nichols

    Abstract: Programmers frequently engage with machine learning tutorials in computational notebooks and have been adopting code generation technologies based on large language models (LLMs). However, they encounter difficulties in understanding and working with code produced by LLMs. To mitigate these challenges, we introduce a novel workflow into computational notebooks that augments LLM-based code generati… ▽ More

    Submitted 11 July, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  23. arXiv:2404.06398  [pdf

    physics.optics physics.app-ph quant-ph

    Integrated electro-optics on thin-film lithium niobate

    Authors: Yaowen Hu, Di Zhu, Shengyuan Lu, Xinrui Zhu, Yunxiang Song, Dylan Renaud, Daniel Assumpcao, Rebecca Cheng, CJ Xin, Matthew Yeh, Hana Warner, Xiangwen Guo, Amirhassan Shams-Ansari, David Barton, Neil Sinclair, Marko Loncar

    Abstract: Electro-optics serves as the crucial bridge between electronics and photonics, unlocking a wide array of applications ranging from communications and computing to sensing and quantum information. Integrated electro-optics approaches in particular enable essential electronic high-speed control for photonics while offering substantial photonic parallelism for electronics. Recent strides in thin-film… ▽ More

    Submitted 11 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  24. arXiv:2404.06290  [pdf, other

    cs.NE

    Exploring the True Potential: Evaluating the Black-box Optimization Capability of Large Language Models

    Authors: Beichen Huang, Xingyu Wu, Yu Zhou, Jibin Wu, Liang Feng, Ran Cheng, Kay Chen Tan

    Abstract: Large language models (LLMs) have demonstrated exceptional performance not only in natural language processing tasks but also in a great variety of non-linguistic domains. In diverse optimization scenarios, there is also a rising trend of applying LLMs. However, whether the application of LLMs in the black-box optimization problems is genuinely beneficial remains unexplored. This paper endeavors t… ▽ More

    Submitted 6 July, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  25. arXiv:2404.04895  [pdf, other

    cs.NE

    Tensorized Ant Colony Optimization for GPU Acceleration

    Authors: Luming Yang, Tao Jiang, Ran Cheng

    Abstract: Ant Colony Optimization (ACO) is renowned for its effectiveness in solving Traveling Salesman Problems, yet it faces computational challenges in CPU-based environments, particularly with large-scale instances. In response, we introduce a Tensorized Ant Colony Optimization (TensorACO) to utilize the advancements of GPU acceleration. As the core, TensorACO fully transforms ant system and ant path in… ▽ More

    Submitted 12 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  26. arXiv:2404.03032  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Even-Odd Layer-Dependent Exchange Bias Effect in MnBi2Te4 Chern Insulator Devices

    Authors: Bo Chen, Xiaoda Liu, Yu-Hang Li, Han Tay, Takashi Taniguchi, Kenji Watanabe, Moses. H. W. Chan, Jiaqiang Yan, Fengqi Song, Ran Cheng, Cui-Zu Chang

    Abstract: Magnetic topological materials with coexisting magnetism and non-trivial band structures exhibit many novel quantum phenomena, including the quantum anomalous Hall effect, the axion insulator state, and the Weyl semimetal phase. As a stoichiometric layered antiferromagnetic topological insulator, thin films of MnBi2Te4 show fascinating even-odd layer-dependent physics. In this work, we fabricate a… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 23 pages, 4 figures, comments are very much welcome

  27. arXiv:2404.01817  [pdf, other

    cs.NE

    Tensorized NeuroEvolution of Augmenting Topologies for GPU Acceleration

    Authors: Lishuang Wang, Mengfei Zhao, Enyu Liu, Kebin Sun, Ran Cheng

    Abstract: The NeuroEvolution of Augmenting Topologies (NEAT) algorithm has received considerable recognition in the field of neuroevolution. Its effectiveness is derived from initiating with simple networks and incrementally evolving both their topologies and weights. Although its capability across various challenges is evident, the algorithm's computational efficiency remains an impediment, limiting its sc… ▽ More

    Submitted 11 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  28. GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEA

    Authors: Zhenyu Liang, Tao Jiang, Kebin Sun, Ran Cheng

    Abstract: Evolutionary multiobjective optimization has witnessed remarkable progress during the past decades. However, existing algorithms often encounter computational challenges in large-scale scenarios, primarily attributed to the absence of hardware acceleration. In response, we introduce a Tensorized Reference Vector Guided Evolutionary Algorithm (TensorRVEA) for harnessing the advancements of GPU acce… ▽ More

    Submitted 11 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: Genetic and Evolutionary Computation Conference (GECCO '24)

  29. arXiv:2403.13463  [pdf, ps, other

    math.AG

    Derived categories of quartic double fivefolds

    Authors: Raymond Cheng, Alexander Perry, Xiaolei Zhao

    Abstract: We construct singular quartic double fivefolds whose Kuznetsov component admits a crepant categorical resolution of singularities by a twisted Calabi--Yau threefold. We also construct rational specializations of these fivefolds where such a resolution exists without a twist. This confirms an instance of a higher-dimensional version of Kuznetsov's rationality conjecture, and of a noncommutative ver… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 21 pages, comments welcome!

    MSC Class: 14F08; 14E08 (primary); 14M20; 14D06 (secondary)

  30. arXiv:2403.13286  [pdf, other

    stat.ML cs.DB cs.LG

    A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs

    Authors: Yun Wang, Chrysanthi Kosyfaki, Sihem Amer-Yahia, Reynold Cheng

    Abstract: Hypothesis testing is a statistical method used to draw conclusions about populations from sample data, typically represented in tables. With the prevalence of graph representations in real-life applications, hypothesis testing in graphs is gaining importance. In this work, we formalize node, edge, and path hypotheses in attributed graphs. We develop a sampling-based hypothesis testing framework,… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  31. arXiv:2403.11073  [pdf

    cs.CV cs.AI

    Tokensome: Towards a Genetic Vision-Language GPT for Explainable and Cognitive Karyotyping

    Authors: Haoxi Zhang, Xinxu Zhang, Yuanxin Lin, Maiqi Wang, Yi Lai, Yu Wang, Linfeng Yu, Yufeng Xu, Ran Cheng, Edward Szczerbicki

    Abstract: Automatic karyotype analysis is often defined as a visual perception task focused solely on chromosomal object-level modeling. This definition has led most existing methods to overlook componential and holistic information, significantly constraining model performance. Moreover, the lack of interpretability in current technologies hinders clinical adoption. In this paper, we introduce Tokensome, a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Preprint. Work in progress

  32. arXiv:2403.08600  [pdf

    cs.NI

    Evaluation of Control/User-Plane Denial-of-Service (DoS) Attack on O-RAN Fronthaul Interface

    Authors: Ferlinda Feliana, Ting-Wei Hung, Binbin Chen, Ray-Guang Cheng

    Abstract: The open fronthaul interface defined by O-RAN ALLIANCE aims to support the interoperability between multi-vendor open radio access network (O-RAN) radio units (O-RU) and O-RAN distributed units (O-DU). This paper introduces a new tool that could be used to evaluate Denial-of-Service (DoS) attacks against the open fronthaul interface. We launched an array of control/user planes (C/U-Planes) attacks… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: Accepted by IEEE INFOCOM Workshop: Next-generation Open and Programmable Radio Access Networks (NG-OPERA)

  33. arXiv:2403.07846  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci quant-ph

    Topology-induced symmetry breaking: a demonstration in antiferromagnetic magnons on a Möbius strip

    Authors: Kuangyin Deng, Ran Cheng

    Abstract: We propose a mechanism of topology-induced symmetry breaking, where certain local symmetry preserved by the Hamiltonian is broken in the excited eigenstates due to the nontrivial boundary condition. As a demonstration, we study magnon excitations on a Möbius strip comprising of two antiferromagnetically coupled spin chains. Even under a simple Hamiltonian respecting local rotational symmetry and w… ▽ More

    Submitted 12 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  34. arXiv:2403.07145   

    physics.optics physics.app-ph

    Electrically Programmable Pixelated Graphene-Integrated Plasmonic Metasurfaces for Coherent Mid-Infrared Emission

    Authors: Xiu Liu, Yibai Zhong, Zexiao Wang, Tianyi Huang, Sen Lin, Jingyi Zou, Haozhe Wang, Zhien Wang, Zhuo Li, Xiao Luo, Rui Cheng, Jiayu Li, Hyeong Seok Yun, Han Wang, Jing Kong, Xu Zhang, Sheng Shen

    Abstract: Active metasurfaces have recently emerged as compact, lightweight, and efficient platforms for dynamic control of electromagnetic fields and optical responses. However, the complexities associated with their post-fabrication tunability significantly hinder their widespread applications, especially for the mid-infrared range due to material scarcity and design intricacy. Here, we experimentally dem… ▽ More

    Submitted 6 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Needs more updates for the experiments

  35. arXiv:2403.05680  [pdf, other

    cs.AI cs.CL cs.CV

    How Well Do Multi-modal LLMs Interpret CT Scans? An Auto-Evaluation Framework for Analyses

    Authors: Qingqing Zhu, Benjamin Hou, Tejas S. Mathai, Pritam Mukherjee, Qiao Jin, Xiuying Chen, Zhizheng Wang, Ruida Cheng, Ronald M. Summers, Zhiyong Lu

    Abstract: Automatically interpreting CT scans can ease the workload of radiologists. However, this is challenging mainly due to the scarcity of adequate datasets and reference standards for evaluation. This study aims to bridge this gap by introducing a novel evaluation framework, named ``GPTRadScore''. This framework assesses the capabilities of multi-modal LLMs, such as GPT-4 with Vision (GPT-4V), Gemini… ▽ More

    Submitted 18 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  36. arXiv:2403.05307  [pdf, other

    cs.AI

    Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents

    Authors: Jinyang Li, Nan Huo, Yan Gao, Jiayi Shi, Yingxiu Zhao, Ge Qu, Yurong Wu, Chenhao Ma, Jian-Guang Lou, Reynold Cheng

    Abstract: Interactive Data Analysis, the collaboration between humans and LLM agents, enables real-time data exploration for informed decision-making. The challenges and costs of collecting realistic interactive logs for data analysis hinder the quantitative evaluation of Large Language Model (LLM) agents in this task. To mitigate this issue, we introduce Tapilot-Crossing, a new benchmark to evaluate LLM ag… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 30 pages, 7 figures

  37. arXiv:2403.04796  [pdf, other

    cs.CR eess.SY

    Blockchain-Enhanced UAV Networks for Post-Disaster Communication: A Decentralized Flocking Approach

    Authors: Sana Hafeez, Runze Cheng, Lina Mohjazi, Yao Sun, Muhammad Ali Imran

    Abstract: Unmanned Aerial Vehicles (UAVs) have significant potential for agile communication and relief coordination in post-disaster scenarios, particularly when ground infrastructure is compromised. However, efficiently coordinating and securing flocks of heterogeneous UAVs from different service providers poses significant challenges related to privacy, scalability, lightweight consensus protocols, and c… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 11 pages, 9 figures, Digital Communications and Networks Open access

  38. arXiv:2402.17237  [pdf, other

    cs.CV cs.CL

    Image-Text Matching with Multi-View Attention

    Authors: Rui Cheng, Wanqing Cui

    Abstract: Existing two-stream models for image-text matching show good performance while ensuring retrieval speed and have received extensive attention from industry and academia. These methods use a single representation to encode image and text separately and get a matching score with cosine similarity or the inner product of vectors. However, the performance of the two-stream model is often sub-optimal.… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  39. arXiv:2402.15331  [pdf, other

    cs.CR eess.SY

    A Blockchain-Enabled Framework of UAV Coordination for Post-Disaster Networks

    Authors: Sana Hafeez, Runze Cheng, Lina Mohjazi, Muhammad Ali Imran, Yao Sun

    Abstract: Emergency communication is critical but challenging after natural disasters when ground infrastructure is devastated. Unmanned aerial vehicles (UAVs) offer enormous potential for agile relief coordination in these scenarios. However, effectively leveraging UAV fleets poses additional challenges around security, privacy, and efficient collaboration across response agencies. This paper presents a ro… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures,IEEE 99th Vehicular Technology Conference: VTC2024-Spring, Singapore

  40. arXiv:2402.13116  [pdf, other

    cs.CL

    A Survey on Knowledge Distillation of Large Language Models

    Authors: Xiaohan Xu, Ming Li, Chongyang Tao, Tao Shen, Reynold Cheng, Jinyang Li, Can Xu, Dacheng Tao, Tianyi Zhou

    Abstract: In the era of Large Language Models (LLMs), Knowledge Distillation (KD) emerges as a pivotal methodology for transferring advanced capabilities from leading proprietary LLMs, such as GPT-4, to their open-source counterparts like LLaMA and Mistral. Additionally, as open-source LLMs flourish, KD plays a crucial role in both compressing these models, and facilitating their self-improvement by employi… ▽ More

    Submitted 8 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: 44 pages

  41. arXiv:2402.12699  [pdf, other

    cond-mat.mtrl-sci

    Positive temperature-dependent thermal conductivity induced by wavelike phonons in complex Ag-based argyrodites

    Authors: Niuchang Ouyang, Dongyi Shen, Chen Wang, Ruihuan Cheng, Qi Wang, Yue Chen

    Abstract: The phonon transport mechanisms and the anomalous temperature-dependent lattice thermal conductivities (kL) in Ag-based argyrodites have not been fully understood. Herein, we systematically study the phonon thermal transport of five Ag-based crystalline argyrodites Ag7PS6, Ag7AsS6, Ag8SnS6, Ag8GeS6 and Ag9GaS6 utilizing perturbation theory and the unified theory thermal transport model. Our result… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 6 pages, 4 figures

  42. arXiv:2402.09884  [pdf, other

    math.AG math.NT math.RT

    $q$-bic threefolds and their surface of lines

    Authors: Raymond Cheng

    Abstract: For any power $q$ of the positive ground field characteristic, a smooth $q$-bic threefold -- the Fermat threefold of degree $q+1$ for example -- has a smooth surface $S$ of lines which behaves like the Fano surface of a smooth cubic threefold. I develop projective, moduli-theoretic, and degeneration techniques to study the geometry of $S$. Using, in addition, the modular representation theory of t… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 47 pages. Comments very welcome!

    MSC Class: 14F10; 14J29; 14G17; (primary); 14J70; 14M12; 14N05; (secondary)

  43. Debiasing Recommendation with Personal Popularity

    Authors: Wentao Ning, Reynold Cheng, Xiao Yan, Ben Kao, Nan Huo, Nur AI Hasan Haldar, Bo Tang

    Abstract: Global popularity (GP) bias is the phenomenon that popular items are recommended much more frequently than they should be, which goes against the goal of providing personalized recommendations and harms user experience and recommendation accuracy. Many methods have been proposed to reduce GP bias but they fail to notice the fundamental problem of GP, i.e., it considers popularity from a \textit{gl… ▽ More

    Submitted 21 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted by WWW'24 as a research full paper

  44. arXiv:2402.06071  [pdf, other

    cs.HC

    Keyframer: Empowering Animation Design using Large Language Models

    Authors: Tiffany Tseng, Ruijia Cheng, Jeffrey Nichols

    Abstract: Large language models (LLMs) have the potential to impact a wide range of creative domains, but the application of LLMs to animation is underexplored and presents novel challenges such as how users might effectively describe motion in natural language. In this paper, we present Keyframer, a design tool for animating static images (SVGs) with natural language. Informed by interviews with profession… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  45. arXiv:2402.03362  [pdf, other

    cs.IR cs.AI cs.CL

    NanoNER: Named Entity Recognition for nanobiology using experts' knowledge and distant supervision

    Authors: Martin Lentschat, Cyril Labbé, Ran Cheng

    Abstract: Here we present the training and evaluation of NanoNER, a Named Entity Recognition (NER) model for Nanobiology. NER consists in the identification of specific entities in spans of unstructured texts and is often a primary task in Natural Language Processing (NLP) and Information Extraction. The aim of our model is to recognise entities previously identified by domain experts as constituting the es… ▽ More

    Submitted 30 January, 2024; originally announced February 2024.

  46. arXiv:2401.14201  [pdf

    astro-ph.EP astro-ph.IM physics.geo-ph

    Investigating Organic Carbon and Thermal History of CM Carbonaceous Chondrites Using Spectroscopy and Laboratory Techniques

    Authors: Safoura Tanbakouei, Rui-Lin Cheng, Binlong Ye, Josep Ryan Michalski, Ashley J. King

    Abstract: The CM chondrites are characterized as primary accretionary rocks which originate from primitive water-rich asteroids formed during the early Solar System. Here, we study the mineralogy and organic characteristics of right CM and one ungrouped chondrite to better understand their alteration history; Queen Alexandra Range 93005 (QUE 93005), Murchison, LaPaz Icefield 02333 (LAP 02333), Miller Range… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  47. Demonstrating Mobile Manipulation in the Wild: A Metrics-Driven Approach

    Authors: Max Bajracharya, James Borders, Richard Cheng, Dan Helmick, Lukas Kaul, Dan Kruse, John Leichty, Jeremy Ma, Carolyn Matl, Frank Michel, Chavdar Papazov, Josh Petersen, Krishna Shankar, Mark Tjersland

    Abstract: We present our general-purpose mobile manipulation system consisting of a custom robot platform and key algorithms spanning perception and planning. To extensively test the system in the wild and benchmark its performance, we choose a grocery shopping scenario in an actual, unmodified grocery store. We derive key performance metrics from detailed robot log data collected during six week-long field… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Presented at RSS 2023 [Best Demo Paper Award]

  48. arXiv:2312.10890  [pdf, other

    cs.CV cs.GR

    Low-latency Space-time Supersampling for Real-time Rendering

    Authors: Ruian He, Shili Zhou, Yuqi Sun, Ri Cheng, Weimin Tan, Bo Yan

    Abstract: With the rise of real-time rendering and the evolution of display devices, there is a growing demand for post-processing methods that offer high-resolution content in a high frame rate. Existing techniques often suffer from quality and latency issues due to the disjointed treatment of frame supersampling and extrapolation. In this paper, we recognize the shared context and mechanisms between frame… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024

  49. arXiv:2312.07180  [pdf, other

    cs.CV

    Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation

    Authors: Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan

    Abstract: Existing recurrent optical flow estimation networks are computationally expensive since they use a fixed large number of iterations to update the flow field for each sample. An efficient network should skip iterations when the flow improvement is limited. In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of… ▽ More

    Submitted 5 January, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 2024, Association for the Advancement of Artificial Intelligence

  50. arXiv:2312.05734  [pdf, ps, other

    math.FA math.NA

    A Duality Approach to Regularized Learning Problems in Banach Spaces

    Authors: Raymond Cheng, Rui Wang, Yuesheng Xu

    Abstract: Learning methods in Banach spaces are often formulated as regularization problems which minimize the sum of a data fidelity term in a Banach norm and a regularization term in another Banach norm. Due to the infinite dimensional nature of the space, solving such regularization problems is challenging. We construct a direct sum space based on the Banach spaces for the data fidelity term and the regu… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.