Skip to main content

Showing 1–50 of 335 results for author: Cui, C

  1. arXiv:2407.11462  [pdf, other

    astro-ph.IM astro-ph.HE

    X-ray Sources Classification Using Machine Learning: A Study with EP-WXT Pathfinder LEIA

    Authors: Xiaoxiong Zuo, Yihan Tao, Yuan Liu, Yunfei Xu, Wenda Zhang, Haiwu Pan, Hui Sun, Zhen Zhang, Chenzhou Cui, Weimin Yuan

    Abstract: X-ray observations play a crucial role in time-domain astronomy. The Einstein Probe (EP), a recently launched X-ray astronomical satellite, emerges as a forefront player in the field of time-domain astronomy and high-energy astrophysics. With a focus on systematic surveys in the soft X-ray band, EP aims to discover high-energy transients and monitor variable sources in the universe. To achieve the… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.09979  [pdf, other

    cs.CV

    PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models

    Authors: Can Cui, Ruining Deng, Junlin Guo, Quan Liu, Tianyuan Yao, Haichun Yang, Yuankai Huo

    Abstract: The Vision Foundation Model has recently gained attention in medical image analysis. Its zero-shot learning capabilities accelerate AI deployment and enhance the generalizability of clinical applications. However, segmenting pathological images presents a special focus on the flexibility of segmentation targets. For instance, a single click on a Whole Slide Image (WSI) could signify a cell, a func… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  3. arXiv:2407.09621  [pdf, other

    cs.MS cs.PF math.NA

    Acceleration of Tensor-Product Operations with Tensor Cores

    Authors: Cu Cui

    Abstract: In this paper, we explore the acceleration of tensor product operations in finite element methods, leveraging the computational power of the NVIDIA A100 GPU Tensor Cores. We provide an accessible overview of the necessary mathematical background and discuss our implementation strategies. Our study focuses on two common programming approaches for NVIDIA Tensor Cores: the C++ Warp Matrix Functions i… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    ACM Class: G.1.8; G.4

  4. arXiv:2407.09079  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Crossed real nodal-line phonons in gold monobromide

    Authors: Yilin Han, Yichen Liu, Chaoxi Cui, Cheng-Cheng Liu, Zhi-Ming Yu

    Abstract: Spacetime inversion symmetry can generate intriguing types of spinless excitations in crystalline materials. Here, we propose a topological phase protected by spacetime inversion symmetry - the crossed real nodal line (RNL) in the phonon spectrum of gold monobromide (AuBr). In AuBr, there exist four straight nodal lines, which are linked by a crossed nodal line formed by two lower bands. Remarkabl… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  5. arXiv:2407.06302  [pdf, other

    astro-ph.IM

    Towards quantum-enhanced long-baseline optical/near-IR interferometry

    Authors: Jayadev K. Rajagopal, Ryan M. Lau, Isack Padilla, Stephen T. Ridgway, Chaohan Cui, Brittany McClinton, Aqil Sajjad, Stuartt Corder, Mark Rawlings, Fredrik Rantakyro, J. Gabriel Richardson, Amit Ashok, Saikat Guha

    Abstract: Microarcsecond resolutions afforded by an optical-NIR array with kilometer-baselines would enable breakthrough science. However significant technology barriers exist in transporting weakly coherent photon states over these distances: primarily photon loss and phase errors. Quantum telescopy, using entangled states to link spatially separated apertures, offers a possible solution to the loss of pho… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Proceeding of SPIE Conference "Astronomical Telescopes + Instrumentation" (June 2024)

    Report number: Paper No. 13095-58

  6. arXiv:2407.06116  [pdf

    eess.IV cs.CV cs.LG

    Data-driven Nucleus Subclassification on Colon H&E using Style-transferred Digital Pathology

    Authors: Lucas W. Remedios, Shunxing Bao, Samuel W. Remedios, Ho Hin Lee, Leon Y. Cai, Thomas Li, Ruining Deng, Nancy R. Newlin, Adam M. Saunders, Can Cui, Jia Li, Qi Liu, Ken S. Lau, Joseph T. Roland, Mary K Washington, Lori A. Coburn, Keith T. Wilson, Yuankai Huo, Bennett A. Landman

    Abstract: Understanding the way cells communicate, co-locate, and interrelate is essential to furthering our understanding of how the body functions. H&E is widely available, however, cell subtyping often requires expert knowledge and the use of specialized stains. To reduce the annotation burden, AI has been proposed for the classification of cells on H&E. For example, the recent Colon Nucleus Identificati… ▽ More

    Submitted 15 May, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2401.05602

  7. arXiv:2407.04842  [pdf, other

    cs.CV cs.CL cs.LG

    MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

    Authors: Zhaorun Chen, Yichao Du, Zichen Wen, Yiyang Zhou, Chenhang Cui, Zhenzhen Weng, Haoqin Tu, Chaoqi Wang, Zhengwei Tong, Qinglan Huang, Canyu Chen, Qinghao Ye, Zhihong Zhu, Yuqing Zhang, Jiawei Zhou, Zhuokai Zhao, Rafael Rafailov, Chelsea Finn, Huaxiu Yao

    Abstract: While text-to-image models like DALLE-3 and Stable Diffusion are rapidly proliferating, they often encounter challenges such as hallucination, bias, and the production of unsafe, low-quality output. To effectively address these issues, it is crucial to align these models with desired behaviors based on feedback from a multimodal judge. Despite their significance, current multimodal judges frequent… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 42 pages, 13 figures, 33 tables

  8. arXiv:2407.03307  [pdf, other

    eess.IV cs.CV

    HoloHisto: End-to-end Gigapixel WSI Segmentation with 4K Resolution Sequential Tokenization

    Authors: Yucheng Tang, Yufan He, Vishwesh Nath, Pengfeig Guo, Ruining Deng, Tianyuan Yao, Quan Liu, Can Cui, Mengmeng Yin, Ziyue Xu, Holger Roth, Daguang Xu, Haichun Yang, Yuankai Huo

    Abstract: In digital pathology, the traditional method for deep learning-based image segmentation typically involves a two-stage process: initially segmenting high-resolution whole slide images (WSI) into smaller patches (e.g., 256x256, 512x512, 1024x1024) and subsequently reconstructing them to their original scale. This method often struggles to capture the complex details and vast scope of WSIs. In this… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  9. arXiv:2407.02103  [pdf, ps, other

    astro-ph.EP

    Rossby wave instability in weakly ionized protoplanetary disks. I. azimuthal or vertical B-fields

    Authors: Can Cui, Ashutosh Tripathi, Cong Yu, Min-Kai Lin, Andrew Youdin

    Abstract: Rossby wave instability (RWI) is considered the underlying mechanism to crescent-shaped azimuthal asymmetries, discovered in (sub-)millimeter dust continuum of many protoplanetary disks. Previous works on linear theory were conducted in the hydrodynamic limit. Nevertheless, protoplanetary disks are likely magnetized and weakly ionized. We examine the influence of magnetic fields and non-ideal magn… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures, submitted to MNRAS

  10. arXiv:2407.00596  [pdf, other

    eess.IV cs.CV

    HATs: Hierarchical Adaptive Taxonomy Segmentation for Panoramic Pathology Image Analysis

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Juming Xiong, Shunxing Bao, Hao Li, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Panoramic image segmentation in computational pathology presents a remarkable challenge due to the morphologically complex and variably scaled anatomy. For instance, the intricate organization in kidney pathology spans multiple layers, from regions like the cortex and medulla to functional units such as glomeruli, tubules, and vessels, down to various cell types. In this paper, we propose a novel… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.19286

  11. arXiv:2406.17226  [pdf, other

    math.OC

    Extended alternating structure-adapted proximal gradient algorithm for nonconvex nonsmooth problems

    Authors: Ying Gao, Chunfeng Cui, Wenxing Zhang, Deren Han

    Abstract: Alternating structure-adapted proximal (ASAP) gradient algorithm (M. Nikolova and P. Tan, SIAM J Optim, 29:2053-2078, 2019) has drawn much attention due to its efficiency in solving nonconvex nonsmooth optimization problems. However, the multiblock nonseparable structure confines the performance of ASAP to far-reaching practical problems, e.g., coupled tensor decomposition. In this paper, we propo… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  12. arXiv:2406.16865  [pdf, other

    cond-mat.str-el

    Variational Monte Carlo Study of the Doped $t$-$J$ Model on Honeycomb Lattice

    Authors: Can Cui, Jing-Yu Zhao, Zheng-Yu Weng

    Abstract: The ground state of the bipartite $t$-$J$ model must satisfy a specific sign structure, based on which the single-hole and two-hole ground state $Ans\ddot{a}tze$ on honeycomb lattice are constructed and studied by a variational Monte Carlo (VMC) method. The VMC results are in good agreement with the exact diagonalization (ED) calculation. For the single-hole case, the degenerate ground states are… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages, 11 figures

  13. arXiv:2406.16168  [pdf, other

    cs.LG

    An All-MLP Sequence Modeling Architecture That Excels at Copying

    Authors: Chenwei Cui, Zehao Yan, Gedeon Muhawenayo, Hannah Kerner

    Abstract: Recent work demonstrated Transformers' ability to efficiently copy strings of exponential sizes, distinguishing them from other architectures. We present the Causal Relation Network (CausalRN), an all-MLP sequence modeling architecture that can match Transformers on the copying task. Extending Relation Networks (RNs), we implemented key innovations to support autoregressive sequence modeling while… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024 Next Generation of Sequence Modeling Architectures Workshop

  14. arXiv:2406.15787  [pdf, other

    eess.SY

    On Physics-Informed Neural Network Control for Power Electronics

    Authors: Peifeng Hui, Chenggang Cui, Pengfeng Lin, Amer M. Y. M. Ghias, Xitong Niu, Chuanlin Zhang

    Abstract: Considering the growing necessity for precise modeling of power electronics amidst operational and environmental uncertainties, this paper introduces an innovative methodology that ingeniously combines model-driven and data-driven approaches to enhance the stability of power electronics interacting with grid-forming microgrids. By employing the physics-informed neural network (PINN) as a foundatio… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  15. arXiv:2406.13920  [pdf, other

    cs.LG cs.SI

    Explainable AI Security: Exploring Robustness of Graph Neural Networks to Adversarial Attacks

    Authors: Tao Wu, Canyixing Cui, Xingping Xian, Shaojie Qiao, Chao Wang, Lin Yuan, Shui Yu

    Abstract: Graph neural networks (GNNs) have achieved tremendous success, but recent studies have shown that GNNs are vulnerable to adversarial attacks, which significantly hinders their use in safety-critical scenarios. Therefore, the design of robust GNNs has attracted increasing attention. However, existing research has mainly been conducted via experimental trial and error, and thus far, there remains a… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  16. arXiv:2406.13499  [pdf, other

    cs.SI cs.LG

    GraphMU: Repairing Robustness of Graph Neural Networks via Machine Unlearning

    Authors: Tao Wu, Xinwen Cao, Chao Wang, Shaojie Qiao, Xingping Xian, Lin Yuan, Canyixing Cui, Yanbing Liu

    Abstract: Graph Neural Networks (GNNs) have demonstrated significant application potential in various fields. However, GNNs are still vulnerable to adversarial attacks. Numerous adversarial defense methods on GNNs are proposed to address the problem of adversarial attacks. However, these methods can only serve as a defense before poisoning, but cannot repair poisoned GNN. Therefore, there is an urgent need… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  17. arXiv:2406.12628  [pdf, other

    eess.SY

    Large Language Models based Multi-Agent Framework for Objective Oriented Control Design in Power Electronics

    Authors: Chenggang Cui, Jiaming Liu, Junkang Feng, Peifeng Hui, Amer M. Y. M. Ghias, Chuanlin Zhang

    Abstract: Power electronics, a critical component in modern power systems, face several challenges in control design, including model uncertainties, and lengthy and costly design cycles. This paper is aiming to propose a Large Language Models (LLMs) based multi-agent framework for objective-oriented control design in power electronics. The framework leverages the reasoning capabilities of LLMs and a multi-a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 6 pages, 6 figures

  18. arXiv:2406.12373  [pdf, other

    cs.CL cs.AI cs.LG

    WebCanvas: Benchmarking Web Agents in Online Environments

    Authors: Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu

    Abstract: For web agents to be practically useful, they must adapt to the continuously evolving web environment characterized by frequent updates to user interfaces and content. However, most existing benchmarks only capture the static aspects of the web. To bridge this gap, we introduce WebCanvas, an innovative online evaluation framework for web agents that effectively addresses the dynamic nature of web… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Our platform, tool and dataset are publically available at https://www.imean.ai/web-canvas/ and https://huggingface.co/datasets/iMeanAI/Mind2Web-Live/

    MSC Class: 68T50 ACM Class: I.2.7

  19. arXiv:2406.11273  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Planar Hall Plateau in Magnetic Weyl Semimetals

    Authors: Lei Li, Chaoxi Cui, Run-Wu Zhang, Zhi-Ming Yu, Yugui Yao

    Abstract: Despite the rapid progress in the study of planar Hall effect (PHE) in recent years, all the previous works only showed that the PHE is connected to local geometric quantities, such as Berry curvature. Here, for the first time, we point out that the PHE in magnetic Weyl semimetals is directly related to a global quantity, namely, the Chern number of the Weyl point. This leads to a remarkable conse… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 15 pages, 5 figures

  20. arXiv:2406.00225  [pdf, other

    cs.ET cond-mat.mes-hall

    Kinematic Model of Magnetic Domain Wall Motion for Fast, High-Accuracy Simulations

    Authors: Kristi Doleh, Leonard Humphrey, Chandler M. Linseisen, Michael D. Kitcher, Joanna M. Martin, Can Cui, Jean Anne C. Incorvia, Felipe Garcia-Sanchez, Naimul Hassan, Alexander J. Edwards, Joseph S. Friedman

    Abstract: Domain wall (DW) devices have garnered recent interest for diverse applications including memory, logic, and neuromorphic primitives; fast, accurate device models are therefore imperative for large-scale system design and verification. Extant DW motion models are sub-optimal for large-scale system design either over-consuming compute resources with physics-heavy equations or oversimplifying the ph… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  21. arXiv:2405.19004  [pdf, ps, other

    math.NA

    An implementation of tensor product patch smoothers on GPU

    Authors: Cu Cui, Paul Grosse-Bley, Guido Kanschat, Robert Strzodka

    Abstract: We present a GPU implementation of vertex-patch smoothers for higher order finite element methods in two and three dimensions. Analysis shows that they are not memory bound with respect to GPU DRAM, but with respect to on-chip scratchpad memory. Multigrid operations are optimized through localization and reorganized local operations in on-chip memory, achieving minimal global data transfer and a c… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    MSC Class: 65N55; 65Y20

  22. arXiv:2405.18982  [pdf, other

    math.NA

    Multilevel Interior Penalty Methods on GPUs

    Authors: Cu Cui, Guido Kanschat

    Abstract: We present a matrix-free multigrid method for high-order discontinuous Galerkin (DG) finite element methods with GPU acceleration. A performance analysis is conducted, comparing various data and compute layouts. Smoother implementations are optimized through localization and fast diagonalization techniques. Leveraging conflict-free access patterns in shared memory, arithmetic throughput of up to 3… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    MSC Class: 65N55; 65Y20

  23. arXiv:2405.17824  [pdf, other

    cs.CV

    mTREE: Multi-Level Text-Guided Representation End-to-End Learning for Whole Slide Image Analysis

    Authors: Quan Liu, Ruining Deng, Can Cui, Tianyuan Yao, Vishwesh Nath, Yucheng Tang, Yuankai Huo

    Abstract: Multi-modal learning adeptly integrates visual and textual data, but its application to histopathology image and text analysis remains challenging, particularly with large, high-resolution images like gigapixel Whole Slide Images (WSIs). Current methods typically rely on manual region labeling or multi-stage learning to assemble local representations (e.g., patch-level) into global features (e.g.,… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  24. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  25. arXiv:2405.15410  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electric Hall Effect and Quantum Electric Hall Effect

    Authors: Chaoxi Cui, Run-Wu Zhang, Yilin Han, Zhi-Ming Yu, Yugui Yao

    Abstract: Exploring new Hall effect is always a fascinating research topic. The ordinary Hall effect and the quantum Hall effect, initially discovered in two-dimensional (2D) non-magnetic systems, are the phenomena that a transverse current is generated when a system carrying an electron current is placed in a magnetic field perpendicular to the currents. In this work, we propose the electric counterparts o… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 7 pages, 3 figures

  26. arXiv:2405.14622  [pdf, other

    cs.LG cs.CL cs.CV

    Calibrated Self-Rewarding Vision Language Models

    Authors: Yiyang Zhou, Zhiyuan Fan, Dongjie Cheng, Sihan Yang, Zhaorun Chen, Chenhang Cui, Xiyao Wang, Yun Li, Linjun Zhang, Huaxiu Yao

    Abstract: Large Vision-Language Models (LVLMs) have made substantial progress by integrating pre-trained large language models (LLMs) and vision models through instruction tuning. Despite these advancements, LVLMs often exhibit the hallucination phenomenon, where generated text responses appear linguistically plausible but contradict the input image, indicating a misalignment between image and text pairs. T… ▽ More

    Submitted 31 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: fix some typos and add acknowledgement section in V3

  27. arXiv:2405.11210  [pdf, other

    cs.CE cond-mat.mtrl-sci physics.app-ph physics.chem-ph

    Computational predictions of hydrogen-assisted fatigue crack growth

    Authors: C. Cui, P. Bortot, M. Ortolani, E. Martínez-Pañeda

    Abstract: A new model is presented to predict hydrogen-assisted fatigue. The model combines a phase field description of fracture and fatigue, stress-assisted hydrogen diffusion, and a toughness degradation formulation with cyclic and hydrogen contributions. Hydrogen-assisted fatigue crack growth predictions exhibit an excellent agreement with experiments over all the scenarios considered, spanning multiple… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  28. arXiv:2405.09965  [pdf, other

    cs.SE

    Leveraging Large Language Models for Automated Web-Form-Test Generation: An Empirical Study

    Authors: Tao Li, Chenhui Cui, Lei Ma, Dave Towey, Yujie Xie, Rubing Huang

    Abstract: The testing of web forms is an essential activity for ensuring the quality of web applications, which mainly involves evaluating the interactions between users and forms. Automated test-case generation remains a challenge for web-form testing: Due to the complex, multi-level structure of web pages, it can be difficult to automatically capture their inherent contextual information for inclusion in… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  29. arXiv:2405.06059  [pdf, other

    cs.CL cs.AI

    A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

    Authors: Christopher Z. Cui, Xiangyu Peng, Mark O. Riedl

    Abstract: Open-ended worlds are those in which there are no pre-specified goals or environmental reward signal. As a consequence, an agent must know how to perform a multitude of tasks. However, when a new task is presented to an agent, we expect it to be able to reuse some of what it knows from previous tasks to rapidly learn that new task. We introduce a novel technique whereby policies for different a pr… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  30. arXiv:2405.03160  [pdf, ps, other

    math.RA

    Moore Determinant of Dual Quaternion Hermitian Matrices

    Authors: Chunfeng Cui, Liqun Qi, Guangjing Song, Qingwen Wang

    Abstract: In this paper, we extend the Chen and Moore determinants of quaternion Hermitian} matrices to dual quaternion Hermitian matrices. We show the Chen determinant of dual quaternion Hermitian {matrices is invariant under addition, switching, multiplication, and unitary operations at the both hand sides. We then show the Chen and Moore determinants of dual quaternion Hermitian matrices are equal to eac… ▽ More

    Submitted 18 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  31. arXiv:2404.18560  [pdf, other

    math.OC cs.RO

    Non-convex Pose Graph Optimization in SLAM via Proximal Linearized Riemannian ADMM

    Authors: Xin Chen, Chunfeng Cui, Deren Han, Liqun Qi

    Abstract: Pose graph optimization (PGO) is a well-known technique for solving the pose-based simultaneous localization and mapping (SLAM) problem. In this paper, we represent the rotation and translation by a unit quaternion and a three-dimensional vector, and propose a new PGO model based on the von Mises-Fisher distribution. The constraints derived from the unit quaternions are spherical manifolds, and th… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  32. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  33. arXiv:2404.17949  [pdf, other

    cs.CL

    Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering

    Authors: Chenhao Cui, Yufan Jiang, Shuangzhi Wu, Zhoujun Li

    Abstract: Multi-choice Machine Reading Comprehension (MMRC) aims to select the correct answer from a set of options based on a given passage and question. The existing methods employ the pre-trained language model as the encoder, share and transfer knowledge through fine-tuning.These methods mainly focus on the design of exquisite mechanisms to effectively capture the relationships among the triplet of pass… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 10 pages, 1 figures.This article supersedes arXiv:2011.03292

  34. arXiv:2404.16745  [pdf, other

    stat.ME

    Statistical Inference for Covariate-Adjusted and Interpretable Generalized Factor Model with Application to Testing Fairness

    Authors: Jing Ouyang, Chengyu Cui, Kean Ming Tan, Gongjun Xu

    Abstract: In the era of data explosion, statisticians have been developing interpretable and computationally efficient statistical methods to measure latent factors (e.g., skills, abilities, and personalities) using large-scale assessment data. In addition to understanding the latent information, the covariate effect on responses controlling for latent factors is also of great scientific interest and has wi… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  35. arXiv:2404.16425  [pdf, other

    astro-ph.HE

    Soft X-ray prompt emission from a high-redshift gamma-ray burst EP240315a

    Authors: Y. Liu, H. Sun, D. Xu, D. S. Svinkin, J. Delaunay, N. R. Tanvir, H. Gao, C. Zhang, Y. Chen, X. -F. Wu, B. Zhang, W. Yuan, J. An, G. Bruni, D. D. Frederiks, G. Ghirlanda, J. -W. Hu, A. Li, C. -K. Li, J. -D. Li, D. B. Malesani, L. Piro, G. Raman, R. Ricci, E. Troja , et al. (170 additional authors not shown)

    Abstract: Long gamma-ray bursts (GRBs) are believed to originate from core collapse of massive stars. High-redshift GRBs can probe the star formation and reionization history of the early universe, but their detection remains rare. Here we report the detection of a GRB triggered in the 0.5--4 keV band by the Wide-field X-ray Telescope (WXT) on board the Einstein Probe (EP) mission, designated as EP240315a,… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 41 pages, 8 figures, 7 tables

  36. arXiv:2404.15690  [pdf, other

    cs.CL cs.LG

    Neural Proto-Language Reconstruction

    Authors: Chenxuan Cui, Ying Chen, Qinxin Wang, David R. Mortensen

    Abstract: Proto-form reconstruction has been a painstaking process for linguists. Recently, computational models such as RNN and Transformers have been proposed to automate this process. We take three different approaches to improve upon previous methods, including data augmentation to recover missing reflexes, adding a VAE structure to the Transformer model for proto-to-language prediction, and using a neu… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  37. arXiv:2404.08948  [pdf, other

    cs.SE

    Large Language Models for Mobile GUI Text Input Generation: An Empirical Study

    Authors: Chenhui Cui, Tao Li, Junjie Wang, Chunyang Chen, Dave Towey, Rubing Huang

    Abstract: Mobile applications (apps) have become an essential part of our daily lives, making ensuring their quality an important activity. GUI testing, a quality assurance method, has frequently been used for mobile apps. When conducting GUI testing, it is important to generate effective text inputs for the text-input components. Some GUIs require these text inputs to move from one page to the next, which… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  38. arXiv:2404.06054  [pdf, other

    eess.SP

    Pseudo MIMO (pMIMO): An Energy and Spectral Efficient MIMO-OFDM System

    Authors: Sen Wang, Tianxiong Wang, Shulun Zhao, Zhen Feng, Guangyi Liu, Chunfeng Cui, Chih-Lin I, Jiangzhou Wang

    Abstract: This article introduces an energy and spectral efficient multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) transmission scheme designed for the future sixth generation (6G) wireless communication networks. The approach involves connecting each receiving radio frequency (RF) chain with multiple antenna elements and conducting sample-level adjustments for receivin… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  39. arXiv:2404.03789  [pdf, other

    cs.CV cs.AI

    Quantifying Uncertainty in Motion Prediction with Variational Bayesian Mixture

    Authors: Juanwu Lu, Can Cui, Yunsheng Ma, Aniket Bera, Ziran Wang

    Abstract: Safety and robustness are crucial factors in developing trustworthy autonomous vehicles. One essential aspect of addressing these factors is to equip vehicles with the capability to predict future trajectories for all moving objects in the surroundings and quantify prediction uncertainties. In this paper, we propose the Sequential Neural Variational Agent (SeNeVA), a generative model that describe… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  40. arXiv:2404.02493  [pdf, other

    math.NA

    A Neural Multigrid Solver for Helmholtz Equations with High Wavenumber and Heterogeneous Media

    Authors: Chen Cui, Kai Jiang, Shi Shu

    Abstract: Solving high-wavenumber and heterogeneous Helmholtz equations presents a long-standing challenge in scientific computing. In this paper, we introduce a deep learning-enhanced multigrid solver to address this issue. By conducting error analysis on standard multigrid applied to a discrete Helmholtz equation, we devise a strategy to handle errors with different frequencies separately. For error com… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    MSC Class: 65N22; 65N55; 68T07

  41. arXiv:2403.17560  [pdf, other

    cond-mat.supr-con

    Anomalous shift in Andreev reflection from side incidence

    Authors: Runze Li, Chaoxi Cui, Ying Liu, Zhi-Ming Yu, Shengyuan A. Yang

    Abstract: Andreev reflection at a normal-superconductor interface may be accompanied with an anomalous spatial shift. The studies so far are limited to the top incidence configuration. Here, we investigate this effect in the side incidence configuration, with the interface parallel to the principal axis of superconductor. We find that the shift exhibits rich behaviors reflecting the character of pair potent… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  42. arXiv:2403.13358  [pdf, other

    cs.RO cs.CV cs.LG

    GeRM: A Generalist Robotic Model with Mixture-of-experts for Quadruped Robot

    Authors: Wenxuan Song, Han Zhao, Pengxiang Ding, Can Cui, Shangke Lyu, Yaning Fan, Donglin Wang

    Abstract: Multi-task robot learning holds significant importance in tackling diverse and complex scenarios. However, current approaches are hindered by performance issues and difficulties in collecting training datasets. In this paper, we propose GeRM (Generalist Robotic Model). We utilize offline reinforcement learning to optimize data utilization strategies to learn from both demonstrations and sub-optima… ▽ More

    Submitted 9 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  43. arXiv:2403.10308  [pdf, other

    math.NA math.RA

    Eigenvalues of Dual Hermitian Matrices with Application in Formation Control

    Authors: Liqun Qi, Chunfeng Cui

    Abstract: We propose a supplement matrix method for computing eigenvalues of a dual Hermitian matrix, and discuss its application in multi-agent formation control. Suppose we have a ring, which can be the real field, the complex field, or the quaternion ring. We study dual number symmetric matrices, dual complex Hermitian matrices and dual quaternion Hermitian matrices in a unified frame of dual Hermitian m… ▽ More

    Submitted 1 April, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.12988

  44. arXiv:2403.09094  [pdf, other

    astro-ph.IM astro-ph.GA

    Digitization of Astronomical Photographic Plate of China and Astrometric Measurement of Single-exposure Plates

    Authors: Zheng-Jun Shang, Yong Yu, Liang-Liang Wang, Mei-Ting Yang, Jing Yang, Shi-Yin Shen, Min Liu, Quan-Feng Xu, Chen-Zhou Cui, Dong-Wei Fan, Zheng-Hong Tang, Jian-Hai Zhao

    Abstract: From the mid-19th century to the end of the 20th century, photographic plates served as the primary detectors for astronomical observations. Astronomical photographic observations in China began in 1901, and over a century, a total of approximately 30,000 astronomical photographic plates have been captured. These historical plates play an irreplaceable role in conducting long-term, time-domain ast… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted for Research in Astronomy and Astrophysics, 17 pages, 14 figures, 6 tables. Database, https://nadc.china-vo.org/res/r100742/

  45. arXiv:2403.08167  [pdf, other

    cs.LG cs.CL q-bio.QM

    MolBind: Multimodal Alignment of Language, Molecules, and Proteins

    Authors: Teng Xiao, Chao Cui, Huaisheng Zhu, Vasant G. Honavar

    Abstract: Recent advancements in biology and chemistry have leveraged multi-modal learning, integrating molecules and their natural language descriptions to enhance drug discovery. However, current pre-training frameworks are limited to two modalities, and designing a unified network to process different modalities (e.g., natural language, 2D molecular graphs, 3D molecular conformations, and 3D proteins) re… ▽ More

    Submitted 2 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  46. arXiv:2403.06570  [pdf, other

    cs.CL

    Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications

    Authors: Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

    Abstract: Past studies on end-to-end meeting transcription have focused on model architecture and have mostly been evaluated on simulated meeting data. We present a novel study aiming to optimize the use of a Speaker-Attributed ASR (SA-ASR) system in real-life scenarios, such as the AMI meeting corpus, for improved speaker assignment of speech segments. First, we propose a pipeline tailored to real-life app… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Submitted to Odyssey 2024

  47. arXiv:2403.01145  [pdf, other

    cond-mat.mtrl-sci

    Mirror real Chern insulator in two and three dimensions

    Authors: Yang Wang, Chaoxi Cui, Run-Wu Zhang, Xiaotian Wang, Zhi-Ming Yu, Gui-Bin Liu, Yugui Yao

    Abstract: A real Chern insulator (RCI) featuring a real Chern number and a second-order boundary mode appears in a two-dimensional (2D) system with the space-time inversion symmetry (PT ). Here, we propose a kind of RCI: mirror real Chern insulator (MRCI) which emerges from the system having additional horizontal mirror symmetry Mz. The MRCI generally is characterized by two independent real Chern numbers,… ▽ More

    Submitted 6 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  48. arXiv:2403.00371  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Quasi-one-dimensional spin transport in altermagnetic $Z^3$ nodal net metals

    Authors: Tingli He, Lei Li, Chaoxi Cui, Run-Wu Zhang, Zhi-Ming Yu, Guodong Liu, Xiaoming Zhang

    Abstract: In three dimensions, quasi-one-dimensional (Q1D) transport has traditionally been associated with systems featuring a Q1D chain structure. Here, based on first-principle calculations, we go beyond the common belief to show that the Q1D transport can also be realized in many three-dimensional (3D) altermagnetic (AM) metals with a topological nodal net in momentum space but lacking Q1D chain structu… ▽ More

    Submitted 3 April, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  49. arXiv:2402.19286  [pdf, other

    eess.IV cs.CV

    PrPSeg: Universal Proposition Learning for Panoramic Renal Pathology Segmentation

    Authors: Ruining Deng, Quan Liu, Can Cui, Tianyuan Yao, Jialin Yue, Juming Xiong, Lining Yu, Yifei Wu, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Haichun Yang, Yuankai Huo

    Abstract: Understanding the anatomy of renal pathology is crucial for advancing disease diagnostics, treatment evaluation, and clinical research. The complex kidney system comprises various components across multiple levels, including regions (cortex, medulla), functional units (glomeruli, tubules), and cells (podocytes, mesangial cells in glomerulus). Prior studies have predominantly overlooked the intrica… ▽ More

    Submitted 20 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: IEEE / CVF Computer Vision and Pattern Recognition Conference 2024

  50. arXiv:2402.15771  [pdf, ps, other

    math.OC

    Inertial Accelerated Stochastic Mirror Descent for Large-Scale Generalized Tensor CP Decomposition

    Authors: Zehui Liu, Qingsong Wang, Chunfeng Cui, Yong Xia

    Abstract: The majority of classic tensor CP decomposition models are designed for squared loss, employing Euclidean distance as a local proximal term. However, the Euclidean distance is unsuitable for the generalized loss function applicable to various types of real-world data, such as integer and binary data. Consequently, algorithms developed under the squared loss are not easily adaptable to handle these… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.