Skip to main content

Showing 1–50 of 571 results for author: Shi, M

  1. arXiv:2407.11213  [pdf, other

    cs.CV

    OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models

    Authors: Zijian Zhou, Zheng Zhu, Holger Caesar, Miaojing Shi

    Abstract: Panoptic Scene Graph Generation (PSG) aims to segment objects and recognize their relations, enabling the structured understanding of an image. Previous methods focus on predicting predefined object and relation categories, hence limiting their applications in the open world scenarios. With the rapid development of large multimodal models (LMMs), significant progress has been made in open-set obje… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.08813  [pdf, other

    eess.IV cs.AI cs.CV

    FairDomain: Achieving Fairness in Cross-Domain Medical Image Segmentation and Classification

    Authors: Yu Tian, Congcong Wen, Min Shi, Muhammad Muneeb Afzal, Hao Huang, Muhammad Osama Khan, Yan Luo, Yi Fang, Mengyu Wang

    Abstract: Addressing fairness in artificial intelligence (AI), particularly in medical AI, is crucial for ensuring equitable healthcare outcomes. Recent efforts to enhance fairness have introduced new methodologies and datasets in medical AI. However, the fairness issue under the setting of domain transfer is almost unexplored, while it is common that clinics rely on different imaging technologies (e.g., di… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024; Codes available at https://github.com/Harvard-Ophthalmology-AI-Lab/FairDomain

  3. arXiv:2407.08507  [pdf, other

    cs.CV

    Bootstrapping Vision-language Models for Self-supervised Remote Physiological Measurement

    Authors: Zijie Yue, Miaojing Shi, Hanli Wang, Shuai Ding, Qijun Chen, Shanlin Yang

    Abstract: Facial video-based remote physiological measurement is a promising research area for detecting human vital signs (e.g., heart rate, respiration frequency) in a non-contact way. Conventional approaches are mostly supervised learning, requiring extensive collections of facial videos and synchronously recorded photoplethysmography (PPG) signals. To tackle it, self-supervised learning has recently gai… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2406.18263  [pdf, other

    physics.chem-ph

    A Pre-trained Deep Potential Model for Sulfide Solid Electrolytes with Broad Coverage and High Accuracy

    Authors: Ruoyu Wang, Mingyu Guo, Yuxiang Gao, Xiaoxu Wang, Yuzhi Zhang, Bin Deng, Xin Chen, Mengchao Shi, Linfeng Zhang, Zhicheng Zhong

    Abstract: Solid electrolytes with fast ion transport are one of the key challenges for solid state lithium metal batteries. To improve ion conductivity, chemical doping has been the most effective strategy, and atomistic simulation with machine-learning potential helps find optimized doping by predicting ion conductivity for arbitrary composition. Yet most existing machine-learning models are trained on nar… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  5. arXiv:2406.16425  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Spin order and dynamics in the topological rare-earth germanide semimetals

    Authors: Yuhao Wang, Zhixuan Zhen, Jing Meng, Igor Plokhikh, Delong Wu, Dariusz J. Gawryluk, Yang Xu, Qingfeng Zhan, Ming Shi, Ekaterina Pomjakushina, Toni Shiroka, Tian Shang

    Abstract: The $RE$Al(Si,Ge) ($RE$ = rare earth) family, known to break both the inversion- and time-reversal symmetries, represents one of the most suitable platforms for investigating the interplay between correlated-electron phenomena and topologically nontrivial bands. Here, we report on systematic magnetic, transport, and muon-spin rotation and relaxation ($μ$SR) measurements on (Nd,Sm)AlGe single cryst… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 13 pages, 14 figures

  6. arXiv:2406.16039  [pdf, other

    cs.CV

    CholecInstanceSeg: A Tool Instance Segmentation Dataset for Laparoscopic Surgery

    Authors: Oluwatosin Alabi, Ko Ko Zayar Toe, Zijian Zhou, Charlie Budd, Nicholas Raison, Miaojing Shi, Tom Vercauteren

    Abstract: In laparoscopic and robotic surgery, precise tool instance segmentation is an essential technology for advanced computer-assisted interventions. Although publicly available procedures of routine surgeries exist, they often lack comprehensive annotations for tool instance segmentation. Additionally, the majority of standard datasets for tool segmentation are derived from porcine(pig) surgeries. To… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  7. CMDS: Cross-layer Dataflow Optimization for DNN Accelerators Exploiting Multi-bank Memories

    Authors: Man Shi, Steven Colleman, Charlotte VanDeMieroop, Antony Joseph, Maurice Meijer, Wim Dehaene, Marian Verhelst

    Abstract: Deep neural networks (DNN) use a wide range of network topologies to achieve high accuracy within diverse applications. This model diversity makes it impossible to identify a single "dataflow" (execution schedule) to perform optimally across all possible layers and network topologies. Several frameworks support the exploration of the best dataflow for a given DNN layer and hardware. However, switc… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Journal ref: 2023 24th International Symposium on Quality Electronic Design (ISQED)

  8. COAC: Cross-layer Optimization of Accelerator Configurability for Efficient CNN Processing

    Authors: Steven Colleman, Man Shi, Marian Verhelst

    Abstract: To achieve high accuracy, convolutional neural networks (CNNs) are increasingly growing in complexity and diversity in layer types and topologies. This makes it very challenging to efficiently deploy such networks on custom processor architectures for resource-scarce edge devices. Existing mapping exploration frameworks enable searching for the optimal execution schedules or hardware mappings of i… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 14 pages,17 figures.Journal IEEE Transactions on Very Large Scale Integration (VLSI) Systems

    Journal ref: in IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol. 31, no. 7, pp. 945-958, July 2023

  9. arXiv:2406.04595  [pdf, other

    cs.SD cs.CL eess.AS

    Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis

    Authors: Xintong Wang, Mingqian Shi, Ye Wang

    Abstract: Mispronunciation Detection and Diagnosis (MDD) systems, leveraging Automatic Speech Recognition (ASR), face two main challenges in Mandarin Chinese: 1) The two-stage models create an information gap between the phoneme or tone classification stage and the MDD stage. 2) The scarcity of Mandarin MDD datasets limits model training. In this paper, we introduce a stateless RNN-T model for Mandarin MDD,… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  10. arXiv:2406.00545  [pdf, ps, other

    cs.CV cs.AI

    Memory-guided Network with Uncertainty-based Feature Augmentation for Few-shot Semantic Segmentation

    Authors: Xinyue Chen, Miaojing Shi

    Abstract: The performance of supervised semantic segmentation methods highly relies on the availability of large-scale training data. To alleviate this dependence, few-shot semantic segmentation (FSS) is introduced to leverage the model trained on base classes with sufficient data into the segmentation of novel classes with few data. FSS methods face the challenge of model generalization on novel classes du… ▽ More

    Submitted 9 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE International Conference on Multimedia and Expo (ICME) 2024 as an oral presentation

  11. arXiv:2405.17403  [pdf, other

    cs.LG cs.AI

    A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

    Authors: Kai Wang, Yukun Zhou, Mingjia Shi, Zhihang Yuan, Yuzhang Shang, Xiaojiang Peng, Hanwang Zhang, Yang You

    Abstract: Training diffusion models is always a computation-intensive task. In this paper, we introduce a novel speed-up method for diffusion model training, called, which is based on a closer look at time steps. Our key findings are: i) Time steps can be empirically divided into acceleration, deceleration, and convergence areas based on the process increment. ii) These time steps are imbalanced, with many… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    ACM Class: I.2

  12. arXiv:2405.16523  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Charge transfer and Spin-Valley locking in 4Hb-TaS$_{2}$

    Authors: Avior Almoalem, Roni Gofman, Yuval Nitzav, Ilay Mangel, Irena Feldman, Jahyun Koo, Federico Mazzola, Jun Fujii, Ivana Vobornik, J. Sanchez-Barriga, Oliver J. Clark, Nicholas Clark Plumb, Ming Shi, Binghai Yan, Amit Kanigel

    Abstract: 4Hb-TaS$_2$ is a superconductor that exhibits unique characteristics such as time-reversal symmetry breaking, hidden magnetic memory, and topological edge modes. It is a naturally occurring heterostructure comprising of alternating layers of 1H-TaS$_2$ and 1T-TaS$_2$. The former is a well-known superconductor, while the latter is a correlated insulator with a possible non-trivial magnetic ground s… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Journal ref: npj Quantum Materials 9, 36 (2024)

  13. arXiv:2405.12575  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Three-dimensional mapping and electronic origin of large altermagnetic splitting near Fermi level in CrSb

    Authors: Guowei Yang, Zhanghuan Li, Sai Yang, Jiyuan Li, Hao Zheng, Weifan Zhu, Saizheng Cao, Wenxuan Zhao, Jiawen Zhang, Mao Ye, Yu Song, Lun-Hui Hu, Lexian Yang, Ming Shi, Huiqiu Yuan, Yongjun Zhang, Yuanfeng Xu, Yang Liu

    Abstract: Recently, a new kind of collinear magnetism, dubbed altermagnetism, has attracted considerable interests. A key characteristic of altermagnet is the momentum-dependent band and spin splitting without net magnetization. However, finding altermagnetic materials with large splitting near the Fermi level, which necessarily requires three-dimensional k-space mapping and is crucial for spintronic applic… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 16 pages, 4 figures and 1 table

  14. arXiv:2405.11690  [pdf, other

    cs.CV

    InterAct: Capture and Modelling of Realistic, Expressive and Interactive Activities between Two Persons in Daily Scenarios

    Authors: Yinghao Huang, Leo Ho, Dafei Qin, Mingyi Shi, Taku Komura

    Abstract: We address the problem of accurate capture and expressive modelling of interactive behaviors happening between two persons in daily scenarios. Different from previous works which either only consider one person or focus on conversational gestures, we propose to simultaneously model the activities of two persons, and target objective-driven, dynamic, and coherent interactions which often span long… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: The first two authors contributed equally to this work

  15. arXiv:2405.09899  [pdf, other

    quant-ph

    Quantum Metrology with Higher-order Exceptional Points in Atom-cavity Magnonics

    Authors: Minwei Shi, Guzhi Bao, Jinxian Guo, Weiping Zhang

    Abstract: Exceptional points (EPs), early arising from non-Hermitian physics, significantly amplify the system's response to minor perturbations, and act as a useful concept to enhance measurement in metrology. In particular, such a metrological enhancement grows dramatically with the EP's order. However, the Langevin noises intrinsically existing in the non-Hermitian systems diminish this enhancement. In t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  16. arXiv:2405.04761  [pdf

    physics.atom-ph quant-ph

    High sensitivity measurement of ULF, VLF and LF fields with Rydberg-atom sensor

    Authors: Mingwei Lei, Meng Shi

    Abstract: Fields with frequencies below megahertz are challenging for Rydberg-atom-based measurements, due to the low-frequency electric field screening effect that is caused by the alkali-metal atoms adsorbed on the inner surface of the container. In this paper, we investigate on electric fields measurements in the ULF, VLF and LF bands in a Cs vapor cell with built-in parallel electrodes. With optimizatio… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 6 pages, 4 figures

  17. arXiv:2405.02580  [pdf, other

    cs.SE cs.AI

    PropertyGPT: LLM-driven Formal Verification of Smart Contracts through Retrieval-Augmented Property Generation

    Authors: Ye Liu, Yue Xue, Daoyuan Wu, Yuqiang Sun, Yi Li, Miaolei Shi, Yang Liu

    Abstract: With recent advances in large language models (LLMs), this paper explores the potential of leveraging state-of-the-art LLMs, such as GPT-4, to transfer existing human-written properties (e.g., those from Certora auditing reports) and automatically generate customized properties for unknown code. To this end, we embed existing properties into a vector database and retrieve a reference property for… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  18. arXiv:2405.01533  [pdf, other

    cs.CV

    OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning

    Authors: Shihao Wang, Zhiding Yu, Xiaohui Jiang, Shiyi Lan, Min Shi, Nadine Chang, Jan Kautz, Ying Li, Jose M. Alvarez

    Abstract: The advances in multimodal large language models (MLLMs) have led to growing interests in LLM-based autonomous driving agents to leverage their strong reasoning capabilities. However, capitalizing on MLLMs' strong reasoning capabilities for improved planning behavior is challenging since planning requires full 3D situational awareness beyond 2D reasoning. To address this challenge, our work propos… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  19. arXiv:2404.17528  [pdf, other

    cs.CV

    Geometry-aware Reconstruction and Fusion-refined Rendering for Generalizable Neural Radiance Fields

    Authors: Tianqi Liu, Xinyi Ye, Min Shi, Zihao Huang, Zhiyu Pan, Zhan Peng, Zhiguo Cao

    Abstract: Generalizable NeRF aims to synthesize novel views for unseen scenes. Common practices involve constructing variance-based cost volumes for geometry reconstruction and encoding 3D descriptors for decoding novel views. However, existing methods show limited generalization ability in challenging conditions due to inaccurate geometry, sub-optimal descriptors, and decoding strategies. We address these… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024. Project page: https://gefucvpr24.github.io

  20. arXiv:2404.15602  [pdf, other

    cs.RO

    Decentralized Multi-Agent Trajectory Planning in Dynamic Environments with Spatiotemporal Occupancy Grid Maps

    Authors: Siyuan Wu, Gang Chen, Moji Shi, Javier Alonso-Mora

    Abstract: This paper proposes a decentralized trajectory planning framework for the collision avoidance problem of multiple micro aerial vehicles (MAVs) in environments with static and dynamic obstacles. The framework utilizes spatiotemporal occupancy grid maps (SOGM), which forecast the occupancy status of neighboring space in the near future, as the environment representation. Based on this representation… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 6 pages, 6 figures, accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA2024)

  21. arXiv:2404.15121  [pdf, other

    cs.GR cs.AI cs.CV

    Taming Diffusion Probabilistic Models for Character Control

    Authors: Rui Chen, Mingyi Shi, Shaoli Huang, Ping Tan, Taku Komura, Xuelin Chen

    Abstract: We present a novel character control framework that effectively utilizes motion diffusion probabilistic models to generate high-quality and diverse character animations, responding in real-time to a variety of dynamic user-supplied control signals. At the heart of our method lies a transformer-based Conditional Autoregressive Motion Diffusion Model (CAMDM), which takes as input the character's his… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGGRAPH 2024 (Conference Track). Project page and source codes: https://aiganimation.github.io/CAMDM/

  22. arXiv:2404.14848  [pdf, other

    cs.RO

    Evaluating Dynamic Environment Difficulty for Obstacle Avoidance Benchmarking

    Authors: Moji Shi, Gang Chen, Álvaro Serra Gómez, Siyuan Wu, Javier Alonso-Mora

    Abstract: Dynamic obstacle avoidance is a popular research topic for autonomous systems, such as micro aerial vehicles and service robots. Accurately evaluating the performance of dynamic obstacle avoidance methods necessitates the establishment of a metric to quantify the environment's difficulty, a crucial aspect that remains unexplored. In this paper, we propose four metrics to measure the difficulty of… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  23. arXiv:2404.07612  [pdf, ps, other

    cs.CY

    Measuring Geographic Diversity of Foundation Models with a Natural Language--based Geo-guessing Experiment on GPT-4

    Authors: Zilong Liu, Krzysztof Janowicz, Kitty Currier, Meilin Shi

    Abstract: Generative AI based on foundation models provides a first glimpse into the world represented by machines trained on vast amounts of multimodal data ingested by these models during training. If we consider the resulting models as knowledge bases in their own right, this may open up new avenues for understanding places through the lens of machines. In this work, we adopt this thinking and select GPT… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Short paper accepted by AGILE 2024 conference (https://agile-gi.eu/conference-2024)

  24. SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers

    Authors: Weile Li, Muqing Shi, Zhonghua Hong

    Abstract: Traditional deep learning-based object detection networks often resize images during the data preprocessing stage to achieve a uniform size and scale in the feature map. Resizing is done to facilitate model propagation and fully connected classification. However, resizing inevitably leads to object deformation and loss of valuable information in the images. This drawback becomes particularly prono… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  25. arXiv:2404.02471  [pdf, other

    cs.IT

    Some bounds on the cardinality of the $b$-symbol weight spectrum of codes

    Authors: Hongwei Zhu, Shitao Li, Minjia Shi, Shu-Tao Xia, Patrick Sole

    Abstract: The size of the Hamming distance spectrum of a code has received great attention in recent research. The main objective of this paper is to extend these significant theories to the $b$-symbol distance spectrum. We examine this question for various types of codes, including unrestricted codes, additive codes, linear codes, and cyclic codes, successively. For the first three cases, we determine the… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  26. arXiv:2404.01727  [pdf, other

    cs.RO cs.CV

    Generalizing 6-DoF Grasp Detection via Domain Prior Knowledge

    Authors: Haoxiang Ma, Modi Shi, Boyang Gao, Di Huang

    Abstract: We focus on the generalization ability of the 6-DoF grasp detection method in this paper. While learning-based grasp detection methods can predict grasp poses for unseen objects using the grasp distribution learned from the training set, they often exhibit a significant performance drop when encountering objects with diverse shapes and structures. To enhance the grasp detection methods' generaliza… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR 2024

  27. arXiv:2403.19949  [pdf, other

    cs.CV

    FairCLIP: Harnessing Fairness in Vision-Language Learning

    Authors: Yan Luo, Min Shi, Muhammad Osama Khan, Muhammad Muneeb Afzal, Hao Huang, Shuaihang Yuan, Yu Tian, Luo Song, Ava Kouhana, Tobias Elze, Yi Fang, Mengyu Wang

    Abstract: Fairness is a critical concern in deep learning, especially in healthcare, where these models influence diagnoses and treatment decisions. Although fairness has been investigated in the vision-only domain, the fairness of medical vision-language (VL) models remains unexplored due to the scarcity of medical VL datasets for studying fairness. To bridge this research gap, we introduce the first fair… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  28. arXiv:2403.11713  [pdf, other

    cond-mat.supr-con

    Ac$_3$Ni$_2$O$_7$ and La$_2$$Ae$Ni$_2$O$_6$F ($Ae$ = Sr, Ba): Benchmark Materials for Bilayer Nickelate Superconductivity

    Authors: Siqi Wu, Zihan Yang, Xin Ma, Jianhui Dai, Ming Shi, Hui-Qiu Yuan, Hai-Qing Lin, Chao Cao

    Abstract: We theoretically propose Ac$_3$Ni$_2$O$_7$, La$_2$BaNi$_2$O$_6$F, and La$_2$SrNi$_2$O$_6$F compounds to be benchmark materials for bilayer nickelate superconductivity. The stable phase of Ac$_3$Ni$_2$O$_7$ and La$_2$BaNi$_2$O$_6$F are found to be $I4/mmm$ without the lattice distortion caused by octahedra rotation at ambient pressure, where as the lattice distortion in La$_2$SrNi$_2$O$_6$F can be… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  29. arXiv:2403.11511  [pdf, other

    cs.RO cs.CV

    Sim-to-Real Grasp Detection with Global-to-Local RGB-D Adaptation

    Authors: Haoxiang Ma, Ran Qin, Modi shi, Boyang Gao, Di Huang

    Abstract: This paper focuses on the sim-to-real issue of RGB-D grasp detection and formulates it as a domain adaptation problem. In this case, we present a global-to-local method to address hybrid domain gaps in RGB and depth data and insufficient multi-modal feature alignment. First, a self-supervised rotation pre-training strategy is adopted to deliver robust initialization for RGB and depth networks. We… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted at ICRA 2024

  30. arXiv:2403.06728  [pdf, other

    cs.CV

    Large Model driven Radiology Report Generation with Clinical Quality Reinforcement Learning

    Authors: Zijian Zhou, Miaojing Shi, Meng Wei, Oluwatosin Alabi, Zijie Yue, Tom Vercauteren

    Abstract: Radiology report generation (RRG) has attracted significant attention due to its potential to reduce the workload of radiologists. Current RRG approaches are still unsatisfactory against clinical standards. This paper introduces a novel RRG method, \textbf{LM-RRG}, that integrates large models (LMs) with clinical quality reinforcement learning to generate accurate and comprehensive chest X-ray rad… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  31. arXiv:2403.02234  [pdf, other

    cs.CV

    3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

    Authors: Fangzhou Hong, Jiaxiang Tang, Ziang Cao, Min Shi, Tong Wu, Zhaoxi Chen, Shuai Yang, Tengfei Wang, Liang Pan, Dahua Lin, Ziwei Liu

    Abstract: We present a two-stage text-to-3D generation system, namely 3DTopia, which generates high-quality general 3D assets within 5 minutes using hybrid diffusion priors. The first stage samples from a 3D diffusion prior directly learned from 3D data. Specifically, it is powered by a text-conditioned tri-plane latent diffusion model, which quickly generates coarse 3D samples for fast prototyping. The sec… ▽ More

    Submitted 6 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Code available at https://github.com/3DTopia/3DTopia

  32. arXiv:2402.14438  [pdf, ps, other

    stat.ME

    Efficiency-improved doubly robust estimation with non-confounding predictive covariates

    Authors: Shanshan Luo, Mengchen Shi, Wei Li, Xueli Wang, Zhi Geng

    Abstract: In observational studies, covariates with substantial missing data are often omitted, despite their strong predictive capabilities. These excluded covariates are generally believed not to simultaneously affect both treatment and outcome, indicating that they are not genuine confounders and do not impact the identification of the average treatment effect (ATE). In this paper, we introduce an altern… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  33. arXiv:2402.11410  [pdf, ps, other

    cs.LG cs.DS stat.ML

    An Elementary Predictor Obtaining $2\sqrt{T}$ Distance to Calibration

    Authors: Eshwar Ram Arunachaleswaran, Natalie Collina, Aaron Roth, Mirah Shi

    Abstract: Blasiok et al. [2023] proposed distance to calibration as a natural measure of calibration error that unlike expected calibration error (ECE) is continuous. Recently, Qiao and Zheng [2024] gave a non-constructive argument establishing the existence of an online predictor that can obtain $O(\sqrt{T})$ distance to calibration in the adversarial setting, which is known to be impossible for ECE. They… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  34. arXiv:2402.11181  [pdf, other

    astro-ph.SR

    Horizontally Polarized Kink Oscillations Supported by Solar Coronal Loops in an Asymmetric Environment

    Authors: Mijie Shi, Bo Li, Shengju Yuan

    Abstract: Kink oscillations are ubiquitously observed in solar coronal loops, their understanding being crucial in the contexts of coronal seismology and atmospheric heating. We study kink modes supported by a straight coronal loop embeded in an asymmetric environment using three-dimensional magnetohydrodynamic (MHD) simulations. We implement the asymmetric effect by setting different exterior densities bel… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted for publication in A&A

  35. arXiv:2402.11073  [pdf, other

    cs.CL cs.AI

    AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

    Authors: Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold

    Abstract: With the rise of generative AI, automated fact-checking methods to combat misinformation are becoming more and more important. However, factual claim detection, the first step in a fact-checking pipeline, suffers from two key issues that limit its scalability and generalizability: (1) inconsistency in definitions of the task and what a claim is, and (2) the high cost of manual annotation. To addre… ▽ More

    Submitted 2 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: ACL2024 Main Conference

  36. arXiv:2402.08753  [pdf, ps, other

    cs.GT cs.LG

    Forecasting for Swap Regret for All Downstream Agents

    Authors: Aaron Roth, Mirah Shi

    Abstract: We study the problem of making predictions so that downstream agents who best respond to them will be guaranteed diminishing swap regret, no matter what their utility functions are. It has been known since Foster and Vohra (1997) that agents who best-respond to calibrated forecasts have no swap regret. Unfortunately, the best known algorithms for guaranteeing calibrated forecasts in sequential adv… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  37. AdaTreeFormer: Few Shot Domain Adaptation for Tree Counting from a Single High-Resolution Image

    Authors: Hamed Amini Amirkolaee, Miaojing Shi, Lianghua He, Mark Mulligan

    Abstract: The process of estimating and counting tree density using only a single aerial or satellite image is a difficult task in the fields of photogrammetry and remote sensing. However, it plays a crucial role in the management of forests. The huge variety of trees in varied topography severely hinders tree counting models to perform well. The purpose of this paper is to propose a framework that is learn… ▽ More

    Submitted 30 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted in ISPRS Journal of Photogrammetry and Remote Sensing

  38. arXiv:2401.16185  [pdf, other

    cs.CR cs.AI cs.SE

    LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning

    Authors: Yuqiang Sun, Daoyuan Wu, Yue Xue, Han Liu, Wei Ma, Lyuye Zhang, Miaolei Shi, Yang Liu

    Abstract: Large language models (LLMs) have demonstrated significant potential for many downstream tasks, including those requiring human-level intelligence, such as vulnerability detection. However, recent attempts to use LLMs for vulnerability detection are still preliminary, as they lack an in-depth understanding of a subject LLM's vulnerability reasoning capability -- whether it originates from the mode… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: This is a technical report by Nanyang Technological University

  39. arXiv:2401.16163  [pdf

    cond-mat.supr-con

    Direct evidence of interfacial coherent electron-phonon coupling in single-unit-cell FeSe film on Nb-doped SrTiO3

    Authors: Mengdi Zhang, Xiaotong Jiao, Mingxia Shi, Wenfeng Dong, Cui Ding, Yubin Wang, Tingxiao Qin, Haiyun Liu, Lili Wang, Zhenyu Zhang, Qi-Kun Xue, Qihua Xiong

    Abstract: The interface-enhanced superconductivity in monolayer iron selenide (FeSe) films on SrTiO3 has been actively pursued in the past decade. Although a synergistic effect between interfacial charge transfer and interfacial electron-phonon coupling (EPC) is proposed to be responsible for the mechanism, the microscopic nature of the interfacial EPC in the enhancement of superconductivity remains highly… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  40. arXiv:2401.12885  [pdf, other

    astro-ph.SR

    Damped kink motions in a system of two solar coronal tubes with elliptic cross-sections

    Authors: Mijie Shi, Bo Li, Shaoxia Chen, Hui Yu, Mingzhe Guo

    Abstract: This study is motivated by observations of coordinated transverse displacements in neighboring solar active region loops, addressing specifically how the behavior of kink motions in straight two-tube equilibria is impacted by tube interactions and tube cross-sectional shapes.We work with linear, ideal, pressureless magnetohydrodynamics. Axially standing kink motions are examined as an initial valu… ▽ More

    Submitted 3 March, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in A&A

  41. arXiv:2401.08256  [pdf, other

    cs.CV

    Multitask Learning in Minimally Invasive Surgical Vision: A Review

    Authors: Oluwatosin Alabi, Tom Vercauteren, Miaojing Shi

    Abstract: Minimally invasive surgery (MIS) has revolutionized many procedures and led to reduced recovery time and risk of patient injury. However, MIS poses additional complexity and burden on surgical teams. Data-driven surgical vision algorithms are thought to be key building blocks in the development of future MIS systems with improved autonomy. Recent advancements in machine learning and computer visio… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

  42. arXiv:2312.17264  [pdf, other

    cs.CL cs.IR

    ESGReveal: An LLM-based approach for extracting structured data from ESG reports

    Authors: Yi Zou, Mengying Shi, Zhongjie Chen, Zhu Deng, ZongXiong Lei, Zihan Zeng, Shiming Yang, HongXiang Tong, Lei Xiao, Wenwen Zhou

    Abstract: ESGReveal is an innovative method proposed for efficiently extracting and analyzing Environmental, Social, and Governance (ESG) data from corporate reports, catering to the critical need for reliable ESG information retrieval. This approach utilizes Large Language Models (LLM) enhanced with Retrieval Augmented Generation (RAG) techniques. The ESGReveal system includes an ESG metadata module for ta… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  43. arXiv:2312.15492  [pdf, other

    physics.chem-ph cond-mat.mtrl-sci physics.comp-ph

    DPA-2: Towards a universal large atomic model for molecular and material simulation

    Authors: Duo Zhang, Xinzijian Liu, Xiangyu Zhang, Chengqian Zhang, Chun Cai, Hangrui Bi, Yiming Du, Xuejian Qin, Jiameng Huang, Bowen Li, Yifan Shan, Jinzhe Zeng, Yuzhi Zhang, Siyuan Liu, Yifan Li, Junhan Chang, Xinyan Wang, Shuo Zhou, Jianchuan Liu, Xiaoshan Luo, Zhenyu Wang, Wanrun Jiang, Jing Wu, Yudi Yang, Jiyuan Yang , et al. (17 additional authors not shown)

    Abstract: The rapid development of artificial intelligence (AI) is driving significant changes in the field of atomic modeling, simulation, and design. AI-based potential energy models have been successfully used to perform large-scale and long-time simulations with the accuracy of ab initio electronic structure methods. However, the model generation process still hinders applications at scale. We envision… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

  44. arXiv:2312.10723  [pdf

    cond-mat.supr-con

    Atomic-site dependent pairing gap in monolayer FeSe/SrTiO$_3$(001)- ($\sqrt{13} \times \sqrt{13}$)

    Authors: Cui Ding, Zhongxu Wei, Wenfeng Dong, Hai Feng, Mingxia Shi, Lili Wang, Jin-Feng Jia, Qi-Kun Xue

    Abstract: The interfacial FeSe/TiO$_{2-δ}$ coupling induces high-temperature superconductivity in monolayer FeSe films. Using cryogenic atomically resolved scanning tunneling microscopy/spectroscopy, we obtained atomic-site dependent surface density of states, work function, and pairing gap in the monolayer FeSe on SrTiO$_3$(001)-($\sqrt{13} \times \sqrt{13}$)-33.7$°$ surface. Our results disclosed the out-… ▽ More

    Submitted 28 February, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

  45. arXiv:2312.09482  [pdf, ps, other

    cs.IT

    An open problem and a conjecture on binary linear complementary pairs of codes

    Authors: Shitao Li, Minjia Shi, San Ling

    Abstract: The existence of $q$-ary linear complementary pairs (LCPs) of codes with $q> 2$ has been completely characterized so far. This paper gives a characterization for the existence of binary LCPs of codes. As a result, we solve an open problem proposed by Carlet $et~al.$ (IEEE Trans. Inf. Theory 65(3): 1694-1704, 2019) and a conjecture proposed by Choi $et~al.$ (Cryptogr. Commun. 15(2): 469-486, 2023).

    Submitted 14 December, 2023; originally announced December 2023.

  46. arXiv:2312.01220  [pdf, other

    cs.CV

    Boosting Object Detection with Zero-Shot Day-Night Domain Adaptation

    Authors: Zhipeng Du, Miaojing Shi, Jiankang Deng

    Abstract: Detecting objects in low-light scenarios presents a persistent challenge, as detectors trained on well-lit data exhibit significant performance degradation on low-light data due to low visibility. Previous methods mitigate this issue by exploring image enhancement or object detection techniques with real low-light image datasets. However, the progress is impeded by the inherent difficulties about… ▽ More

    Submitted 27 March, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted to CVPR 2024

  47. arXiv:2312.01151  [pdf

    cs.CY cs.CL cs.SC

    Here Is Not There: Measuring Entailment-Based Trajectory Similarity for Location-Privacy Protection and Beyond

    Authors: Zilong Liu, Krzysztof Janowicz, Kitty Currier, Meilin Shi, Jinmeng Rao, Song Gao, Ling Cai, Anita Graser

    Abstract: While the paths humans take play out in social as well as physical space, measures to describe and compare their trajectories are carried out in abstract, typically Euclidean, space. When these measures are applied to trajectories of actual individuals in an application area, alterations that are inconsequential in abstract space may suddenly become problematic once overlaid with geographic realit… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  48. arXiv:2311.16964  [pdf, other

    cond-mat.dis-nn cond-mat.mtrl-sci cs.LG

    Machine learning force-field models for metallic spin glass

    Authors: Menglin Shi, Sheng Zhang, Gia-Wei Chern

    Abstract: Metallic spin glass systems, such as dilute magnetic alloys, are characterized by randomly distributed local moments coupled to each other through a long-range electron-mediated effective interaction. We present a scalable machine learning (ML) framework for dynamical simulations of metallic spin glasses. A Behler-Parrinello type neural-network model, based on the principle of locality, is develop… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: 12 pages, 5 figures

  49. arXiv:2311.16492  [pdf, other

    cs.CV

    VLPrompt: Vision-Language Prompting for Panoptic Scene Graph Generation

    Authors: Zijian Zhou, Miaojing Shi, Holger Caesar

    Abstract: Panoptic Scene Graph Generation (PSG) aims at achieving a comprehensive image understanding by simultaneously segmenting objects and predicting relations among objects. However, the long-tail problem among relations leads to unsatisfactory results in real-world applications. Prior methods predominantly rely on vision information or utilize limited language information, such as object or relation n… ▽ More

    Submitted 19 June, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 22 pages, 9 figures

  50. arXiv:2311.07747  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Magnetic-coupled electronic landscape in bilayer-distorted titanium-based kagome metals

    Authors: Yong Hu, Congcong Le, Long Chen, Hanbin Deng, Ying Zhou, Nicholas C. Plumb, Milan Radovic, Ronny Thomale, Andreas P. Schnyder, Jia-Xin Yin, Gang Wang, Xianxin Wu, Ming Shi

    Abstract: Quantum materials whose atoms are arranged on a lattice of corner-sharing triangles, $\textit{i.e.}$, the kagome lattice, have recently emerged as a captivating platform for investigating exotic correlated and topological electronic phenomena. Here, we combine ultra-low temperature angle-resolved photoemission spectroscopy (ARPES) with scanning tunneling microscopy and density functional theory ca… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

    Report number: RIKEN-iTHEMS-Report-23