Skip to main content

Showing 1–50 of 240 results for author: Shang, H

  1. arXiv:2407.08504  [pdf, other

    cond-mat.mtrl-sci

    Revisiting the Formulation of Charged Defect in Solids

    Authors: Hanzhi Shang, Zeyu Jiang, Yiyang Sun, Damien West, Shengbai Zhang

    Abstract: Defect physics is at the heart of microelectronics. By keeping track of the reference energy in total energy calculations, we explicitly show that the "potential alignment" correction vanishes, and the classic Markov-Payne correction yields accurate results. From linear response theory, we further formulate an accurate expression for the quadrupole correction. Application to numerous defects inclu… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2407.02005  [pdf, other

    cs.CL cs.SD eess.AS

    An End-to-End Speech Summarization Using Large Language Model

    Authors: Hengchao Shang, Zongyao Li, Jiaxin Guo, Shaojun Li, Zhiqiang Rao, Yuanchang Luo, Daimeng Wei, Hao Yang

    Abstract: Abstractive Speech Summarization (SSum) aims to generate human-like text summaries from spoken content. It encounters difficulties in handling long speech input and capturing the intricate cross-modal mapping between long speech inputs and short text summaries. Research on large language models (LLMs) and multimodal information fusion has provided new insights for addressing these challenges. In t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: InterSpeech 2024

  3. arXiv:2406.09180  [pdf, other

    cs.LG

    Detection-Rate-Emphasized Multi-objective Evolutionary Feature Selection for Network Intrusion Detection

    Authors: Zi-Hang Cheng, Haopu Shang, Chao Qian

    Abstract: Network intrusion detection is one of the most important issues in the field of cyber security, and various machine learning techniques have been applied to build intrusion detection systems. However, since the number of features to describe the network connections is often large, where some features are redundant or noisy, feature selection is necessary in such scenarios, which can both improve t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.08801  [pdf, other

    cs.CV

    Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

    Authors: Mingwang Xu, Hui Li, Qingkun Su, Hanlin Shang, Liwei Zhang, Ce Liu, Jingdong Wang, Yao Yao, Siyu Zhu

    Abstract: The field of portrait image animation, driven by speech audio input, has experienced significant advancements in the generation of realistic and dynamic portraits. This research delves into the complexities of synchronizing facial movements and creating visually appealing, temporally consistent animations within the framework of diffusion-based methodologies. Moving away from traditional paradigms… ▽ More

    Submitted 16 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: 20 pages

  5. arXiv:2406.04791  [pdf, other

    cs.SD eess.AS

    Speaker-Smoothed kNN Speaker Adaptation for End-to-End ASR

    Authors: Shaojun Li, Daimeng Wei, Hengchao Shang, Jiaxin Guo, ZongYao Li, Zhanglin Wu, Zhiqiang Rao, Yuanchang Luo, Xianghui He, Hao Yang

    Abstract: Despite recent improvements in End-to-End Automatic Speech Recognition (E2E ASR) systems, the performance can degrade due to vocal characteristic mismatches between training and testing data, particularly with limited target speaker adaptation data. We propose a novel speaker adaptation approach Speaker-Smoothed kNN that leverages k-Nearest Neighbors (kNN) retrieval techniques to improve model out… ▽ More

    Submitted 1 July, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  6. arXiv:2406.04754  [pdf, ps, other

    math.AP

    Global well-posedness and large time behavior for the Oldroyd-B model

    Authors: Haifeng Shang

    Abstract: This paper studies the global well-posedness and optimal decay estimates to the Oldroyd-B model in $\mathbb R^d$ ($d\geq2$). By utilizing the special structure of this system, we give a simplified proof to the global existence of solutions for the case of initial data small in critical Besov spaces and non-small coupling parameters. Moreover, the optimal decay rates of the solutions under minimal… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  7. arXiv:2406.04745  [pdf, other

    cs.LG cs.CV

    Confidence-aware Contrastive Learning for Selective Classification

    Authors: Yu-Chang Wu, Shen-Huan Lyu, Haopu Shang, Xiangyu Wang, Chao Qian

    Abstract: Selective classification enables models to make predictions only when they are sufficiently confident, aiming to enhance safety and reliability, which is important in high-stakes scenarios. Previous methods mainly use deep neural networks and focus on modifying the architecture of classification layers to enable the model to estimate the confidence of its prediction. This work provides a generaliz… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Accepted by ICML 2024

  8. arXiv:2405.16989  [pdf, other

    stat.ME

    Uncertainty Learning for High-dimensional Mean-variance Portfolio

    Authors: Han Lin Shang, Ruike Wu, Yanrong Yang

    Abstract: Accounting for uncertainty in Data quality is important for accurate statistical inference. We aim to an optimal conservative allocation for a large universe of assets in mean-variance portfolio (MVP), which is the worst choice within uncertainty in data distribution. Unlike the low dimensional MVP studied in Blanchet et al. (2022, Management Science), the large number of assets raises a challengi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 2 figures, 4 tables

    MSC Class: 91G10; 62P05

  9. arXiv:2405.14744  [pdf, other

    cs.CY

    Exploring Prosocial Irrationality for LLM Agents: A Social Cognition View

    Authors: Xuan Liu, Jie Zhang, Song Guo, Haoyang Shang, Chengxu Yang, Quanyan Zhu

    Abstract: Large language models (LLMs) have been shown to face hallucination issues due to the data they trained on often containing human bias; whether this is reflected in the decision-making process of LLM agents remains under-explored. As LLM Agents are increasingly employed in intricate social environments, a pressing and natural question emerges: Can LLM Agents leverage hallucinations to mirror human… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  10. arXiv:2405.09164  [pdf

    quant-ph

    Rapidly Achieving Chemical Accuracy with Quantum Computing Enforced Language Model

    Authors: Honghui Shang, Xiongzhi Zeng, Ming Gong, Yangju Wu, Shaojun Guo, Haoran Qian, Chen Zha, Zhijie Fan, Kai Yan, Xiaobo Zhu, Zhenyu Li, Yi Luo, Jian-Wei Pan, Jinlong Yang

    Abstract: Finding accurate ground state energy of a many-body system has been a major challenge in quantum chemistry. The integration of classic and quantum computers has shed new light on resolving this outstanding problem. Here we propose QiankunNet-VQE, a transformer based language models enforced with quantum computing to learn and generate quantum states. It has been implemented using up to 12 qubits a… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  11. arXiv:2405.04904  [pdf, other

    stat.ME stat.AP

    Dependence-based fuzzy clustering of functional time series

    Authors: Angel Lopez-Oriona, Ying Sun, Han Lin Shang

    Abstract: Time series clustering is an important data mining task with a wide variety of applications. While most methods focus on time series taking values on the real line, very few works consider functional time series. However, functional objects frequently arise in many fields, such as actuarial science, demography or finance. Functional time series are indexed collections of infinite-dimensional curve… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 43 pages, 5 figures, 10 tables. arXiv admin note: substantial text overlap with arXiv:2402.08687

    MSC Class: 62R10

  12. arXiv:2404.10542  [pdf, other

    astro-ph.HE

    Statistical analysis of pulsar flux density distribution

    Authors: H. W. Xu, R. S. Zhao, Erbil Gugercinoglu, H. Liu, D. Li, P. Wang, C. H. Niu, C. Miao, X. Zhu, R. W. Tian, W. L. Li, S. D. Wang, Z. F. Tu, Q. J. Zhi, S. J. Dang, L. H. Shang, S. Xiao

    Abstract: This study presents a comprehensive analysis of the spectral properties of 886 pulsars across a wide frequency range from 20MHz to 343.5GHz, including a total of 86 millisecond pulsars. The majority of the pulsars exhibit power-law behavior in their spectra, although some exceptions are observed. Five different spectral models, namely simple power-law, broken power-law, low-frequency turn-over, hi… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: 39 papers,17figures

  13. arXiv:2404.10492  [pdf, other

    cond-mat.mtrl-sci

    Efficient structural relaxation based on the random phase approximation: Applications to the water clusters

    Authors: Muhammad N. Tahir, Honghui Shang, Jia Li, Xinguo Ren

    Abstract: We report an improved implementation for evaluating the analytical gradients of the random phase approximation (RPA) electron-correlation energy based on atomic orbitals and the localized resolution of identity scheme. The more efficient RPA force calculations allow us to relax structures of medium-size water clusters. Particular attention is paid to the structures and energy orderings of the low-… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  14. arXiv:2403.13340  [pdf, other

    stat.ME

    Forecasting density-valued functional panel data

    Authors: Cristian F. Jiménez-Varón, Ying Sun, Han Lin Shang

    Abstract: We introduce a statistical method for modeling and forecasting functional panel data, where each element is a density. Density functions are nonnegative and have a constrained integral and thus do not constitute a linear vector space. We implement a center log-ratio transformation to transform densities into unconstrained functions. These functions exhibit cross-sectionally correlation and tempora… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  15. arXiv:2403.11430  [pdf, other

    cs.CL

    A Novel Paradigm Boosting Translation Capabilities of Large Language Models

    Authors: Jiaxin Guo, Hao Yang, Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen

    Abstract: This paper presents a study on strategies to enhance the translation capabilities of large language models (LLMs) in the context of machine translation (MT) tasks. The paper proposes a novel paradigm consisting of three stages: Secondary Pre-training using Extensive Monolingual Data, Continual Pre-training with Interlinear Text Format Documents, and Leveraging Source-Language Consistent Instructio… ▽ More

    Submitted 15 April, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Accepted in NAACL 2024

  16. arXiv:2403.03574  [pdf, other

    astro-ph.HE

    Formation of limb-brightened radio jets by angle-dependent energy extraction from rapidly rotating black holes

    Authors: Kouichi Hirotani, Hsien Shang, Ruben Krasnopolsky, Kenichi Nishikawa

    Abstract: By general relativistic magnetohydrodynamic simulations, it is suggested that the rotational energy of a rapidly rotating black hole (BH) is preferentially extracted along the magnetic field lines threading the event horizon in the middle and lower latitudes. Applying this angle-dependent Poynting flux to the jet downstream, we demonstrate that the jets exhibit limb-brightened structures at variou… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 19 pages, 8 figures. The Astrophysical Journal in press

  17. arXiv:2403.02118  [pdf, other

    cs.CY cs.AI cs.CV

    Position: Towards Implicit Prompt For Text-To-Image Models

    Authors: Yue Yang, Yuqi Lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo

    Abstract: Recent text-to-image (T2I) models have had great success, and many benchmarks have been proposed to evaluate their performance and safety. However, they only consider explicit prompts while neglecting implicit prompts (hint at a target without explicitly mentioning it). These prompts may get rid of safety constraints and pose potential threats to the applications of these models. This position pap… ▽ More

    Submitted 28 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  18. arXiv:2402.02529  [pdf, other

    astro-ph.SR astro-ph.GA

    A Unified Model for Bipolar Outflows from Young Stars: Kinematic and Mixing Structures in HH 30

    Authors: Tsung-Han Ai, Chun-Fan Liu, Hsien Shang, Doug Johnstone, Ruben Krasnopolsky

    Abstract: The young stellar source HH 30 is a textbook example of an ionic optical jet originating from a disk in an edge-on system shown by the HST. It has a remnant envelope in $^{12}$CO observed by ALMA. The optical jet is characterized by its narrow appearance, large line width at the base, and high temperature inferred from line diagnostics. Three featured structures can be identified, most evident in… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 19 pages, 9 figures, ApJ in press

  19. arXiv:2401.16696  [pdf, ps, other

    nucl-th

    Properties of chiral nucleon-nucleon interaction at N$^3$LO with high cutoffs studied by local projection

    Authors: Haoyu Shang, Rongzhe Hu, Junchen Pei, Furong Xu

    Abstract: The chiral nucleon-nucleon ($NN$) interaction at high cutoffs has been plagued by the presence of spurious bound states. In this work, the chiral $NN$ interaction at N$^3$LO is studied by the local projection method as the cutoff increases. The evolution of short-range behaviors of pion-exchange interactions and contact interactions is intuitively demonstrated. The $P$-channel potentials toward hi… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 16 pages, 12 figures

  20. arXiv:2401.13943  [pdf, other

    stat.AP stat.ME

    Is the age pension in Australia sustainable and fair? Evidence from forecasting the old-age dependency ratio using the Hamilton-Perry model

    Authors: Sizhe Chen, Han Lin Shang, Yang Yang

    Abstract: The age pension aims to assist eligible elderly Australians meet specific age and residency criteria in maintaining basic living standards. In designing efficient pension systems, government policymakers seek to satisfy the expectations of the overall aging population in Australia. However, the population's unique demographic characteristics at the state and territory level are often overlooked du… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 31 pages, 14 figures, 1 table

    MSC Class: 62R10

  21. arXiv:2401.05784  [pdf, other

    econ.EM stat.ME

    Covariance Function Estimation for High-Dimensional Functional Time Series with Dual Factor Structures

    Authors: Chenlei Leng, Degui Li, Hanlin Shang, Yingcun Xia

    Abstract: We propose a flexible dual functional factor model for modelling high-dimensional functional time series. In this model, a high-dimensional fully functional factor parametrisation is imposed on the observed functional processes, whereas a low-dimensional version (via series approximation) is assumed for the latent functional factors. We extend the classic principal component analysis technique for… ▽ More

    Submitted 12 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  22. arXiv:2401.05700  [pdf, other

    cs.CL cs.AI

    R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation

    Authors: Jiaxin Guo, Zhanglin Wu, Zongyao Li, Hengchao Shang, Daimeng Wei, Xiaoyu Chen, Zhiqiang Rao, Shaojun Li, Hao Yang

    Abstract: Incremental Decoding is an effective framework that enables the use of an offline model in a simultaneous setting without modifying the original model, making it suitable for Low-Latency Simultaneous Speech Translation. However, this framework may introduce errors when the system outputs from incomplete input. To reduce these output errors, several strategies such as Hold-$n$, LA-$n$, and SP-$n$ c… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Preprint

  23. UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

    Authors: Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang

    Abstract: Error correction techniques have been used to refine the output sentences from automatic speech recognition (ASR) models and achieve a lower word error rate (WER). Previous works usually adopt end-to-end models and has strong dependency on Pseudo Paired Data and Original Paired Data. But when only pre-training on Pseudo Paired Data, previous models have negative effect on correction. While fine-tu… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: Accepted in ICASSP 2023

  24. arXiv:2401.02882  [pdf, other

    cs.HC q-bio.TO

    SpatialVisVR: An Immersive, Multiplexed Medical Image Viewer With Contextual Similar-Patient Search

    Authors: Jai Prakash Veerla, Partha Sai Guttikonda, Amir Hajighasemi, Jillur Rahman Saurav, Aarti Darji, Cody T. Reynolds, Mohamed Mohamed, Mohammad S. Nasr, Helen H. Shang, Jacob M. Luber

    Abstract: In contemporary pathology, multiplexed immunofluorescence (mIF) and multiplex immunohistochemistry (mIHC) present both significant opportunities and challenges. These methodologies shed light on intricate tumor microenvironment interactions, emphasizing the need for intuitive visualization tools to analyze vast biological datasets effectively. As electronic health records (EHR) proliferate and phy… ▽ More

    Submitted 11 May, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

  25. arXiv:2312.14574  [pdf, other

    cs.CV cs.LG

    MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning

    Authors: Liang Peng, Songyue Cai, Zongqian Wu, Huifang Shang, Xiaofeng Zhu, Xiaoxiao Li

    Abstract: Prompt learning has demonstrated impressive efficacy in the fine-tuning of multimodal large models to a wide range of downstream tasks. Nonetheless, applying existing prompt learning methods for the diagnosis of neurological disorder still suffers from two issues: (i) existing methods typically treat all patches equally, despite the fact that only a small number of patches in neuroimaging are rele… ▽ More

    Submitted 27 June, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

  26. arXiv:2312.12587  [pdf, other

    eess.SP cs.DC q-bio.TO

    Real-Time Diagnostic Integrity Meets Efficiency: A Novel Platform-Agnostic Architecture for Physiological Signal Compression

    Authors: Neel R Vora, Amir Hajighasemi, Cody T. Reynolds, Amirmohammad Radmehr, Mohamed Mohamed, Jillur Rahman Saurav, Abdul Aziz, Jai Prakash Veerla, Mohammad S Nasr, Hayden Lotspeich, Partha Sai Guttikonda, Thuong Pham, Aarti Darji, Parisa Boodaghi Malidarreh, Helen H Shang, Jay Harvey, Kan Ding, Phuc Nguyen, Jacob M Luber

    Abstract: Head-based signals such as EEG, EMG, EOG, and ECG collected by wearable systems will play a pivotal role in clinical diagnosis, monitoring, and treatment of important brain disorder diseases. However, the real-time transmission of the significant corpus physiological signals over extended periods consumes substantial power and time, limiting the viability of battery-dependent physiological monit… ▽ More

    Submitted 4 January, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  27. arXiv:2312.04998  [pdf, other

    astro-ph.IM

    An Efficient Algorithm for Astrochemical Systems Using Stoichiometry Matrices

    Authors: Kazutaka Motoyama, Ruben Krasnopolsky, Hsien Shang, Kento Aida, Eisaku Sakane

    Abstract: Astrochemical simulations are a powerful tool for revealing chemical evolution in the interstellar medium. Astrochemical calculations require efficient processing of large matrices for the chemical networks. The large chemical reaction networks often present bottlenecks for computation because of time derivatives of chemical abundances. We propose an efficient algorithm using a stoichiometry matri… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: 10 pages, 7 figures, accepted for publication in ApJS

  28. arXiv:2311.18477  [pdf, other

    stat.ME stat.AP

    Intraday foreign exchange rate volatility forecasting: univariate and multilevel functional GARCH models

    Authors: Fearghal Kearney, Han Lin Shang, Yuqian Zhao

    Abstract: This paper seeks to predict conditional intraday volatility in foreign exchange (FX) markets using functional Generalized AutoRegressive Conditional Heteroscedasticity (GARCH) models. We contribute to the existing functional GARCH-type models by accounting for the stylised features of long-range and cross-dependence through estimating the models with long-range dependent and multi-level functional… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 43 pages, 5 figures, 8 tables

    MSC Class: 62R10

  29. arXiv:2311.18200  [pdf, other

    cs.CL

    INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion

    Authors: Hengchao Shang, Zongyao Li, Daimeng Wei, Jiaxin Guo, Minghan Wang, Xiaoyu Chen, Lizhi Lei, Hao Yang

    Abstract: Computer-aided translation (CAT) aims to enhance human translation efficiency and is still important in scenarios where machine translation cannot meet quality requirements. One fundamental task within this field is Word-Level Auto Completion (WLAC). WLAC predicts a target word given a source sentence, translation context, and a human typed character sequence. Previous works either employ word cla… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: EMNLP2023

  30. arXiv:2311.00401  [pdf, other

    cs.CV cs.AI

    A Spatial-Temporal Transformer based Framework For Human Pose Assessment And Correction in Education Scenarios

    Authors: Wenyang Hu, Kai Liu, Libin Liu, Huiliang Shang

    Abstract: Human pose assessment and correction play a crucial role in applications across various fields, including computer vision, robotics, sports analysis, healthcare, and entertainment. In this paper, we propose a Spatial-Temporal Transformer based Framework (STTF) for human pose assessment and correction in education scenarios such as physical exercises and science experiment. The framework comprising… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  31. arXiv:2311.00370  [pdf

    astro-ph.HE astro-ph.GA hep-ph

    Discovery of four pulsars in a pilot survey at intermediate Galactic latitudes with FAST

    Authors: Q. J. Zhi, J. T. Bai, S. Dai, X. Xu, S. J. Dang, L. H. Shang, R. S. Zhao, D. Li, W. W. Zhu, N. Wang, J. P. Yuan, P. Wang, L. Zhang, Y. Feng, J. B. Wang, S. Q. Wang, Q. D. Wu, A. J. Dong, H. Yang, J. Tian, W. Q. Zhong, X. H. Luo, Miroslav D. Filipovi, G. J. Qiao

    Abstract: We present the discovery and timing results of four pulsars discovered in a pilot survey at intermediate Galactic latitudes with the Five-hundred Aperture Spherical Telescope (FAST). Among these pulsars, two belong to the category of millisecond pulsars (MSPs) with spin periods of less than 20 ms. The other two fall under the classification of "mildly recycled" pulsars, with massive white dwarfs a… ▽ More

    Submitted 28 December, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 7 pages, 4 figures, 2 tables, accepted to ApJ

  32. arXiv:2310.16480  [pdf, other

    astro-ph.GA

    Exploring the Formation of Resistive Pseudodisks with the GPU Code Astaroth

    Authors: Miikka S. Väisälä, Hsien Shang, Daniele Galli, Susana Lizano, Ruben Krasnopolsky

    Abstract: Pseudodisks are dense structures formed perpendicular to the direction of the magnetic field during the gravitational collapse of a molecular cloud core. Numerical simulations of the formation of pseudodisks are usually computationally expensive with conventional CPU codes. To demonstrate the proof-of-concept of a fast computing method for this numerically costly problem, we explore the GPU-powere… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 29 pages, 1 table, 15 figures, Accepted for publication in the Astrophysical Journal

  33. arXiv:2310.09568  [pdf, other

    cs.AR

    Wafer-scale Computing: Advancements, Challenges, and Future Perspectives

    Authors: Yang Hu, Xinhan Lin, Huizheng Wang, Zhen He, Xingmao Yu, Jiahao Zhang, Qize Yang, Zheng Xu, Sihan Guan, Jiahao Fang, Haoran Shang, Xinru Tang, Xu Dai, Shaojun Wei, Shouyi Yin

    Abstract: Nowadays, artificial intelligence (AI) technology with large models plays an increasingly important role in both academia and industry. It also brings a rapidly increasing demand for the computing power of the hardware. As the computing demand for AI continues to grow, the growth of hardware computing power has failed to keep up. This has become a significant factor restricting the development of… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

    ACM Class: B.7.0; C.1

  34. arXiv:2310.08439  [pdf, other

    physics.comp-ph cs.DC

    TensorMD: Scalable Tensor-Diagram based Machine Learning Interatomic Potential on Heterogeneous Many-Core Processors

    Authors: Xin Chen, Yucheng Ouyang, Xin Chen, Zhenchuan Chen, Rongfen Lin, Xingyu Gao, Lifang Wang, Fang Li, Yin Liu, Honghui Shang, Haifeng Song

    Abstract: Molecular dynamics simulations have emerged as a potent tool for investigating the physical properties and kinetic behaviors of materials at the atomic scale, particularly in extreme conditions. Ab initio accuracy is now achievable with machine learning based interatomic potentials. With recent advancements in high-performance computing, highly accurate and large-scale simulations become feasible.… ▽ More

    Submitted 12 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  35. arXiv:2308.14104  [pdf, other

    cs.LG

    Towards Generalizable Neural Solvers for Vehicle Routing Problems via Ensemble with Transferrable Local Policy

    Authors: Chengrui Gao, Haopu Shang, Ke Xue, Dong Li, Chao Qian

    Abstract: Machine learning has been adapted to help solve NP-hard combinatorial optimization problems. One prevalent way is learning to construct solutions by deep neural networks, which has been receiving more and more attention due to the high efficiency and less requirement for expert knowledge. However, many neural construction methods for Vehicle Routing Problems~(VRPs) focus on synthetic problem insta… ▽ More

    Submitted 5 May, 2024; v1 submitted 27 August, 2023; originally announced August 2023.

    Comments: Accepted by IJCAI 2024

  36. arXiv:2308.05494  [pdf, other

    astro-ph.GA astro-ph.SR

    ALMA Survey of Orion Planck Galactic Cold Clumps (ALMASOP): The Warm-Envelope Origin of Hot Corinos

    Authors: Shih-Ying Hsu, Sheng-Yuan Liu, Doug Johnstone, Tie Liu, Leonardo Bronfman, Huei-Ru Vivien Chen, Somnath Dutta, David J. Eden, Neal J. Evans II, Naomi Hirano, Mika Juvela, Yi-Jehng Kuan, Woojin Kwon, Chin-Fei Lee, Chang Won Lee, Jeong-Eun Lee, Shanghuo Li, Chun-Fan Liu, Xunchuan Liu, Qiuyi Luo, Sheng-Li Qin, Mark G. Rawlings, Dipen Sahu, Patricio Sanhueza, Hsien Shang , et al. (2 additional authors not shown)

    Abstract: Hot corinos are of great interest due to their richness in interstellar complex organic molecules (COMs) and the consequent potential prebiotic connection to solar-like planetary systems. Recent surveys have reported an increasing number of hot corino detections in Class 0/I protostars; however, the relationships between their physical properties and the hot-corino signatures remain elusive. In th… ▽ More

    Submitted 11 August, 2023; v1 submitted 10 August, 2023; originally announced August 2023.

    Comments: 28 pages, 11 figures

  37. arXiv:2308.01454  [pdf, other

    astro-ph.EP

    TOI-4860 b, a short-period giant planet transiting an M3.5 dwarf

    Authors: J. M. Almenara, X. Bonfils, E. M. Bryant, A. Jordán, G. Hébrard, E. Martioli, A. C. M. Correia, N. Astudillo-Defru, C. Cadieux, L. Arnold, É. Artigau, G. Á. Bakos, S. C. C. Barros, D. Bayliss, F. Bouchy, G. Boué, R. Brahm, A. Carmona, D. Charbonneau, D. R. Ciardi, R. Cloutier, M. Cointepas, N. J. Cook, N. B. Cowan, X. Delfosse , et al. (25 additional authors not shown)

    Abstract: We report the discovery and characterisation of a giant transiting planet orbiting a nearby M3.5V dwarf (d = 80.4 pc, $G$ = 15.1 mag, $K$=11.2 mag, R$_\star$ = 0.358 $\pm$ 0.015 R$_\odot$, M$_\star$ = 0.340 $\pm$ 0.009 M$_\odot$). Using the photometric time series from TESS sectors 10, 36, 46, and 63 and near-infrared spectrophotometry from ExTrA, we measured a planetary radius of 0.77 $\pm$ 0.03… ▽ More

    Submitted 12 January, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: 16 pages, 14 figures, accepted for publication in A&A

  38. arXiv:2307.12746  [pdf, other

    astro-ph.SR astro-ph.GA astro-ph.HE

    A high-resolution radio study of the L1551 IRS 5 and L1551 NE jets

    Authors: A. Feeney-Johansson, S. J. D. Purser, T. P. Ray, C. Carrasco-González, A. Rodríguez-Kamenetzky, J. Eislöffel, J. Lim, R. Galván-Madrid, S. Lizano, L. F. Rodríguez, H. Shang, P. Ho, M. Hoare

    Abstract: Using observations with e-MERLIN and the VLA, together with archival data from ALMA, we obtain high-resolution radio images of two binary YSOs: L1551 IRS 5 and L1551 NE, covering a wide range of frequencies from 5 - 336 GHz, and resolving emission from the radio jet on scales of only ~15 au. By comparing these observations to those from a previous epoch, it is shown that there is a high degree of… ▽ More

    Submitted 24 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 13 pages, 7 figures, accepted for publication in A&A

    Journal ref: A&A 677, A97 (2023)

  39. arXiv:2307.09343  [pdf, other

    quant-ph

    Solving Schrödinger Equation with a Language Model

    Authors: Honghui Shang, Chu Guo, Yangjun Wu, Zhenyu Li, Jinlong Yang

    Abstract: Accurately solving the Schrödinger equation for intricate systems remains a prominent challenge in physical sciences. A paradigm-shifting approach to address this challenge involves the application of artificial intelligence techniques. In this study, we introduce a machine-learning model named QiankunNet, based on the transformer architecture employed in language models. By incorporating the atte… ▽ More

    Submitted 4 April, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

  40. arXiv:2307.01047  [pdf, other

    cs.CV

    Cross-modal Place Recognition in Image Databases using Event-based Sensors

    Authors: Xiang Ji, Jiaxin Wei, Yifu Wang, Huiliang Shang, Laurent Kneip

    Abstract: Visual place recognition is an important problem towards global localization in many robotics tasks. One of the biggest challenges is that it may suffer from illumination or appearance changes in surrounding environments. Event cameras are interesting alternatives to frame-based sensors as their high dynamic range enables robust perception in difficult illumination conditions. However, current eve… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  41. arXiv:2306.17019  [pdf, other

    eess.IV cs.CV q-bio.TO

    Histopathology Slide Indexing and Search: Are We There Yet?

    Authors: Helen H. Shang, Mohammad Sadegh Nasr, Jai Prakash Veerla, Parisa Boodaghi Malidarreh, MD Jillur Rahman Saurav, Amir Hajighasemi, Manfred Huber, Chace Moleta, Jitin Makker, Jacob M. Luber

    Abstract: The search and retrieval of digital histopathology slides is an important task that has yet to be solved. In this case study, we investigate the clinical readiness of three state-of-the-art histopathology slide search engines, Yottixel, SISH, and RetCCL, on three patients with solid tumors. We provide a qualitative assessment of each model's performance in providing retrieval results that are reli… ▽ More

    Submitted 4 January, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

  42. arXiv:2306.16989  [pdf

    q-bio.TO cs.CV eess.IV

    The State of Applying Artificial Intelligence to Tissue Imaging for Cancer Research and Early Detection

    Authors: Michael Robben, Amir Hajighasemi, Mohammad Sadegh Nasr, Jai Prakesh Veerla, Anne M. Alsup, Biraaj Rout, Helen H. Shang, Kelli Fowlds, Parisa Boodaghi Malidarreh, Paul Koomey, MD Jillur Rahman Saurav, Jacob M. Luber

    Abstract: Artificial intelligence represents a new frontier in human medicine that could save more lives and reduce the costs, thereby increasing accessibility. As a consequence, the rate of advancement of AI in cancer medical imaging and more particularly tissue pathology has exploded, opening it to ethical and technical questions that could impede its adoption into existing systems. In order to chart the… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Journal ref: F1000Research 2023, 12:1436

  43. arXiv:2306.16705  [pdf, other

    quant-ph cs.AI

    NNQS-Transformer: an Efficient and Scalable Neural Network Quantum States Approach for Ab initio Quantum Chemistry

    Authors: Yangjun Wu, Chu Guo, Yi Fan, Pengyu Zhou, Honghui Shang

    Abstract: Neural network quantum state (NNQS) has emerged as a promising candidate for quantum many-body problems, but its practical applications are often hindered by the high cost of sampling and local energy calculation. We develop a high-performance NNQS method for \textit{ab initio} electronic structure calculations. The major innovations include: (1) A transformer based architecture as the quantum wav… ▽ More

    Submitted 1 November, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: Accepted by SC'23, fix Table1 CCSD references

  44. ALMA Survey of Orion Planck Galactic Cold Clumps (ALMASOP): A forming quadruple system with continuum `ribbons' and intricate outflows

    Authors: Qiu-yi Luo, Tie Liu, Aaron T. Lee, Stella S. R. Offner, James di Francesco, Doug Johnstone, Mika Juvela, Paul F. Goldsmith, Sheng-Li Qin, Xiaofeng Mai, Xun-chuan Liu, Patricio Sanhueza, Feng-Wei Xu, Ken'ichi Tatematsu, Somnath Dutta, Huei-Ru Vivien Chen, Shanghuo Li, Aiyuan Yang, Sheng-Yuan Liu, Chin-Fei Lee, Naomi Hirano, Chang Won Lee, Dipen Sahu, Hsien Shang, Shih-Ying Hsu , et al. (9 additional authors not shown)

    Abstract: One of the most poorly understood aspects of low-mass star formation is how multiple-star systems are formed. Here we present the results of Atacama Large Millimeter/submillimeter Array (ALMA) Band-6 observations towards a forming quadruple protostellar system, G206.93-16.61E2, in the Orion B molecular cloud. ALMA 1.3 mm continuum emission reveals four compact objects, of which two are Class I you… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: The paper was accepted by APJL

  45. arXiv:2306.06780  [pdf, other

    eess.IV cs.CV q-bio.QM

    Multimodal Pathology Image Search Between H&E Slides and Multiplexed Immunofluorescent Images

    Authors: Amir Hajighasemi, MD Jillur Rahman Saurav, Mohammad S Nasr, Jai Prakash Veerla, Aarti Darji, Parisa Boodaghi Malidarreh, Michael Robben, Helen H Shang, Jacob M Luber

    Abstract: We present an approach for multimodal pathology image search, using dynamic time warping (DTW) on Variational Autoencoder (VAE) latent space that is fed into a ranked choice voting scheme to retrieve multiplexed immunofluorescent imaging (mIF) that is most similar to a query H&E slide. Through training the VAE and applying DTW, we align and compare mIF and H&E slides. Our method improves different… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  46. arXiv:2306.01318  [pdf, other

    cs.CL cs.LG

    Text Style Transfer Back-Translation

    Authors: Daimeng Wei, Zhanglin Wu, Hengchao Shang, Zongyao Li, Minghan Wang, Jiaxin Guo, Xiaoyu Chen, Zhengzhe Yu, Hao Yang

    Abstract: Back Translation (BT) is widely used in the field of machine translation, as it has been proved effective for enhancing translation quality. However, BT mainly improves the translation of inputs that share a similar style (to be more specific, translation-like inputs), since the source side of BT data is machine-translated. For natural inputs, BT brings only slight improvements and sometimes even… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: acl2023, 14 pages, 4 figures, 19 tables

  47. arXiv:2305.19749  [pdf, other

    stat.ME stat.AP

    Forecasting high-dimensional functional time series: Application to sub-national age-specific mortality

    Authors: Cristian F. Jiménez-Varón, Ying Sun, Han Lin Shang

    Abstract: We study the modeling and forecasting of high-dimensional functional time series (HDFTS), which can be cross-sectionally correlated and temporally dependent. We introduce a decomposition of the HDFTS into two distinct components: a deterministic component and a residual component that varies over time. The decomposition is derived through the estimation of two-way functional analysis of variance.… ▽ More

    Submitted 13 February, 2024; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 31 pages, 6 figures

    MSC Class: 62R10; 91D20

  48. arXiv:2305.16531  [pdf, other

    stat.ME stat.AP

    Forecasting intraday financial time series with sieve bootstrapping and dynamic updating

    Authors: Han Lin Shang, Kaiying Ji

    Abstract: Intraday financial data often take the form of a collection of curves that can be observed sequentially over time, such as intraday stock price curves. These curves can be viewed as a time series of functions observed on equally spaced and dense grids. Due to the curse of dimensionality, high-dimensional data poses challenges from a statistical aspect; however, it also provides opportunities to an… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 25 pages, 10 figures, 2 tables

    MSC Class: 62M10; 62M20

  49. arXiv:2305.03893  [pdf

    q-bio.GN stat.AP

    Generalizability of PRS313 for breast cancer risk amongst non-Europeans in a Los Angeles biobank

    Authors: Helen Shang, Yi Ding, Vidhya Venkateswaran, Kristin Boulier, Nikhita Kathuria-Prakash, Parisa Boodaghi Malidarreh, Jacob M. Luber, Bogdan Pasaniuc

    Abstract: Polygenic risk scores (PRS) summarize the combined effect of common risk variants and are associated with breast cancer risk in patients without identifiable monogenic risk factors. One of the most well-validated PRSs in breast cancer to date is PRS313, which was developed from a Northern European biobank but has shown attenuated performance in non-European ancestries. We further investigate the g… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 27 pages, 2 figures

  50. arXiv:2304.09423  [pdf, other

    cs.CV

    ASM: Adaptive Skinning Model for High-Quality 3D Face Modeling

    Authors: Kai Yang, Hong Shang, Tianyang Shi, Xinghan Chen, Jingkai Zhou, Zhongqian Sun, Wei Yang

    Abstract: The research fields of parametric face model and 3D face reconstruction have been extensively studied. However, a critical question remains unanswered: how to tailor the face model for specific reconstruction settings. We argue that reconstruction with multi-view uncalibrated images demands a new model with stronger capacity. Our study shifts attention from data-dependent 3D Morphable Models (3DMM… ▽ More

    Submitted 8 October, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 18 pages