Skip to main content

Showing 1–50 of 1,084 results for author: Feng, S

  1. arXiv:2407.11086  [pdf, other

    cs.LG cs.AI physics.chem-ph

    Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

    Authors: Yuyan Ni, Shikun Feng, Xin Hong, Yuancheng Sun, Wei-Ying Ma, Zhi-Ming Ma, Qiwei Ye, Yanyan Lan

    Abstract: Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  2. arXiv:2407.05165  [pdf, other

    cs.SE

    Feedback-Driven Automated Whole Bug Report Reproduction for Android Apps

    Authors: Dingbang Wang, Yu Zhao, Sidong Feng, Zhaoxu Zhang, William G. J. Halfond, Chunyang Chen, Xiaoxia Sun, Jiangfan Shi, Tingting Yu

    Abstract: In software development, bug report reproduction is a challenging task. This paper introduces ReBL, a novel feedback-driven approach that leverages GPT-4, a large-scale language model, to automatically reproduce Android bug reports. Unlike traditional methods, ReBL bypasses the use of Step to Reproduce (S2R) entities. Instead, it leverages the entire textual bug report and employs innovative promp… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Accepted by ISSTA 2024

  3. arXiv:2407.04984  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Prolonged Phase Segregation of Mixed-Halide Perovskite Nanocrystals in the Dark

    Authors: Xueying Ma, Yuhui Ye, Yang Xiao, Shengnan Feng, Chunfeng Zhang, Keyu Xia, Fengrui Hu, Min Xiao, Xiaoyong Wang

    Abstract: A critical issue hindering the potential applications of semiconductor mixed-halide perovskites is the phase segregation effect, wherein localized regions enriched with one type of halide anions would be formed upon continuous photogeneration of the excited-state charge carriers. These unexpected phases are capable of remixing again in the dark under the entropic driving force, the process of whic… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  4. arXiv:2407.04813  [pdf, other

    astro-ph.EP astro-ph.GA astro-ph.SR

    FAUST XVII: Super deuteration in the planet forming system IRS 63 where the streamer strikes the disk

    Authors: L. Podio, C. Ceccarelli, C. Codella, G. Sabatini, D. Segura-Cox, N. Balucani, A. Rimola, P. Ugliengo, C. J. Chandler, N. Sakai, B. Svoboda, J. Pineda, M. De Simone, E. Bianchi, P. Caselli, A. Isella, Y. Aikawa, M. Bouvier, E. Caux, L. Chahine, S. B. Charnley, N. Cuello, F. Dulieu, L. Evans, D. Fedele , et al. (33 additional authors not shown)

    Abstract: Recent observations suggest that planets formation starts early, in protostellar disks of $\le10^5$ yrs, which are characterized by strong interactions with the environment, e.g., through accretion streamers and molecular outflows. To investigate the impact of such phenomena on disk physical and chemical properties it is key to understand what chemistry planets inherit from their natal environment… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 12 pages, 10 figures, accepted for publication on A&A

  5. arXiv:2407.04549  [pdf, other

    cs.CL cs.AI

    Spontaneous Reward Hacking in Iterative Self-Refinement

    Authors: Jane Pan, He He, Samuel R. Bowman, Shi Feng

    Abstract: Language models are capable of iteratively improving their outputs based on natural language feedback, thus enabling in-context optimization of user preference. In place of human users, a second language model can be used as an evaluator, providing feedback along with numerical ratings which the generator attempts to optimize. However, because the evaluator is an imperfect proxy of user preference… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2407.03053  [pdf

    physics.optics physics.app-ph

    Visible, Near-, and Mid-infrared Computational Spectrometer Enabled by Single-Spinning Film Encoder

    Authors: Junren Wen, Weiming Shi, Cheng Gao, Yujie Liu, Shuaibo Feng, Yu Shao, Haiqi Gao, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

    Abstract: Computational spectrometers are pivotal in enabling low-cost, in-situ and rapid spectral analysis, with potential applications in chemistry, biology, and environmental science. However, filter-based spectral encoding approaches typically use filter arrays, complicating the manufacturing process and hindering device consistency. By capitalizing on the polarization separation effect under oblique in… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2407.02646  [pdf, other

    cs.AI cs.CL

    A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

    Authors: Daking Rai, Yilun Zhou, Shi Feng, Abulhair Saparov, Ziyu Yao

    Abstract: Mechanistic interpretability (MI) is an emerging sub-field of interpretability that seeks to understand a neural network model by reverse-engineering its internal computations. Recently, MI has garnered significant attention for interpreting transformer-based language models (LMs), resulting in many novel insights yet introducing new challenges. However, there has not been work that comprehensivel… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 11 pages, 11 figures, Preprint

    ACM Class: I.2.7

  8. arXiv:2407.02056  [pdf, other

    cs.CL cs.AI

    Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation

    Authors: Xinglin Wang, Yiwei Li, Shaoxiong Feng, Peiwen Yuan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li

    Abstract: Self-consistency (SC), leveraging multiple samples from LLMs, shows significant gains on various reasoning tasks but struggles with free-form generation due to the difficulty of aggregating answers. Its variants, UCS and USC, rely on sample selection or voting mechanisms to improve output quality. These methods, however, face limitations due to their inability to fully utilize the nuanced consensu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL2024 Main Conference

  9. arXiv:2407.00627  [pdf, ps, other

    hep-ph

    Rotation effect on the spectral function of heavy vector mesons in holographic QCD

    Authors: Xiao-Long Wang, Sheng-Qin Feng

    Abstract: Exploring heavy vector mesons of the $ J / ψ$ and $ Υ( 1 S )$ is crucial for understanding the quark gluon plasma (QGP) formed in heavy ion collisions. The influences of rotational effect on the properties of the $ J / ψ$ and the $ Υ( 1 S )$ are investigated by incorporating rotation medium into the holographic QCD. It is found that temperature, chemical potential, and rotational radius effects en… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 17 pages, 7 figures

  10. arXiv:2406.17797  [pdf, other

    physics.chem-ph cs.AI cs.LG

    MoleculeCLA: Rethinking Molecular Benchmark via Computational Ligand-Target Binding Analysis

    Authors: Shikun Feng, Jiaxin Zheng, Yinjun Jia, Yanwen Huang, Fengfeng Zhou, Wei-Ying Ma, Yanyan Lan

    Abstract: Molecular representation learning is pivotal for various molecular property prediction tasks related to drug discovery. Robust and accurate benchmarks are essential for refining and validating current methods. Existing molecular property benchmarks derived from wet experiments, however, face limitations such as data volume constraints, unbalanced label distribution, and noisy labels. To address th… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  11. arXiv:2406.16588  [pdf, other

    eess.SY cs.FL

    Switching Controller Synthesis for Hybrid Systems Against STL Formulas

    Authors: Han Su, Shenghua Feng, Sinong Zhan, Naijun Zhan

    Abstract: Switching controllers play a pivotal role in directing hybrid systems (HSs) towards the desired objective, embodying a ``correct-by-construction'' approach to HS design. Identifying these objectives is thus crucial for the synthesis of effective switching controllers. While most of existing works focus on safety and liveness, few of them consider timing constraints. In this paper, we delves into t… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  12. arXiv:2406.15992  [pdf, other

    cs.CL

    Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

    Authors: Yizhuo Zhang, Heng Wang, Shangbin Feng, Zhaoxuan Tan, Xiaochuang Han, Tianxing He, Yulia Tsvetkov

    Abstract: Large language models (LLMs) demonstrate great potential for problems with implicit graphical structures, while recent works seek to enhance the graph reasoning capabilities of LLMs through specialized instruction tuning. The resulting 'graph LLMs' are evaluated with in-distribution settings only, thus it remains underexplored whether LLMs are learning generalizable graph reasoning skills or merel… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 6 figures, Code and data will be publicly available at https://github.com/MatthewYZhang/NLGift

    ACM Class: I.2.7

  13. arXiv:2406.15951  [pdf, other

    cs.CL

    Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

    Authors: Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov

    Abstract: While existing alignment paradigms have been integral in developing large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but special… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  14. arXiv:2406.15948  [pdf, other

    cs.CL

    Teaching LLMs to Abstain across Languages via Multilingual Feedback

    Authors: Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov

    Abstract: Multilingual LLMs often have knowledge disparities across languages, with larger gaps in under-resourced languages. Teaching LLMs to abstain in the face of knowledge gaps is thus a promising strategy to mitigate hallucinations in multilingual settings. However, previous studies on LLM abstention primarily focus on English; we find that directly applying existing solutions beyond English results in… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  15. arXiv:2406.15352  [pdf, other

    cs.CL

    A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick

    Authors: Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan Boyd-Graber

    Abstract: Keyword mnemonics are memorable explanations that link new terms to simpler keywords. Prior works generate mnemonics for students, but they do not guide models toward mnemonics students prefer and aid learning. We build SMART, a mnemonic generator trained on feedback from real students learning new terms. To train SMART, we first fine-tune LLaMA-2 on a curated set of user-written mnemonics. We the… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: In-Progress Preprint

  16. arXiv:2406.14103  [pdf, other

    cs.AI

    Two-Stage Depth Enhanced Learning with Obstacle Map For Object Navigation

    Authors: Yanwei Zheng, Shaopu Feng, Bowen Huang, Changrui Li, Xiao Zhang, Dongxiao Yu

    Abstract: The task that requires an agent to navigate to a given object through only visual observation is called visual object navigation (VON). The main bottlenecks of VON are strategies exploration and prior knowledge exploitation. Traditional strategies exploration ignores the differences of searching and navigating stages, using the same reward in two stages, which reduces navigation performance and tr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  17. arXiv:2406.11633  [pdf, other

    cs.CV

    DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

    Authors: Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao

    Abstract: Scientific documents record research findings and valuable human knowledge, comprising a vast corpus of high-quality data. Leveraging multi-modality data extracted from these documents and assessing large models' abilities to handle scientific document-oriented tasks is therefore meaningful. Despite promising advancements, large models still perform poorly on multi-page scientific document extract… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Homepage of DocGenome: https://unimodal4reasoning.github.io/DocGenome_page 22 pages, 11 figures

  18. arXiv:2406.11568  [pdf, other

    cs.CL cs.SD eess.AS q-bio.NC

    Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models

    Authors: Sheng Feng, Heyang Liu, Yu Wang, Yanfeng Wang

    Abstract: In this paper, we introduce a groundbreaking end-to-end (E2E) framework for decoding invasive brain signals, marking a significant advancement in the field of speech neuroprosthesis. Our methodology leverages the comprehensive reasoning abilities of large language models (LLMs) to facilitate direct decoding. By fully integrating LLMs, we achieve results comparable to the state-of-the-art cascade m… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.10447  [pdf, other

    cs.CV

    The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences

    Authors: Bria Long, Violet Xiang, Stefan Stojanov, Robert Z. Sparks, Zi Yin, Grace E. Keene, Alvin W. M. Tan, Steven Y. Feng, Chengxu Zhuang, Virginia A. Marchman, Daniel L. K. Yamins, Michael C. Frank

    Abstract: Human children far exceed modern machine learning algorithms in their sample efficiency, achieving high performance in key domains with much less data than current models. This ''data gap'' is a key challenge both for building intelligent artificial systems and for understanding human development. Egocentric video capturing children's experience -- their ''training data'' -- is a key ingredient fo… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 9 pages, 2 figures, 4 tables and SI. Submitted to NeurIPS Datasets and Benchmarks

  20. arXiv:2406.09881  [pdf, other

    cs.CL

    A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation

    Authors: Yongkang Liu, Ercong Nie, Shi Feng, Zheng Hua, Zifeng Ding, Daling Wang, Yifei Zhang, Hinrich Schütze

    Abstract: Current state-of-the-art dialogue systems heavily rely on extensive training datasets. However, challenges arise in domains where domain-specific training datasets are insufficient or entirely absent. To tackle this challenge, we propose a novel data \textbf{A}ugmentation framework for \textbf{M}ulti-\textbf{D}omain \textbf{D}ialogue \textbf{G}eneration, referred to as \textbf{AMD$^2$G}. The AMD… ▽ More

    Submitted 28 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: 17pages,ECML-PKDD

    Journal ref: 2024 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases

  21. arXiv:2406.09486  [pdf, other

    cs.CV cs.AI

    SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

    Authors: Shenghua Wan, Ziyuan Chen, Le Gan, Shuai Feng, De-Chuan Zhan

    Abstract: Model-based offline reinforcement Learning (RL) is a promising approach that leverages existing data effectively in many real-world applications, especially those involving high-dimensional inputs like images and videos. To alleviate the distribution shift issue in offline RL, existing model-based methods heavily rely on the uncertainty of learned dynamics. However, the model uncertainty estimatio… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 23 pages, 10 figures

  22. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  23. arXiv:2406.07850  [pdf, other

    cs.CL cs.AI

    Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation

    Authors: Yiwei Li, Fei Mi, Yitong Li, Yasheng Wang, Bin Sun, Shaoxiong Feng, Kan Li

    Abstract: Stochastic sampling strategies such as top-k and top-p have been widely used in dialogue generation task. However, as an open-domain chatting system, there will be two different conversation scenarios, i.e. chit-chat and knowledge-based question answering. In the former situation, responses diversity is essential due to the one-to-many nature in dialogue. The latter, on the other hand, requires le… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: ACL 2024 Findings

  24. arXiv:2406.05135  [pdf

    cs.RO math.OC

    Smart Navigation System for Parking Assignment at Large Events: Incorporating Heterogeneous Driver Characteristics

    Authors: Xi Cheng, Gaofeng Su, Siyuan Feng, Ke Liu, Chen Zhu, Hui Lin, Jilin Song, Jianan Chen

    Abstract: Parking challenges escalate significantly during large events such as concerts or sports games, yet few studies address dynamic parking lot assignments for such occasions. This paper introduces a smart navigation system designed to optimize parking assignments swiftly during large events, utilizing a mixed search algorithm that accounts for the heterogeneous characteristics of drivers. We conducte… ▽ More

    Submitted 14 May, 2024; originally announced June 2024.

  25. arXiv:2406.00922  [pdf, other

    cs.CL cs.AI

    MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

    Authors: Shuyue Stella Li, Vidhisha Balachandran, Shangbin Feng, Jonathan Ilgen, Emma Pierson, Pang Wei Koh, Yulia Tsvetkov

    Abstract: In high-stakes domains like clinical reasoning, AI assistants powered by large language models (LLMs) are yet to be reliable and safe. We identify a key obstacle towards reliability: existing LLMs are trained to answer any question, even with incomplete context in the prompt or insufficient parametric knowledge. We propose to change this paradigm to develop more careful LLMs that ask follow-up que… ▽ More

    Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Comments: 29 pages, 12 figures

  26. arXiv:2405.19062  [pdf, other

    cs.LG cs.AI

    SIG: Efficient Self-Interpretable Graph Neural Network for Continuous-time Dynamic Graphs

    Authors: Lanting Fang, Yulian Yang, Kai Wang, Shanshan Feng, Kaiyu Feng, Jie Gui, Shuliang Wang, Yew-Soon Ong

    Abstract: While dynamic graph neural networks have shown promise in various applications, explaining their predictions on continuous-time dynamic graphs (CTDGs) is difficult. This paper investigates a new research task: self-interpretable GNNs for CTDGs. We aim to predict future links within the dynamic graph while simultaneously providing causal explanations for these predictions. There are two key challen… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 19 pages

  27. arXiv:2405.18549  [pdf, other

    cs.LG cs.DB cs.SC

    Learning from Uncertain Data: From Possible Worlds to Possible Models

    Authors: Jiongli Zhu, Su Feng, Boris Glavic, Babak Salimi

    Abstract: We introduce an efficient method for learning linear models from uncertain data, where uncertainty is represented as a set of possible variations in the data, leading to predictive multiplicity. Our approach leverages abstract interpretation and zonotopes, a type of convex polytope, to compactly represent these dataset variations, enabling the symbolic execution of gradient descent on all possible… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  28. arXiv:2405.16835  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Superionic surface Li-ion transport in carbonaceous materials

    Authors: Jianbin Zhou, Shen Wang, Chaoshan Wu, Ji Qi, Hongli Wan, Shen Lai, Shijie Feng, Tsz Wai Ko, Zhaohui Liang, Ke Zhou, Nimrod Harpak, Nick Solan, Mengchen Liu, Zeyu Hui, Paulina J. Ai, Kent Griffith, Chunsheng Wang, Shyue Ping Ong, Yan Yao, Ping Liu

    Abstract: Unlike Li-ion transport in the bulk of carbonaceous materials, little is known about Li-ion diffusion on their surface. In this study, we have discovered an ultra-fast Li-ion transport phenomenon on the surface of carbonaceous materials, particularly when they have limited Li insertion capacity along with a high surface area. This is exemplified by a carbon black, Ketjen Black (KB). An ionic condu… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 21 pages, 6 figures

  29. arXiv:2405.16778  [pdf, other

    cond-mat.supr-con

    Unusual switch from low-temperature T-quadratic resistivity in the underdoped pseudogap phase of cuprate superconductors to low-temperature T-linear resistivity in the overdoped strange-metal phase

    Authors: Xingyu Ma, Minghuan Zeng, Huaiming Guo, Shiping Feng

    Abstract: The transport experiments demonstrate a dramatic switch from the low-temperature T-linear resistivity in the overdoped strange-metal phase to the T-quadratic resistivity in the underdoped pseudogap phase of cuprate superconductors, however, a consensus on the origin of this switch is still lacking. Here the low-temperature resistivity in the underdoped pseudogap phase of cuprate superconductors is… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  30. arXiv:2405.16709  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Spin-orbit coupling controlled two-dimensional magnetism in chromium trihalides

    Authors: Inhee Lee, Jiefu Cen, Oleksandr Molchanov, Shi Feng, Warren L. Huey, Johan van Tol, Joshua E. Goldberger, Nandini Trivedi, Hae-Young Kee, P. Chris Hammel

    Abstract: CrX$_3$ (X = Cl, Br, I) have the same crystal structure and Hamiltonian but different ligand spin-orbit coupling (SOC) constant $λ_X$, providing excellent material platform exploring for exotic two-dimensional (2D) spin orders. Their microscopic mechanism underlying 2D spin physics and Hamiltonian remain unestablished, along with experimental corroboration of Kitaev exchange interaction, central t… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 18 pages, 4 figures

  31. arXiv:2405.12735  [pdf, other

    astro-ph.GA

    Multiple chemical tracers finally unveil the intricate NGC\,1333 IRAS\,4A outflow system. FAUST XVI

    Authors: Layal Chahine, Cecilia Ceccarelli, Marta De Simone, Claire J. Chandler, Claudio Codella, Linda Podio, Ana López-Sepulcre, Nami Sakai, Laurent Loinard, Mathilde Bouvier, Paola Caselli, Charlotte Vastel, Eleonora Bianchi, Nicolás Cuello, Francesco Fontani, Doug Johnstone, Giovanni Sabatini, Tomoyuki Hanawa, Ziwei E. Zhang, Yuri Aikawa, Gemma Busquet, Emmanuel Caux, Aurore Durán, Eric Herbst, François Ménard , et al. (32 additional authors not shown)

    Abstract: The exploration of outflows in protobinary systems presents a challenging yet crucial endeavour, offering valuable insights into the dynamic interplay between protostars and their evolution. In this study, we examine the morphology and dynamics of jets and outflows within the IRAS\,4A protobinary system. This analysis is based on ALMA observations of SiO(5--4), H$_2$CO(3$_{0,3}$--2$_{0,3}$), and H… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  32. arXiv:2405.12278  [pdf, other

    cond-mat.str-el quant-ph

    Emergent Majorana metal from a chiral spin liquid

    Authors: Penghao Zhu, Shi Feng, Kang Wang, Tao Xiang, Nandini Trivedi

    Abstract: We propose a novel mechanism to explain the emergence of an intermediate gapless spin liquid phase (IGP) in the antiferromagnetic Kitaev model in an externally applied magnetic field, sandwiched between the well-known gapped chiral spin liquid (CSL) and the gapped partially polarized (PP) phase. We propose in moderate fields $π$-fluxes nucleate in the ground state and can trap Majorana zero modes.… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 6+13 pages, 4+7 figures

  33. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  34. arXiv:2405.10558  [pdf, other

    cs.SI

    CACL: Community-Aware Heterogeneous Graph Contrastive Learning for Social Media Bot Detection

    Authors: Sirry Chen, Shuo Feng, Songsong Liang, Chen-Chen Zong, Jing Li, Piji Li

    Abstract: Social media bot detection is increasingly crucial with the rise of social media platforms. Existing methods predominantly construct social networks as graph and utilize graph neural networks (GNNs) for bot detection. However, most of these methods focus on how to improve the performance of GNNs while neglecting the community structure within social networks. Moreover, GNNs based methods still fac… ▽ More

    Submitted 3 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL 2024 findings

  35. arXiv:2405.10343  [pdf, other

    q-bio.BM cs.AI cs.LG

    UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning

    Authors: Shikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying Ma, Yanyan Lan

    Abstract: Recently, a noticeable trend has emerged in developing pre-trained foundation models in the domains of CV and NLP. However, for molecular pre-training, there lacks a universal model capable of effectively applying to various categories of molecular tasks, since existing prevalent pre-training methods exhibit effectiveness for specific types of downstream tasks. Furthermore, the lack of profound un… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  36. arXiv:2405.09220  [pdf, other

    cs.LG cs.AI cs.CL

    ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

    Authors: Siwei Wang, Yifei Shen, Shi Feng, Haoran Sun, Shang-Hua Teng, Wei Chen

    Abstract: In this paper, we present the findings of our Project ALPINE which stands for ``Autoregressive Learning for Planning In NEtworks." Project ALPINE initiates a theoretical investigation into the development of planning capabilities in Transformer-based language models through their autoregressive learning mechanisms, aiming to identify any potential limitations in their planning abilities. We abstra… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  37. arXiv:2405.08306  [pdf, other

    math.OC eess.SY

    Flight Path Optimization with Optimal Control Method

    Authors: Gaofeng Su, Xi Cheng, Siyuan Feng, Ke Liu, Jilin Song, Jianan Chen, Chen Zhu, Hui Lin

    Abstract: This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to d… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  38. arXiv:2405.08298  [pdf, other

    cs.LG

    Deep Reinforcement Learning for Real-Time Ground Delay Program Revision and Corresponding Flight Delay Assignments

    Authors: Ke Liu, Fan Hu, Hui Lin, Xi Cheng, Jianan Chen, Jilin Song, Siyuan Feng, Gaofeng Su, Chen Zhu

    Abstract: This paper explores the optimization of Ground Delay Programs (GDP), a prevalent Traffic Management Initiative used in Air Traffic Management (ATM) to reconcile capacity and demand discrepancies at airports. Employing Reinforcement Learning (RL) to manage the inherent uncertainties in the national airspace system-such as weather variability, fluctuating flight demands, and airport arrival rates-we… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  39. arXiv:2405.08293  [pdf, other

    cs.LG

    Airport Delay Prediction with Temporal Fusion Transformers

    Authors: Ke Liu, Kaijing Ding, Xi Cheng, Jianan Chen, Siyuan Feng, Hui Lin, Jilin Song, Chen Zhu

    Abstract: Since flight delay hurts passengers, airlines, and airports, its prediction becomes crucial for the decision-making of all stakeholders in the aviation industry and thus has been attempted by various previous research. However, previous delay predictions are often categorical and at a highly aggregated level. To improve that, this study proposes to apply the novel Temporal Fusion Transformer model… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  40. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  41. arXiv:2405.07229  [pdf, other

    cs.MM

    MM-InstructEval: Zero-Shot Evaluation of (Multimodal) Large Language Models on Multimodal Reasoning Tasks

    Authors: Xiaocui Yang, Wenfang Wu, Shi Feng, Ming Wang, Daling Wang, Yang Li, Qi Sun, Yifei Zhang, Xiaoming Fu, Soujanya Poria

    Abstract: The rising popularity of multimodal large language models (MLLMs) has sparked a significant increase in research dedicated to evaluating these models. However, current evaluation studies predominantly concentrate on the ability of models to comprehend and reason within a unimodal (vision-only) context, overlooking critical performance evaluations in complex multimodal reasoning tasks that integrat… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: Under review, the new version of MM-BigBench: arXiv:2310.09036

  42. arXiv:2405.07090  [pdf, other

    cs.HC

    MUD: Towards a Large-Scale and Noise-Filtered UI Dataset for Modern Style UI Modeling

    Authors: Sidong Feng, Suyu Ma, Han Wang, David Kong, Chunyang Chen

    Abstract: The importance of computational modeling of mobile user interfaces (UIs) is undeniable. However, these require a high-quality UI dataset. Existing datasets are often outdated, collected years ago, and are frequently noisy with mismatches in their visual representation. This presents challenges in modeling UI understanding in the wild. This paper introduces a novel approach to automatically mine UI… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  43. arXiv:2405.06705  [pdf, other

    cs.CL cs.AI

    LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought

    Authors: Zhuoxuan Jiang, Haoyuan Peng, Shanshan Feng, Fan Li, Dongsheng Li

    Abstract: Self-correction is emerging as a promising approach to mitigate the issue of hallucination in Large Language Models (LLMs). To facilitate effective self-correction, recent research has proposed mistake detection as its initial step. However, current literature suggests that LLMs often struggle with reliably identifying reasoning mistakes when using simplistic prompting strategies. To address this… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: To appear at IJCAI 2024

  44. arXiv:2405.06230  [pdf

    eess.IV

    Fire in SRRN: Next-Gen 3D Temperature Field Reconstruction Technology

    Authors: Shenxiang Feng, Xiaojian Hao, Xiaodong Huang, Pan Pei, Tong Wei, Chenyang Xu

    Abstract: In aerospace and energy engineering, accurate 3D combustion field temperature measurement is critical. The resolution of traditional methods based on algebraic iteration is limited by the initial voxel division. This study introduces a novel method for reconstructing three-dimensional temperature fields using the Spatial Radiation Representation Network (SRRN). This method utilizes the flame therm… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  45. arXiv:2404.18999  [pdf, other

    astro-ph.GA

    CO Observations of Early-mid Stage Major-mergers in MaNGA Survey

    Authors: Qingzheng Yu, Taotao Fang, Cong Kevin Xu, Shuai Feng, Siyi Feng, Yu Gao, Xue-Jian Jiang, Ute Lisenfeld

    Abstract: We present a study of the molecular gas in early-mid stage major-mergers, with a sample of 43 major-merger galaxy pairs selected from the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey and a control sample of 195 isolated galaxies selected from the xCOLD GASS survey. Adopting kinematic asymmetry as a new effective indicator to describe the merger stage, we aim to study the role… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 25 pages, 12 figures, 5 tables, accepted for publication in ApJS

  46. arXiv:2404.18696  [pdf, ps, other

    physics.optics physics.app-ph

    Enhanced second harmonic generation in high-$Q$ all-dielectric metasurfaces with backward frequency conversion

    Authors: Xu Tu, Siqi Feng, Jiajun Li, Yangguang Xing, Feng Wu, Tingting Liu, Shuyuan Xiao

    Abstract: Here we employ the quasi-bound state in the continuum (quasi-BIC) resonance in all-dielectric metasurfaces for efficient nonlinear processes in consideration of the backward frequency conversion. We theoretically study the second-harmonic generation (SHG) from symmetry-broken AlGaAs metasurfaces and reveal the efficiency enhancement empowered by high-$Q$ quasi-BIC resonances. By introducing the co… ▽ More

    Submitted 11 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Journal ref: Physical Review A 109 (6), 063522 (2024)

  47. arXiv:2404.16034  [pdf, ps, other

    math.PR

    Central limit theorems associated with the hierarchical Dirichlet process

    Authors: Shui Feng, J. E. Paguyo

    Abstract: The Dirichlet process is a discrete random measure specified by a concentration parameter and a base distribution, and is used as a prior distribution in Bayesian nonparametrics. The hierarchical Dirichlet process generalizes the Dirichlet process by randomizing the base distribution through a draw from another Dirichlet process. It is motivated by the study of groups of clustered data, where the… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 30 pages. Comments welcome

    MSC Class: 60G57; 62F15

  48. arXiv:2404.14701  [pdf, other

    cs.LG

    Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization

    Authors: Siqi Feng, Rui Yao, Stephane Hess, Ricardo A. Daziano, Timothy Brathwaite, Joan Walker, Shenhao Wang

    Abstract: Deep neural networks (DNNs) frequently present behaviorally irregular patterns, significantly limiting their practical potentials and theoretical validity in travel behavior modeling. This study proposes strong and weak behavioral regularities as novel metrics to evaluate the monotonicity of individual demand functions (a.k.a. law of demand), and further designs a constrained optimization framewor… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  49. arXiv:2404.13748  [pdf, other

    eess.SY math.NA

    Application of Kalman Filter in Stochastic Differential Equations

    Authors: Wencheng Bao, Shi Feng, Kaiwen Zhang

    Abstract: In areas such as finance, engineering, and science, we often face situations that change quickly and unpredictably. These situations are tough to handle and require special tools and methods capable of understanding and predicting what might happen next. Stochastic Differential Equations (SDEs) are renowned for modeling and analyzing real-world dynamical systems. However, obtaining the parameters,… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 18 pages, 14 figures

  50. arXiv:2404.13076  [pdf, other

    cs.CL cs.AI

    LLM Evaluators Recognize and Favor Their Own Generations

    Authors: Arjun Panickssery, Samuel R. Bowman, Shi Feng

    Abstract: Self-evaluation using large language models (LLMs) has proven valuable not only in benchmarking but also methods like reward modeling, constitutional AI, and self-refinement. But new biases are introduced due to the same LLM acting as both the evaluator and the evaluatee. One such bias is self-preference, where an LLM evaluator scores its own outputs higher than others' while human annotators cons… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.