subscribe to arXiv mailings

DART: Deep Adversarial Automated Red Teaming for LLM Safety

Authors: Bojian Jiang, Yi Jing, Tianhao Shen, Qing Yang, Deyi Xiong

Abstract: Manual Red teaming is a commonly-used method to identify vulnerabilities in large language models (LLMs), which, is costly and unscalable. In contrast, automated red teaming uses a Red LLM to automatically generate adversarial prompts to the Target LLM, offering a scalable way for safety vulnerability detection. However, the difficulty of building a powerful automated Red LLM lies in the fact that… ▽ More Manual Red teaming is a commonly-used method to identify vulnerabilities in large language models (LLMs), which, is costly and unscalable. In contrast, automated red teaming uses a Red LLM to automatically generate adversarial prompts to the Target LLM, offering a scalable way for safety vulnerability detection. However, the difficulty of building a powerful automated Red LLM lies in the fact that the safety vulnerabilities of the Target LLM are dynamically changing with the evolution of the Target LLM. To mitigate this issue, we propose a Deep Adversarial Automated Red Teaming (DART) framework in which the Red LLM and Target LLM are deeply and dynamically interacting with each other in an iterative manner. In each iteration, in order to generate successful attacks as many as possible, the Red LLM not only takes into account the responses from the Target LLM, but also adversarially adjust its attacking directions by monitoring the global diversity of generated attacks across multiple iterations. Simultaneously, to explore dynamically changing safety vulnerabilities of the Target LLM, we allow the Target LLM to enhance its safety via an active learning based data selection mechanism. Experimential results demonstrate that DART significantly reduces the safety risk of the target LLM. For human evaluation on Anthropic Harmless dataset, compared to the instruction-tuning target LLM, DART eliminates the violation risks by 53.4\%. We will release the datasets and codes of DART soon. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2407.03409 [pdf, other]

From Halos to Galaxies. IX. Estimate of Halo Assembly History for SDSS Galaxy Groups

Authors: Cheqiu Lyu, Yingjie Peng, Yipeng Jing, Xiaohu Yang, Luis C. Ho, Alvio Renzini, Dingyi Zhao, Filippo Mannucci, Houjun Mo, Kai Wang, Bitao Wang, Bingxiao Xu, Jing Dou, Anna R. Gallazzi, Qiusheng Gu, Roberto Maiolino, Enci Wang, Feng Yuan

Abstract: The properties of the galaxies are tightly connected to their host halo mass and halo assembly history. Accurate measurement of the halo assembly history in observation is challenging but crucial to the understanding of galaxy formation and evolution. The stellar-to-halo mass ratio ($M_*/M_{\mathrm{h}}$) for the centrals has often been used to indicate the halo assembly time $t_{\mathrm{h,50}}$ of… ▽ More The properties of the galaxies are tightly connected to their host halo mass and halo assembly history. Accurate measurement of the halo assembly history in observation is challenging but crucial to the understanding of galaxy formation and evolution. The stellar-to-halo mass ratio ($M_*/M_{\mathrm{h}}$) for the centrals has often been used to indicate the halo assembly time $t_{\mathrm{h,50}}$ of the group, where $t_{\mathrm{h,50}}$ is the lookback time at which a halo has assembled half of its present-day virial mass. Using mock data from the semi-analytic models, we find that $M_*/M_{\mathrm{h}}$ shows a significant scatter with $t_{\mathrm{h,50}}$, with a strong systematic difference between the group with a star-forming central (blue group) and passive central (red group). To improve the accuracy, we develop machine-learning models to estimate $t_{\mathrm{h,50}}$ for galaxy groups using only observable quantities in the mocks. Since star-formation quenching will decouple the co-growth of the dark matter and baryon, we train our models separately for blue and red groups. Our models have successfully recovered $t_{\mathrm{h,50}}$, within an accuracy of $\sim$ 1.09 Gyr. With careful calibrations of individual observable quantities in the mocks with SDSS observations, we apply the trained models to the SDSS Yang et al. groups and derive the $t_{\mathrm{h,50}}$ for each group for the first time. The derived SDSS $t_{\mathrm{h,50}}$ distributions are in good agreement with that in the mocks, in particular for blue groups. The derived halo assembly history, together with the halo mass, make an important step forward in studying the halo-galaxy connections in observation. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: 18 pages, 7 figures. Accepted by ApJ

arXiv:2406.08278 [pdf, other]

doi 10.1088/1674-4527/ad5398

HiFAST : An HI Data Calibration and Imaging Pipeline for FAST II. Flux Density Calibration

Authors: Ziming Liu, Jie Wang, Yingjie Jing, Zhi-Yu Zhang, Chen Xu, Tiantian Liang, Qingze Chen, Ningyu Tang, Qingliang Yang

Abstract: Accurate flux density calibration is essential for precise analysis and interpretation of observations across different observation modes and instruments. In this research, we firstly introduce the flux calibration model incorporated in HIFAST pipeline, designed for processing HI 21-cm spectra. Furthermore, we investigate different calibration techniques and assess the dependence of the gain param… ▽ More Accurate flux density calibration is essential for precise analysis and interpretation of observations across different observation modes and instruments. In this research, we firstly introduce the flux calibration model incorporated in HIFAST pipeline, designed for processing HI 21-cm spectra. Furthermore, we investigate different calibration techniques and assess the dependence of the gain parameter on the time and environmental factors. A comparison is carried out in various observation modes (e.g. tracking and scanning modes) to determine the flux density gain ($G$), revealing insignificant discrepancies in $G$ among different methods. Long-term monitoring data shows a linear correlation between $G$ and atmospheric temperature. After subtracting the $G$--Temperature dependence, the dispersion of $G$ is reduced to $<$3% over a one-year time scale. The stability of the receiver response of FAST is considered sufficient to facilitate HI observations that can accommodate a moderate error in flux calibration (e.g., $>\sim5\%$) when utilizing a constant $G$ for calibration purposes. Our study will serve as a useful addition to the results provided by Jiang et al. (2020). Detailed measurement of $G$ for the 19 beams of FAST, covering the frequency range 1000 MHz -- 1500 MHz can be found on the HIFAST homepage: https://hifast.readthedocs.io/fluxgain. △ Less

Submitted 12 June, 2024; originally announced June 2024.

Comments: 14 pages, 15 figures, accepted by RAA

arXiv:2406.02215 [pdf, other]

doi 10.1088/1674-4527/ad5397

Observation of HI around three satellite galaxies of the M31 with the FAST: Andromeda II, NGC 205, and NGC 185

Authors: Ziming Liu, Jie Wang, Yingjie Jing, Chen Xu, Tiantian Liang, Qingze Chen, Zerui Liu, Zhipeng Hou, Yougang Wang

Abstract: With the exceptional sensitivity of the Five-hundred-meter Aperture Spherical radio Telescope (FAST), we conducted observations of the neutral hydrogen (HI) in the Circular Galactical Medium (CGM) of Andromeda's (M31) satellite galaxies, specifically Andromeda II, NGC 205, and NGC 185. Initially, three drift scans were executed for these satellites, with a detection limit of $4\times10^{18}$ cm… ▽ More With the exceptional sensitivity of the Five-hundred-meter Aperture Spherical radio Telescope (FAST), we conducted observations of the neutral hydrogen (HI) in the Circular Galactical Medium (CGM) of Andromeda's (M31) satellite galaxies, specifically Andromeda II, NGC 205, and NGC 185. Initially, three drift scans were executed for these satellites, with a detection limit of $4\times10^{18}$ cm$^{-2}$ ( approximately $1.88\times10^3 M_{\odot}$ of HI mass), followed by a more in-depth scan of a specific region. We discovered a C-shaped HI arc structure sharing a position and line-of-sight velocity similar to a stellar ring structure around Andromeda II, hinting at a potential connection with Andromeda II. In the context of NGC 205, we identified two mass concentrations in the northeast direction, which could be indicative of tidal streams resulting from the interaction between this galaxy and M31. These new lumps discovered could be very helpful in solving the missing interstellar medium (ISM) problem for NGC 205. Observations regarding NGC 185 are consistent with previous studies, and we did not detect any additional HI material around this galaxy. These observational results enhance our understanding of the evolution of these satellite galaxies and provide insight into their historical interactions with the M31 galaxy. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 9 pages, 7 figures, accepted by RAA

arXiv:2405.16484 [pdf, other]

Accurate Measurement of the Lensing Magnification by BOSS CMASS Galaxies and Its Implications for Cosmology and Dark Matter

Authors: Kun Xu, Y. P. Jing, Hongyu Gao, Xiaolin Luo, Ming Li

Abstract: Magnification serves as an independent and complementary gravitational lensing measurement to shear. We develop a novel method to achieve an accurate and robust magnification measurement around BOSS CMASS galaxies across physical scales of $0.016h^{-1}{\rm Mpc} < r_{\rm p} < 10h^{-1}{\rm Mpc}$. We first measure the excess total flux density $δM$ of the source galaxies in deep DECaLS photometric ca… ▽ More Magnification serves as an independent and complementary gravitational lensing measurement to shear. We develop a novel method to achieve an accurate and robust magnification measurement around BOSS CMASS galaxies across physical scales of $0.016h^{-1}{\rm Mpc} < r_{\rm p} < 10h^{-1}{\rm Mpc}$. We first measure the excess total flux density $δM$ of the source galaxies in deep DECaLS photometric catalog that are lensed by CMASS galaxies. We convert $δM$ to magnification $μ$ by establishing the $δμ-δM$ relation using a deeper photometric sample. By comparing magnification measurements in three optical bands ($grz$), we constrain the dust attenuation curve and its radial distribution, discovering a steep attenuation curve in the circumgalactic medium of CMASS galaxies. We further compare dust-corrected magnification measurements to model predictions from high-resolution dark matter-only (DMO) simulations in WMAP and Planck cosmologies, as well as the hydrodynamic simulation \texttt{TNG300-1}, using precise galaxy-halo connections from the Photometric objects Around Cosmic webs method and the accurate ray-tracing algorithm \texttt{P3MLens}. For $r_{\rm p} > 70h^{-1}$ kpc, our magnification measurements are in good agreement with both WMAP and Planck cosmologies. However, at $r_{\rm p} < 70h^{-1}$ kpc, we observe an excess magnification signal, which is higher than the DMO model in Planck cosmology at $2.8σ$ and would be exacerbated if significant baryon feedback is included. Implications of the potential small scale discrepancy for the nature of dark matter and for the processes governing galaxy formation are discussed. △ Less

Submitted 9 July, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

Comments: 25 pages, 19 figures. Main results in Figure 9 (dust) and Figure 18 (matter). Accepted for publication in ApJ

arXiv:2405.05554 [pdf, other]

RELICS: a REactor neutrino LIquid xenon Coherent elastic Scattering experiment

Authors: Chang Cai, Guocai Chen, Jiangyu Chen, Rundong Fang, Fei Gao, Xiaoran Guo, Jiheng Guo, Tingyi He, Chengjie Jia, Gaojun Jin, Yipin Jing, Gaojun Ju, Yang Lei, Jiayi Li, Kaihang Li, Meng Li, Minhua Li, Shengchao Li, Siyin Li, Tao Li, Qing Lin, Jiajun Liu, Minghao Liu, Sheng Lv, Guang Luo , et al. (24 additional authors not shown)

Abstract: Coherent elastic neutrino-nucleus scattering (CEvNS) provides a unique probe for neutrino properties Beyond the Standard Model (BSM) physics. REactor neutrino LIquid xenon Coherent Scattering experiment (RELICS), a proposed reactor neutrino program using liquid xenon time projection chamber (LXeTPC) technology, aims to investigate the CEvNS process of antineutrinos off xenon atomic nuclei. In this… ▽ More Coherent elastic neutrino-nucleus scattering (CEvNS) provides a unique probe for neutrino properties Beyond the Standard Model (BSM) physics. REactor neutrino LIquid xenon Coherent Scattering experiment (RELICS), a proposed reactor neutrino program using liquid xenon time projection chamber (LXeTPC) technology, aims to investigate the CEvNS process of antineutrinos off xenon atomic nuclei. In this work, the design of the experiment is studied and optimized based on Monte Carlo (MC) simulations. To achieve a sufficiently low energy threshold for CEvNS detection, an ionization-only analysis channel is adopted for RELICS. A high emission rate of delayed electrons after a big ionization signal is the major background, leading to an analysis threshold of 120 photo-electrons in the CEvNS search. The second largest background, nuclear recoils induced by cosmic-ray neutrons, is suppressed via a passive water shield. The physics potential of RELICS is explored with a 32 kg-yr exposure at a baseline of 25 m from a reactor core with a 3 GW thermal power. In an energy range of 120 to 240 PE, we expect 4902.4 CEvNS and 1318.4 background events. The sensitivity of RELICS to the weak mixing angle is investigated at a low momentum transfer. Our study shows that RELICS can further improve the constraints on the non-standard neutrino interaction (NSI) compared to the current best results. △ Less

Submitted 12 June, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

arXiv:2403.20014 [pdf, other]

PURPLE: Making a Large Language Model a Better SQL Writer

Authors: Tonghui Ren, Yuankai Fan, Zhenying He, Ren Huang, Jiaqi Dai, Can Huang, Yinan Jing, Kai Zhang, Yifan Yang, X. Sean Wang

Abstract: Large Language Model (LLM) techniques play an increasingly important role in Natural Language to SQL (NL2SQL) translation. LLMs trained by extensive corpora have strong natural language understanding and basic SQL generation abilities without additional tuning specific to NL2SQL tasks. Existing LLMs-based NL2SQL approaches try to improve the translation by enhancing the LLMs with an emphasis on us… ▽ More Large Language Model (LLM) techniques play an increasingly important role in Natural Language to SQL (NL2SQL) translation. LLMs trained by extensive corpora have strong natural language understanding and basic SQL generation abilities without additional tuning specific to NL2SQL tasks. Existing LLMs-based NL2SQL approaches try to improve the translation by enhancing the LLMs with an emphasis on user intention understanding. However, LLMs sometimes fail to generate appropriate SQL due to their lack of knowledge in organizing complex logical operator composition. A promising method is to input the LLMs with demonstrations, which include known NL2SQL translations from various databases. LLMs can learn to organize operator compositions from the input demonstrations for the given task. In this paper, we propose PURPLE (Pre-trained models Utilized to Retrieve Prompts for Logical Enhancement), which improves accuracy by retrieving demonstrations containing the requisite logical operator composition for the NL2SQL task on hand, thereby guiding LLMs to produce better SQL translation. PURPLE achieves a new state-of-the-art performance of 80.5% exact-set match accuracy and 87.8% execution match accuracy on the validation set of the popular NL2SQL benchmark Spider. PURPLE maintains high accuracy across diverse benchmarks, budgetary constraints, and various LLMs, showing robustness and cost-effectiveness. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: 12 pages, accepted by ICDE 2024 (40th IEEE International Conference on Data Engineering)

arXiv:2403.19275 [pdf, other]

Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent

Authors: Junkai Zhou, Liang Pang, Ya Jing, Jia Gu, Huawei Shen, Xueqi Cheng

Abstract: Constructing personalized and anthropomorphic agents holds significant importance in the simulation of social networks. However, there are still two key problems in existing works: the agent possesses world knowledge that does not belong to its personas, and it cannot eliminate the interference of diverse persona information on current actions, which reduces the personalization and anthropomorphis… ▽ More Constructing personalized and anthropomorphic agents holds significant importance in the simulation of social networks. However, there are still two key problems in existing works: the agent possesses world knowledge that does not belong to its personas, and it cannot eliminate the interference of diverse persona information on current actions, which reduces the personalization and anthropomorphism of the agent. To solve the above problems, we construct the social media agent based on personalized knowledge and dynamic persona information. For personalized knowledge, we add external knowledge sources and match them with the persona information of agents, thereby giving the agent personalized world knowledge. For dynamic persona information, we use current action information to internally retrieve the persona information of the agent, thereby reducing the interference of diverse persona information on the current action. To make the agent suitable for social media, we design five basic modules for it: persona, planning, action, memory and reflection. To provide an interaction and verification environment for the agent, we build a social media simulation sandbox. In the experimental verification, automatic and human evaluations demonstrated the effectiveness of the agent we constructed. △ Less

Submitted 2 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

arXiv:2403.17745 [pdf, other]

Leave No Patient Behind: Enhancing Medication Recommendation for Rare Disease Patients

Authors: Zihao Zhao, Yi Jing, Fuli Feng, Jiancan Wu, Chongming Gao, Xiangnan He

Abstract: Medication recommendation systems have gained significant attention in healthcare as a means of providing tailored and effective drug combinations based on patients' clinical information. However, existing approaches often suffer from fairness issues, as recommendations tend to be more accurate for patients with common diseases compared to those with rare conditions. In this paper, we propose a no… ▽ More Medication recommendation systems have gained significant attention in healthcare as a means of providing tailored and effective drug combinations based on patients' clinical information. However, existing approaches often suffer from fairness issues, as recommendations tend to be more accurate for patients with common diseases compared to those with rare conditions. In this paper, we propose a novel model called Robust and Accurate REcommendations for Medication (RAREMed), which leverages the pretrain-finetune learning paradigm to enhance accuracy for rare diseases. RAREMed employs a transformer encoder with a unified input sequence approach to capture complex relationships among disease and procedure codes. Additionally, it introduces two self-supervised pre-training tasks, namely Sequence Matching Prediction (SMP) and Self Reconstruction (SR), to learn specialized medication needs and interrelations among clinical codes. Experimental results on two real-world datasets demonstrate that RAREMed provides accurate drug sets for both rare and common disease patients, thereby mitigating unfairness in medication recommendation systems. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.16208 [pdf, ps, other]

Convergence analysis of OT-Flow for sample generation

Authors: Yang Jing, Lei Li

Abstract: Deep generative models aim to learn the underlying distribution of data and generate new ones. Despite the diversity of generative models and their high-quality generation performance in practice, most of them lack rigorous theoretical convergence proofs. In this work, we aim to establish some convergence results for OT-Flow, one of the deep generative models. First, by reformulating the framework… ▽ More Deep generative models aim to learn the underlying distribution of data and generate new ones. Despite the diversity of generative models and their high-quality generation performance in practice, most of them lack rigorous theoretical convergence proofs. In this work, we aim to establish some convergence results for OT-Flow, one of the deep generative models. First, by reformulating the framework of OT-Flow model, we establish the $Γ$-convergence of the formulation of OT-flow to the corresponding optimal transport (OT) problem as the regularization term parameter $α$ goes to infinity. Second, since the loss function will be approximated by Monte Carlo method in training, we established the convergence between the discrete loss function and the continuous one when the sample number $N$ goes to infinity as well. Meanwhile, the approximation capability of the neural network provides an upper bound for the discrete loss function of the minimizers. The proofs in both aspects provide convincing assurances for OT-Flow. △ Less

Submitted 24 March, 2024; originally announced March 2024.

arXiv:2403.07165 [pdf, other]

Localized interfacial Phonon Modes at the Electronic Axion Domain Wall

Authors: Abhinava Chatterjee, Mourad Oudich, Yun Jing, Chao-Xing Liu

Abstract: The most salient feature of electronic topological states of matter is the existence of exotic electronic modes localized at the surface or interface of a sample. In this work, in an electronic topological system, we demonstrate the existence of localized phonon modes at the domain wall between topologically trivial and non-trivial regions, in addition to the localized interfacial electronic state… ▽ More The most salient feature of electronic topological states of matter is the existence of exotic electronic modes localized at the surface or interface of a sample. In this work, in an electronic topological system, we demonstrate the existence of localized phonon modes at the domain wall between topologically trivial and non-trivial regions, in addition to the localized interfacial electronic states. In particular, we consider a theoretical model for the Dirac semimetal with a gap opened by external strains and study the phonon dynamics, which couples to electronic degrees of freedom via strong electron-phonon interaction. By treating the phonon modes as a pseudo-gauge field, we find that the axion type of terms for phonon dynamics can emerge in gapped Dirac semimetal model and lead to interfacial phonon modes localized at the domain wall between trivial and non-trivial regimes that possess the axion parameters 0 and π, respectively. We also discuss the physical properties and possible experimental probe of such interfacial phonon modes. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2403.00211 [pdf, other]

Trustworthy Self-Attention: Enabling the Network to Focus Only on the Most Relevant References

Authors: Yu Jing, Tan Yujuan, Ren Ao, Liu Duo

Abstract: The prediction of optical flow for occluded points is still a difficult problem that has not yet been solved. Recent methods use self-attention to find relevant non-occluded points as references for estimating the optical flow of occluded points based on the assumption of self-similarity. However, they rely on visual features of a single image and weak constraints, which are not sufficient to cons… ▽ More The prediction of optical flow for occluded points is still a difficult problem that has not yet been solved. Recent methods use self-attention to find relevant non-occluded points as references for estimating the optical flow of occluded points based on the assumption of self-similarity. However, they rely on visual features of a single image and weak constraints, which are not sufficient to constrain the trained network to focus on erroneous and weakly relevant reference points. We make full use of online occlusion recognition information to construct occlusion extended visual features and two strong constraints, allowing the network to learn to focus only on the most relevant references without requiring occlusion ground truth to participate in the training of the network. Our method adds very few network parameters to the original framework, making it very lightweight. Extensive experiments show that our model has the greatest cross-dataset generalization. Our method achieves much greater error reduction, 18.6%, 16.2%, and 20.1% for all points, non-occluded points, and occluded points respectively from the state-of-the-art GMA-base method, MATCHFlow(GMA), on Sintel Albedo pass. Furthermore, our model achieves state-of-the-art performance on the Sintel bench-marks, ranking \#1 among all published methods on Sintel clean pass. The code will be open-source. △ Less

Submitted 26 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

Comments: Correct Figure 1

arXiv:2402.17144 [pdf, other]

Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation

Authors: Yuankai Fan, Zhenying He, Tonghui Ren, Can Huang, Yinan Jing, Kai Zhang, X. Sean Wang

Abstract: The Natural Language Interface to Databases (NLIDB) empowers non-technical users with database access through intuitive natural language (NL) interactions. Advanced approaches, utilizing neural sequence-to-sequence models or large-scale language models, typically employ auto-regressive decoding to generate unique SQL queries sequentially. While these translation models have greatly improved the ov… ▽ More The Natural Language Interface to Databases (NLIDB) empowers non-technical users with database access through intuitive natural language (NL) interactions. Advanced approaches, utilizing neural sequence-to-sequence models or large-scale language models, typically employ auto-regressive decoding to generate unique SQL queries sequentially. While these translation models have greatly improved the overall translation accuracy, surpassing 70% on NLIDB benchmarks, the use of auto-regressive decoding to generate single SQL queries may result in sub-optimal outputs, potentially leading to erroneous translations. In this paper, we propose Metasql, a unified generate-then-rank framework that can be flexibly incorporated with existing NLIDBs to consistently improve their translation accuracy. Metasql introduces query metadata to control the generation of better SQL query candidates and uses learning-to-rank algorithms to retrieve globally optimized queries. Specifically, Metasql first breaks down the meaning of the given NL query into a set of possible query metadata, representing the basic concepts of the semantics. These metadata are then used as language constraints to steer the underlying translation model toward generating a set of candidate SQL queries. Finally, Metasql ranks the candidates to identify the best matching one for the given NL query. Extensive experiments are performed to study Metasql on two public NLIDB benchmarks. The results show that the performance of the translation models can be effectively improved using Metasql. △ Less

Submitted 26 February, 2024; originally announced February 2024.

arXiv:2402.15140 [pdf, other]

A Relation-Interactive Approach for Message Passing in Hyper-relational Knowledge Graphs

Authors: Yonglin Jing

Abstract: Hyper-relational knowledge graphs (KGs) contain additional key-value pairs, providing more information about the relations. In many scenarios, the same relation can have distinct key-value pairs, making the original triple fact more recognizable and specific. Prior studies on hyper-relational KGs have established a solid standard method for hyper-relational graph encoding. In this work, we propose… ▽ More Hyper-relational knowledge graphs (KGs) contain additional key-value pairs, providing more information about the relations. In many scenarios, the same relation can have distinct key-value pairs, making the original triple fact more recognizable and specific. Prior studies on hyper-relational KGs have established a solid standard method for hyper-relational graph encoding. In this work, we propose a message-passing-based graph encoder with global relation structure awareness ability, which we call ReSaE. Compared to the prior state-of-the-art approach, ReSaE emphasizes the interaction of relations during message passing process and optimizes the readout structure for link prediction tasks. Overall, ReSaE gives a encoding solution for hyper-relational KGs and ensures stronger performance on downstream link prediction tasks. Our experiments demonstrate that ReSaE achieves state-of-the-art performance on multiple link prediction benchmarks. Furthermore, we also analyze the influence of different model structures on model performance. △ Less

Submitted 1 March, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

arXiv:2402.14312 [pdf, other]

doi 10.61977/ati2024008

The Jiao Tong University Spectroscopic Telescope Project

Authors: JUST Team, Chengze Liu, Ying Zu, Fabo Feng, Zhaoyu Li, Yu Yu, Hua Bai, Xiangqun Cui, Bozhong Gu, Yizhou Gu, Jiaxin Han, Yonghui Hou, Zhongwen Hu, Hangxin Ji, Yipeng Jing, Wei Li, Zhaoxiang Qi, Xianyu Tan, Cairang Tian, Dehua Yang, Xiangyan Yuan, Chao Zhai, Congcong Zhang, Jun Zhang, Haotong Zhang , et al. (6 additional authors not shown)

Abstract: The Jiao Tong University Spectroscopic Telescope (JUST) is a 4.4-meter f/6.0 segmentedmirror telescope dedicated to spectroscopic observations. The JUST primary mirror is composed of 18 hexagonal segments, each with a diameter of 1.1 m. JUST provides two Nasmyth platforms for placing science instruments. One Nasmyth focus fits a field of view of 10 arcmin and the other has an extended field of vie… ▽ More The Jiao Tong University Spectroscopic Telescope (JUST) is a 4.4-meter f/6.0 segmentedmirror telescope dedicated to spectroscopic observations. The JUST primary mirror is composed of 18 hexagonal segments, each with a diameter of 1.1 m. JUST provides two Nasmyth platforms for placing science instruments. One Nasmyth focus fits a field of view of 10 arcmin and the other has an extended field of view of 1.2 deg with correction optics. A tertiary mirror is used to switch between the two Nasmyth foci. JUST will be installed at a site at Lenghu in Qinghai Province, China, and will conduct spectroscopic observations with three types of instruments to explore the dark universe, trace the dynamic universe, and search for exoplanets: (1) a multi-fiber (2000 fibers) medium-resolution spectrometer (R=4000-5000) to spectroscopically map galaxies and large-scale structure; (2) an integral field unit (IFU) array of 500 optical fibers and/or a long-slit spectrograph dedicated to fast follow-ups of transient sources for multimessenger astronomy; (3) a high-resolution spectrometer (R~100000) designed to identify Jupiter analogs and Earth-like planets, with the capability to characterize the atmospheres of hot exoplanets. △ Less

Submitted 29 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

Comments: 28 pages, 6 figures

arXiv:2401.17364 [pdf, other]

doi 10.1007/s11433-023-2333-8

HiFAST: an HI data calibration and imaging pipeline for FAST

Authors: Yingjie Jing, Jie Wang, Chen Xu, Ziming Liu, Qingze Chen, Tiantian Liang, Jinlong Xu, Yixian Cao, Jing Wang, Huijie Hu, Chuan-Peng Zhang, Qi Guo, Liang Gao, Mei Ai, Hengqian Gan, Xuyang Gao, Jinlin Han, Ligang Hou, Zhipeng Hou, Peng Jiang, Xu Kong, Fujia Li, Zerui Liu, Li Shao, Hengxing Pan , et al. (8 additional authors not shown)

Abstract: The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of fr… ▽ More The Five-hundred-meter Aperture Spherical radio Telescope (FAST) has the largest aperture and a 19-beam L-band receiver, making it powerful for investigating the neutral hydrogen atomic gas (HI) in the universe. We present HiFAST (https://hifast.readthedocs.io), a dedicated, modular, and self-contained calibration and imaging pipeline for processing the HI data of FAST. The pipeline consists of frequency-dependent noise diode calibration, baseline fitting, standing wave removal using an FFT-based method, flux density calibration, stray radiation correction, and gridding to produce data cubes. These modules can be combined as needed to process the data from most FAST observation modes: tracking, drift scanning, On-The-Fly mapping, and most of their variants. With HiFAST, the RMS noises of the calibrated spectra from all 19 beams were only slightly (~ 5%) higher than the theoretical expectation. The results for the extended source M33 and the point sources are consistent with the results from Arecibo. The moment maps (0,1 and 2) of M33 agree well with the results from the Arecibo Galaxy Environment Survey (AGES) with a fractional difference of less than 10%. For a common sample of 221 sources with signal-to-noise ratio S/N >10 from the Arecibo Legacy Fast ALFA (ALFALFA) survey, the mean value of fractional difference in the integrated flux density, $S_{\mathrm{int}}$, between the two datasets is approximately 0.005 %, with a dispersion of 15.4%. Further checks on the integrated flux density of 23 sources with seven observations indicate that the variance in the flux density of the source with luminous objects ($S_\mathrm{int}$ $ > 2.5$ Jy km s$^{-1}$) is less than 5%. Our tests suggest that the FAST telescope, with the efficient, precise, and user-friendly pipeline HiFAST, will yield numerous significant scientific findings in the investigation of the HI in the universe. △ Less

Submitted 30 January, 2024; originally announced January 2024.

Comments: Accepted by SCPMA. 21 pages, 14 figures. The pipeline is accessible at https://hifast.readthedocs.io

arXiv:2401.14730 [pdf, other]

ELUCID VIII: Simulating the Coma Galaxy Cluster to Calibrate Model and Understand Feedback

Authors: Xiong Luo, Huiyuan Wang, Weiguang Cui, Houjun Mo, RenJie Li, Yipeng Jing, Neal Katz, Romeel Davé, Xiaohu Yang, Yangyao Chen, Hao Li, Shuiyao Huang

Abstract: We conducted an investigation of the Coma cluster of galaxies by running a series of constrained hydrodynamic simulations with GIZMO-SIMBA and GADGET-3, based on initial conditions reconstructed from the SDSS survey volume in the ELUCID project. We compared simulation predictions and observations for galaxies, ICM and IGM in and around the Coma cluster to constrain galaxy formation physics. Our re… ▽ More We conducted an investigation of the Coma cluster of galaxies by running a series of constrained hydrodynamic simulations with GIZMO-SIMBA and GADGET-3, based on initial conditions reconstructed from the SDSS survey volume in the ELUCID project. We compared simulation predictions and observations for galaxies, ICM and IGM in and around the Coma cluster to constrain galaxy formation physics. Our results demonstrate that this type of constrained investigation allows us to probe in more detail the implemented physical processes, because the comparison between simulations and observations is free of cosmic variance and hence can be conducted in a ''one-to-one'' manner. We found that an increase in the earlier star formation rate and the supernova feedback of the original GIZMO-SIMBA model is needed to match observational data on stellar, ISM and ICM metallicity. The simulations without AGN feedback can well reproduce the observational ICM electron density, temperature, and entropy profiles, ICM substructures, and the IGM temperature-density relation, while the ones with AGN feedback usually fail. However, one requires something like AGN feedback to reproduce a sufficiently large population of quiescent galaxies, particularly in low-density regions. The constrained simulations of the Coma cluster thus provide a test bed to understand processes that drive galaxy formation and evolution. △ Less

Submitted 26 March, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 21 pages, 9 figures

arXiv:2401.11997 [pdf, other]

PAC.V. The Roles of Mass and Environment in the Quenching of Galaxies

Authors: Yun Zheng, Kun Xu, Y. P. Jing, Donghai Zhao, Hongyu Gao, Xiaolin Luo, Jianxin Han, Yu Yu, Ming Li

Abstract: The roles that mass and environment play in the galaxy quenching are still under debate. Leveraging the Photometric objects Around Cosmic webs (PAC) method, we analyze the excess surface distribution $\bar{n}_2w_{\rm{p}}(r_{\rm{p}})$ of photometric galaxies in different color (rest-frame $u-r$) within the stellar mass range of $10^{9.0}M_{\odot}\sim10^{11.0}M_{\odot}$ around spectroscopic massive… ▽ More The roles that mass and environment play in the galaxy quenching are still under debate. Leveraging the Photometric objects Around Cosmic webs (PAC) method, we analyze the excess surface distribution $\bar{n}_2w_{\rm{p}}(r_{\rm{p}})$ of photometric galaxies in different color (rest-frame $u-r$) within the stellar mass range of $10^{9.0}M_{\odot}\sim10^{11.0}M_{\odot}$ around spectroscopic massive central galaxies ($10^{10.9}\sim10^{11.7}M_{\odot}$) at the redshift interval $0<z_s<0.7$, utilizing data from the Hyper SuprimeCam Subaru Strategic Program and the spectroscopic samples of Slogan Digital Sky Survey (i.e. Main, LOWZ and CMASS samples). We find that both mass and environment quenching contribute to the evolution of companion galaxies. To isolate the environment effect, we quantify the quenched fraction excess (QFE) of companion galaxies encircling massive central galaxies within $0.01h^{-1}{\rm{Mpc}}<r_{\rm{p}}<20h^{-1}\rm{Mpc}$, representing the surplus quenched fraction relative to the average. We find that the high density halo environment affects the star formation quenching up to about three times of the virial radius, and this effect becomes stronger at lower redshift. We also find that even after being scaled by the virial radius, the environment quenching efficiency is higher for more massive halos or for companion galaxies of higher stellar mass, though the trends are quite weak. We present a fitting formula that comprehensively captures the QFE across central and companion stellar mass bins, halo-centric distance bins, and redshift bins, offering a valuable tool for constraining galaxy formation models. Furthermore, we have made a quantitative comparison with Illustris-TNG that underscores some important differences, particularly in the excessive quenching of low-mass companion galaxies ($<10^{9.5}M_{\odot}$) by TNG. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 23 pages, 14 figures. Submitted to ApJ. Comments welcome :-)

arXiv:2401.05879 [pdf]

YOIO: You Only Iterate Once by mining and fusing multiple necessary global information in the optical flow estimation

Authors: Yu Jing, Tan Yujuan, Ren Ao, Liu Duo

Abstract: Occlusions pose a significant challenge to optical flow algorithms that even rely on global evidences. We consider an occluded point to be one that is imaged in the reference frame but not in the next. Estimating the motion of these points is extremely difficult, particularly in the two-frame setting. Previous work only used the current frame as the only input, which could not guarantee providing… ▽ More Occlusions pose a significant challenge to optical flow algorithms that even rely on global evidences. We consider an occluded point to be one that is imaged in the reference frame but not in the next. Estimating the motion of these points is extremely difficult, particularly in the two-frame setting. Previous work only used the current frame as the only input, which could not guarantee providing correct global reference information for occluded points, and had problems such as long calculation time and poor accuracy in predicting optical flow at occluded points. To enable both high accuracy and efficiency, We fully mine and utilize the spatiotemporal information provided by the frame pair, design a loopback judgment algorithm to ensure that correct global reference information is obtained, mine multiple necessary global information, and design an efficient refinement module that fuses these global information. Specifically, we propose a YOIO framework, which consists of three main components: an initial flow estimator, a multiple global information extraction module, and a unified refinement module. We demonstrate that optical flow estimates in the occluded regions can be significantly improved in only one iteration without damaging the performance in non-occluded regions. Compared with GMA, the optical flow prediction accuracy of this method in the occluded area is improved by more than 10%, and the occ_out area exceeds 15%, while the calculation time is 27% shorter. This approach, running up to 18.9fps with 436*1024 image resolution, obtains new state-of-the-art results on the challenging Sintel dataset among all published and unpublished approaches that can run in real-time, suggesting a new paradigm for accurate and efficient optical flow estimation. △ Less

Submitted 11 January, 2024; originally announced January 2024.

Comments: arXiv admin note: text overlap with arXiv:2104.02409 by other authors

arXiv:2401.00565 [pdf, other]

doi 10.3847/1538-4357/ad3b96

Photometric Objects Around Cosmic Webs (PAC). VI. High Satellite Fraction of Quasars

Authors: Shanquan Gui, Kun Xu, Y. P. Jing, Donghai Zhao, Hongyu Gao

Abstract: The Photometric objects Around Cosmic webs (PAC) approach developed in Xu et al. (2022b) has the advantage of making full use of spectroscopic and deeper photometric surveys. With the merits of PAC, the excess surface density $\bar{n}_2w_{\rm{p}}$ of neighboring galaxies can be measured down to stellar mass $10^{10.80}\,M_{\odot}$ around quasars at redshift $0.8<z_{\rm{s}}<1.0$, with the data from… ▽ More The Photometric objects Around Cosmic webs (PAC) approach developed in Xu et al. (2022b) has the advantage of making full use of spectroscopic and deeper photometric surveys. With the merits of PAC, the excess surface density $\bar{n}_2w_{\rm{p}}$ of neighboring galaxies can be measured down to stellar mass $10^{10.80}\,M_{\odot}$ around quasars at redshift $0.8<z_{\rm{s}}<1.0$, with the data from the Sloan Digital Sky Survey IV (SDSS-IV) extended Baryon Oscillation Spectroscopic Survey (eBOSS) and the Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Surveys. We find that $\bar{n}_2w_{\rm{p}}$ generally increases quite steeply with the decrease of the separation. Using subhalo abundance matching method, we can accurately model the $\bar{n}_2w_{\rm{p}}$ both on small and large scales. We show that the steep increase of the $\bar{n}_2w_{\rm{p}}$ towards the quasars requires that a large fraction $f_{\mathrm{sate}}=0.29_{-0.06}^{+0.05}$ of quasars should be satellites in massive halos, and find that this fraction measurement is insensitive to the assumptions of our modeling. This high satellite fraction indicates that the subhalos have nearly the same probability to host quasars as the halos for the same (infall) halo mass, and the large scale environment has negligible effect on the quasar activity. We show that even with this high satellite fraction, each massive halo on average does not host more than one satellite quasar due to the sparsity of quasars. △ Less

Submitted 15 May, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

Comments: 15 pages, 11 figures, 2 tables, accepted for publication in the Astrophysical Journal

Journal ref: The Astrophysical Journal, 967:17 (13pp), 2024 May 20

arXiv:2312.13139 [pdf, other]

Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation

Authors: Hongtao Wu, Ya Jing, Chilam Cheang, Guangzeng Chen, Jiafeng Xu, Xinghang Li, Minghuan Liu, Hang Li, Tao Kong

Abstract: Generative pre-trained models have demonstrated remarkable effectiveness in language and vision domains by learning useful representations. In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training. We introduce GR-1, a straightforward GPT-style model designed for multi-task language-c… ▽ More Generative pre-trained models have demonstrated remarkable effectiveness in language and vision domains by learning useful representations. In this paper, we extend the scope of this effectiveness by showing that visual robot manipulation can significantly benefit from large-scale video generative pre-training. We introduce GR-1, a straightforward GPT-style model designed for multi-task language-conditioned visual robot manipulation. GR-1 takes as inputs a language instruction, a sequence of observation images, and a sequence of robot states. It predicts robot actions as well as future images in an end-to-end manner. Thanks to a flexible design, GR-1 can be seamlessly finetuned on robot data after pre-trained on a large-scale video dataset. We perform extensive experiments on the challenging CALVIN benchmark and a real robot. On CALVIN benchmark, our method outperforms state-of-the-art baseline methods and improves the success rate from 88.9% to 94.9%. In the setting of zero-shot unseen scene generalization, GR-1 improves the success rate from 53.3% to 85.4%. In real robot experiments, GR-1 also outperforms baseline methods and shows strong potentials in generalization to unseen scenes and objects. We provide inaugural evidence that a unified GPT-style transformer, augmented with large-scale video generative pre-training, exhibits remarkable generalization to multi-task visual robot manipulation. Project page: https://GR1-Manipulation.github.io △ Less

Submitted 21 December, 2023; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: Project page: https://GR1-Manipulation.github.io

arXiv:2312.06970 [pdf]

Engineering Moiré Meta-crystals with Conventional Photonic and Phononic Structures

Authors: Mourad Oudich, Xianghong Kong, Tan Zhang, Chengwei Qiu, Yun Jing

Abstract: Recent discoveries on Mott insulating and unconventional superconducting states in twisted bilayer graphene with Moiré superlattices have reshaped the landscape of ''twistronics'' and paved the way for developing high-temperature superconductors and new devices for quantum computing and sensing. Meanwhile, artificially structured photonic and phononic metamaterials/crystals (or meta-crystals) have… ▽ More Recent discoveries on Mott insulating and unconventional superconducting states in twisted bilayer graphene with Moiré superlattices have reshaped the landscape of ''twistronics'' and paved the way for developing high-temperature superconductors and new devices for quantum computing and sensing. Meanwhile, artificially structured photonic and phononic metamaterials/crystals (or meta-crystals) have become a fertile playground for emulating quantum-mechanical features of condensed matter systems, revealing new routes for robust control of classical waves. Drawing inspiration from the success of twisted bilayer graphene, this perspective casts an overarching framework of the emerging Moiré photonic and phononic meta-crystals that promise novel classical-wave devices. We begin with the fundamentals of Moiré superlattices, before highlighting recent works that exploit twist angle and interlayer coupling as new ingredients to engineer and tailor the band structures and effective material properties of photonic and phononic meta-crystals. We finally discuss future directions and promises of this emerging area in materials science and wave physics. △ Less

Submitted 11 December, 2023; originally announced December 2023.

Comments: 4 figures

arXiv:2312.06097 [pdf, other]

doi 10.1007/s11433-023-2219-7

The FAST all sky HI survey (FASHI): The first release of catalog

Authors: Chuan-Peng Zhang, M. Zhu, P. Jiang, C. Cheng, J. Wang, J. Wang, J. -L. Xu, X. -L. Liu, N. -P. Yu, L. Qian, H. Yu, M. Ai, Y. Jing, C. Xu, Z. Liu, X. Guan, C. Sun, Q. Yang, M. Huang, Q. Hao, FAST Collaboration

Abstract: The FAST All Sky HI survey (FASHI) was designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), spanning approximately 22000 square degrees of declination between -14 deg and +66 deg, and in the frequency range of 1050-1450 MHz, with the expectation of eventually detecting more than 100000 HI sources. Between August 2020 and June 2023, FASHI… ▽ More The FAST All Sky HI survey (FASHI) was designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), spanning approximately 22000 square degrees of declination between -14 deg and +66 deg, and in the frequency range of 1050-1450 MHz, with the expectation of eventually detecting more than 100000 HI sources. Between August 2020 and June 2023, FASHI had covered more than 7600 square degrees, which is approximately 35% of the total sky observable by FAST. It has a median detection sensitivity of around 0.76 mJy/beam and a spectral line velocity resolution of ~6.4 km/s at a frequency of ~1.4 GHz. As of now, a total of 41741 extragalactic HI sources have been detected in the frequency range 1305.5-1419.5 MHz, corresponding to a redshift limit of z<0.09. By cross-matching FASHI sources with the Siena Galaxy Atlas (SGA) and the Sloan Digital Sky Survey (SDSS) catalogs, we found that 16972 (40.7%) sources have spectroscopic redshifts and 10975 (26.3%) sources have only photometric redshifts. Most of the remaining 13794 (33.0%) HI sources are located in the direction of the Galactic plane, making their optical counterparts difficult to identify due to high extinction or high contamination of Galactic stellar sources. Based on current survey results, the FASHI survey is an unprecedented blind extragalactic HI survey. It has higher spectral and spatial resolution and broader coverage than the Arecibo Legacy Fast ALFA Survey (ALFALFA). When completed, FASHI will provide the largest extragalactic HI catalog and an objective view of HI content and large-scale structure in the local universe. △ Less

Submitted 10 December, 2023; originally announced December 2023.

Comments: 22 pages, 12 figures, published in SCPMA. All catalogs are available at https://zcp521.github.io/fashi and https://fast.bao.ac.cn/cms/article/271/

Journal ref: Sci. China-Phys. Mech. Astron. 67, 219511 (2024)

arXiv:2311.09829 [pdf, other]

FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models

Authors: Yimin Jing, Renren Jin, Jiahao Hu, Huishi Qiu, Xiaohua Wang, Peng Wang, Deyi Xiong

Abstract: The effective assessment of the instruction-following ability of large language models (LLMs) is of paramount importance. A model that cannot adhere to human instructions might be not able to provide reliable and helpful responses. In pursuit of this goal, various benchmarks have been constructed to evaluate the instruction-following capacity of these models. However, these benchmarks are limited… ▽ More The effective assessment of the instruction-following ability of large language models (LLMs) is of paramount importance. A model that cannot adhere to human instructions might be not able to provide reliable and helpful responses. In pursuit of this goal, various benchmarks have been constructed to evaluate the instruction-following capacity of these models. However, these benchmarks are limited to a single language and are constructed using automated approaches, which restricts their applicability and the quality of the test examples they contain. To bridge this gap, we introduce the FollowEval benchmark in this paper. This benchmark is composed of instances in both English and Chinese, and all test examples are crafted by human experts. Furthermore, the FollowEval benchmark is designed to assess LLMs across five critical dimensions of instruction following: string manipulation, commonsense reasoning, logical reasoning, spatial reasoning, and response constraints. To enhance the complexity and present a sufficient challenge, each test example is designed to evaluate more than one dimension. We have evaluated various LLMs using the FollowEval benchmark and found that their performance significantly lags behind that of humans. This highlights the considerable room for improvement in the instruction-following ability of these models. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Comments: Work in progress

arXiv:2311.04796 [pdf, other]

Bar-driven Gas Dynamics of M31

Authors: Zi-Xuan Feng, Zhi Li, Juntai Shen, Ortwin Gerhard, Roberto Saglia, Matias Blana, Hui Li, Yingjie Jing

Abstract: The large-scale gaseous shocks in the bulge of M31 can be naturally explained by a rotating stellar bar. We use gas dynamical models to provide an independent measurement of the bar pattern speed in M31. The gravitational potentials of our simulations are from a set of made-to-measure models constrained by stellar photometry and kinematics. If the inclination of the gas disk is fixed at… ▽ More The large-scale gaseous shocks in the bulge of M31 can be naturally explained by a rotating stellar bar. We use gas dynamical models to provide an independent measurement of the bar pattern speed in M31. The gravitational potentials of our simulations are from a set of made-to-measure models constrained by stellar photometry and kinematics. If the inclination of the gas disk is fixed at $i = 77^{\circ}$, we find that a low pattern speed of $16-20\;\rm km\;s^{-1}\;kpc^{-1}$ is needed to match the observed position and amplitude of the shock features, as shock positions are too close to the bar major axis in high $Ω_{b}$ models. The pattern speed can increase to $20-30\;\rm km\;s^{-1}\;kpc^{-1}$ if the inner gas disk has a slightly smaller inclination angle compared with the outer one. Including sub-grid physics such as star formation and stellar feedback has minor effects on the shock amplitude, and does not change the shock position significantly. If the inner gas disk is allowed to follow a varying inclination similar to the HI and ionized gas observations, the gas models with a pattern speed of $38\;\rm km\;s^{-1}\;kpc^{-1}$, which is consistent with stellar-dynamical models, can match both the shock features and the central gas features. △ Less

Submitted 13 November, 2023; v1 submitted 8 November, 2023; originally announced November 2023.

Comments: 26 pages, 16 figures. To appear on ApJ

arXiv:2311.01378 [pdf, other]

Vision-Language Foundation Models as Effective Robot Imitators

Authors: Xinghang Li, Minghuan Liu, Hanbo Zhang, Cunjun Yu, Jie Xu, Hongtao Wu, Chilam Cheang, Ya Jing, Weinan Zhang, Huaping Liu, Hang Li, Tao Kong

Abstract: Recent progress in vision language foundation models has shown their ability to understand multimodal data and resolve complicated vision language tasks, including robotics manipulation. We seek a straightforward way of making use of existing vision-language models (VLMs) with simple fine-tuning on robotics data. To this end, we derive a simple and novel vision-language manipulation framework, dub… ▽ More Recent progress in vision language foundation models has shown their ability to understand multimodal data and resolve complicated vision language tasks, including robotics manipulation. We seek a straightforward way of making use of existing vision-language models (VLMs) with simple fine-tuning on robotics data. To this end, we derive a simple and novel vision-language manipulation framework, dubbed RoboFlamingo, built upon the open-source VLMs, OpenFlamingo. Unlike prior works, RoboFlamingo utilizes pre-trained VLMs for single-step vision-language comprehension, models sequential history information with an explicit policy head, and is slightly fine-tuned by imitation learning only on language-conditioned manipulation datasets. Such a decomposition provides RoboFlamingo the flexibility for open-loop control and deployment on low-performance platforms. By exceeding the state-of-the-art performance with a large margin on the tested benchmark, we show RoboFlamingo can be an effective and competitive alternative to adapt VLMs to robot control. Our extensive experimental results also reveal several interesting conclusions regarding the behavior of different pre-trained VLMs on manipulation tasks. We believe RoboFlamingo has the potential to be a cost-effective and easy-to-use solution for robotics manipulation, empowering everyone with the ability to fine-tune their own robotics policy. △ Less

Submitted 4 February, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: Fix typos. Project page: https://roboflamingo.github.io

arXiv:2310.10733 [pdf, other]

From Halos to Galaxies. VII. The Connections Between Stellar Mass Growth History, Quenching History and Halo Assembly History for Central Galaxies

Authors: Cheqiu Lyu, Yingjie Peng, Yipeng Jing, Xiaohu Yang, Luis C. Ho, Alvio Renzini, Bitao Wang, Kai Wang, Bingxiao Xu, Dingyi Zhao, Jing Dou, Qiusheng Gu, Roberto Maiolino, Filippo Mannucci, Feng Yuan

Abstract: The assembly of galaxies over cosmic time is tightly connected to the assembly of their host dark matter halos. We investigate the stellar mass growth history and the chemical enrichment history of central galaxies in SDSS-MaNGA. We find that the derived stellar metallicity of passive central galaxies is always higher than that of the star-forming ones. This stellar metallicity enhancement becomes… ▽ More The assembly of galaxies over cosmic time is tightly connected to the assembly of their host dark matter halos. We investigate the stellar mass growth history and the chemical enrichment history of central galaxies in SDSS-MaNGA. We find that the derived stellar metallicity of passive central galaxies is always higher than that of the star-forming ones. This stellar metallicity enhancement becomes progressively larger towards low-mass galaxies (at a given epoch) and earlier epochs (at a given stellar mass), which suggests strangulation as the primary mechanism for star formation quenching in central galaxies not only in the local universe, but also very likely at higher redshifts up to $z\sim3$. We show that at the same present-day stellar mass, passive central galaxies assembled half of their final stellar mass $\sim 2$ Gyr earlier than star-forming central galaxies, which agrees well with semi-analytic model. Exploring semi-analytic model, we find that this is because passive central galaxies reside in, on average, more massive halos with a higher halo mass increase rate across cosmic time. As a consequence, passive central galaxies are assembled faster and also quenched earlier than their star-forming counterparts. While at the same present-day halo mass, different halo assembly history also produces very different final stellar mass of the central galaxy within, and halos assembled earlier host more massive centrals with a higher quenched fraction, in particular around the "golden halo mass" at $10^{12}\mathrm{M_\odot}$. Our results call attention back to the dark matter halo as a key driver of galaxy evolution. △ Less

Submitted 16 October, 2023; originally announced October 2023.

Comments: 19 pages, 11 figures. Accepted by ApJ

arXiv:2309.05962 [pdf, other]

Neutral Hydrogen content of dwarf galaxies in different environments

Authors: Hui-Jie Hu, Qi Guo, Pablo Renard, Hang Yang, Zheng Zheng, Yingjie Jing, Hao Chen, Hui Li

Abstract: Environments play an important role in galaxy formation and evolution, particularly in regulating the content of neutral gas. However, current HI surveys have limitations in their depth, which prevents them from adequately studying low HI content galaxies in high-density regions. In this study, we address this issue by employing the Five-hundred-meter Aperture Spherical radio Telescope (FAST) with… ▽ More Environments play an important role in galaxy formation and evolution, particularly in regulating the content of neutral gas. However, current HI surveys have limitations in their depth, which prevents them from adequately studying low HI content galaxies in high-density regions. In this study, we address this issue by employing the Five-hundred-meter Aperture Spherical radio Telescope (FAST) with extensive integration times to complement the relatively shallow Arecibo Legacy Fast Arecibo L-band Feed Array (ALFALFA) HI survey. This approach allows us to explore the gas content of dwarf galaxies across various environments. We observe a positive relationship between HI mass and stellar mass in dwarf galaxies, with a well-defined upper boundary for HI mass that holds true in both observations and simulations. Furthermore, we find a decrease in the HI-to-stellar mass ratio ($\rm M_{\rm HI}/M_*$) as the density of the environment increases, irrespective of whether it is determined by the proximity to the nearest group or the projected number density. Comparing our observations to simulations, we note a steeper slope in the relationship, indicating a gradual gas-stripping process in the observational data. Additionally, we find that the scaling relation between the $\rm M_{\rm HI}/M_*$ and optical properties can be improved by incorporating galaxy environments. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 14 pages, 10 figures, 2 table, accepted for publication in RAA

arXiv:2309.05073 [pdf, other]

FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions

Authors: Jiong Wang, Fengyu Yang, Wenbo Gou, Bingliang Li, Danqi Yan, Ailing Zeng, Yijun Gao, Junle Wang, Yanqing Jing, Ruimao Zhang

Abstract: Estimating the 3D structure of the human body from natural scenes is a fundamental aspect of visual perception. 3D human pose estimation is a vital step in advancing fields like AIGC and human-robot interaction, serving as a crucial technique for understanding and interacting with human actions in real-world settings. However, the current datasets, often collected under single laboratory condition… ▽ More Estimating the 3D structure of the human body from natural scenes is a fundamental aspect of visual perception. 3D human pose estimation is a vital step in advancing fields like AIGC and human-robot interaction, serving as a crucial technique for understanding and interacting with human actions in real-world settings. However, the current datasets, often collected under single laboratory conditions using complex motion capture equipment and unvarying backgrounds, are insufficient. The absence of datasets on variable conditions is stalling the progress of this crucial task. To facilitate the development of 3D pose estimation, we present FreeMan, the first large-scale, multi-view dataset collected under the real-world conditions. FreeMan was captured by synchronizing 8 smartphones across diverse scenarios. It comprises 11M frames from 8000 sequences, viewed from different perspectives. These sequences cover 40 subjects across 10 different scenarios, each with varying lighting conditions. We have also established an semi-automated pipeline containing error detection to reduce the workload of manual check and ensure precise annotation. We provide comprehensive evaluation baselines for a range of tasks, underlining the significant challenges posed by FreeMan. Further evaluations of standard indoor/outdoor human sensing datasets reveal that FreeMan offers robust representation transferability in real and complex scenes. Code and data are available at https://wangjiongw.github.io/freeman. △ Less

Submitted 3 April, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

Comments: CVPR2024 camera ready version. 19 pages, 16 figures. Project page: https://wangjiongw.github.io/freeman/ ; API: https://github.com/wangjiongw/FreeMan_API

arXiv:2309.03802 [pdf, other]

The DESI One-Percent Survey: A concise model for galactic conformity of ELGs

Authors: Hongyu Gao, Y. P. Jing, Kun Xu, Donghai Zhao, Shanquan Gui, Yun Zheng, Xiaolin Luo, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Todd Claybaugh, Shaun Cole, Axel de la Macorra, Jaime E. Forero-Romero, Satya Gontcho A Gontcho, Mustapha Ishak, Andrew Lambert, Martin Landriau, Marc Manera, Aaron Meisner, Ramon Miquel, Jundan Nie, Mehdi Rezaie, Graziano Rossi, Eusebio Sanchez , et al. (5 additional authors not shown)

Abstract: Galactic conformity is the phenomenon in which a galaxy of a certain physical property is correlated with its neighbors of the same property, implying a possible causal relationship. The observed auto correlations of emission line galaxies (ELGs) from the highly complete DESI One-Percent survey exhibit a strong clustering signal on small scales, providing clear evidence for the conformity effect o… ▽ More Galactic conformity is the phenomenon in which a galaxy of a certain physical property is correlated with its neighbors of the same property, implying a possible causal relationship. The observed auto correlations of emission line galaxies (ELGs) from the highly complete DESI One-Percent survey exhibit a strong clustering signal on small scales, providing clear evidence for the conformity effect of ELGs. Building upon the original subhalo abundance matching (SHAM) method developed by Gao et al. (2022, 2023), we propose a concise conformity model to improve the ELG-halo connection. In this model, the number of satellite ELGs is boosted by a factor of $\sim 5$ in the halos whose central galaxies are ELGs. We show that the mean ELG satellite number in such central halos is still smaller than 1, and the model does not significantly increase the overall satellite fraction. With this model, we can well recover the ELG auto correlations to the smallest scales explored with the current data (i.e. $r_{\mathrm{p}} > 0.03$ $\mathrm{Mpc}\,h^{-1}$ in real space and at $s > 0.3$ $\mathrm{Mpc}\,h^{-1}$ in redshift space), while the cross correlations between luminous red galaxies (LRGs) and ELGs are nearly unchanged. Although our SHAM model has only 8 parameters, we further verify that it can accurately describe the ELG clustering in the entire redshift range from $z = 0.8$ to $1.6$. We therefore expect that this method can be used to generate high-quality ELG lightcone mocks for DESI. △ Less

Submitted 7 November, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

Comments: 18 pages, 10 figures, accepted by ApJ

arXiv:2308.03624 [pdf, other]

MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

Authors: Taozheng Yang, Ya Jing, Hongtao Wu, Jiafeng Xu, Kuankuan Sima, Guangzeng Chen, Qie Sima, Tao Kong

Abstract: In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks. While learning-based methods have the potential to generate actions in an end-to-end manner, they often suffer from insufficient action accuracy and robustness against noise. On the other hand, classical control-based methods can enhance system robustness, but at the cost of extens… ▽ More In this paper, we present a novel method for mobile manipulators to perform multiple contact-rich manipulation tasks. While learning-based methods have the potential to generate actions in an end-to-end manner, they often suffer from insufficient action accuracy and robustness against noise. On the other hand, classical control-based methods can enhance system robustness, but at the cost of extensive parameter tuning. To address these challenges, we present MOMA-Force, a visual-force imitation method that seamlessly combines representation learning for perception, imitation learning for complex motion generation, and admittance whole-body control for system robustness and controllability. MOMA-Force enables a mobile manipulator to learn multiple complex contact-rich tasks with high success rates and small contact forces. In a real household setting, our method outperforms baseline methods in terms of task success rates. Moreover, our method achieves smaller contact forces and smaller force variances compared to baseline methods without force imitation. Overall, we offer a promising approach for efficient and robust mobile manipulation in the real world. Videos and more details can be found on \url{https://visual-force-imitation.github.io} △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

arXiv:2308.03620 [pdf, other]

Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Authors: Ya Jing, Xuelin Zhu, Xingbin Liu, Qie Sima, Taozheng Yang, Yunhai Feng, Tao Kong

Abstract: Visual pre-training with large-scale real-world data has made great progress in recent years, showing great potential in robot learning with pixel observations. However, the recipes of visual pre-training for robot manipulation tasks are yet to be built. In this paper, we thoroughly investigate the effects of visual pre-training strategies on robot manipulation tasks from three fundamental perspec… ▽ More Visual pre-training with large-scale real-world data has made great progress in recent years, showing great potential in robot learning with pixel observations. However, the recipes of visual pre-training for robot manipulation tasks are yet to be built. In this paper, we thoroughly investigate the effects of visual pre-training strategies on robot manipulation tasks from three fundamental perspectives: pre-training datasets, model architectures and training methods. Several significant experimental findings are provided that are beneficial for robot learning. Further, we propose a visual pre-training scheme for robot manipulation termed Vi-PRoM, which combines self-supervised learning and supervised learning. Concretely, the former employs contrastive learning to acquire underlying patterns from large-scale unlabeled data, while the latter aims learning visual semantics and temporal dynamics. Extensive experiments on robot manipulations in various simulation environments and the real robot demonstrate the superiority of the proposed scheme. Videos and more details can be found on \url{https://explore-pretrain-robot.github.io}. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023

arXiv:2308.00962 [pdf, other]

Spin wave amplification through superradiance

Authors: X. R. Wang, X. Gong, K. Y. Jing

Abstract: Superradiance is a phenomenon of multiple facets that occurs in classical and quantum physics under extreme conditions. Here we present its manifestation in spin waves under an easily realized condition. We show that an interface between a current-free (normal) ferromagnetic (FM) region and a current-flow (pumped) FM region can be a spin wave super-mirror whose reflection coefficient is larger tha… ▽ More Superradiance is a phenomenon of multiple facets that occurs in classical and quantum physics under extreme conditions. Here we present its manifestation in spin waves under an easily realized condition. We show that an interface between a current-free (normal) ferromagnetic (FM) region and a current-flow (pumped) FM region can be a spin wave super-mirror whose reflection coefficient is larger than 1. The super-reflection is the consequence of current-induced spectrum inversion where phase and group velocities of spin waves are in the opposite directions. An incident spin wave activates a backward propagating refractive wave inside pumped FM region. The refractive spin wave re-enters the normal FM region to constructively interfere with the reflective wave. It appears that the pumped FM region coherently emits reflective waves, leading to a super-reflection. The process resembles superradiance of a spinning black hole through the Hawking radiation process, or Dicke superradiance of cavity photons inside population inverted media. △ Less

Submitted 2 August, 2023; originally announced August 2023.

arXiv:2308.00870 [pdf]

On the importance of low-frequency signals in functional and molecular photoacoustic computed tomography

Authors: Tri Vu, Paul Klippel, Aidan J. Canning, Chenshuo Ma, Huijuan Zhang, Ludmila A. Kasatkina, Yuqi Tang, Jun Xia, Vladislav V. Verkhusha, Tuan Vo-Dinh, Yun Jing, Junjie Yao

Abstract: In photoacoustic computed tomography (PACT) with short-pulsed laser excitation, wideband acoustic signals are generated in biological tissues with frequencies related to the effective shapes and sizes of the optically absorbing targets. Low-frequency photoacoustic signal components correspond to slowly varying spatial features and are often omitted during imaging due to the limited detection bandw… ▽ More In photoacoustic computed tomography (PACT) with short-pulsed laser excitation, wideband acoustic signals are generated in biological tissues with frequencies related to the effective shapes and sizes of the optically absorbing targets. Low-frequency photoacoustic signal components correspond to slowly varying spatial features and are often omitted during imaging due to the limited detection bandwidth of the ultrasound transducer, or during image reconstruction as undesired background that degrades image contrast. Here we demonstrate that low-frequency photoacoustic signals, in fact, contain functional and molecular information, and can be used to enhance structural visibility, improve quantitative accuracy, and reduce spare-sampling artifacts. We provide an in-depth theoretical analysis of low-frequency signals in PACT, and experimentally evaluate their impact on several representative PACT applications, such as mapping temperature in photothermal treatment, measuring blood oxygenation in a hypoxia challenge, and detecting photoswitchable molecular probes in deep organs. Our results strongly suggest that low-frequency signals are important for functional and molecular PACT. △ Less

Submitted 1 August, 2023; originally announced August 2023.

arXiv:2307.16356 [pdf, other]

Interleaved Training for Massive MIMO Downlink via Exploring Spatial Correlation

Authors: Cheng Zhang, Chang Liu, Yindi Jing, Minjie Ding, Yongming Huang

Abstract: Interleaved training has been studied for single-user and multi-user massive MIMO downlink with either fully-digital or hybrid beamforming. However, the impact of channel correlation on its average training overhead is rarely addressed. In this paper, we explore the channel correlation to improve the interleaved training for single-user massive MIMO downlink. For the beam-domain interleaved traini… ▽ More Interleaved training has been studied for single-user and multi-user massive MIMO downlink with either fully-digital or hybrid beamforming. However, the impact of channel correlation on its average training overhead is rarely addressed. In this paper, we explore the channel correlation to improve the interleaved training for single-user massive MIMO downlink. For the beam-domain interleaved training, we propose a modified scheme by optimizing the beam training codebook. The basic antenna-domain interleaved training is also improved by dynamically adjusting the training order of the base station (BS) antennas during the training process based on the values of the already trained channels. Exact and simplified approximate expressions of the average training length are derived in closed-form for the basic and modified beam-domain schemes and the basic antenna-domain scheme in correlated channels. For the modified antenna-domain scheme, a deep neural network (DNN)-based approximation is provided for fast performance evaluation. Analytical results and simulations verify the accuracy of our derived training length expressions and explicitly reveal the impact of system parameters on the average training length. In addition, the modified beam/antenna-domain schemes are shown to have a shorter average training length compared to the basic schemes. △ Less

Submitted 16 January, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

Comments: 14 pages (double column), 8 figures. The paper has been accepted by IEEE Transactions on Wireless Communications

arXiv:2307.12334 [pdf, other]

doi 10.3847/1538-4357/acf835

Toward a Physical Understanding of Galaxy-Halo Alignment

Authors: Kun Xu, Y. P. Jing, Donghai Zhao

Abstract: We investigate the alignment of galaxy and halo orientations using the TNG300-1 hydrodynamical simulation. Our analysis reveals that the distribution of the 2D misalignment angle $θ_{\rm{2D}}$ can be well described by a truncated shifted exponential (TSE) distribution with only {\textit{one}} free parameter across different redshifts and galaxy/halo properties. We demonstrate that the galaxy-ellip… ▽ More We investigate the alignment of galaxy and halo orientations using the TNG300-1 hydrodynamical simulation. Our analysis reveals that the distribution of the 2D misalignment angle $θ_{\rm{2D}}$ can be well described by a truncated shifted exponential (TSE) distribution with only {\textit{one}} free parameter across different redshifts and galaxy/halo properties. We demonstrate that the galaxy-ellipticity (GI) correlations of galaxies can be reproduced by perturbing halo orientations with the obtained $θ_{\rm{2D}}$ distribution, with only a small bias ($<3^{\circ}$) possibly arising from unaccounted couplings between $θ_{\rm{2D}}$ and other factors. We find that both the 2D and 3D misalignment angles $θ_{\rm{2D}}$ and $θ_{\rm{3D}}$ decrease with ex situ stellar mass fraction $F_{\rm{acc}}$, halo mass $M_{\rm{vir}}$ and stellar mass $M_{*}$, while increasing with disk-to-total stellar mass fraction $F_{\rm{disk}}$ and redshift. These dependences are in good agreement with our recent observational study based on the BOSS galaxy samples. Our results suggest that $F_{\rm{acc}}$ is a key factor in determining the galaxy-halo alignment. Grouping galaxies by $F_{\rm{acc}}$ nearly eliminates the dependence of $θ_{\rm{3D}}$ on $M_{\rm{vir}}$ for all three principle axes, and also reduces the redshift dependence. For $θ_{\rm{2D}}$, we find a more significant redshift dependence than for $θ_{\rm{3D}}$ even after controlling $F_{\rm{acc}}$, which may be attributed to the evolution of galaxy and halo shapes. Our findings present a valuable model for observational studies and enhance our understanding of galaxy-halo alignment. △ Less

Submitted 5 November, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

Comments: 19 pages, 12 figures, Published in ApJ

Journal ref: The Astrophysical Journal, Volume 957, 2023, Number 1

arXiv:2307.10730 [pdf, other]

Joint Port Selection Based Channel Acquisition for FDD Cell-Free Massive MIMO

Authors: Cheng Zhang, Pengguang Du, Minjie Ding, Yindi Jing, Yongming Huang

Abstract: In frequency division duplexing (FDD) cell-free massive MIMO, the acquisition of the channel state information (CSI) is very challenging because of the large overhead required for the training and feedback of the downlink channels of multiple cooperating base stations (BSs). In this paper, for systems with partial uplink-downlink channel reciprocity, and a general spatial domain channel model with… ▽ More In frequency division duplexing (FDD) cell-free massive MIMO, the acquisition of the channel state information (CSI) is very challenging because of the large overhead required for the training and feedback of the downlink channels of multiple cooperating base stations (BSs). In this paper, for systems with partial uplink-downlink channel reciprocity, and a general spatial domain channel model with variations in the average port power and correlation among port coefficients, we propose a joint-port-selection-based CSI acquisition and feedback scheme for the downlink transmission with zero-forcing precoding. The scheme uses an eigenvalue-decomposition-based transformation to reduce the feedback overhead by exploring the port correlation. We derive the sum-rate of the system for any port selection. Based on the sum-rate result, we propose a low-complexity greedy-search-based joint port selection (GS-JPS) algorithm. Moreover, to adapt to fast time-varying scenarios, a supervised deep learning-enhanced joint port selection (DL-JPS) algorithm is proposed. Simulations verify the effectiveness of our proposed schemes and their advantage over existing port-selection channel acquisition schemes. △ Less

Submitted 12 January, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

Comments: 15 pages, 11 figures. The paper has been accepted by IEEE TRANSACTIONS ON COMMUNICATIONS

arXiv:2307.03066 [pdf, ps, other]

Kemperman's inequality and Freiman's lemma via few translates

Authors: Yifan Jing, Akshat Mudgal

Abstract: Let $G$ be a connected compact group equipped with the normalised Haar measure $μ$. Our first result shows that given $α, β>0$, there is a constant $c = c(α,β)>0$ such that for any compact sets $A,B\subseteq G$ with $ αμ(B)\geqμ(A)\geq μ(B) $ and $ μ(A)+μ(B)\leq 1-β$, there exist $b_1,\dots b_c\in B$ such that \[ μ(A\cdot \{b_1,\dots,b_c\})\geq μ(A)+μ(B).\] A special case of this, that is, when… ▽ More Let $G$ be a connected compact group equipped with the normalised Haar measure $μ$. Our first result shows that given $α, β>0$, there is a constant $c = c(α,β)>0$ such that for any compact sets $A,B\subseteq G$ with $ αμ(B)\geqμ(A)\geq μ(B) $ and $ μ(A)+μ(B)\leq 1-β$, there exist $b_1,\dots b_c\in B$ such that \[ μ(A\cdot \{b_1,\dots,b_c\})\geq μ(A)+μ(B).\] A special case of this, that is, when $G=\mathbb{T}^d$, confirms a recent conjecture of Bollobás, Leader and Tiba. We also prove a quantitatively stronger version of such a result in the discrete setting of $\mathbb{R}^d$. Thus, given $d \in \mathbb{N}$, we show that there exists $c = c(d) >0$ such that for any finite, non-empty set $A \subseteq \mathbb{R}^d$ which is not contained in a translate of a hyperplane, one can find $a_1, \dots, a_c \in A$ satisfying \[ |A+ \{a_1, \dots, a_c\}| \geq (d+1)|A| - O_d(1). \] The main term here is optimal and recovers the bounds given by Freiman's lemma up to the $O_d(1)$ error term. △ Less

Submitted 10 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

Comments: 18 pages; typos corrected, the error term in Theorem 1.2 improved

arXiv:2307.02993 [pdf, ps, other]

doi 10.1103/PhysRevLett.132.220402

Biorthogonal Dynamical Quantum Phase Transitions in Non-Hermitian Systems

Authors: Yecheng Jing, Jian-Jun Dong, Yu-Yu Zhang, Zi-Xiang Hu

Abstract: By utilizing biorthogonal bases, we develop a comprehensive framework for studying biorthogonal dynamical quantum phase transitions in non-Hermitian systems. With the help of the previously overlooked associated state, we define the automatically normalized biorthogonal Loschmidt echo. This approach is capable of handling arbitrary non-Hermitian systems with complex eigenvalues and naturally elimi… ▽ More By utilizing biorthogonal bases, we develop a comprehensive framework for studying biorthogonal dynamical quantum phase transitions in non-Hermitian systems. With the help of the previously overlooked associated state, we define the automatically normalized biorthogonal Loschmidt echo. This approach is capable of handling arbitrary non-Hermitian systems with complex eigenvalues and naturally eliminates the negative value of Loschmidt rate obtained without the biorthogonal bases. Taking the non-Hermitian Su-Schrieffer-Heeger model as a concrete example, a $1/2$ change of dynamical topological order parameter in biorthogonal bases is observed which is not shown in self-normal bases. Furthermore, we discover that the periodicity of biorthogonal dynamical quantum phase transitions depends on whether the two-level subsystem at the critical momentum oscillates or reaches a steady state. △ Less

Submitted 31 May, 2024; v1 submitted 6 July, 2023; originally announced July 2023.

Comments: 7 pages, 4 figures; Supplemental Material

Journal ref: Phys. Rev. Lett. 132, 220402 (2024)

arXiv:2306.09407 [pdf, other]

doi 10.1038/s41550-023-02035-4

Evidence for baryon acoustic oscillations from galaxy-ellipticity correlations

Authors: Kun Xu, Y. P. Jing, Gong-Bo Zhao, Antonio J. Cuesta

Abstract: The Baryon Acoustic Oscillations (BAO) feature in the clustering of galaxies or quasars provides a ``standard ruler" for distance measurements in cosmology. In this work, we report a $2\sim3σ$ signal of the BAO dip feature in the galaxy density-ellipticity (GI) cross-correlation functions using the spectroscopic sample of the Baryon Oscillation Spectroscopic Survey (BOSS) CMASS, combined with the… ▽ More The Baryon Acoustic Oscillations (BAO) feature in the clustering of galaxies or quasars provides a ``standard ruler" for distance measurements in cosmology. In this work, we report a $2\sim3σ$ signal of the BAO dip feature in the galaxy density-ellipticity (GI) cross-correlation functions using the spectroscopic sample of the Baryon Oscillation Spectroscopic Survey (BOSS) CMASS, combined with the deep DESI Legacy Imaging Surveys for precise galaxy shape measurements. We measure the GI correlation functions and model them using the linear alignment model. We constrain the distance $D_V/r_{\mathrm{d}}$ to redshift $0.57$ to a precision of $3\sim5\%$, depending on the details of modeling. The GI measurement reduces the uncertainty of distance measurement by $\sim10\%$ on top of that derived from the galaxy-galaxy (GG) correlation. More importantly, for future large and deep galaxy surveys, the independent GI measurements can help sort out the systematics in the BAO studies. △ Less

Submitted 27 July, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

Comments: Main text 3 figures + supplementary 5 figures. Published in Nature Astronomy

arXiv:2306.07610 [pdf, other]

Soft Language Clustering for Multilingual Model Pre-training

Authors: Jiali Zeng, Yufan Jiang, Yongjing Yin, Yi Jing, Fandong Meng, Binghuai Lin, Yunbo Cao, Jie Zhou

Abstract: Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size. In this paper, we propose XLM-P, which contextually retrieves prompts as flexible guidance for encoding instances conditionally. Ou… ▽ More Multilingual pre-trained language models have demonstrated impressive (zero-shot) cross-lingual transfer abilities, however, their performance is hindered when the target language has distant typology from source languages or when pre-training data is limited in size. In this paper, we propose XLM-P, which contextually retrieves prompts as flexible guidance for encoding instances conditionally. Our XLM-P enables (1) lightweight modeling of language-invariant and language-specific knowledge across languages, and (2) easy integration with other multilingual pre-training methods. On the tasks of XTREME including text classification, sequence labeling, question answering, and sentence retrieval, both base- and large-size language models pre-trained with our proposed method exhibit consistent performance improvement. Furthermore, it provides substantial advantages for low-resource languages in unsupervised sentence retrieval and for target languages that differ greatly from the source language in cross-lingual transfer. △ Less

Submitted 13 June, 2023; originally announced June 2023.

arXiv:2306.06317 [pdf, other]

The DESI One-Percent survey: constructing galaxy-halo connections for ELGs and LRGs using auto and cross correlations

Authors: Hongyu Gao, Y. P. Jing, Shanquan Gui, Kun Xu, Yun Zheng, Donghai Zhao, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Todd Claybaugh, Kyle Dawson, Axel de la Macorra, Peter Doel, Kevin Fanning, Jaime E. Forero-Romero, Satya Gontcho A Gontcho, Julien Guy, Klaus Honscheid, Robert Kehoe, Martin Landriau, Marc Manera, Aaron Meisner, Ramon Miquel, John Moustakas, Jeffrey A. Newman , et al. (9 additional authors not shown)

Abstract: In the current Dark Energy Spectroscopic Instrument (DESI) survey, emission line galaxies (ELGs) and luminous red galaxies (LRGs) are essential for mapping the dark matter distribution at $z \sim 1$. We measure the auto and cross correlation functions of ELGs and LRGs at $0.8<z\leq 1.0$ from the DESI One-Percent survey. Following Gao et al. (2022), we construct the galaxy-halo connections for ELGs… ▽ More In the current Dark Energy Spectroscopic Instrument (DESI) survey, emission line galaxies (ELGs) and luminous red galaxies (LRGs) are essential for mapping the dark matter distribution at $z \sim 1$. We measure the auto and cross correlation functions of ELGs and LRGs at $0.8<z\leq 1.0$ from the DESI One-Percent survey. Following Gao et al. (2022), we construct the galaxy-halo connections for ELGs and LRGs simultaneously. With the stellar-halo mass relation (SHMR) for the whole galaxy population (i.e. normal galaxies), LRGs can be selected directly by stellar mass, while ELGs can also be selected randomly based on the observed number density of each stellar mass, once the probability $P_{\mathrm{sat}}$ of a satellite galaxy becoming an ELG is determined. We demonstrate that the observed small scale clustering prefers a halo mass-dependent $P_{\mathrm{sat}}$ model rather than a constant. With this model, we can well reproduce the auto correlations of LRGs and the cross correlations between LRGs and ELGs at $r_{\mathrm{p}}>0.1$ $\mathrm{Mpc}\,h^{-1}$. We can also reproduce the auto correlations of ELGs at $r_{\mathrm{p}}>0.3$ $\mathrm{Mpc}\,h^{-1}$ ($s>1$ $\mathrm{Mpc}\,h^{-1}$) in real (redshift) space. Although our model has only seven parameters, we show that it can be extended to higher redshifts and reproduces the observed auto correlations of ELGs in the whole range of $0.8<z<1.6$, which enables us to generate a lightcone ELG mock for DESI. With the above model, we further derive halo occupation distributions (HODs) for ELGs which can be used to produce ELG mocks in coarse simulations without resolving subhalos. △ Less

Submitted 18 July, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: 27 pages, 16 figures, accepted by ApJ

arXiv:2306.06308 [pdf, other]

doi 10.5281/zenodo.7964161

The Early Data Release of the Dark Energy Spectroscopic Instrument

Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, G. Aldering, D. M. Alexander, R. Alfarsy, C. Allende Prieto, M. Alvarez, O. Alves, A. Anand, F. Andrade-Oliveira, E. Armengaud, J. Asorey, S. Avila, A. Aviles, S. Bailey, A. Balaguera-Antolínez, O. Ballester, C. Baltay, A. Bault, J. Bautista, J. Behera, S. F. Beltran , et al. (240 additional authors not shown)

Abstract: The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes… ▽ More The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes good-quality spectral information from 466,447 objects targeted as part of the Milky Way Survey, 428,758 as part of the Bright Galaxy Survey, 227,318 as part of the Luminous Red Galaxy sample, 437,664 as part of the Emission Line Galaxy sample, and 76,079 as part of the Quasar sample. In addition, the release includes spectral information from 137,148 objects that expand the scope beyond the primary samples as part of a series of secondary programs. Here, we describe the spectral data, data quality, data products, Large-Scale Structure science catalogs, access to the data, and references that provide relevant background to using these spectra. △ Less

Submitted 15 June, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: 43 pages, 7 figures, 17 tables, submitted to AJ, DESI EDR references added

arXiv:2306.06307 [pdf, other]

doi 10.5281/zenodo.7858207

Validation of the Scientific Program for the Dark Energy Spectroscopic Instrument

Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, G. Aldering, D. M. Alexander, R. Alfarsy, C. Allende Prieto, M. Alvarez, O. Alves, A. Anand, F. Andrade-Oliveira, E. Armengaud, J. Asorey, S. Avila, A. Aviles, S. Bailey, A. Balaguera-Antolínez, O. Ballester, C. Baltay, A. Bault, J. Bautista, J. Behera, S. F. Beltran , et al. (239 additional authors not shown)

Abstract: The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of… ▽ More The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of tens of thousands of objects from each of the stellar (MWS), bright galaxy (BGS), luminous red galaxy (LRG), emission line galaxy (ELG), and quasar target classes. These SV spectra were used to optimize redshift distributions, characterize exposure times, determine calibration procedures, and assess observational overheads for the five-year program. In this paper, we present the final target selection algorithms, redshift distributions, and projected cosmology constraints resulting from those studies. We also present a `One-Percent survey' conducted at the conclusion of Survey Validation covering 140 deg$^2$ using the final target selection algorithms with exposures of a depth typical of the main survey. The Survey Validation indicates that DESI will be able to complete the full 14,000 deg$^2$ program with spectroscopically-confirmed targets from the MWS, BGS, LRG, ELG, and quasar programs with total sample sizes of 7.2, 13.8, 7.46, 15.7, and 2.87 million, respectively. These samples will allow exploration of the Milky Way halo, clustering on all scales, and BAO measurements with a statistical precision of 0.28% over the redshift interval $z<1.1$, 0.39% over the redshift interval $1.1<z<1.9$, and 0.46% over the redshift interval $1.9<z<3.5$. △ Less

Submitted 12 January, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

Comments: 42 pages, 18 figures, accepted by AJ

arXiv:2306.04311 [pdf, other]

Unraveling the Complexity of Dwarf Galaxy Dynamics: A study of Binary Orbital Motions

Authors: Wenting Wang, Ling Zhu, Yipeng Jing, Robert J. J. Grand, Zhaozhou Li, Xiaoting Fu, Lu Li, Jiaxin Han, Ting S. Li, Fabo Feng, Carlos Frenk

Abstract: We investigate the impact of binary orbital motions on the dynamical modeling of dwarf galaxies with intrinsic line-of-sight velocity dispersions ($σ_{v_r}$) of 1 to 9 km/s. Using dwarf galaxies from the Auriga level-2 and level-3 simulations, we apply the Jeans Anisotropic Multi-Gaussian Expansion modelling to tracer stars before and after including binaries to recover the dynamical masses. The r… ▽ More We investigate the impact of binary orbital motions on the dynamical modeling of dwarf galaxies with intrinsic line-of-sight velocity dispersions ($σ_{v_r}$) of 1 to 9 km/s. Using dwarf galaxies from the Auriga level-2 and level-3 simulations, we apply the Jeans Anisotropic Multi-Gaussian Expansion modelling to tracer stars before and after including binaries to recover the dynamical masses. The recovered total masses within the half-mass radius of tracers, $M(<r_\mathrm{half})$, are always inflated due to binary motions, with greater inflations occurring for smaller $σ_{v_r}$. However, many dwarf galaxies experience central density deflated due to binary motions, with little dependences on $σ_{v_r}$. This is due to the negative radial gradients in the velocity dispersion profiles, with the fractional inflation in $σ_{v_r}$ due to binaries more significant in outskirts. An extreme binary fraction of 70% can lead to central density deflation of up to 10-20% at 3 km/s$<σ_{v_r}<$8 km/s, with $M(<r_\mathrm{half})$ inflated by 4% at 9 km/s and up to 15% at 3 km/s. A lower binary fraction of 36% leads to similar deflations, with the inflations decreasing to approximately 10% at 3 km/s and becoming statistically insignificant. The choice of binary orbit distribution models does not result in significant differences, and observational errors tend to slightly weaken the deflations in the recovered central density. Two observations separated by one year to exclude binaries lead to almost zero inflations/deflations for a binary fraction of 36% over 3 km/s$<σ_{v_r}<$9 km/s. For $σ_{v_r}\sim$1 km/s to 3 km/s, a binary fraction of 70% (36%) still results in 60% (30%) to 10% (1%) of inflations in $M(<r_\mathrm{half})$, even with two-epoch observation. △ Less

Submitted 29 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

Comments: accepted by ApJ, comments welcome

arXiv:2305.19900 [pdf, other]

doi 10.1103/PhysRevAccelBeams.26.064201

Improved Calibration of RF Cavities for Relativistic Electron Beams: Effects of Secondary Corrections and Experimental Verification

Authors: K. Shih, I. Petrushina, V. N. Litvinenko, I. Pinayev, J. Ma, G. Wang, Y. Jing, Y. Wu

Abstract: In the aspect of longitudinal beam bunching, the bunching strength can be controlled by the RF cavity phase and voltage. However, these machine parameters are different from those that interact with the beam itself. In order to gain control of the beam-cavity interaction, cavity calibration must be performed. Furthermore, it relies on fitting the beam energy gain versus cavity phase to a calibrati… ▽ More In the aspect of longitudinal beam bunching, the bunching strength can be controlled by the RF cavity phase and voltage. However, these machine parameters are different from those that interact with the beam itself. In order to gain control of the beam-cavity interaction, cavity calibration must be performed. Furthermore, it relies on fitting the beam energy gain versus cavity phase to a calibration function. Under the conventional assumption of relativistic beam conditions, the calibration function is a first harmonic sinusoidal function (a sinusoidal function with a period of 2π). However, this expression is insufficient for a high-voltage bunching cavity. Due to beam acceleration inside the cavity, an energy bias and a second harmonic function should be included to modify the conventional calibration function, even for a relativistic electron beam. In this paper, we will derive this modification and provide a comparison to both the Coherent Electron Cooling Experiment and the IMPACT-T simulation, respectively. △ Less

Submitted 31 May, 2023; originally announced May 2023.

Journal ref: Physical Review ACCELERATORS AND BEAMS 2023

arXiv:2305.16982 [pdf, other]

TranSFormer: Slow-Fast Transformer for Machine Translation

Authors: Bei Li, Yi Jing, Xu Tan, Zhen Xing, Tong Xiao, Jingbo Zhu

Abstract: Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems. Prior research has primarily focused on treating subwords as basic units in developing such systems. However, the incorporation of fine-grained character-level features into multiscale Transformer has not yet been explored. In this work, we present a \textbf{S}low-\textbf{F}ast… ▽ More Learning multiscale Transformer models has been evidenced as a viable approach to augmenting machine translation systems. Prior research has primarily focused on treating subwords as basic units in developing such systems. However, the incorporation of fine-grained character-level features into multiscale Transformer has not yet been explored. In this work, we present a \textbf{S}low-\textbf{F}ast two-stream learning model, referred to as Tran\textbf{SF}ormer, which utilizes a ``slow'' branch to deal with subword sequences and a ``fast'' branch to deal with longer character sequences. This model is efficient since the fast branch is very lightweight by reducing the model width, and yet provides useful fine-grained features for the slow branch. Our TranSFormer shows consistent BLEU improvements (larger than 1 BLEU point) on several machine translation benchmarks. △ Less

Submitted 26 May, 2023; originally announced May 2023.

Comments: Accepted by Findings of ACL2023

arXiv:2305.13694 [pdf, other]

doi 10.1093/mnras/stae121

Assessing Mass Loss and Stellar-to-Halo Mass Ratio of Satellite Galaxies: A Galaxy-Galaxy Lensing Approach Utilizing DECaLS DR8 Data

Authors: Chunxiang Wang, Ran Li, Huanyuan Shan, Weiwei Xu, Ji Yao, Yingjie Jing, Liang Gao, Nan Li, Yushan Xie, Kai Zhu, Hang Yang, Qingze Chen

Abstract: The galaxy-galaxy lensing technique allows us to measure the subhalo mass of satellite galaxies, studying their mass loss and evolution within galaxy clusters and providing direct observational validation for theories of galaxy formation. In this study, we use the weak gravitational lensing observations from DECaLS DR8, in combination with the redMaPPer galaxy cluster catalog from Sloan Digital Sk… ▽ More The galaxy-galaxy lensing technique allows us to measure the subhalo mass of satellite galaxies, studying their mass loss and evolution within galaxy clusters and providing direct observational validation for theories of galaxy formation. In this study, we use the weak gravitational lensing observations from DECaLS DR8, in combination with the redMaPPer galaxy cluster catalog from Sloan Digital Sky Survey data (SDSS) DR8 to accurately measure the dark matter halo mass of satellite galaxies. We confirm a significant increase in the stellar-to-halo mass ratio of satellite galaxies with their halo-centric radius, indicating clear evidence of mass loss due to tidal stripping. Additionally, we find that this mass loss is strongly dependent on the mass of the satellite galaxies, with satellite galaxies above $10^{11}~{\rm M_{\odot}/h}$ experiencing more pronounced mass loss compared to lower mass satellites, reaching 86\% at projected halo-centric radius $0.5R_{\rm 200c}$. The average mass loss rate, when not considering halo-centric radius, displays a U-shaped variation with stellar mass, with galaxies of approximately $4\times10^{10}~{\rm M_{\odot}/h}$ exhibiting the least mass loss, around 60\%. We compare our results with state-of-the-art hydrodynamical numerical simulations and find that the satellite galaxy stellar-to-halo mass ratio in the outskirts of galaxy clusters is higher compared to the predictions of the Illustris-TNG project about factor 5. Furthermore, the Illustris-TNG project's numerical simulations did not predict the observed dependence of satellite galaxy mass loss rate on satellite galaxy mass. △ Less

Submitted 31 January, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

Comments: 14 pages, 9 figures

arXiv:2305.08313 [pdf]

doi 10.1103/PhysRevLett.131.157201

Realization of a Z classified chiral-symmetric higher-order topological insulator in a coupling-inversion acoustic crystal

Authors: Dongyi Wang, Yuanchen Deng, Mourad Oudich, Wladimir A. Benalcazar, Guancong Ma, Yun Jing

Abstract: Higher-order topological band theory has transformed the landscape of topological phases in quantum and classical systems. Here, we experimentally demonstrate a two-dimensional (2D) higher-order topological phase (HOTP), referred to as the multiple chiral topological phase (MCTP), which is protected by a multipole chiral number (MCN). Our realization differs from previous HOTPs in that it possesse… ▽ More Higher-order topological band theory has transformed the landscape of topological phases in quantum and classical systems. Here, we experimentally demonstrate a two-dimensional (2D) higher-order topological phase (HOTP), referred to as the multiple chiral topological phase (MCTP), which is protected by a multipole chiral number (MCN). Our realization differs from previous HOTPs in that it possesses a larger-than-unity MCN, which arises when the nearest-neighbor couplings (NNCs) are weaker than long-range couplings (LRCs). Our phase has an MCN of 4, protecting the existence of 4 mid-gap topological corner modes (TCMs) at each corner. The multiple TCMs demonstrated here could lead to enhanced quantum-inspired devices for sensing and computing. Our study also highlights the rich and untapped potential of LRC manipulation for future research in topological phases. △ Less

Submitted 12 October, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

arXiv:2304.14593 [pdf, other]

Deep Graph Reprogramming

Authors: Yongcheng Jing, Chongbin Yuan, Li Ju, Yiding Yang, Xinchao Wang, Dacheng Tao

Abstract: In this paper, we explore a novel model reusing task tailored for graph neural networks (GNNs), termed as "deep graph reprogramming". We strive to reprogram a pre-trained GNN, without amending raw node features nor model parameters, to handle a bunch of cross-level downstream tasks in various domains. To this end, we propose an innovative Data Reprogramming paradigm alongside a Model Reprogramming… ▽ More In this paper, we explore a novel model reusing task tailored for graph neural networks (GNNs), termed as "deep graph reprogramming". We strive to reprogram a pre-trained GNN, without amending raw node features nor model parameters, to handle a bunch of cross-level downstream tasks in various domains. To this end, we propose an innovative Data Reprogramming paradigm alongside a Model Reprogramming paradigm. The former one aims to address the challenge of diversified graph feature dimensions for various tasks on the input side, while the latter alleviates the dilemma of fixed per-task-per-model behavior on the model side. For data reprogramming, we specifically devise an elaborated Meta-FeatPadding method to deal with heterogeneous input dimensions, and also develop a transductive Edge-Slimming as well as an inductive Meta-GraPadding approach for diverse homogenous samples. Meanwhile, for model reprogramming, we propose a novel task-adaptive Reprogrammable-Aggregator, to endow the frozen model with larger expressive capacities in handling cross-domain tasks. Experiments on fourteen datasets across node/graph classification/regression, 3D object recognition, and distributed action recognition, demonstrate that the proposed methods yield gratifying results, on par with those by re-training from scratch. △ Less

Submitted 27 April, 2023; originally announced April 2023.

Comments: CVPR 2023 Highlight

Showing 1–50 of 447 results for author: Jing, Y