subscribe to arXiv mailings

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Authors: Chen Ju, Haicheng Wang, Haozhe Cheng, Xu Chen, Zhonghua Zhai, Weilin Huang, Jinsong Lan, Shuai Xiao, Bo Zheng

Abstract: Vision-Language Large Models (VLMs) recently become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in the real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantization, but completely overlook the data-perspective… ▽ More Vision-Language Large Models (VLMs) recently become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in the real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantization, but completely overlook the data-perspective redundancy. To fill the overlook, this paper pioneers the severity of data redundancy, and designs one plug-and-play Turbo module guided by information degree to prune inefficient tokens from visual or textual data. In pursuit of efficiency-performance trade-offs, information degree takes two crucial factors into consideration: mutual redundancy and semantic value. Concretely, the former evaluates data duplication between sequential tokens; while the latter evaluates each token by its contribution to the overall semantics. As a result, tokens with high information degree carry less redundancy and stronger semantics. For VLMs' calculation, Turbo works as a user-friendly plug-in that sorts data referring to information degree, utilizing only top-level ones to save costs. Its advantages are multifaceted, e.g., being generally compatible to various VLMs across understanding and generation, simple use without re-training and trivial engineering efforts. On multiple VLMs benchmarks, we fully experiment to demonstrate the good acceleration of Turbo, under negligible performance drop. △ Less

Submitted 16 July, 2024; originally announced July 2024.

Comments: ECCV 2024. The first two authors share the same contribution. arXiv admin note: substantial text overlap with arXiv:2312.07408

arXiv:2405.16598 [pdf, other]

Regularized Projection Matrix Approximation with Applications to Community Detection

Authors: Zheng Zhai, Mingxin Wu, Xiaohui Li

Abstract: This paper introduces a regularized projection matrix approximation framework aimed at recovering cluster information from the affinity matrix. The model is formulated as a projection approximation problem incorporating an entrywise penalty function. We explore three distinct penalty functions addressing bounded, positive, and sparse scenarios, respectively, and derive the Alternating Direction Me… ▽ More This paper introduces a regularized projection matrix approximation framework aimed at recovering cluster information from the affinity matrix. The model is formulated as a projection approximation problem incorporating an entrywise penalty function. We explore three distinct penalty functions addressing bounded, positive, and sparse scenarios, respectively, and derive the Alternating Direction Method of Multipliers (ADMM) algorithm to solve the problem. Then, we provide a theoretical analysis establishing the convergence properties of the proposed algorithm. Extensive numerical experiments on both synthetic and real-world datasets demonstrate that our regularized projection matrix approximation approach significantly outperforms state-of-the-art methods in terms of clustering performance. △ Less

Submitted 26 May, 2024; originally announced May 2024.

arXiv:2405.13495 [pdf, other]

Euclid. V. The Flagship galaxy mock catalogue: a comprehensive simulation for the Euclid mission

Authors: Euclid Collaboration, F. J. Castander, P. Fosalba, J. Stadel, D. Potter, J. Carretero, P. Tallada-Crespí, L. Pozzetti, M. Bolzonella, G. A. Mamon, L. Blot, K. Hoffmann, M. Huertas-Company, P. Monaco, E. J. Gonzalez, G. De Lucia, C. Scarlata, M. -A. Breton, L. Linke, C. Viglione, S. -S. Li, Z. Zhai, Z. Baghkhani, K. Pardede, C. Neissner , et al. (344 additional authors not shown)

Abstract: We present the Flagship galaxy mock, a simulated catalogue of billions of galaxies designed to support the scientific exploitation of the Euclid mission. Euclid is a medium-class mission of the European Space Agency optimised to determine the properties of dark matter and dark energy on the largest scales of the Universe. It probes structure formation over more than 10 billion years primarily from… ▽ More We present the Flagship galaxy mock, a simulated catalogue of billions of galaxies designed to support the scientific exploitation of the Euclid mission. Euclid is a medium-class mission of the European Space Agency optimised to determine the properties of dark matter and dark energy on the largest scales of the Universe. It probes structure formation over more than 10 billion years primarily from the combination of weak gravitational lensing and galaxy clustering data. The breath of Euclid's data will also foster a wide variety of scientific analyses. The Flagship simulation was developed to provide a realistic approximation to the galaxies that will be observed by Euclid and used in its scientific analyses. We ran a state-of-the-art N-body simulation with four trillion particles, producing a lightcone on the fly. From the dark matter particles, we produced a catalogue of 16 billion haloes in one octant of the sky in the lightcone up to redshift z=3. We then populated these haloes with mock galaxies using a halo occupation distribution and abundance matching approach, calibrating the free parameters of the galaxy mock against observed correlations and other basic galaxy properties. Modelled galaxy properties include luminosity and flux in several bands, redshifts, positions and velocities, spectral energy distributions, shapes and sizes, stellar masses, star formation rates, metallicities, emission line fluxes, and lensing properties. We selected a final sample of 3.4 billion galaxies with a magnitude cut of H_E<26, where we are complete. We have performed a comprehensive set of validation tests to check the similarity to observational data and theoretical models. In particular, our catalogue is able to closely reproduce the main characteristics of the weak lensing and galaxy clustering samples to be used in the mission's main cosmological analysis. (abridged) △ Less

Submitted 22 May, 2024; originally announced May 2024.

Comments: Paper submitted as part of the A&A special issue `Euclid on Sky', which contains Euclid key reference papers and first results from the Euclid Early Release Observations

arXiv:2404.17571 [pdf, other]

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Authors: Zhengze Xu, Mengting Chen, Zhao Wang, Linyu Xing, Zhonghua Zhai, Nong Sang, Jinsong Lan, Shuai Xiao, Changxin Gao

Abstract: Video try-on is a challenging task and has not been well tackled in previous works. The main obstacle lies in preserving the details of the clothing and modeling the coherent motions simultaneously. Faced with those difficulties, we address video try-on by proposing a diffusion-based framework named "Tunnel Try-on." The core idea is excavating a "focus tunnel" in the input video that gives close-u… ▽ More Video try-on is a challenging task and has not been well tackled in previous works. The main obstacle lies in preserving the details of the clothing and modeling the coherent motions simultaneously. Faced with those difficulties, we address video try-on by proposing a diffusion-based framework named "Tunnel Try-on." The core idea is excavating a "focus tunnel" in the input video that gives close-up shots around the clothing regions. We zoom in on the region in the tunnel to better preserve the fine details of the clothing. To generate coherent motions, we first leverage the Kalman filter to construct smooth crops in the focus tunnel and inject the position embedding of the tunnel into attention layers to improve the continuity of the generated videos. In addition, we develop an environment encoder to extract the context information outside the tunnels as supplementary cues. Equipped with these techniques, Tunnel Try-on keeps the fine details of the clothing and synthesizes stable and smooth videos. Demonstrating significant advancements, Tunnel Try-on could be regarded as the first attempt toward the commercial-level application of virtual try-on in videos. △ Less

Submitted 26 April, 2024; originally announced April 2024.

Comments: Project Page: https://mengtingchen.github.io/tunnel-try-on-page/

arXiv:2404.06395 [pdf, other]

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Authors: Shengding Hu, Yuge Tu, Xu Han, Chaoqun He, Ganqu Cui, Xiang Long, Zhi Zheng, Yewei Fang, Yuxiang Huang, Weilin Zhao, Xinrong Zhang, Zheng Leng Thai, Kaihuo Zhang, Chongyi Wang, Yuan Yao, Chenyang Zhao, Jie Zhou, Jie Cai, Zhongwu Zhai, Ning Ding, Chao Jia, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

Abstract: The burgeoning interest in developing Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario underscores the importance of exploring the potential of Small Language Models (SLMs) as a resource-efficient alternative. In this context, we introduce… ▽ More The burgeoning interest in developing Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario underscores the importance of exploring the potential of Small Language Models (SLMs) as a resource-efficient alternative. In this context, we introduce MiniCPM, specifically the 1.2B and 2.4B non-embedding parameter variants, not only excel in their respective categories but also demonstrate capabilities on par with 7B-13B LLMs. While focusing on SLMs, our approach exhibits scalability in both model and data dimensions for future LLM research. Regarding model scaling, we employ extensive model wind tunnel experiments for stable and optimal scaling. For data scaling, we introduce a Warmup-Stable-Decay (WSD) learning rate scheduler (LRS), conducive to continuous training and domain adaptation. We present an in-depth analysis of the intriguing training dynamics that occurred in the WSD LRS. With WSD LRS, we are now able to efficiently study data-model scaling law without extensive retraining experiments on both axes of model and data, from which we derive the much higher compute optimal data-model ratio than Chinchilla Optimal. Additionally, we introduce MiniCPM family, including MiniCPM-DPO, MiniCPM-MoE and MiniCPM-128K, whose excellent performance further cementing MiniCPM's foundation in diverse SLM applications. MiniCPM models are available publicly at https://github.com/OpenBMB/MiniCPM . △ Less

Submitted 3 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: revise according to peer review

arXiv:2404.00629 [pdf, other]

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

Authors: Lizhi Lin, Honglin Mu, Zenan Zhai, Minghan Wang, Yuxia Wang, Renxi Wang, Junjie Gao, Yixuan Zhang, Wanxiang Che, Timothy Baldwin, Xudong Han, Haonan Li

Abstract: Generative models are rapidly gaining popularity and being integrated into everyday applications, raising concerns over their safety issues as various vulnerabilities are exposed. Faced with the problem, the field of red teaming is experiencing fast-paced growth, which highlights the need for a comprehensive organization covering the entire pipeline and addressing emerging topics for the community… ▽ More Generative models are rapidly gaining popularity and being integrated into everyday applications, raising concerns over their safety issues as various vulnerabilities are exposed. Faced with the problem, the field of red teaming is experiencing fast-paced growth, which highlights the need for a comprehensive organization covering the entire pipeline and addressing emerging topics for the community. Our extensive survey, which examines over 120 papers, introduces a taxonomy of fine-grained attack strategies grounded in the inherent capabilities of language models. Additionally, we have developed the searcher framework that unifies various automatic red teaming approaches. Moreover, our survey covers novel areas including multimodal attacks and defenses, risks around multilingual models, overkill of harmless queries, and safety of downstream applications. We hope this survey can provide a systematic perspective on the field and unlock new areas of research. △ Less

Submitted 31 March, 2024; originally announced April 2024.

arXiv:2403.15082

Cell Variational Information Bottleneck Network

Authors: Zhonghua Zhai, Chen Ju, Jinsong Lan, Shuai Xiao

Abstract: In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end training method. Our Cell Variational Information Bottleneck Network is constructed by stacking VIB cells, which generate feature maps with uncertainty. As l… ▽ More In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end training method. Our Cell Variational Information Bottleneck Network is constructed by stacking VIB cells, which generate feature maps with uncertainty. As layers going deeper, the regularization effect will gradually increase, instead of directly adding excessive regular constraints to the output layer of the model as in Deep VIB. Under each VIB cell, the feedforward process learns an independent mean term and an standard deviation term, and predicts the Gaussian distribution based on them. The feedback process is based on reparameterization trick for effective training. This work performs an extensive analysis on MNIST dataset to verify the effectiveness of each VIB cells, and provides an insightful analysis on how the VIB cells affect mutual information. Experiments conducted on CIFAR-10 also prove that our cellVIB is robust against noisy labels during training and against corrupted images during testing. Then, we validate our method on PACS dataset, whose results show that the VIB cells can significantly improve the generalization performance of the basic model. Finally, in a more complex representation learning task, face recognition, our network structure has also achieved very competitive results. △ Less

Submitted 29 March, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

Comments: Found errors in the article, therefore postponing publication for now

arXiv:2403.12965 [pdf, other]

Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment

Authors: Mengting Chen, Xi Chen, Zhonghua Zhai, Chen Ju, Xuewen Hong, Jinsong Lan, Shuai Xiao

Abstract: This paper introduces a novel framework for virtual try-on, termed Wear-Any-Way. Different from previous methods, Wear-Any-Way is a customizable solution. Besides generating high-fidelity results, our method supports users to precisely manipulate the wearing style. To achieve this goal, we first construct a strong pipeline for standard virtual try-on, supporting single/multiple garment try-on and… ▽ More This paper introduces a novel framework for virtual try-on, termed Wear-Any-Way. Different from previous methods, Wear-Any-Way is a customizable solution. Besides generating high-fidelity results, our method supports users to precisely manipulate the wearing style. To achieve this goal, we first construct a strong pipeline for standard virtual try-on, supporting single/multiple garment try-on and model-to-model settings in complicated scenarios. To make it manipulable, we propose sparse correspondence alignment which involves point-based control to guide the generation for specific locations. With this design, Wear-Any-Way gets state-of-the-art performance for the standard setting and provides a novel interaction form for customizing the wearing style. For instance, it supports users to drag the sleeve to make it rolled up, drag the coat to make it open, and utilize clicks to control the style of tuck, etc. Wear-Any-Way enables more liberated and flexible expressions of the attires, holding profound implications in the fashion industry. △ Less

Submitted 19 March, 2024; originally announced March 2024.

Comments: Project Page: https://mengtingchen.github.io/wear-any-way-page/

arXiv:2403.10754 [pdf, other]

doi 10.1093/mnras/stae762

CSST large-scale structure analysis pipeline: I. constructing reference mock galaxy redshift surveys

Authors: Yizhou Gu, Xiaohu Yang, Jiaxin Han, Yirong Wang, Qingyang Li, Zhenlin Tan, Wenkang Jiang, Yaru Wang, Jiaqi Wang, Antonios Katsianis, Xiaoju Xu, Haojie Xu, Wensheng Hong, Houjun Mo, Run Wen, Xianzhong Zheng, Feng Shi, Pengjie Zhang, Zhongxu Zhai, Chengze Liu, Wenting Wang, Ying Zu, Hong Guo, Youcai Zhang, Yi Lu , et al. (7 additional authors not shown)

Abstract: In this paper, we set out to construct a set of reference mock galaxy redshift surveys (MGRSs) for the future Chinese Space-station Survey Telescope (CSST) observation, where subsequent survey selection effects can be added and evaluated. This set of MGRSs is generated using the dark matter subhalos extracted from a high-resolution Jiutian $N$-body simulation of the standard $Λ$CDM cosmogony with… ▽ More In this paper, we set out to construct a set of reference mock galaxy redshift surveys (MGRSs) for the future Chinese Space-station Survey Telescope (CSST) observation, where subsequent survey selection effects can be added and evaluated. This set of MGRSs is generated using the dark matter subhalos extracted from a high-resolution Jiutian $N$-body simulation of the standard $Λ$CDM cosmogony with $Ω_m=0.3111$, $Ω_Λ=0.6889$, and $σ_8=0.8102$. The simulation has a boxsize of $1~h^{-1} {\rm Gpc}$, and consists of $6144^3$ particles with mass resolution $3.723 \times 10^{8} h^{-1} M_\odot $. In order to take into account the effect of redshift evolution, we first use all 128 snapshots in the Jiutian simulation to generate a light-cone halo/subhalo catalog. Next, galaxy luminosities are assigned to the main and subhalo populations using the subhalo abundance matching (SHAM) method with the DESI $z$-band luminosity functions at different redshifts. Multi-band photometries, as well as images, are then assigned to each mock galaxy using a 3-dimensional parameter space nearest neighbor sampling of the DESI LS observational galaxies and groups. Finally, the CSST and DESI LS survey geometry and magnitude limit cuts are applied to generate the required MGRSs. As we have checked, this set of MGRSs can generally reproduce the observed galaxy luminosity/mass functions within 0.1 dex for galaxies with $L > 10^8 L_\odot$ (or $M_* > 10^{8.5} M_\odot$) and within 1-$σ$ level for galaxies with $L < 10^8L_\odot$ (or $M_* < 10^{8.5} M_\odot$). Together with the CSST slitless spectra and redshifts for our DESI LS seed galaxies that are under construction, we will set out to test various slitless observational selection effects in subsequent probes. △ Less

Submitted 15 March, 2024; originally announced March 2024.

Comments: 13 pages, 9 figures, accepted for publication in MNRAS

arXiv:2403.06653 [pdf, other]

UAV-Enabled Asynchronous Federated Learning

Authors: Zhiyuan Zhai, Xiaojun Yuan, Xin Wang, Huiyuan Yang

Abstract: To exploit unprecedented data generation in mobile edge networks, federated learning (FL) has emerged as a promising alternative to the conventional centralized machine learning (ML). However, there are some critical challenges for FL deployment. One major challenge called straggler issue severely limits FL's coverage where the device with the weakest channel condition becomes the bottleneck o… ▽ More To exploit unprecedented data generation in mobile edge networks, federated learning (FL) has emerged as a promising alternative to the conventional centralized machine learning (ML). However, there are some critical challenges for FL deployment. One major challenge called straggler issue severely limits FL's coverage where the device with the weakest channel condition becomes the bottleneck of the model aggregation performance. Besides, the huge uplink communication overhead compromises the effectiveness of FL, which is particularly pronounced in large-scale systems. To address the straggler issue, we propose the integration of an unmanned aerial vehicle (UAV) as the parameter server (UAV-PS) to coordinate the FL implementation. We further employ over-the-air computation technique that leverages the superposition property of wireless channels for efficient uplink communication. Specifically, in this paper, we develop a novel UAV-enabled over-the-air asynchronous FL (UAV-AFL) framework which supports the UAV-PS in updating the model continuously to enhance the learning performance. Moreover, we conduct a convergence analysis to quantitatively capture the impact of model asynchrony, device selection and communication errors on the UAV-AFL learning performance. Based on this, a unified communication-learning problem is formulated to maximize asymptotical learning performance by optimizing the UAV-PS trajectory, device selection and over-the-air transceiver design. Simulation results demonstrate that the proposed scheme achieves substantially learning efficiency improvement compared with the state-of-the-art approaches. △ Less

Submitted 11 March, 2024; originally announced March 2024.

arXiv:2402.14877 [pdf, other]

Machine-learning prediction of tipping and collapse of the Atlantic Meridional Overturning Circulation

Authors: Shirin Panahi, Ling-Wei Kong, Mohammadamin Moradi, Zheng-Meng Zhai, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

Abstract: Recent research on the Atlantic Meridional Overturning Circulation (AMOC) raised concern about its potential collapse through a tipping point due to the climate-change caused increase in the freshwater input into the North Atlantic. The predicted time window of collapse is centered about the middle of the century and the earliest possible start is approximately two years from now. More generally,… ▽ More Recent research on the Atlantic Meridional Overturning Circulation (AMOC) raised concern about its potential collapse through a tipping point due to the climate-change caused increase in the freshwater input into the North Atlantic. The predicted time window of collapse is centered about the middle of the century and the earliest possible start is approximately two years from now. More generally, anticipating a tipping point at which the system transitions from one stable steady state to another is relevant to a broad range of fields. We develop a machine-learning approach to predicting tipping in noisy dynamical systems with a time-varying parameter and test it on a number of systems including the AMOC, ecological networks, an electrical power system, and a climate model. For the AMOC, our prediction based on simulated fingerprint data and real data of the sea surface temperature places the time window of a potential collapse between the years 2040 and 2065. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 6 pages, 3 figures

arXiv:2402.14131 [pdf, other]

doi 10.1063/5.0189564

Random forests for detecting weak signals and extracting physical information: a case study of magnetic navigation

Authors: Mohammadamin Moradi, Zheng-Meng Zhai, Aaron Nielsen, Ying-Cheng Lai

Abstract: It was recently demonstrated that two machine-learning architectures, reservoir computing and time-delayed feed-forward neural networks, can be exploited for detecting the Earth's anomaly magnetic field immersed in overwhelming complex signals for magnetic navigation in a GPS-denied environment. The accuracy of the detected anomaly field corresponds to a positioning accuracy in the range of 10 to… ▽ More It was recently demonstrated that two machine-learning architectures, reservoir computing and time-delayed feed-forward neural networks, can be exploited for detecting the Earth's anomaly magnetic field immersed in overwhelming complex signals for magnetic navigation in a GPS-denied environment. The accuracy of the detected anomaly field corresponds to a positioning accuracy in the range of 10 to 40 meters. To increase the accuracy and reduce the uncertainty of weak signal detection as well as to directly obtain the position information, we exploit the machine-learning model of random forests that combines the output of multiple decision trees to give optimal values of the physical quantities of interest. In particular, from time-series data gathered from the cockpit of a flying airplane during various maneuvering stages, where strong background complex signals are caused by other elements of the Earth's magnetic field and the fields produced by the electronic systems in the cockpit, we demonstrate that the random-forest algorithm performs remarkably well in detecting the weak anomaly field and in filtering the position of the aircraft. With the aid of the conventional inertial navigation system, the positioning error can be reduced to less than 10 meters. We also find that, contrary to the conventional wisdom, the classic Tolles-Lawson model for calibrating and removing the magnetic field generated by the body of the aircraft is not necessary and may even be detrimental for the success of the random-forest method. △ Less

Submitted 21 February, 2024; originally announced February 2024.

Comments: 12 pages, 11 figures

Journal ref: APL Machine Learning 2 (1), 016118 (2024)

arXiv:2402.12193 [pdf, other]

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

Authors: Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin

Abstract: Many studies have demonstrated that large language models (LLMs) can produce harmful responses, exposing users to unexpected risks when LLMs are deployed. Previous studies have proposed comprehensive taxonomies of the risks posed by LLMs, as well as corresponding prompts that can be used to examine the safety mechanisms of LLMs. However, the focus has been almost exclusively on English, and little… ▽ More Many studies have demonstrated that large language models (LLMs) can produce harmful responses, exposing users to unexpected risks when LLMs are deployed. Previous studies have proposed comprehensive taxonomies of the risks posed by LLMs, as well as corresponding prompts that can be used to examine the safety mechanisms of LLMs. However, the focus has been almost exclusively on English, and little has been explored for other languages. Here we aim to bridge this gap. We first introduce a dataset for the safety evaluation of Chinese LLMs, and then extend it to two other scenarios that can be used to better identify false negative and false positive examples in terms of risky prompt rejections. We further present a set of fine-grained safety assessment criteria for each risk type, facilitating both manual annotation and automatic evaluation in terms of LLM response harmfulness. Our experiments on five LLMs show that region-specific risks are the prevalent type of risk, presenting the major issue with all Chinese LLMs we experimented with. Our data is available at https://github.com/Libr-AI/do-not-answer. Warning: this paper contains example data that may be offensive, harmful, or biased. △ Less

Submitted 26 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: 14 pages

arXiv:2312.07408 [pdf, other]

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

Authors: Chen Ju, Haicheng Wang, Zeqian Li, Xu Chen, Zhonghua Zhai, Weilin Huang, Shuai Xiao

Abstract: Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantification, but completely overlook the data-perspective redund… ▽ More Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in real-world scenarios. To achieve acceleration for VLMs, most existing methods focus on the model perspective: pruning, distillation, quantification, but completely overlook the data-perspective redundancy. To fill the overlook, this paper pioneers the severity of data redundancy, and designs one plug-and-play Turbo module guided by information degree to prune inefficient tokens from visual or textual data. In pursuit of efficiency-performance trade-offs, information degree takes two key factors into consideration: mutual redundancy and semantic value. Concretely, the former evaluates the data duplication between sequential tokens; while the latter evaluates each token by its contribution to the overall semantics. As a result, tokens with high information degree carry less redundancy and stronger semantics. For VLMs' calculation, Turbo works as a user-friendly plug-in that sorts data referring to information degree, utilizing only top-level ones to save costs. Its advantages are multifaceted, e.g., being generally compatible to various VLMs across understanding and generation, simple use without retraining and trivial engineering efforts. On multiple public VLMs benchmarks, we conduct extensive experiments to reveal the gratifying acceleration of Turbo, under negligible performance drop. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2312.04889 [pdf, other]

KwaiAgents: Generalized Information-seeking Agent System with Large Language Models

Authors: Haojie Pan, Zepeng Zhai, Hao Yuan, Yaojia Lv, Ruiji Fu, Ming Liu, Zhongyuan Wang, Bing Qin

Abstract: Driven by curiosity, humans have continually sought to explore and understand the world around them, leading to the invention of various tools to satiate this inquisitiveness. Despite not having the capacity to process and memorize vast amounts of information in their brains, humans excel in critical thinking, planning, reflection, and harnessing available tools to interact with and interpret the… ▽ More Driven by curiosity, humans have continually sought to explore and understand the world around them, leading to the invention of various tools to satiate this inquisitiveness. Despite not having the capacity to process and memorize vast amounts of information in their brains, humans excel in critical thinking, planning, reflection, and harnessing available tools to interact with and interpret the world, enabling them to find answers efficiently. The recent advancements in large language models (LLMs) suggest that machines might also possess the aforementioned human-like capabilities, allowing them to exhibit powerful abilities even with a constrained parameter count. In this paper, we introduce KwaiAgents, a generalized information-seeking agent system based on LLMs. Within KwaiAgents, we propose an agent system that employs LLMs as its cognitive core, which is capable of understanding a user's query, behavior guidelines, and referencing external documents. The agent can also update and retrieve information from its internal memory, plan and execute actions using a time-aware search-browse toolkit, and ultimately provide a comprehensive response. We further investigate the system's performance when powered by LLMs less advanced than GPT-4, and introduce the Meta-Agent Tuning (MAT) framework, designed to ensure even an open-sourced 7B or 13B model performs well among many agent systems. We exploit both benchmark and human evaluations to systematically validate these capabilities. Extensive experiments show the superiority of our agent system compared to other autonomous agents and highlight the enhanced generalized agent-abilities of our fine-tuned LLMs. △ Less

Submitted 10 January, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

arXiv:2311.09142 [pdf, other]

Machine-learning parameter tracking with partial state observation

Authors: Zheng-Meng Zhai, Mohammadamin Moradi, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

Abstract: Complex and nonlinear dynamical systems often involve parameters that change with time, accurate tracking of which is essential to tasks such as state estimation, prediction, and control. Existing machine-learning methods require full state observation of the underlying system and tacitly assume adiabatic changes in the parameter. Formulating an inverse problem and exploiting reservoir computing,… ▽ More Complex and nonlinear dynamical systems often involve parameters that change with time, accurate tracking of which is essential to tasks such as state estimation, prediction, and control. Existing machine-learning methods require full state observation of the underlying system and tacitly assume adiabatic changes in the parameter. Formulating an inverse problem and exploiting reservoir computing, we develop a model-free and fully data-driven framework to accurately track time-varying parameters from partial state observation in real time. In particular, with training data from a subset of the dynamical variables of the system for a small number of known parameter values, the framework is able to accurately predict the parameter variations in time. Low- and high-dimensional, Markovian and non-Markovian nonlinear dynamical systems are used to demonstrate the power of the machine-learning based parameter-tracking framework. Pertinent issues affecting the tracking performance are addressed. △ Less

Submitted 15 November, 2023; originally announced November 2023.

Comments: 5 pages, 4 figures

arXiv:2311.05848 [pdf, other]

Emulating power spectra for pre- and post-reconstructed galaxy samples

Authors: Yuting Wang, Ruiyang Zhao, Zhongxu Zhai, Kazuya Koyama, Will J. Percival, Hong Guo, Yin Li, Gong-Bo Zhao, Takahiro Nishimichi, Héctor Gil-Marín, Yonghao Feng, Hanyu Zhang, Yi Wu

Abstract: The small-scale linear information in galaxy samples typically lost during non-linear growth can be restored to a certain level by the density field reconstruction, which has been demonstrated for improving the precision of the baryon acoustic oscillations (BAO) measurements. As proposed in the literature, a joint analysis of the power spectrum before and after the reconstruction enables an effici… ▽ More The small-scale linear information in galaxy samples typically lost during non-linear growth can be restored to a certain level by the density field reconstruction, which has been demonstrated for improving the precision of the baryon acoustic oscillations (BAO) measurements. As proposed in the literature, a joint analysis of the power spectrum before and after the reconstruction enables an efficient extraction of information carried by high-order statistics. However, the statistics of the post-reconstruction density field are difficult to model. In this work, we circumvent this issue by developing an accurate emulator for the pre-reconstructed, post-reconstructed, and cross power spectra ($P_{\rm pre}$, $P_{\rm post}$, $P_{\rm cross}$) up to $k=0.5~h~{\rm Mpc^{-1}}$ based on the \textsc{Dark Quest} N-body simulations. The accuracy of the emulator is at percent level, namely, the error of the emulated monopole and quadrupole of the power spectra is less than $1\%$ and $10\%$ of the ground truth, respectively. A fit to an example power spectra using the emulator shows that the constraints on cosmological parameters get largely improved using $P_{\rm pre}$+$P_{\rm post}$+$P_{\rm cross}$ with $k_{\rm max}=0.25~h~{\rm Mpc^{-1}}$, compared to that derived from $P_{\rm pre}$ alone, namely, the constraints on ($Ω_m$, $H_0$, $σ_8$) are tightened by $\sim41 \%-55\%$, and the uncertainties of the derived BAO and RSD parameters ($α_{\perp}$, $α_{||}$, $fσ_8$) shrink by $\sim 28\%-54\%$, respectively. This highlights the complementarity among $P_{\rm pre}$, $P_{\rm post}$ and $P_{\rm cross}$, which demonstrates the efficiency and practicability of a joint $P_{\rm pre}$, $P_{\rm post}$ and $P_{\rm cross}$ analysis for cosmological implications. △ Less

Submitted 25 February, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

Comments: 19 pages, 11 figures, 2 tables; accepted for publication in ApJ

arXiv:2310.10814 [pdf, ps, other]

A saturated 1-system of curves on the surface of genus 3

Authors: Zhaoshen Zhai

Abstract: We construct a saturated system of 33 essential simple closed curves that are pairwise non-homotopic and intersect at most once on the oriented, closed surface of genus 3. We construct a saturated system of 33 essential simple closed curves that are pairwise non-homotopic and intersect at most once on the oriented, closed surface of genus 3. △ Less

Submitted 21 December, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

Comments: Added future work on the maximality of the 1-system

MSC Class: 57K20

arXiv:2310.05075 [pdf, other]

Decentralized Federated Learning via MIMO Over-the-Air Computation: Consensus Analysis and Performance Optimization

Authors: Zhiyuan Zhai, Xiaojun Yuan, Xin Wang

Abstract: Decentralized federated learning (DFL), inherited from distributed optimization, is an emerging paradigm to leverage the explosively growing data from wireless devices in a fully distributed manner.DFL enables joint training of machine learning model under device to device (D2D) communication fashion without the coordination of a parameter server. However, the deployment of wireless DFL is facing… ▽ More Decentralized federated learning (DFL), inherited from distributed optimization, is an emerging paradigm to leverage the explosively growing data from wireless devices in a fully distributed manner.DFL enables joint training of machine learning model under device to device (D2D) communication fashion without the coordination of a parameter server. However, the deployment of wireless DFL is facing some pivotal challenges. Communication is a critical bottleneck due to the required extensive message exchange between neighbor devices to share the learned model. Besides, consensus becomes increasingly difficult as the number of devices grows because there is no available central server to perform coordination. To overcome these difficulties, this paper proposes employing over-the-air computation (Aircomp) to improve communication efficiency by exploiting the superposition property of analog waveform in multi-access channels, and introduce the mixing matrix mechanism to promote consensus using the spectral property of symmetric doubly stochastic matrix. Specifically, we develop a novel multiple-input multiple-output over-the-air DFL (MIMO OA-DFL) framework to study over-the-air DFL problem over MIMO multiple access channels. We conduct a general convergence analysis to quantitatively capture the influence of aggregation weight and communication error on the MIMO OA-DFL performance in \emph{ad hoc} networks. The result shows that the communication error together with the spectral gap of mixing matrix has a significant impact on the learning performance. Based on this, a joint communication-learning optimization problem is formulated to optimize transceiver beamformers and mixing matrix. Extensive numerical experiments are performed to reveal the characteristics of different topologies and demonstrate the substantial learning performance enhancement of our proposed algorithm. △ Less

Submitted 8 October, 2023; originally announced October 2023.

arXiv:2309.13518 [pdf, other]

Review of computational methods for estimating cell potency from single-cell RNA-seq data, with a detailed analysis of discrepancies between method description and code implementation

Authors: Qingyang Wang, Zhiqian Zhai, Dongyuan Song, Jingyi Jessica Li

Abstract: In single-cell RNA sequencing (scRNA-seq) data analysis, a critical challenge is to infer hidden dynamic cellular processes from measured static cell snapshots. To tackle this challenge, many computational methods have been developed from distinct perspectives. Besides the common perspectives of inferring trajectories (or pseudotime) and RNA velocity, another important perspective is to estimate t… ▽ More In single-cell RNA sequencing (scRNA-seq) data analysis, a critical challenge is to infer hidden dynamic cellular processes from measured static cell snapshots. To tackle this challenge, many computational methods have been developed from distinct perspectives. Besides the common perspectives of inferring trajectories (or pseudotime) and RNA velocity, another important perspective is to estimate the differentiation potential of cells, which is commonly referred to as "cell potency." In this review, we provide a comprehensive summary of 11 computational methods that estimate cell potency from scRNA-seq data under different assumptions, some of which are even conceptually contradictory. We divide these methods into three categories: mean-based, entropy-based, and correlation-based methods, depending on how a method summarizes gene expression levels of a cell or cell type into a potency measure. Our review focuses on the key similarities and differences of the methods within each category and between the categories, providing a high-level intuition of each method. Moreover, we use a unified set of mathematical notations to detail the 11 methods' methodologies and summarize their usage complexities, including the number of ad-hoc parameters, the number of required inputs, and the existence of discrepancies between the method description in publications and the method implementation in software packages. Realizing the conceptual contradictions of existing methods and the difficulty of fair benchmarking without single-cell-level ground truths, we conclude that accurate estimation of cell potency from scRNA-seq data remains an open challenge. △ Less

Submitted 23 September, 2023; originally announced September 2023.

arXiv:2309.11470 [pdf, other]

doi 10.1038/s41467-023-41379-3

Model-free tracking control of complex dynamical trajectories with machine learning

Authors: Zheng-Meng Zhai, Mohammadamin Moradi, Ling-Wei Kong, Bryan Glaz, Mulugeta Haile, Ying-Cheng Lai

Abstract: Nonlinear tracking control enabling a dynamical system to track a desired trajectory is fundamental to robotics, serving a wide range of civil and defense applications. In control engineering, designing tracking control requires complete knowledge of the system model and equations. We develop a model-free, machine-learning framework to control a two-arm robotic manipulator using only partially obs… ▽ More Nonlinear tracking control enabling a dynamical system to track a desired trajectory is fundamental to robotics, serving a wide range of civil and defense applications. In control engineering, designing tracking control requires complete knowledge of the system model and equations. We develop a model-free, machine-learning framework to control a two-arm robotic manipulator using only partially observed states, where the controller is realized by reservoir computing. Stochastic input is exploited for training, which consists of the observed partial state vector as the first and its immediate future as the second component so that the neural machine regards the latter as the future state of the former. In the testing (deployment) phase, the immediate-future component is replaced by the desired observational vector from the reference trajectory. We demonstrate the effectiveness of the control framework using a variety of periodic and chaotic signals, and establish its robustness against measurement noise, disturbances, and uncertainties. △ Less

Submitted 20 September, 2023; originally announced September 2023.

Comments: 16 pages, 8 figures

Journal ref: Nat Commun 14, 5698 (2023)

arXiv:2309.02408 [pdf, ps, other]

Wiener type regularity for non-linear integro-differential equations

Authors: Shaoguang Shi, Guanglan Wang, Zhichun Zhai

Abstract: The primary purpose of this paper is to study the Wiener-type regularity criteria for non-linear equations driven by integro-differential operators, whose model is the fractional $p-$Laplace equation. In doing so, with the help of tools from potential analysis, such as fractional relative Sobolev capacities, Wiener type integrals, Wolff potentials, $(α,p)-$barriers, and $(α,p)-$balayages, we first… ▽ More The primary purpose of this paper is to study the Wiener-type regularity criteria for non-linear equations driven by integro-differential operators, whose model is the fractional $p-$Laplace equation. In doing so, with the help of tools from potential analysis, such as fractional relative Sobolev capacities, Wiener type integrals, Wolff potentials, $(α,p)-$barriers, and $(α,p)-$balayages, we first prove the characterizations of the fractional thinness and the Perron boundary regularity. Then, we establish a Wiener test and a generalized fractional Wiener criterion. Furthermore, we also prove the continuity of the fractional superharmonic function, the fractional resolutivity, a connection between $(α,p)-$potentials and $(α,p)-$Perron solutions, and the existence of a capacitary function for an arbitrary condenser. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: 27 pages, any comments are welcome

arXiv:2308.11104 [pdf]

doi 10.1002/adfm.202302214

Revealing unusual bandgap shifts with temperature and bandgap renormalization effect in phase-stabilized metal halide perovskite thin films

Authors: Haochen Zhang, Zhixuan Bi, Zehua Zhai, Han Gao, Yuwei Liu, Meiling Jin, Meng Ye, Xuanzhang Li, Haowen Liu, Yuegang Zhang, Xiang Li, Hairen Tan, Yong Xu, Luyi Yang

Abstract: Hybrid organic-inorganic metal halide perovskites are emerging materials in photovoltaics, whose bandgap is one of the most crucial parameters governing their light harvesting performance. Here we present the temperature and photocarrier density dependence of the bandgap in two phase-stabilized perovskite thin films (MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3) using photoluminescence and absorption… ▽ More Hybrid organic-inorganic metal halide perovskites are emerging materials in photovoltaics, whose bandgap is one of the most crucial parameters governing their light harvesting performance. Here we present the temperature and photocarrier density dependence of the bandgap in two phase-stabilized perovskite thin films (MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3) using photoluminescence and absorption spectroscopy. Contrasting bandgap shifts with temperature are observed between the two perovskites. Using X-ray diffraction and in situ high-pressure photoluminescence spectroscopy, we show that thermal expansion plays only a minor role in the large bandgap blueshift, which is attributed to the enhanced structural stability of our samples. Our first-principles calculations further demonstrate the significant impact of thermally induced lattice distortions on the bandgap widening. We propose that the anomalous trends are caused by the competition between static and dynamic distortions. Additionally, both the bandgap renormalization and band filling effects are directly observed for the first time in fluence-dependent photoluminescence measurements and are employed to estimate the exciton effective mass. Our results provide new insights into the basic understanding of thermal and charge-accumulation effects on the band structure of hybrid perovskite thin films. △ Less

Submitted 28 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

arXiv:2308.03057 [pdf]

doi 10.1021/acs.nanolett.3c01734

Spin Coherence and Spin Relaxation in Hybrid Organic-Inorganic Lead and Mixed Lead-Tin Perovskites

Authors: Haochen Zhang, Zehua Zhai, Zhixuan Bi, Han Gao, Meng Ye, Yong Xu, Hairen Tan, Luyi Yang

Abstract: Metal halide perovskites make up a promising class of materials for semiconductor spintronics. Here we report a systematic investigation of coherent spin precession, spin dephasing and spin relaxation of electrons and holes in two hybrid organic-inorganic perovskites MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3 using time-resolved Faraday rotation spectroscopy. With applied in-plane magnetic fields,… ▽ More Metal halide perovskites make up a promising class of materials for semiconductor spintronics. Here we report a systematic investigation of coherent spin precession, spin dephasing and spin relaxation of electrons and holes in two hybrid organic-inorganic perovskites MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3 using time-resolved Faraday rotation spectroscopy. With applied in-plane magnetic fields, we observe robust Larmor spin precession of electrons and holes that persists for hundreds of picoseconds. The spin dephasing and relaxation processes are likely to be sensitive to the defect levels. Temperature-dependent measurements give further insights into the spin relaxation channels. The extracted electron Landé g-factors (3.75 and 4.36) are the biggest among the reported values in inorganic or hybrid perovskites. Both the electron and hole g-factors shift dramatically with temperature, which we propose to originate from thermal lattice vibration effects on the band structure. These results lay the foundation for further design and use of lead- and tin-based perovskites for spintronic applications. △ Less

Submitted 1 September, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

Journal ref: Nano Letters 23, 7917-7920 (2023)

arXiv:2306.05722 [pdf, other]

Ridge Estimation with Nonlinear Transformations

Authors: Zheng Zhai, Hengchao Chen, Zhigang Yao

Abstract: Ridge estimation is an important manifold learning technique. The goal of this paper is to examine the effects of nonlinear transformations on the ridge sets. The main result proves the inclusion relationship between ridges: $\cR(f\circ p)\subseteq \cR(p)$, provided that the transformation $f$ is strictly increasing and concave on the range of the function $p$. Additionally, given an underlying tr… ▽ More Ridge estimation is an important manifold learning technique. The goal of this paper is to examine the effects of nonlinear transformations on the ridge sets. The main result proves the inclusion relationship between ridges: $\cR(f\circ p)\subseteq \cR(p)$, provided that the transformation $f$ is strictly increasing and concave on the range of the function $p$. Additionally, given an underlying true manifold $\cM$, we show that the Hausdorff distance between $\cR(f\circ p)$ and its projection onto $\cM$ is smaller than the Hausdorff distance between $\cR(p)$ and the corresponding projection. This motivates us to apply an increasing and concave transformation before the ridge estimation. In specific, we show that the power transformations $f^{q}(y)=y^q/q,-\infty<q\leq 1$ are increasing and concave on $\RR_+$, and thus we can use such power transformations when $p$ is strictly positive. Numerical experiments demonstrate the advantages of the proposed methods. △ Less

Submitted 4 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

arXiv:2305.15430 [pdf, other]

doi 10.1109/LSP.2023.3298282

Bounded Projection Matrix Approximation with Applications to Community Detection

Authors: Zheng Zhai, Hengchao Chen, Qiang Sun

Abstract: Community detection is an important problem in unsupervised learning. This paper proposes to solve a projection matrix approximation problem with an additional entrywise bounded constraint. Algorithmically, we introduce a new differentiable convex penalty and derive an alternating direction method of multipliers (ADMM) algorithm. Theoretically, we establish the convergence properties of the propos… ▽ More Community detection is an important problem in unsupervised learning. This paper proposes to solve a projection matrix approximation problem with an additional entrywise bounded constraint. Algorithmically, we introduce a new differentiable convex penalty and derive an alternating direction method of multipliers (ADMM) algorithm. Theoretically, we establish the convergence properties of the proposed algorithm. Numerical experiments demonstrate the superiority of our algorithm over its competitors, such as the semi-definite relaxation method and spectral clustering. △ Less

Submitted 21 May, 2023; originally announced May 2023.

arXiv:2305.03972 [pdf, other]

Category-Oriented Representation Learning for Image to Multi-Modal Retrieval

Authors: Zida Cheng, Chen Ju, Shuai Xiao, Xu Chen, Zhonghua Zhai, Xiaoyi Zeng, Weilin Huang, Junchi Yan

Abstract: The rise of multi-modal search requests from users has highlighted the importance of multi-modal retrieval (i.e. image-to-text or text-to-image retrieval), yet the more complex task of image-to-multi-modal retrieval, crucial for many industry applications, remains under-explored. To address this gap and promote further research, we introduce and define the concept of Image-to-Multi-Modal Retrieval… ▽ More The rise of multi-modal search requests from users has highlighted the importance of multi-modal retrieval (i.e. image-to-text or text-to-image retrieval), yet the more complex task of image-to-multi-modal retrieval, crucial for many industry applications, remains under-explored. To address this gap and promote further research, we introduce and define the concept of Image-to-Multi-Modal Retrieval (IMMR), a process designed to retrieve rich multi-modal (i.e. image and text) documents based on image queries. We focus on representation learning for IMMR and analyze three key challenges for it: 1) skewed data and noisy label in real-world industrial data, 2) the information-inequality between image and text modality of documents when learning representations, 3) effective and efficient training in large-scale industrial contexts. To tackle the above challenges, we propose a novel framework named organizing categories and learning by classification for retrieval (OCLEAR). It consists of three components: 1) a novel category-oriented data governance scheme coupled with a large-scale classification-based learning paradigm, which handles the skewed and noisy data from a data perspective. 2) model architecture specially designed for multi-modal learning, where information-inequality between image and text modality of documents is considered for modality fusion. 3) a hybrid parallel training approach for tackling large-scale training in industrial scenario. The proposed framework achieves SOTA performance on public datasets and has been deployed in a real-world industrial e-commence system, leading to significant business growth. Code will be made publicly available. △ Less

Submitted 9 June, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

arXiv:2304.04420 [pdf, other]

Feature Representation Learning with Adaptive Displacement Generation and Transformer Fusion for Micro-Expression Recognition

Authors: Zhijun Zhai, Jianhui Zhao, Chengjiang Long, Wenju Xu, Shuangjiang He, Huijuan Zhao

Abstract: Micro-expressions are spontaneous, rapid and subtle facial movements that can neither be forged nor suppressed. They are very important nonverbal communication clues, but are transient and of low intensity thus difficult to recognize. Recently deep learning based methods have been developed for micro-expression (ME) recognition using feature extraction and fusion techniques, however, targeted feat… ▽ More Micro-expressions are spontaneous, rapid and subtle facial movements that can neither be forged nor suppressed. They are very important nonverbal communication clues, but are transient and of low intensity thus difficult to recognize. Recently deep learning based methods have been developed for micro-expression (ME) recognition using feature extraction and fusion techniques, however, targeted feature learning and efficient feature fusion still lack further study according to the ME characteristics. To address these issues, we propose a novel framework Feature Representation Learning with adaptive Displacement Generation and Transformer fusion (FRL-DGT), in which a convolutional Displacement Generation Module (DGM) with self-supervised learning is used to extract dynamic features from onset/apex frames targeted to the subsequent ME recognition task, and a well-designed Transformer Fusion mechanism composed of three Transformer-based fusion modules (local, global fusions based on AU regions and full-face fusion) is applied to extract the multi-level informative features after DGM for the final ME prediction. The extensive experiments with solid leave-one-subject-out (LOSO) evaluation results have demonstrated the superiority of our proposed FRL-DGT to state-of-the-art methods. △ Less

Submitted 10 April, 2023; originally announced April 2023.

arXiv:2303.17095 [pdf, other]

doi 10.1093/mnras/stad1793

Small scale clustering of BOSS galaxies: dependence on luminosity, color, age, stellar mass, specific star formation rate and other properties

Authors: Zhongxu Zhai, Will J. Percival, Hong Guo

Abstract: We measure and analyze galaxy clustering and the dependence on luminosity, color, age, stellar mass and specific star formation rate using Baryon Oscillation Spectroscopic Survey (BOSS) galaxies at $0.48<z<0.62$. We fit the monopole and quadrupole moments of the two-point correlation function (2PCF) and its projection on scales of $0.1$ -- $60.2h^{-1}$Mpc, after having split the catalog in a varie… ▽ More We measure and analyze galaxy clustering and the dependence on luminosity, color, age, stellar mass and specific star formation rate using Baryon Oscillation Spectroscopic Survey (BOSS) galaxies at $0.48<z<0.62$. We fit the monopole and quadrupole moments of the two-point correlation function (2PCF) and its projection on scales of $0.1$ -- $60.2h^{-1}$Mpc, after having split the catalog in a variety of ways. We find that the clustering dependence is consistent with previous well-established results showing the broad trends expected: For example, that brighter, redder, older, more massive and quenched galaxies are more strongly clustered. We also investigate the dependence on additional parameters previously derived from stellar population synthesis model fits to the spectra. We find that galaxy clustering depends on look-back formation time at a low level, while it has little dependence on metallicity. To understand the physics behind these trends, we fit the clustering with a simulation-based emulator to simultaneously model cosmology and galaxy bias using a Halo Occupation Distribution framework. After marginalizing parameters determining the background cosmology, galaxy bias, and a scaling parameter to decouple halo velocity field, we find that the growth rate of large scale structure as determined by the redshift-space distortions is consistent with previous analysis using the full sample, and we do not find evidence that cosmological constraints depend systematically on galaxy selection. This demonstrates that cosmological inference using small scale clustering measurements is robust to changes in the catalog selection. △ Less

Submitted 17 July, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

Comments: 17 pages, 13+2 figures, comments welcome, updated to match the published version

arXiv:2303.09762 [pdf, other]

doi 10.1088/1475-7516/2023/07/054

Aemulus $ν$: Precise Predictions for Matter and Biased Tracer Power Spectra in the Presence of Neutrinos

Authors: Joseph DeRose, Nickolas Kokron, Arka Banerjee, Shi-Fan Chen, Martin White, Risa Wechsler, Kate Storey-Fisher, Jeremy Tinker, Zhongxu Zhai

Abstract: We present the Aemulus $ν$ simulations: a suite of 150 $(1.05 h^{-1}\rm Gpc)^3$ $N$-body simulations with a mass resolution of $3.51\times 10^{10} \frac{Ω_{cb}}{0.3} ~ h^{-1} M_{\odot}$ in a $wν$CDM cosmological parameter space. The simulations have been explicitly designed to span a broad range in $σ_8$ to facilitate investigations of tension between large scale structure and cosmic microwave bac… ▽ More We present the Aemulus $ν$ simulations: a suite of 150 $(1.05 h^{-1}\rm Gpc)^3$ $N$-body simulations with a mass resolution of $3.51\times 10^{10} \frac{Ω_{cb}}{0.3} ~ h^{-1} M_{\odot}$ in a $wν$CDM cosmological parameter space. The simulations have been explicitly designed to span a broad range in $σ_8$ to facilitate investigations of tension between large scale structure and cosmic microwave background cosmological probes. Neutrinos are treated as a second particle species to ensure accuracy to $0.5\, \rm eV$, the maximum neutrino mass that we have simulated. By employing Zel'dovich control variates, we increase the effective volume of our simulations by factors of $10-10^5$ depending on the statistic in question. As a first application of these simulations, we build new hybrid effective field theory and matter power spectrum surrogate models, demonstrating that they achieve $\le 1\%$ accuracy for $k\le 1\, h\,\rm Mpc^{-1}$ and $0\le z \le 3$, and $\le 2\%$ accuracy for $k\le 4\, h\,\rm Mpc^{-1}$ for the matter power spectrum. We publicly release the trained surrogate models, and estimates of the surrogate model errors in the hope that they will be broadly applicable to a range of cosmological analyses for many years to come. △ Less

Submitted 24 July, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

Comments: 37 pages, 15 figures, matching version accepted by JCAP

arXiv:2303.05717 [pdf, other]

The effective volume of supernovae samples and sample variance

Authors: Zhongxu Zhai, Will J. Percival, Zhejie Ding

Abstract: The source of the tension between local SN Ia based Hubble constant measurements and those from the CMB or BAO+BBN measurements is one of the most interesting unknowns of modern cosmology. Sample variance forms a key component of the error on the local measurements, and will dominate the error budget in the future as more SNe Ia are observed. Many methods have been proposed to estimate sample vari… ▽ More The source of the tension between local SN Ia based Hubble constant measurements and those from the CMB or BAO+BBN measurements is one of the most interesting unknowns of modern cosmology. Sample variance forms a key component of the error on the local measurements, and will dominate the error budget in the future as more SNe Ia are observed. Many methods have been proposed to estimate sample variance in many contexts, and we compared results from a number of approximate methods to estimates from N-body simulations in a previous paper, confirming that sample variance for the Pantheon SNe Ia sample does not solve the Hubble tension. We now extend this analysis to include the more accurate analytic method based on calculating correlations between the radial peculiar velocities of SNe Ia, comparing this technique with results from numerical simulations. We consider the dependence of these errors on the linear power spectrum and how non-linear velocities contribute to the error. Using this technique, and matching sample variance errors from more approximate methods, we can define an effective volume for SNe Ia samples, finding that the Pantheon sample is equivalent to a top-hat sphere of radius $\sim220~h^{-1}$Mpc. We use this link between sample-variance errors to compute $ΔH_{0}$ for idealised surveys with particular angular distributions of SNe Ia. For example, a half-sky survey at the Pantheon depth has the potential to suppress the sample variance of $H_{0}$ to $\sim0.1$ km s$^{-1}$Mpc$^{-1}$, a significant improvement compared with the current result. Finally, we consider the strength of large-scale velocity power spectrum required to explain the Hubble tension using sample variance, finding it requires an extreme model well beyond that allowed by other observations. △ Less

Submitted 26 February, 2024; v1 submitted 10 March, 2023; originally announced March 2023.

Comments: 12 pages, 6 figures. Analyses updated and extended, minor to moderate updates due to referees' comments, replaced to match the accepted version by PRD

arXiv:2302.11621 [pdf, other]

doi 10.1093/mnras/stad2351

Isolating the linear signal when making redshift space distortion measurements

Authors: Michael J. Chapman, Zhongxu Zhai, Will J. Percival

Abstract: Constraints on the linear growth rate, $fσ_8$, using small scale redshift space distortion measurements have a significant statistical advantage over those made on large scales. However, these measurements need to carefully disentangle the linear and non-linear information when interpreting redshift space distortions in terms of $fσ_8$. It is particularly important to do this given that some previ… ▽ More Constraints on the linear growth rate, $fσ_8$, using small scale redshift space distortion measurements have a significant statistical advantage over those made on large scales. However, these measurements need to carefully disentangle the linear and non-linear information when interpreting redshift space distortions in terms of $fσ_8$. It is particularly important to do this given that some previous measurements found a significant deviation from the expectation based on the $Λ$CDM model constrained by Planck CMB data. We construct a new emulator-based model for small scale galaxy clustering with scaling parameters for both the linear and non-linear velocities of galaxies, allowing us to isolate the linear growth rate. We train the emulator using simulations from the AbacusCosmos suite, and apply it to data from the extended Baryon Oscillation Spectroscopic Survey (eBOSS) luminous red galaxy sample. We obtain a value of $fσ_8(z=0.737)=0.368\pm0.041$, in 2.3-$σ$ tension with the Planck 2018 $Λ$CDM expectation, and find less dependence on the minimum measurement scale than previous analyses. △ Less

Submitted 22 February, 2023; originally announced February 2023.

Comments: 14 pages, 9 figures, submitted to MNRAS

arXiv:2301.12965 [pdf, other]

Quadratic Matrix Factorization with Applications to Manifold Learning

Authors: Zheng Zhai, Hengchao Chen, Qiang Sun

Abstract: Matrix factorization is a popular framework for modeling low-rank data matrices. Motivated by manifold learning problems, this paper proposes a quadratic matrix factorization (QMF) framework to learn the curved manifold on which the dataset lies. Unlike local linear methods such as the local principal component analysis, QMF can better exploit the curved structure of the underlying manifold. Algor… ▽ More Matrix factorization is a popular framework for modeling low-rank data matrices. Motivated by manifold learning problems, this paper proposes a quadratic matrix factorization (QMF) framework to learn the curved manifold on which the dataset lies. Unlike local linear methods such as the local principal component analysis, QMF can better exploit the curved structure of the underlying manifold. Algorithmically, we propose an alternating minimization algorithm to optimize QMF and establish its theoretical convergence properties. Moreover, to avoid possible over-fitting, we then propose a regularized QMF algorithm and discuss how to tune its regularization parameter. Finally, we elaborate how to apply the regularized QMF to manifold learning problems. Experiments on a synthetic manifold learning dataset and two real datasets, including the MNIST handwritten dataset and a cryogenic electron microscopy dataset, demonstrate the superiority of the proposed method over its competitors. △ Less

Submitted 30 January, 2023; originally announced January 2023.

arXiv:2212.08699 [pdf, other]

doi 10.1093/mnras/stad1591

Phenomenological power spectrum models for H$α$ emission line galaxies from the Nancy Grace Roman Space Telescope

Authors: Kevin S. McCarthy, Zhongxu Zhai, Yun Wang

Abstract: The High Latitude Spectroscopic Survey (HLSS) is the reference baseline spectroscopic survey for NASA's Nancy Grace Roman space telescope, measuring redshifts of $\sim 10$M H$α$ emission line galaxies over a $2000$ deg$^2$ footprint at $z=1-2$. In this work, we use a realistic Roman galaxy mock catalogue to explore optimal phenomenological modeling of the measured power spectrum. We consider two m… ▽ More The High Latitude Spectroscopic Survey (HLSS) is the reference baseline spectroscopic survey for NASA's Nancy Grace Roman space telescope, measuring redshifts of $\sim 10$M H$α$ emission line galaxies over a $2000$ deg$^2$ footprint at $z=1-2$. In this work, we use a realistic Roman galaxy mock catalogue to explore optimal phenomenological modeling of the measured power spectrum. We consider two methods for modeling the redshift-space distortions (Kaiser squashing and another with a window function on $β$ that selects out the coherent radial infall pairwise velocities, $M_A$ and $M_B$, respectively), two models for the nonlinear impact of baryons that smears the BAO signal (a fixed ratio between the smearing scales in the perpendicular and parallel dimensions and another where these smearing scales are kept as a free parameters, P$_{dw}(k|k_*)$ and P$_{dw}(k|Σ_\perp,Σ_\parallel)$, respectively), and two analytical emulations of nonlinear growth (one employing the halo model and another formulated from simulated galaxy clustering of a semi-analytical model, $F_{HM}$ and $F_{SAM}$, respectively). We find that the best model combination employing $F_{HM}$ is $P_{dw}(k|k_*)*F_{HM}*M_B$, while the best combination employing $F_{SAM}$ is $P_{dw}(k|k_*)*F_{SAM}*M_B$, which leads to unbiased measurements of cosmological parameters. We compare these to the Effective Field Theory of Large-Scale Structure perturbation theory model $P_{EFT}(k|Θ)$, and find that our simple phenomenological models are comparable across the entire redshift range for $k_{max}=0.25$ and $0.3$ $h$/Mpc. We expect the tools that we have developed to be useful in probing dark energy and testing gravity using Roman in an accurate and robust manner. △ Less

Submitted 23 May, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

Comments: 17 pages, 3 figures, 4 tables, Accepted to MNRAS 23 May 2023

arXiv:2212.04197 [pdf, other]

HyperEnclave: An Open and Cross-platform Trusted Execution Environment

Authors: Yuekai Jia, Shuang Liu, Wenhao Wang, Yu Chen, Zhengde Zhai, Shoumeng Yan, Zhengyu He

Abstract: A number of trusted execution environments (TEEs) have been proposed by both academia and industry. However, most of them require specific hardware or firmware changes and are bound to specific hardware vendors (such as Intel, AMD, ARM, and IBM). In this paper, we propose HyperEnclave, an open and cross-platform process-based TEE that relies on the widely-available virtualization extension to crea… ▽ More A number of trusted execution environments (TEEs) have been proposed by both academia and industry. However, most of them require specific hardware or firmware changes and are bound to specific hardware vendors (such as Intel, AMD, ARM, and IBM). In this paper, we propose HyperEnclave, an open and cross-platform process-based TEE that relies on the widely-available virtualization extension to create the isolated execution environment. In particular, HyperEnclave is designed to support the flexible enclave operation modes to fulfill the security and performance demands under various enclave workloads. We provide the enclave SDK to run existing SGX programs on HyperEnclave with little or no source code changes. We have implemented HyperEnclave on commodity AMD servers and deployed the system in a world-leading FinTech company to support real-world privacy-preserving computations. The evaluation on both micro-benchmarks and application benchmarks shows the design of HyperEnclave introduces only a small overhead. △ Less

Submitted 8 December, 2022; originally announced December 2022.

Journal ref: In 2022 USENIX Annual Technical Conference (USENIX ATC 22), pages 437-454, Carlsbad, CA, July 2022. USENIX Association

arXiv:2212.01215 [pdf, other]

Olive Branch Learning: A Topology-Aware Federated Learning Framework for Space-Air-Ground Integrated Network

Authors: Qingze Fang, Zhiwei Zhai, Shuai Yu, Qiong Wu, Xiaowen Gong, Xu Chen

Abstract: The space-air-ground integrated network (SAGIN), one of the key technologies for next-generation mobile communication systems, can facilitate data transmission for users all over the world, especially in some remote areas where vast amounts of informative data are collected by Internet of remote things (IoRT) devices to support various data-driven artificial intelligence (AI) services. However, tr… ▽ More The space-air-ground integrated network (SAGIN), one of the key technologies for next-generation mobile communication systems, can facilitate data transmission for users all over the world, especially in some remote areas where vast amounts of informative data are collected by Internet of remote things (IoRT) devices to support various data-driven artificial intelligence (AI) services. However, training AI models centrally with the assistance of SAGIN faces the challenges of highly constrained network topology, inefficient data transmission, and privacy issues. To tackle these challenges, we first propose a novel topology-aware federated learning framework for the SAGIN, namely Olive Branch Learning (OBL). Specifically, the IoRT devices in the ground layer leverage their private data to perform model training locally, while the air nodes in the air layer and the ring-structured low earth orbit (LEO) satellite constellation in the space layer are in charge of model aggregation (synchronization) at different scales.To further enhance communication efficiency and inference performance of OBL, an efficient Communication and Non-IID-aware Air node-Satellite Assignment (CNASA) algorithm is designed by taking the data class distribution of the air nodes as well as their geographic locations into account. Furthermore, we extend our OBL framework and CNASA algorithm to adapt to more complex multi-orbit satellite networks. We analyze the convergence of our OBL framework and conclude that the CNASA algorithm contributes to the fast convergence of the global model. Extensive experiments based on realistic datasets corroborate the superior performance of our algorithm over the benchmark policies. △ Less

Submitted 2 December, 2022; originally announced December 2022.

Comments: accepted by IEEE Transactions on Wireless Communications, Dec. 2022

arXiv:2211.09955 [pdf, other]

Emergence of a stochastic resonance in machine learning

Authors: Zheng-Meng Zhai, Ling-Wei Kong, Ying-Cheng Lai

Abstract: Can noise be beneficial to machine-learning prediction of chaotic systems? Utilizing reservoir computers as a paradigm, we find that injecting noise to the training data can induce a stochastic resonance with significant benefits to both short-term prediction of the state variables and long-term prediction of the attractor of the system. A key to inducing the stochastic resonance is to include the… ▽ More Can noise be beneficial to machine-learning prediction of chaotic systems? Utilizing reservoir computers as a paradigm, we find that injecting noise to the training data can induce a stochastic resonance with significant benefits to both short-term prediction of the state variables and long-term prediction of the attractor of the system. A key to inducing the stochastic resonance is to include the amplitude of the noise in the set of hyperparameters for optimization. By so doing, the prediction accuracy, stability and horizon can be dramatically improved. The stochastic resonance phenomenon is demonstrated using two prototypical high-dimensional chaotic systems. △ Less

Submitted 15 November, 2022; originally announced November 2022.

Comments: 7 pages, 4 figures

arXiv:2211.03921 [pdf, ps, other]

Towards the assignments for $1^{1}D_{2}$ and $1^{3}D_{2}$ meson nonets

Authors: Xue Chao Feng, Ke Wei Wei, Jie Wu, Xue Zhen Zhai, ShiZhuo Wang

Abstract: In this work, we investigate the mass spectrum of $1^{1}D_{2}$ and $1^{3}D_{2}$ meson nonets in the framework of the meson mass matrix and Regge phenomenology. The results are compared with the values from different phenomenological models and may be useful for the assignment of the $1^{1}D_{2}$ and $1^{3}D_{2}$ meson nonets in the future. In this work, we investigate the mass spectrum of $1^{1}D_{2}$ and $1^{3}D_{2}$ meson nonets in the framework of the meson mass matrix and Regge phenomenology. The results are compared with the values from different phenomenological models and may be useful for the assignment of the $1^{1}D_{2}$ and $1^{3}D_{2}$ meson nonets in the future. △ Less

Submitted 7 November, 2022; originally announced November 2022.

Comments: 13 pages, 6 figures

arXiv:2211.00732 [pdf, other]

Kuaipedia: a Large-scale Multi-modal Short-video Encyclopedia

Authors: Haojie Pan, Zepeng Zhai, Yuzhou Zhang, Ruiji Fu, Ming Liu, Yangqiu Song, Zhongyuan Wang, Bing Qin

Abstract: Online encyclopedias, such as Wikipedia, have been well-developed and researched in the last two decades. One can find any attributes or other information of a wiki item on a wiki page edited by a community of volunteers. However, the traditional text, images and tables can hardly express some aspects of an wiki item. For example, when we talk about ``Shiba Inu'', one may care more about ``How to… ▽ More Online encyclopedias, such as Wikipedia, have been well-developed and researched in the last two decades. One can find any attributes or other information of a wiki item on a wiki page edited by a community of volunteers. However, the traditional text, images and tables can hardly express some aspects of an wiki item. For example, when we talk about ``Shiba Inu'', one may care more about ``How to feed it'' or ``How to train it not to protect its food''. Currently, short-video platforms have become a hallmark in the online world. Whether you're on TikTok, Instagram, Kuaishou, or YouTube Shorts, short-video apps have changed how we consume and create content today. Except for producing short videos for entertainment, we can find more and more authors sharing insightful knowledge widely across all walks of life. These short videos, which we call knowledge videos, can easily express any aspects (e.g. hair or how-to-feed) consumers want to know about an item (e.g. Shiba Inu), and they can be systematically analyzed and organized like an online encyclopedia. In this paper, we propose Kuaipedia, a large-scale multi-modal encyclopedia consisting of items, aspects, and short videos lined to them, which was extracted from billions of videos of Kuaishou (Kwai), a well-known short-video platform in China. We first collected items from multiple sources and mined user-centered aspects from millions of users' queries to build an item-aspect tree. Then we propose a new task called ``multi-modal item-aspect linking'' as an expansion of ``entity linking'' to link short videos into item-aspect pairs and build the whole short-video encyclopedia. Intrinsic evaluations show that our encyclopedia is of large scale and highly accurate. We also conduct sufficient extrinsic experiments to show how Kuaipedia can help fundamental applications such as entity typing and entity linking. △ Less

Submitted 11 August, 2023; v1 submitted 28 October, 2022; originally announced November 2022.

arXiv:2210.03203 [pdf, other]

doi 10.3847/1538-4357/ad0ce8

The Aemulus Project VI: Emulation of beyond-standard galaxy clustering statistics to improve cosmological constraints

Authors: Kate Storey-Fisher, Jeremy Tinker, Zhongxu Zhai, Joseph DeRose, Risa H. Wechsler, Arka Banerjee

Abstract: There is untapped cosmological information in galaxy redshift surveys in the non-linear regime. In this work, we use the AEMULUS suite of cosmological $N$-body simulations to construct Gaussian process emulators of galaxy clustering statistics at small scales ($0.1-50 \: h^{-1}\,\mathrm{Mpc}$) in order to constrain cosmological and galaxy bias parameters. In addition to standard statistics -- the… ▽ More There is untapped cosmological information in galaxy redshift surveys in the non-linear regime. In this work, we use the AEMULUS suite of cosmological $N$-body simulations to construct Gaussian process emulators of galaxy clustering statistics at small scales ($0.1-50 \: h^{-1}\,\mathrm{Mpc}$) in order to constrain cosmological and galaxy bias parameters. In addition to standard statistics -- the projected correlation function $w_\mathrm{p}(r_\mathrm{p})$, the redshift-space monopole of the correlation function $ξ_0(s)$, and the quadrupole $ξ_2(s)$ -- we emulate statistics that include information about the local environment, namely the underdensity probability function $P_\mathrm{U}(s)$ and the density-marked correlation function $M(s)$. This extends the model of AEMULUS III for redshift-space distortions by including new statistics sensitive to galaxy assembly bias. In recovery tests, we find that the beyond-standard statistics significantly increase the constraining power on cosmological parameters of interest: including $P_\mathrm{U}(s)$ and $M(s)$ improves the precision of our constraints on $Ω_m$ by 27%, $σ_8$ by 19%, and the growth of structure parameter, $f σ_8$, by 12% compared to standard statistics. We additionally find that scales below $\sim6 \: h^{-1}\,\mathrm{Mpc}$ contain as much information as larger scales. The density-sensitive statistics also contribute to constraining halo occupation distribution parameters and a flexible environment-dependent assembly bias model, which is important for extracting the small-scale cosmological information as well as understanding the galaxy-halo connection. This analysis demonstrates the potential of emulating beyond-standard clustering statistics at small scales to constrain the growth of structure as a test of cosmic acceleration. △ Less

Submitted 12 March, 2024; v1 submitted 6 October, 2022; originally announced October 2022.

Comments: Published in the Astrophysical Journal; updated to match journal version

Journal ref: ApJ 961 (2), 208 (2024)

arXiv:2207.02373 [pdf, other]

doi 10.1103/PhysRevD.106.103527

Sample Variance for Supernovae Distance Measurements and the Hubble tension

Authors: Zhongxu Zhai, Will J. Percival

Abstract: Recent local measurements of the Hubble constant made using supernovae have delivered a value that differs by $\sim$5$σ$ (statistical error) from predictions using the Cosmic Microwave Background (CMB), or using Baryon Acoustic Oscillations (BAO) and Big-Bang Nucleosynthesis (BBN) constraints, which are themselves consistent. The effective volume covered by the supernovae is small compared to the… ▽ More Recent local measurements of the Hubble constant made using supernovae have delivered a value that differs by $\sim$5$σ$ (statistical error) from predictions using the Cosmic Microwave Background (CMB), or using Baryon Acoustic Oscillations (BAO) and Big-Bang Nucleosynthesis (BBN) constraints, which are themselves consistent. The effective volume covered by the supernovae is small compared to the other probes, and it is therefore interesting to consider whether sample variance (often also called cosmic variance) is a significant contributor to the offset. We consider four ways of calculating the sample variance: (i) perturbation theory applied to the luminosity distance, which is the most common method considered in the literature; (ii) perturbation of cosmological parameters, as is commonly used to alleviate super-sample covariance in sets of N-body simulations; (iii) a new method based on the variance between perturbed spherical top-hat regions; (iv) using numerical N-body simulations. All give consistent results showing that, for the Pantheon supernova sample, sample variance can only lead to fluctuations in $H_0$ of order $\pm1$ km s$^{-1}$Mpc$^{-1}$ or less. While this is not in itself a new result, the agreement between the methods used adds to its robustness. Furthermore, it is instructive to see how the different methods fit together. We also investigate the internal variance of the $H_{0}$ measurement using SH0ES and Pantheon data. By searching for an offset between measurements in opposite hemispheres, we find that the direction coincident with the CMB dipole has a higher $H_{0}$ measurement than the opposite hemisphere by roughly 4 km s$^{-1}$Mpc$^{-1}$. We compare this with a large number of simulations and find that the size of this asymmetry is statistically likely, but the preference of direction may indicate that further calibration is needed. △ Less

Submitted 5 July, 2022; originally announced July 2022.

Comments: 13 pages, 5 figures; comments welcome

arXiv:2203.12799 [pdf, other]

Energy-Efficient UAV-Mounted RIS Assisted Mobile Edge Computing

Authors: Zhiyuan Zhai, Xinhong Dai, Bin Duo, Xin Wang, Xiaojun Yuan

Abstract: Unmanned aerial vehicle (UAV) and reconfigurable intelligent surface (RIS) have been recently applied in the field of mobile edge computing (MEC) to improve the data exchange environment by proactively changing the wireless channels through maneuverable location deployment and intelligent signals reflection, respectively. Nevertheless, they may suffer from inherent limitations in practical scenari… ▽ More Unmanned aerial vehicle (UAV) and reconfigurable intelligent surface (RIS) have been recently applied in the field of mobile edge computing (MEC) to improve the data exchange environment by proactively changing the wireless channels through maneuverable location deployment and intelligent signals reflection, respectively. Nevertheless, they may suffer from inherent limitations in practical scenarios. UAV-mounted RIS (U-RIS), as a promising integrated approach, can combine the advantages of UAV and RIS to break the limit. Inspired by this, we consider a novel U-RIS assisted MEC system, where a U-RIS is deployed to assist the communication between the ground users and an MEC server. The joint UAV trajectory, RIS passive beamforming and MEC resource allocation design is developed to maximize the energy efficiency (EE) of the system. To tackle the intractable non-convex problem, we divide it into two subproblems and solve them iteratively based on successive convex approximation (SCA) and the Dinkelbach method. Finally we obtain a high-performance suboptimal solution. Simulation results show that the proposed algorithm significantly improves the energy efficiency of the MEC system. △ Less

Submitted 23 March, 2022; originally announced March 2022.

arXiv:2203.08999 [pdf, other]

doi 10.3847/1538-4357/acc65b

The Aemulus Project V: Cosmological constraint from small-scale clustering of BOSS galaxies

Authors: Zhongxu Zhai, Jeremy L. Tinker, Arka Banerjee, Joseph DeRose, Hong Guo, Yao-Yuan Mao, Sean McLaughlin, Kate Storey-Fisher, Risa H. Wechsler

Abstract: We analyze clustering measurements of BOSS galaxies using a simulation-based emulator of two-point statistics. We focus on the monopole and quadrupole of the redshift-space correlation function, and the projected correlation function, at scales of $0.1\sim60~h^{-1}$Mpc. Although our simulations are based on $w$CDM with general relativity (GR), we include a scaling parameter of the halo velocity fi… ▽ More We analyze clustering measurements of BOSS galaxies using a simulation-based emulator of two-point statistics. We focus on the monopole and quadrupole of the redshift-space correlation function, and the projected correlation function, at scales of $0.1\sim60~h^{-1}$Mpc. Although our simulations are based on $w$CDM with general relativity (GR), we include a scaling parameter of the halo velocity field, $γ_f$, defined as the amplitude of the halo velocity field relative to the GR prediction. We divide the BOSS data into three redshift bins. After marginalizing over other cosmological parameters, galaxy bias parameters, and the velocity scaling parameter, we find $fσ_{8}(z=0.25) = 0.413\pm0.031$, $fσ_{8}(z=0.4) = 0.470\pm0.026$ and $fσ_{8}(z=0.55) = 0.396\pm0.022$. Compared with Planck observations using a flat $Λ$CDM model, our results are lower by $1.9σ$, $0.3σ$ and $3.4σ$ respectively. These results are consistent with other recent simulation-based results at non-linear scales, including weak lensing measurements of BOSS LOWZ galaxies, two-point clustering of eBOSS LRGs, and an independent clustering analysis of BOSS LOWZ. All these results are generally consistent with a combination of $γ_f^{1/2}σ_8\approx 0.75$. We note, however, that the BOSS data is well fit assuming GR, i.e. $γ_f=1$. We cannot rule out an unknown systematic error in the galaxy bias model at non-linear scales, but near-future data and modeling will enhance our understanding of the galaxy--halo connection, and provide a strong test of new physics beyond the standard model. △ Less

Submitted 21 March, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 30 pages, 23 figures and 4 tables; ApJ accepted, updated to the match the accepted version

arXiv:2203.00230 [pdf, other]

doi 10.1088/1361-6587/ac7975

Chandrasekhar-Kendall-Woltjer-Taylor state in a resistive plasma

Authors: Ze-Yu Zhai, Yang-Guang Yang, Xiao-Liang Xia, Qun Wang

Abstract: We give a criterion for the Chandrasekhar-Kendall-Woltjer-Taylor (CKWT) state in a resistive plasma. We find that the lowest momentum (longest wavelength) of the initial helicity amplitudes of magnetic fields are the key to the CKWT state which can be reached if one helicity is favored over the other. This indicates that the imbalance between two helicities at the lowest momentum or longest wavele… ▽ More We give a criterion for the Chandrasekhar-Kendall-Woltjer-Taylor (CKWT) state in a resistive plasma. We find that the lowest momentum (longest wavelength) of the initial helicity amplitudes of magnetic fields are the key to the CKWT state which can be reached if one helicity is favored over the other. This indicates that the imbalance between two helicities at the lowest momentum or longest wavelength in the initial conditions is essential to the CKWT state. A few examples of initial conditions for helicity amplitudes are taken to support the above statement both analytically and numerically. △ Less

Submitted 28 February, 2022; originally announced March 2022.

Comments: RevTex 4.1, 19 pages, 3 figures

Report number: USTC-ICTS/PCFT-22-02

Journal ref: Plasma Phys. Control. Fusion 64 (2022) 095003

arXiv:2202.05168 [pdf]

doi 10.1016/j.apsusc.2022.152550

Surface engineering for cellulose as a boosted Layer-by-Layer assembly: excellent flame retardancy and improved durability with introduction of bio-based "molecular glue"

Authors: Can Fu, Xiaoli Xu, Guang-Zhong Yin, Baoyun Xu, Pingyang Li, Bo Ai, Zhongjie Zhai, Fei GaO, Jinguo Zhai, De-Yi Wang

Abstract: Layer-by-Layer (LbL) assembly was attractive as a versatile tool to address the flammability of cotton, while the washing fastness of LbL coating stayed an issue. Aiming to tackle this issue, LbL layers consisted of phenylphosphonic acid (PHA) and 3-aminopropyltriethoxysilane (APTES) was deposited on polydopamine (PDA)-coated cotton. The prepared cotton reached 31.4% of limiting oxygen index (LOI)… ▽ More Layer-by-Layer (LbL) assembly was attractive as a versatile tool to address the flammability of cotton, while the washing fastness of LbL coating stayed an issue. Aiming to tackle this issue, LbL layers consisted of phenylphosphonic acid (PHA) and 3-aminopropyltriethoxysilane (APTES) was deposited on polydopamine (PDA)-coated cotton. The prepared cotton reached 31.4% of limiting oxygen index (LOI), and extinguished immediately after removing the ignitor. Peak of heat release rate (pHRR) attenuated around 36 % compared with pure cotton. A combined barrier and quenching mechanisms were proposed. Moreover, enhanced washing durability (24.1% of LOI) was achieved even after 50 detergent laundering cycles. A facile, boosted LbL approach with proposed π-π stacking interactions between PDA abundant aromatic structures and benzene ring in PHA from LbL layers, is first to put forward to construct durable efficient flame retardant (FR) cotton. This work attempted to enlighten more thoughts and design for durable FR cotton fabrics. △ Less

Submitted 22 January, 2022; originally announced February 2022.

arXiv:2201.07346 [pdf, ps, other]

Problems related to Waring-Goldbach problem involving cubes of primes

Authors: Zhichun Zhai

Abstract: In this note, we try to understand the recent development on the Waring-Goldbach problem involving cubes of primes. Especially, we want to determine whether integers that are either primes, squares of primes, cubes of primes, or a cube of an even number can be written as the sum of four cubes of primes. Meanwhile, we raise some problems that may deepen our understanding of the problem about the su… ▽ More In this note, we try to understand the recent development on the Waring-Goldbach problem involving cubes of primes. Especially, we want to determine whether integers that are either primes, squares of primes, cubes of primes, or a cube of an even number can be written as the sum of four cubes of primes. Meanwhile, we raise some problems that may deepen our understanding of the problem about the sum of four cubes of primes. Moreover, some examples suggest that almost all the cubes of integers can be written as the sum of cubes of four integers. △ Less

Submitted 3 January, 2022; originally announced January 2022.

Comments: The idea to write this note was motived when watching the documentary "Chosen by Mathematics" in the Christmas holiday. No rigorous proof was provided. Some problems were raised

arXiv:2201.00765 [pdf, ps, other]

Fractional Besov Trace/Extension Type Inequalities via the Caffarelli-Silvestre extension

Authors: Pengtao Li, Rui Hu, Zhichun Zhai

Abstract: Let $u(\cdot,\cdot)$ be the Caffarelli-Silvestre extension of $f.$ The first goal of this article is to establish the fractional trace type inequalities involving the Caffarelli-Silvestre extension $u(\cdot,\cdot)$ of $f.$ In doing so, firstly, we establish the fractional Sobolev/ logarithmic Sobolev/ Hardy trace inequalities in terms of $\nabla_{(x,t)}u(x,t).$ Then, we prove the fractional anisot… ▽ More Let $u(\cdot,\cdot)$ be the Caffarelli-Silvestre extension of $f.$ The first goal of this article is to establish the fractional trace type inequalities involving the Caffarelli-Silvestre extension $u(\cdot,\cdot)$ of $f.$ In doing so, firstly, we establish the fractional Sobolev/ logarithmic Sobolev/ Hardy trace inequalities in terms of $\nabla_{(x,t)}u(x,t).$ Then, we prove the fractional anisotropic Sobolev/ logarithmic Sobolev/ Hardy trace inequalities in terms of $ {\partial_{t} u(x,t)}$ or $(-Δ)^{-γ/2}u(x,t)$ only. Moreover, based on an estimate of the Fourier transform of the Caffarelli-Silvestre extension kernel and the sharp affine weighted $L^p$ Sobolev inequality, we prove that the $\dot{H}^{-β/2}(\mathbb{R}^n)$ norm of $f(x)$ can be controlled by the product of the weighted $L^p-$affine energy and the weighted $L^p-$norm of ${\partial_{t} u(x,t)}.$ The second goal of this article is to characterize non-negative measures $μ$ on $\mathbb{R}^{n+1}_+$ such that the embeddings $$\|u(\cdot,\cdot)\|_{L^{q_0,p_0}_μ(\mathbb{R}^{n+1})}\lesssim \|f\|_{\dotΛ^{p,q}_β(\mathbb{R}^n)}$$ hold for some $p_0$ and $q_0$ depending on $p$ and $q$ which are classified in three different cases: (1). $p=q\in (n/(n+β),1];$ (2) $(p,q)\in (1,n/β)\times (1,\infty);$ (3). $(p,q)\in (1,n/β)\times\{\infty\}.$ For case (1), the embeddings can be characterized in terms of an analytic condition of the variational capacity minimizing function, the iso-capacitary inequality of open balls, and other weak type inequalities. For cases (2) and (3), the embeddings are characterized by the iso-capacitary inequality for fractonal Besov capacity of open sets. △ Less

Submitted 14 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

Comments: The case of $p\geq 1$ is added

arXiv:2201.00753 [pdf, ps, other]

Strengthened Fractional Sobolev Type Inequalities in Besov Spaces

Authors: Pengtao Li, Rui Hu, Zhichun Zhai

Abstract: The purpose of this article is twofold. The first is to strengthen fractional Sobolev type inequalities in Besov spaces via the classical Lorentz space. In doing so, we show that the Sobolev inequality in Besov spaces is equivalent to the fractional Hardy inequality and the iso-capacitary type inequality. Secondly, we will strengthen fractional Sobolev type inequalities in Besov spaces via capacit… ▽ More The purpose of this article is twofold. The first is to strengthen fractional Sobolev type inequalities in Besov spaces via the classical Lorentz space. In doing so, we show that the Sobolev inequality in Besov spaces is equivalent to the fractional Hardy inequality and the iso-capacitary type inequality. Secondly, we will strengthen fractional Sobolev type inequalities in Besov spaces via capacitary Lorentz spaces associated with Besov capacities. For this purpose, we first study the embedding of the associated capacitary Lorentz space to the classical Lorentz space. Then, the embedding of the Besov space to the capacitary Lorentz space is established. Meanwhile, we show that these embeddings are closely related to the iso-capacitary type inequalities in terms of a new-introduced fractional $(β, p, q)$-perimeter. Moreover, characterizations of more general Sobolev type inequalities in Besov spaces have also been established. △ Less

Submitted 20 February, 2022; v1 submitted 3 January, 2022; originally announced January 2022.

Comments: 14 pages. Submitted. New results added

arXiv:2112.02387 [pdf, other]

Illuminating Galaxy Evolution at Cosmic Noon with ISCEA: the Infrared Satellite for Cosmic Evolution Astrophysics

Authors: Yun Wang, Lee Armus, Andrew Benson, Emanuele Daddi, Andreas Faisst, Anthony Gonzalez, Casey Papovich, Zoran Ninkov, Massimo Robberto, Randall J. Rose, Thomas, Rose, Claudia Scarlata, S. A. Stanford, Todd Veach, Zhongxu Zhai, Bradford Benson, L. E. Bleem, Michael W. Davis, George Helou, Lynne Hillenbrand

Abstract: ISCEA (Infrared Satellite for Cosmic Evolution Astrophysics) is a small astrophysics mission whose Science Goal is to discover how galaxies evolved in the cosmic web of dark matter at cosmic noon. Its Science Objective is to determine the history of star formation and its quenching in galaxies as a function of local density and stellar mass when the Universe was 3-5 Gyrs old (1.2<z<2.1). ISCEA is… ▽ More ISCEA (Infrared Satellite for Cosmic Evolution Astrophysics) is a small astrophysics mission whose Science Goal is to discover how galaxies evolved in the cosmic web of dark matter at cosmic noon. Its Science Objective is to determine the history of star formation and its quenching in galaxies as a function of local density and stellar mass when the Universe was 3-5 Gyrs old (1.2<z<2.1). ISCEA is designed to test the Science Hypothesis that during the period of cosmic noon, at 1.7 < z < 2.1, environmental quenching is the dominant quenching mechanism for typical galaxies not only in clusters and groups, but also in the extended cosmic web surrounding these structures. ISCEA meets its Science Objective by making a 10% shot noise measurement of star formation rate down to 6 solar masses per year using H-alpha out to a radius > 10 Mpc in each of 50 protocluster (cluster and cosmic web) fields at 1.2 < z < 2.1. ISCEA measures the star formation quenching factor in those fields, and galaxy kinematics with a precision < 50 km/s to deduce the 3D spatial distribution in each field. ISCEA will transform our understanding of galaxy evolution at cosmic noon. ISCEA is a small satellite observatory with a 30cm equivalent diameter aperture telescope with a FoV of 0.32 deg^2, and a multi-object spectrograph with a digital micro-mirror device (DMD) as its programmable slit mask. ISCEA will obtain spectra of 1000 galaxies simultaneously at an effective resolving power of R=1000, with 2.8"x2.8" slits, over the NIR wavelength range of 1.1 to 2.0 microns, a regime not accessible from the ground without large gaps in coverage. ISCEA will achieve a pointing accuracy of <= 2" FWHM over 200s. ISCEA will be launched into a Low Earth Orbit, with a prime mission of 2.5 years. ISCEA's space-qualification of DMDs opens a new window for spectroscopy from space, enabling revolutionary advances in astrophysics. △ Less

Submitted 4 December, 2021; originally announced December 2021.

Comments: 40 pages, 31 figures

arXiv:2110.01829 [pdf, other]

doi 10.3847/1538-4357/ac4973

The High Latitude Spectroscopic Survey on the Nancy Grace Roman Space Telescope

Authors: Yun Wang, Zhongxu Zhai, Anahita Alavi, Elena Massara, Alice Pisani, Andrew Benson, Christopher M. Hirata, Lado Samushia, David H. Weinberg, James Colbert, Olivier Doré, Tim Eifler, Chen Heinrich, Shirley Ho, Elisabeth Krause, Nikhil Padmanabhan, David Spergel, Harry I. Teplitz

Abstract: The Nancy Grace Roman Space Telescope will conduct a High Latitude Spectroscopic Survey (HLSS) over a large volume at high redshift, using the near-IR grism (1.0-1.93 $μ$m, $R=435-865$) and the 0.28 deg$^2$ wide field camera. We present a reference HLSS which maps 2000 deg$^2$ and achieves an emission line flux limit of 10$^{-16}$ erg/s/cm$^2$ at 6.5$σ$, requiring $\sim$0.6 yrs of observing time.… ▽ More The Nancy Grace Roman Space Telescope will conduct a High Latitude Spectroscopic Survey (HLSS) over a large volume at high redshift, using the near-IR grism (1.0-1.93 $μ$m, $R=435-865$) and the 0.28 deg$^2$ wide field camera. We present a reference HLSS which maps 2000 deg$^2$ and achieves an emission line flux limit of 10$^{-16}$ erg/s/cm$^2$ at 6.5$σ$, requiring $\sim$0.6 yrs of observing time. We summarize the flowdown of the Roman science objectives to the science and technical requirements of the HLSS. We construct a mock redshift survey over the full HLSS volume by applying a semi-analytic galaxy formation model to a cosmological N-body simulation, and use this mock survey to create pixel-level simulations of 4 deg$^2$ of HLSS grism spectroscopy. We find that the reference HLSS would measure $\sim$ 10 million H$α$ galaxy redshifts that densely map large scale structure at $z=1-2$ and 2 million [OIII] galaxy redshifts that sparsely map structures at $z=2-3$. We forecast the performance of this survey for measurements of the cosmic expansion history with baryon acoustic oscillations and the growth of large scale structure with redshift space distortions. We also study possible deviations from the reference design, and find that a deep HLSS at $f_{\rm line}>7\times10^{-17}$erg/s/cm$^2$ over 4000 deg$^2$ (requiring $\sim$1.5 yrs of observing time) provides the most compelling stand-alone constraints on dark energy from Roman alone. This provides a useful reference for future optimizations. The reference survey, simulated data sets, and forecasts presented here will inform community decisions on the final scope and design of the Roman HLSS. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Comments: 29 pages, 8 figures, ApJ submitted

Showing 1–50 of 126 results for author: Zhai, Z