-
A foundation model approach to guide antimicrobial peptide design in the era of artificial intelligence driven scientific discovery
Authors:
Jike Wang,
Jianwen Feng,
Yu Kang,
Peichen Pan,
Jingxuan Ge,
Yan Wang,
Mingyang Wang,
Zhenxing Wu,
Xingcai Zhang,
Jiameng Yu,
Xujun Zhang,
Tianyue Wang,
Lirong Wen,
Guangning Yan,
Yafeng Deng,
Hui Shi,
Chang-Yu Hsieh,
Zhihui Jiang,
Tingjun Hou
Abstract:
We propose AMP-Designer, an LLM-based foundation model approach for the rapid design of novel antimicrobial peptides (AMPs) with multiple desired properties. Within 11 days, AMP-Designer enables de novo design of 18 novel candidates with broad-spectrum potency against Gram-negative bacteria. Subsequent in vitro validation experiments demonstrate that almost all in silico recommended candidates exh…
▽ More
We propose AMP-Designer, an LLM-based foundation model approach for the rapid design of novel antimicrobial peptides (AMPs) with multiple desired properties. Within 11 days, AMP-Designer enables de novo design of 18 novel candidates with broad-spectrum potency against Gram-negative bacteria. Subsequent in vitro validation experiments demonstrate that almost all in silico recommended candidates exhibit notable antibacterial activity, yielding a 94.4% positive rate. Two of these candidates exhibit exceptional activity, minimal hemotoxicity, substantial stability in human plasma, and a low propensity of inducing antibiotic resistance as observed in murine lung infection experiments, showcasing their significant efficacy in reducing bacterial load by approximately one hundredfold. The entire process, from in silico design to in vitro and in vivo validation, is completed within a timeframe of 48 days. Moreover, AMP-Designer demonstrates its remarkable capability in designing specific AMPs to target strains with extremely limited labeled datasets. The most outstanding candidate against Propionibacterium acnes suggested by AMP-Designer exhibits an in vitro minimum inhibitory concentration value of 2.0 $μ$g/ml. Through the integration of advanced machine learning methodologies such as contrastive prompt tuning, knowledge distillation, and reinforcement learning within the AMP-Designer framework, the process of designing AMPs demonstrates exceptional efficiency. This efficiency remains conspicuous even in the face of challenges posed by constraints arising from a scarcity of labeled data. These findings highlight the tremendous potential of AMP-Designer as a promising approach in combating the global health threat of antibiotic resistance.
△ Less
Submitted 16 July, 2024;
originally announced July 2024.
-
The STAR Forward Silicon Tracker
Authors:
J. D. Brandenburg,
Y. Chang,
J. Dong,
Y. He,
Y. Hu,
H. Huang,
T. Huang,
H. Li,
M. Nie,
R. Sharma,
X. Sun,
P. Tribedy,
F. Videbæk,
G. Visser,
G. Wilks,
P. Wang,
G. Xie,
G. Yan,
Z. Ye,
L. Yi,
Y. Yang,
S. Zhang,
Z. Zhang
Abstract:
The Forward Silicon Tracker (FST) is a pivotal component of the forward upgrade of the Solenoidal Tracker at RHIC (STAR), designed to discern hadron charge signs with a momentum resolution better than 30\% for $0.2 < p_T < 2$ GeV/c in the $2.5 < η< 4$ pseudorapidity range. Its compact design features three disks along the beam direction, minimized material budget and scattering effects. The FST us…
▽ More
The Forward Silicon Tracker (FST) is a pivotal component of the forward upgrade of the Solenoidal Tracker at RHIC (STAR), designed to discern hadron charge signs with a momentum resolution better than 30\% for $0.2 < p_T < 2$ GeV/c in the $2.5 < η< 4$ pseudorapidity range. Its compact design features three disks along the beam direction, minimized material budget and scattering effects. The FST uses Hamamatsu's p-in-n silicon strip sensors with a double metal layer for efficient signal processing. The flexible hybrid boards, essential for the readout system, are constructed with Kapton and copper layers to optimize signal handling and power distribution. These boards connect silicon strips to analogue pipeline ASIC APV25-S1 chips, which read up to 128 channels each. A cooling system with nonconducting, volatile NOVEC 7200 coolant at 22.2°C mitigates ASIC-generated heat. The FST enhances forward tracking performance at RHIC, showcasing unique design solutions to complex challenges.
△ Less
Submitted 13 July, 2024;
originally announced July 2024.
-
Question-Score Identity Detection (Q-SID): A Statistical Algorithm to Detect Collusion Groups with Error Quantification from Exam Question Scores
Authors:
Guanao Yan,
Jingyi Jessica Li,
Mark D. Biggin
Abstract:
Collusion between students in online exams is a major problem that undermines the integrity of the exam results. Although there exist methods that use exam data to identify pairs of students who have likely copied each other's answers, these methods are restricted to specific formats of multiple-choice exams. Here we present a statistical algorithm, Q-SID, that efficiently detects groups of studen…
▽ More
Collusion between students in online exams is a major problem that undermines the integrity of the exam results. Although there exist methods that use exam data to identify pairs of students who have likely copied each other's answers, these methods are restricted to specific formats of multiple-choice exams. Here we present a statistical algorithm, Q-SID, that efficiently detects groups of students who likely have colluded, i.e., collusion groups, with error quantification. Q-SID uses graded numeric question scores only, so it works for many formats of multiple-choice and non-multiple-choice exams. Q-SID reports two false-positive rates (FPRs) for each collusion group: (1) empirical FPR, whose null data are from 36 strictly proctored exam datasets independent of the user-input exam data and (2) synthetic FPR, whose null data are simulated from a copula-based probabilistic model, which is first fitted to the user-input exam data and then modified to have no collusion. On 34 unproctored exam datasets, including two benchmark datasets with true positives and negatives verified by textural analysis, we demonstrate that Q-SID is a collusion detection algorithm with powerful and robust performance across exam formats, numbers of questions and students, and exam complexity.
△ Less
Submitted 12 July, 2024; v1 submitted 10 July, 2024;
originally announced July 2024.
-
Atomic cluster expansion interatomic potential for defects and thermodynamics of Cu-W system
Authors:
Jiahao Pan,
Huiqun Cheng,
Gaosheng Yan,
Lei Zhang,
Wenshan Yu,
Shengping Shen
Abstract:
The unique properties exhibited in immiscible metals, such as excellent strength, hardness, and radiation-damage tolerance, have stimulated the interest of many researchers. As a typical immiscible metal system, the Cu-W nano-multilayers combine the plasticity of copper and the strength of tungsten, making it a suitable candidate for applications in aerospace, nuclear fusion engineering, and elect…
▽ More
The unique properties exhibited in immiscible metals, such as excellent strength, hardness, and radiation-damage tolerance, have stimulated the interest of many researchers. As a typical immiscible metal system, the Cu-W nano-multilayers combine the plasticity of copper and the strength of tungsten, making it a suitable candidate for applications in aerospace, nuclear fusion engineering, and electronic packaging etc. To understand the atomistic origin of the defects and thermodynamics of the Cu-W immiscible system, we have developed an accurate machine learning interatomic potential (ML-IAP) for Cu-W based on the atomic cluster expansion (ACE) method. The Cu-W ACE potential can faithfully reproduce the fundamental properties of Cu and W predicted by density functional theory (DFT). Moreover, the thermodynamical properties, such as the melting point, coefficient of thermal expansion, diffusion coefficient, and equation of the state curve of the Cu-W solid solution, are calculated and compared against DFT and experiments. Monte Carlo Molecular Dynamics (MC-MD) simulations performed with the Cu-W ACE potential predict the experimentally observed phase separation and uphill diffusion phenomena. Our findings not only provide an accurate ACE potential for describing the Cu-W immiscible system, but also shed light on understanding the atomistic mechanism during the Cu-W nano-multilayers formation process.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
On automorphism groups of polar codes
Authors:
Jicheng Ma,
Guiying Yan
Abstract:
Over the past years, Polar codes have arisen as a highly effective class of linear codes, equipped with a decoding algorithm of low computational complexity. This family of codes share a common algebraic formalism with the well-known Reed-Muller codes, which involves monomial evaluations. As useful algebraic codes, more specifically known as decreasing monomial codes, a lot of decoding work has be…
▽ More
Over the past years, Polar codes have arisen as a highly effective class of linear codes, equipped with a decoding algorithm of low computational complexity. This family of codes share a common algebraic formalism with the well-known Reed-Muller codes, which involves monomial evaluations. As useful algebraic codes, more specifically known as decreasing monomial codes, a lot of decoding work has been done on Reed-Muller codes based on their rich code automorphisms. In 2021, a new permutation group decoder, referred to as the automorphism ensemble (AE) decoder, was introduced. This decoder can be applied to Polar codes and has been shown to produce similar decoding effects. However, identifying the right set of code automorphisms that enhance decoding performance for Polar codes remains a challenging task. This paper aims to characterize the full automorphism group of Polar codes. We will prove a reduction theorem that effectively reduces the problem of determining the full automorphism group of arbitrary random Polar codes to that of a specified class of Polar codes. Besides, we give exact classification of the full automorphism groups of families of Polar codes that are constructed using the Reed-Muller codes.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Planar Turán number for balanced double stars
Authors:
Xin Xu,
Qiang Zhou,
Tong Li,
Guiying Yan
Abstract:
Planar Turán number, denoted by $ex_{\mathcal{P}}(n,H)$, is the maximum number of edges in an $n$-vertex planar graph which does not contain $H$ as a subgraph. Ghosh, Győri, Paulos and Xiao initiated the topic of the planar Turán number for double stars. For balanced double star, $S_{3,3}$ is the only remaining graph need to be considered. In this paper, we give the exact value of…
▽ More
Planar Turán number, denoted by $ex_{\mathcal{P}}(n,H)$, is the maximum number of edges in an $n$-vertex planar graph which does not contain $H$ as a subgraph. Ghosh, Győri, Paulos and Xiao initiated the topic of the planar Turán number for double stars. For balanced double star, $S_{3,3}$ is the only remaining graph need to be considered. In this paper, we give the exact value of $ex_{\mathcal{P}}(n,S_{3,3})$, forcing the planar Turán number for all balanced double stars completely determined.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Categorization of 31 computational methods to detect spatially variable genes from spatially resolved transcriptomics data
Authors:
Guanao Yan,
Shuo Harper Hua,
Jingyi Jessica Li
Abstract:
In the analysis of spatially resolved transcriptomics data, detecting spatially variable genes (SVGs) is crucial. Numerous computational methods exist, but varying SVG definitions and methodologies lead to incomparable results. We review 31 state-of-the-art methods, categorizing SVGs into three types: overall, cell-type-specific, and spatial-domain-marker SVGs. Our review explains the intuitions u…
▽ More
In the analysis of spatially resolved transcriptomics data, detecting spatially variable genes (SVGs) is crucial. Numerous computational methods exist, but varying SVG definitions and methodologies lead to incomparable results. We review 31 state-of-the-art methods, categorizing SVGs into three types: overall, cell-type-specific, and spatial-domain-marker SVGs. Our review explains the intuitions underlying these methods, summarizes their applications, and categorizes the hypothesis tests they use in the trade-off between generality and specificity for SVG detection. We discuss challenges in SVG detection and propose future directions for improvement. Our review offers insights for method developers and users, advocating for category-specific benchmarking.
△ Less
Submitted 8 July, 2024; v1 submitted 29 May, 2024;
originally announced May 2024.
-
Syngas conversion to higher alcohols via wood-framed Cu/Co-carbon catalyst
Authors:
Guihua Yan,
Paulina Pršlja,
Gaofeng Chen,
Jiahui Kang,
Yongde Liu,
Miguel A. Caro,
Xi Chen,
Xianhai Zeng,
Bo Peng
Abstract:
Syngas conversion into higher alcohols represents a promising avenue for transforming coal or biomass into liquid fuels. However, the commercialization of this process has been hindered by the high cost, low activity, and inadequate C$_{2+}$OH selectivity of catalysts. Herein, we have developed Cu/Co carbon wood catalysts, offering a cost-effective and stable alternative with exceptional selectivi…
▽ More
Syngas conversion into higher alcohols represents a promising avenue for transforming coal or biomass into liquid fuels. However, the commercialization of this process has been hindered by the high cost, low activity, and inadequate C$_{2+}$OH selectivity of catalysts. Herein, we have developed Cu/Co carbon wood catalysts, offering a cost-effective and stable alternative with exceptional selectivity for catalytic conversion. The formation of Cu/Co nanoparticles was found, influenced by water-1,2-propylene glycol ratios in the solution, resulting in bidisperse nanoparticles. The catalyst exhibited a remarkable CO conversion rate of 74.8% and a selectivity of 58.7% for C$_{2+}$OH, primarily comprising linear primary alcohols. This catalyst demonstrated enduring stability and selectivity under industrial conditions, maintaining its efficacy for up to 350 h of operation. We also employed density functional theory (DFT) to analyze selectivity, particularly focusing on the binding strength of CO, a crucial precursor for subsequent reactions leading to the formation of CH$_3$OH. DFT identified the pathway of CH$_x$ and CO coupling, ultimately yielding C$_2$H$_5$OH. This computational understanding, coupled with high performance of the Cu/Co-carbon wood catalyst, paves ways for the development of catalytically selective materials tailored for higher alcohols production, thereby ushering in new possibility in this field.
△ Less
Submitted 24 May, 2024;
originally announced May 2024.
-
Preservation of Topological Surface States in Millimeter-Scale Transferred Membranes
Authors:
Chi Ian Jess Ip,
Qiang Gao,
Khanhy Du Nguyen,
Chenhui Yan,
Gangbin Yan,
Eli Hoenig,
Thomas S. Marchese,
Minghao Zhang,
Woojoo Lee,
Hossein Rokni,
Ying Shirley Meng,
Chong Liu,
Shuolong Yang
Abstract:
Ultrathin topological insulator membranes are building blocks of exotic quantum matter. However, traditional epitaxy of these materials does not facilitate stacking in arbitrary orders, while mechanical exfoliation from bulk crystals is also challenging due to the non-negligible interlayer coupling therein. Here we liberate millimeter-scale films of topological insulator Bi$_2$Se$_3$, grown by mol…
▽ More
Ultrathin topological insulator membranes are building blocks of exotic quantum matter. However, traditional epitaxy of these materials does not facilitate stacking in arbitrary orders, while mechanical exfoliation from bulk crystals is also challenging due to the non-negligible interlayer coupling therein. Here we liberate millimeter-scale films of topological insulator Bi$_2$Se$_3$, grown by molecular beam epitaxy, down to 3 quintuple layers. We characterize the preservation of the topological surface states and quantum well states in transferred Bi$_{2}$Se$_{3}$ films using angle-resolved photoemission spectroscopy. Leveraging the photon-energy-dependent surface sensitivity, the photoemission spectra taken with $6$ eV and $21.2$ eV photons reveal a transfer-induced migration of the topological surface states from the top to the inner layers. By establishing clear electronic structures of the transferred films and unveiling the wavefunction relocation of the topological surface states, our work paves the physics foundation crucial for the future fabrication of artificially stacked topological materials with single-layer precision.
△ Less
Submitted 21 May, 2024;
originally announced May 2024.
-
Anti-Ramsey Numbers of Expansions of Doubly Edge-critical Graphs in Uniform Hypergraphs
Authors:
Tong Li,
Yucong Tang,
Guiying Yan
Abstract:
For an $r$-graph $H$, the anti-Ramsey number ${\rm ar}(n,r,H)$ is the minimum number $c$ of colors such that for any edge-coloring of the complete $r$-graph on $n$ vertices with at least $c$ colors, there is a copy of $H$ whose edges have distinct colors. A 2-graph $F$ is doubly edge-$p$-critical if the chromatic number $χ(F - e)\geq p$ for every edge $e$ in $F$ and there exist two edges…
▽ More
For an $r$-graph $H$, the anti-Ramsey number ${\rm ar}(n,r,H)$ is the minimum number $c$ of colors such that for any edge-coloring of the complete $r$-graph on $n$ vertices with at least $c$ colors, there is a copy of $H$ whose edges have distinct colors. A 2-graph $F$ is doubly edge-$p$-critical if the chromatic number $χ(F - e)\geq p$ for every edge $e$ in $F$ and there exist two edges $e_1,e_2$ in $F$ such that $χ(F -e_1- e_2)=p-1$. The anti-Ramsey numbers of doubly edge-$p$-critical 2-graphs were determined by Jiang and Pikhurko \cite{Jiang&Pikhurko2009}, which generalized the anti-Ramsey numbers of cliques determined by Erdős, Simonovits and Sós \cite{Erdos&Simonovits&Sos1975}. In general, few exact values of anti-Ramsey numbers of $r$-graphs are known for $r\geq 3$. Given a 2-graph $F$, the expansion $F^{(r)}$ of $F$ is an $r$-graph on $|V(F)|+(r-2)|F|$ vertices obtained from $F$ by adding $r-2$ new vertices to each edge of $F$. In this paper, we determine the exact value of ${\rm ar}(n,r,F^{(r)})$ for any doubly edge-$p$-critical 2-graph $F$ with $p>r\geq 3$ and sufficiently large $n$.
△ Less
Submitted 18 May, 2024;
originally announced May 2024.
-
Anti-Ramsey numbers of loose paths and cycles in uniform hypergraphs
Authors:
Tong Li,
Yucong Tang,
Guanghui Wang,
Guiying Yan
Abstract:
For a fixed family of $r$-uniform hypergraphs $\mathcal{F}$, the anti-Ramsey number of $\mathcal{F}$, denoted by $ ar(n,r,\mathcal{F})$, is the minimum number $c$ of colors such that for any edge-coloring of the complete $r$-uniform hypergraph on $n$ vertices with at least $c$ colors, there is a rainbow copy of some hypergraph in $\mathcal{F}$. Here, a rainbow hypergraph is an edge-colored hypergr…
▽ More
For a fixed family of $r$-uniform hypergraphs $\mathcal{F}$, the anti-Ramsey number of $\mathcal{F}$, denoted by $ ar(n,r,\mathcal{F})$, is the minimum number $c$ of colors such that for any edge-coloring of the complete $r$-uniform hypergraph on $n$ vertices with at least $c$ colors, there is a rainbow copy of some hypergraph in $\mathcal{F}$. Here, a rainbow hypergraph is an edge-colored hypergraph with all edges colored differently. Let $\mathcal{P}_k$ and $\mathcal{C}_k$ be the families of loose paths and loose cycles with $k$ edges in an $r$-uniform hypergraph, respectively. In this paper, we determine the exact values of $ ar(n,r,\mathcal{P}_k)$ and $ ar(n,r,\mathcal{C}_k)$ for all $k\geq 4$ and $r\geq 3$.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Fundamental Bounds on Unequal Error Protection Codes
Authors:
Liuquan Yao,
Shuai Yuan,
Yuan Li,
Huazi Zhang,
Jun Wang,
Guiying Yan,
Zhiming Ma
Abstract:
Unequal error protection (UEP) codes can facilitate the transmission of messages with different protection levels. In this paper, we study the achievability bounds on UEP by the generalization of Gilbert-Varshamov (GV) bound. For the first time, we show that under certain conditions, UEP enhances the code rate comparing with time-sharing (TS) strategies asymptotically.
Unequal error protection (UEP) codes can facilitate the transmission of messages with different protection levels. In this paper, we study the achievability bounds on UEP by the generalization of Gilbert-Varshamov (GV) bound. For the first time, we show that under certain conditions, UEP enhances the code rate comparing with time-sharing (TS) strategies asymptotically.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Sensing Spin Wave Excitations by Spin Defects in Few-Layer Thick Hexagonal Boron Nitride
Authors:
Jingcheng Zhou,
Hanyi Lu,
Di Chen,
Mengqi Huang,
Gerald Q. Yan,
Faris Al-matouq,
Jiu Chang,
Dziga Djugba,
Zhigang Jiang,
Hailong Wang,
Chunhui Rita Du
Abstract:
Optically active spin defects in wide band-gap semiconductors serve as a local sensor of multiple degrees of freedom in a variety of "hard" and "soft" condensed matter systems. Taking advantage of the recent progress on quantum sensing using van der Waals (vdW) quantum materials, here we report direct measurements of spin waves excited in magnetic insulator Y3Fe5O12 (YIG) by boron vacancy $V_B^-$…
▽ More
Optically active spin defects in wide band-gap semiconductors serve as a local sensor of multiple degrees of freedom in a variety of "hard" and "soft" condensed matter systems. Taking advantage of the recent progress on quantum sensing using van der Waals (vdW) quantum materials, here we report direct measurements of spin waves excited in magnetic insulator Y3Fe5O12 (YIG) by boron vacancy $V_B^-$ spin defects contained in few-layer thick hexagonal boron nitride nanoflakes. We show that the ferromagnetic resonance and parametric spin excitations can be effectively detected by $V_B^-$ spin defects under various experimental conditions through optically detected magnetic resonance measurements. The off-resonant dipole interaction between YIG magnons and $V_B^-$ spin defects is mediated by multi-magnon scattering processes, which may find relevant applications in a range of emerging quantum sensing, computing, and metrology technologies. Our results also highlight the opportunities offered by quantum spin defects in layered two-dimensional vdW materials for investigating local spin dynamic behaviors in magnetic solid-state matters.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
Provably Robust Conformal Prediction with Improved Efficiency
Authors:
Ge Yan,
Yaniv Romano,
Tsui-Wei Weng
Abstract:
Conformal prediction is a powerful tool to generate uncertainty sets with guaranteed coverage using any predictive model, under the assumption that the training and test data are i.i.d.. Recently, it has been shown that adversarial examples are able to manipulate conformal methods to construct prediction sets with invalid coverage rates, as the i.i.d. assumption is violated. To address this issue,…
▽ More
Conformal prediction is a powerful tool to generate uncertainty sets with guaranteed coverage using any predictive model, under the assumption that the training and test data are i.i.d.. Recently, it has been shown that adversarial examples are able to manipulate conformal methods to construct prediction sets with invalid coverage rates, as the i.i.d. assumption is violated. To address this issue, a recent work, Randomized Smoothed Conformal Prediction (RSCP), was first proposed to certify the robustness of conformal prediction methods to adversarial noise. However, RSCP has two major limitations: (i) its robustness guarantee is flawed when used in practice and (ii) it tends to produce large uncertainty sets. To address these limitations, we first propose a novel framework called RSCP+ to provide provable robustness guarantee in evaluation, which fixes the issues in the original RSCP method. Next, we propose two novel methods, Post-Training Transformation (PTT) and Robust Conformal Training (RCT), to effectively reduce prediction set size with little computation overhead. Experimental results in CIFAR10, CIFAR100, and ImageNet suggest the baseline method only yields trivial predictions including full label set, while our methods could boost the efficiency by up to $4.36\times$, $5.46\times$, and $16.9\times$ respectively and provide practical robustness guarantee. Our codes are available at https://github.com/Trustworthy-ML-Lab/Provably-Robust-Conformal-Prediction.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Second-Order Identification Capacity of AWGN Channels
Authors:
Zhicheng Liu,
Yuan Li,
Huazi Zhang,
Jun Wang,
Guiying Yan,
Zhiming Ma
Abstract:
In this paper, we establish the second-order randomized identification capacity (RID capacity) of the Additive White Gaussian Noise Channel (AWGNC). On the one hand, we obtain a refined version of Hayashi's theorem to prove the achievability part. On the other, we investigate the relationship between identification and channel resolvability, then we propose a finer quantization method to prove the…
▽ More
In this paper, we establish the second-order randomized identification capacity (RID capacity) of the Additive White Gaussian Noise Channel (AWGNC). On the one hand, we obtain a refined version of Hayashi's theorem to prove the achievability part. On the other, we investigate the relationship between identification and channel resolvability, then we propose a finer quantization method to prove the converse part. Consequently, the second-order RID capacity of the AWGNC has the same form as the second-order transmission capacity. The only difference is that the maximum number of messages in RID scales double exponentially in the blocklength.
△ Less
Submitted 27 June, 2024; v1 submitted 21 April, 2024;
originally announced April 2024.
-
On Reducing the Execution Latency of Superconducting Quantum Processors via Quantum Program Scheduling
Authors:
Wenjie Wu,
Yiquan Wang,
Ge Yan,
Yuming Zhao,
Junchi Yan
Abstract:
Quantum computing has gained considerable attention, especially after the arrival of the Noisy Intermediate-Scale Quantum (NISQ) era. Quantum processors and cloud services have been made world-wide increasingly available. Unfortunately, programs on existing quantum processors are often executed in series, and the workload could be heavy to the processor. Typically, one has to wait for hours or eve…
▽ More
Quantum computing has gained considerable attention, especially after the arrival of the Noisy Intermediate-Scale Quantum (NISQ) era. Quantum processors and cloud services have been made world-wide increasingly available. Unfortunately, programs on existing quantum processors are often executed in series, and the workload could be heavy to the processor. Typically, one has to wait for hours or even longer to obtain the result of a single quantum program on public quantum cloud due to long queue time. In fact, as the scale grows, the qubit utilization rate of the serial execution mode will further diminish, causing the waste of quantum resources. In this paper, to our best knowledge for the first time, the Quantum Program Scheduling Problem (QPSP) is formulated and introduced to improve the utility efficiency of quantum resources. Specifically, a quantum program scheduling method concerning the circuit width, number of measurement shots, and submission time of quantum programs is proposed to reduce the execution latency. We conduct extensive experiments on a simulated Qiskit noise model, as well as on the Xiaohong (from QuantumCTek) superconducting quantum processor. Numerical results show the effectiveness in both QPU time and turnaround time.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
New Partial Orders of Polar Codes for BMSC
Authors:
Liuquan Yao,
Zhichao Liu,
Yuan Li,
Huazi Zhang,
Jun Wang,
Guiying Yan,
Zhiming Ma
Abstract:
In this paper, we define partial orders (POs) of polar codes based on the Bhattacharyya parameter and the bit-error probability, respectively. These POs are applicable to arbitrary binary memoryless symmetric channel (BMSC). Leveraging the extremal inequalities of polarization transformation, we derive new POs for BMSC based on the corresponding POs observed in the Binary Erasure Channel (BEC). %A…
▽ More
In this paper, we define partial orders (POs) of polar codes based on the Bhattacharyya parameter and the bit-error probability, respectively. These POs are applicable to arbitrary binary memoryless symmetric channel (BMSC). Leveraging the extremal inequalities of polarization transformation, we derive new POs for BMSC based on the corresponding POs observed in the Binary Erasure Channel (BEC). %Additionally, we discover more special POs in the Binary Symmetric Channel (BSC). We provide examples that demonstrate the inability of existing POs to deduce these novel POs. Furthermore, we establish upper bounds for the expansion parameter $β$ if the polar codes constructed by $β$-expansion method obey these POs.
△ Less
Submitted 19 April, 2024; v1 submitted 10 April, 2024;
originally announced April 2024.
-
Evolutionary game on any hypergraph
Authors:
Dini Wang,
Peng Yi,
Yiguang Hong,
Jie Chen,
Gang Yan
Abstract:
Cooperation plays a fundamental role in societal and biological domains, and the population structure profoundly shapes the dynamics of evolution. Practically, individuals behave either altruistically or egoistically in multiple groups, such as relatives, friends and colleagues, and feedbacks from these groupwise interactions will contribute to one's cognition and behavior. Due to the intricacy wi…
▽ More
Cooperation plays a fundamental role in societal and biological domains, and the population structure profoundly shapes the dynamics of evolution. Practically, individuals behave either altruistically or egoistically in multiple groups, such as relatives, friends and colleagues, and feedbacks from these groupwise interactions will contribute to one's cognition and behavior. Due to the intricacy within and between groups, exploration of evolutionary dynamics over hypergraphs is relatively limited to date. To uncover this conundrum, we develop a higher-order random walk framework for five distinct updating rules, thus establishing explicit conditions for cooperation emergence on hypergraphs, and finding the overlaps between groups tend to foster cooperative behaviors. Our systematic analysis quantifies how the order and hyperdegree govern evolutionary outcomes. We also discover that whenever following a group wisdom update protocol, choosing a high-fitness group to interact equally within its members, cooperators will significantly prevail throughout the community. These findings underscore a crucial role of higher-order interaction and interdisciplinary collaboration throughout a broad range of living systems, favoring social prosperity.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
On the Performance of Low-complexity Decoders of LDPC and Polar Codes
Authors:
Qingqing Peng,
Dawei Yin,
Dongxu Chang,
Yuan Li,
Huazi Zhang,
Guiying Yan,
Guanghui Wang
Abstract:
Efficient decoding is crucial to high-throughput and low-power wireless communication scenarios. A theoretical analysis of the performance-complexity tradeoff toward low-complexity decoding is required for a better understanding of the fundamental limits in the above-mentioned scenarios. This study aims to explore the performance of decoders with complexity constraints. Specifically, we investigat…
▽ More
Efficient decoding is crucial to high-throughput and low-power wireless communication scenarios. A theoretical analysis of the performance-complexity tradeoff toward low-complexity decoding is required for a better understanding of the fundamental limits in the above-mentioned scenarios. This study aims to explore the performance of decoders with complexity constraints. Specifically, we investigate the performance of LDPC codes with different numbers of belief-propagation iterations and the performance of polar codes with an SSC decoder. We found that the asymptotic error rates of both polar codes and LDPC codes are functions of complexity $T$ and code length $N$, in the form of $2^{-a2^{b\frac{T}{N}}}$, where $a$ and $b$ are constants that depend on channel and coding schemes. Our analysis reveals the different performance-complexity tradeoffs for LDPC and polar codes. The results indicate that if one aims to further enhance the decoding efficiency for LDPC codes, the key lies in how to efficiently pass messages on the factor graph. In terms of decoding efficiency, polar codes asymptotically outperform $(J, K)$-regular LDPC codes with a code rate $R \le 1-\frac{J(J-1)}{2^J+(J-1)}$ in the low-complexity regime $(T \le O(NlogN))$.
△ Less
Submitted 3 April, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Ultrafast Adaptive Primary Frequency Tuning and Secondary Frequency Identification for S/S WPT system
Authors:
Chang Liu,
Wei Han,
Guangyu Yan,
Bowang Zhang,
Chunlin Li
Abstract:
Magnetic resonance wireless power transfer (WPT) technology is increasingly being adopted across diverse applications. However, its effectiveness can be significantly compromised by parameter shifts within the resonance network, owing to its high system quality factor. Such shifts are inherent and challenging to mitigate during the manufacturing process. In response, this article introduces a rapi…
▽ More
Magnetic resonance wireless power transfer (WPT) technology is increasingly being adopted across diverse applications. However, its effectiveness can be significantly compromised by parameter shifts within the resonance network, owing to its high system quality factor. Such shifts are inherent and challenging to mitigate during the manufacturing process. In response, this article introduces a rapid frequency tuning approach. Leveraging switch-controlled capacitors (SCC) to adjust the resonance network and the primary side's operating frequency, alongside a current zero-crossing detection (ZCD) circuit for voltage-current phase determination, this method circumvents the need for intricate knowledge of WPT system parameters. Moreover, it obviates the necessity for inter-side communication for real-time identification of the secondary side resonance frequency. The swift response of SCC and two-step perturb-and-observe algorithm mitigate output disturbances, thereby expediting the frequency tuning process. Experimental validation on a 200W Series-Series compensated WPT (SS-WPT) system demonstrates that the proposed method achieves frequency recognition accuracy within 0.7kHz in less than 1ms, increasing system efficiency up to 9%.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Multitask frame-level learning for few-shot sound event detection
Authors:
Liang Zou,
Genwei Yan,
Ruoyu Wang,
Jun Du,
Meng Lei,
Tian Gao,
Xin Fang
Abstract:
This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been…
▽ More
This paper focuses on few-shot Sound Event Detection (SED), which aims to automatically recognize and classify sound events with limited samples. However, prevailing methods methods in few-shot SED predominantly rely on segment-level predictions, which often providing detailed, fine-grained predictions, particularly for events of brief duration. Although frame-level prediction strategies have been proposed to overcome these limitations, these strategies commonly face difficulties with prediction truncation caused by background noise. To alleviate this issue, we introduces an innovative multitask frame-level SED framework. In addition, we introduce TimeFilterAug, a linear timing mask for data augmentation, to increase the model's robustness and adaptability to diverse acoustic environments. The proposed method achieves a F-score of 63.8%, securing the 1st rank in the few-shot bioacoustic event detection category of the Detection and Classification of Acoustic Scenes and Events Challenge 2023.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Gemma: Open Models Based on Gemini Research and Technology
Authors:
Gemma Team,
Thomas Mesnard,
Cassidy Hardin,
Robert Dadashi,
Surya Bhupatiraju,
Shreya Pathak,
Laurent Sifre,
Morgane Rivière,
Mihir Sanjay Kale,
Juliette Love,
Pouya Tafti,
Léonard Hussenot,
Pier Giuseppe Sessa,
Aakanksha Chowdhery,
Adam Roberts,
Aditya Barua,
Alex Botev,
Alex Castro-Ros,
Ambrose Slone,
Amélie Héliou,
Andrea Tacchetti,
Anna Bulanova,
Antonia Paterson,
Beth Tsai,
Bobak Shahriari
, et al. (83 additional authors not shown)
Abstract:
This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge…
▽ More
This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Gemma outperforms similarly sized open models on 11 out of 18 text-based tasks, and we present comprehensive evaluations of safety and responsibility aspects of the models, alongside a detailed description of model development. We believe the responsible release of LLMs is critical for improving the safety of frontier models, and for enabling the next wave of LLM innovations.
△ Less
Submitted 16 April, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
Authors:
Gemini Team,
Petko Georgiev,
Ving Ian Lei,
Ryan Burnell,
Libin Bai,
Anmol Gulati,
Garrett Tanzer,
Damien Vincent,
Zhufeng Pan,
Shibo Wang,
Soroosh Mariooryad,
Yifan Ding,
Xinyang Geng,
Fred Alcober,
Roy Frostig,
Mark Omernick,
Lexi Walker,
Cosmin Paduraru,
Christina Sorokin,
Andrea Tacchetti,
Colin Gaffney,
Samira Daruki,
Olcan Sercinoglu,
Zach Gleicher,
Juliette Love
, et al. (1092 additional authors not shown)
Abstract:
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February…
▽ More
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
△ Less
Submitted 14 June, 2024; v1 submitted 8 March, 2024;
originally announced March 2024.
-
Power-Flow-Embedded Projection Conic Matrix Completion for Low-Observable Distribution Systems
Authors:
Xuzhuo Wang,
Guoan Yan,
Zhengshuo Li
Abstract:
A low-observable distribution system has insufficient measurements for conventional weighted least square state estimators. Matrix completion state estimators have been suggested, but their computational times could be prohibitive. To resolve this problem, a novel and efficient power-flow-embedded projection conic matrix completion method customized for low-observable distribution systems is propo…
▽ More
A low-observable distribution system has insufficient measurements for conventional weighted least square state estimators. Matrix completion state estimators have been suggested, but their computational times could be prohibitive. To resolve this problem, a novel and efficient power-flow-embedded projection conic matrix completion method customized for low-observable distribution systems is proposed in this letter. This method can yield more accurate state estimations (2-fold improvement) in a much shorter time (5% or less) than other methods. Case studies on different-scale systems have demonstrated the efficacy of the proposed method when applied to low-observable distribution system state estimation problems.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
DNAct: Diffusion Guided Multi-Task 3D Policy Learning
Authors:
Ge Yan,
Yueh-Hua Wu,
Xiaolong Wang
Abstract:
This paper presents DNAct, a language-conditioned multi-task policy framework that integrates neural rendering pre-training and diffusion training to enforce multi-modality learning in action sequence spaces. To learn a generalizable multi-task policy with few demonstrations, the pre-training phase of DNAct leverages neural rendering to distill 2D semantic features from foundation models such as S…
▽ More
This paper presents DNAct, a language-conditioned multi-task policy framework that integrates neural rendering pre-training and diffusion training to enforce multi-modality learning in action sequence spaces. To learn a generalizable multi-task policy with few demonstrations, the pre-training phase of DNAct leverages neural rendering to distill 2D semantic features from foundation models such as Stable Diffusion to a 3D space, which provides a comprehensive semantic understanding regarding the scene. Consequently, it allows various applications to challenging robotic tasks requiring rich 3D semantics and accurate geometry. Furthermore, we introduce a novel approach utilizing diffusion training to learn a vision and language feature that encapsulates the inherent multi-modality in the multi-task demonstrations. By reconstructing the action sequences from different tasks via the diffusion process, the model is capable of distinguishing different modalities and thus improving the robustness and the generalizability of the learned representation. DNAct significantly surpasses SOTA NeRF-based multi-task manipulation approaches with over 30% improvement in success rate. Project website: dnact.github.io.
△ Less
Submitted 8 March, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
Maintaining Adversarial Robustness in Continuous Learning
Authors:
Xiaolei Ru,
Xiaowei Cao,
Zijia Liu,
Jack Murdoch Moore,
Xin-Ya Zhang,
Xia Zhu,
Wenjia Wei,
Gang Yan
Abstract:
Adversarial robustness is essential for security and reliability of machine learning systems. However, the adversarial robustness gained by sophisticated defense algorithms is easily erased as the neural network evolves to learn new tasks. This vulnerability can be addressed by fostering a novel capability for neural networks, termed continual robust learning, which focuses on both the (classifica…
▽ More
Adversarial robustness is essential for security and reliability of machine learning systems. However, the adversarial robustness gained by sophisticated defense algorithms is easily erased as the neural network evolves to learn new tasks. This vulnerability can be addressed by fostering a novel capability for neural networks, termed continual robust learning, which focuses on both the (classification) performance and adversarial robustness on previous tasks during continuous learning. To achieve continuous robust learning, we propose an approach called Double Gradient Projection that projects the gradients for weight updates orthogonally onto two crucial subspaces -- one for stabilizing the smoothed sample gradients and another for stabilizing the final outputs of the neural network. The experimental results on four benchmarks demonstrate that the proposed approach effectively maintains continuous robustness against strong adversarial attacks, outperforming the baselines formed by combining the existing defense strategies and continual learning methods.
△ Less
Submitted 17 February, 2024;
originally announced February 2024.
-
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving
Authors:
Guohang Yan,
Jiahao Pi,
Jianfei Guo,
Zhaotong Luo,
Min Dou,
Nianchen Deng,
Qiusheng Huang,
Daocheng Fu,
Licheng Wen,
Pinlong Cai,
Xing Gao,
Xinyu Cai,
Bo Zhang,
Xuemeng Yang,
Yeqi Bai,
Hongbin Zhou,
Botian Shi
Abstract:
With deep learning and computer vision technology development, autonomous driving provides new solutions to improve traffic safety and efficiency. The importance of building high-quality datasets is self-evident, especially with the rise of end-to-end autonomous driving algorithms in recent years. Data plays a core role in the algorithm closed-loop system. However, collecting real-world data is ex…
▽ More
With deep learning and computer vision technology development, autonomous driving provides new solutions to improve traffic safety and efficiency. The importance of building high-quality datasets is self-evident, especially with the rise of end-to-end autonomous driving algorithms in recent years. Data plays a core role in the algorithm closed-loop system. However, collecting real-world data is expensive, time-consuming, and unsafe. With the development of implicit rendering technology and in-depth research on using generative models to produce data at scale, we propose OASim, an open and adaptive simulator and autonomous driving data generator based on implicit neural rendering. It has the following characteristics: (1) High-quality scene reconstruction through neural implicit surface reconstruction technology. (2) Trajectory editing of the ego vehicle and participating vehicles. (3) Rich vehicle model library that can be freely selected and inserted into the scene. (4) Rich sensors model library where you can select specified sensors to generate data. (5) A highly customizable data generation system can generate data according to user needs. We demonstrate the high quality and fidelity of the generated data through perception performance evaluation on the Carla simulator and real-world data acquisition. Code is available at https://github.com/PJLab-ADG/OASim.
△ Less
Submitted 6 February, 2024;
originally announced February 2024.
-
Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning
Authors:
Guangfeng Yan,
Tan Li,
Yuanzhang Xiao,
Hanxu Hou,
Linqi Song
Abstract:
Gradient compression has surfaced as a key technique to address the challenge of communication efficiency in distributed learning. In distributed deep learning, however, it is observed that gradient distributions are heavy-tailed, with outliers significantly influencing the design of compression strategies. Existing parameter quantization methods experience performance degradation when this heavy-…
▽ More
Gradient compression has surfaced as a key technique to address the challenge of communication efficiency in distributed learning. In distributed deep learning, however, it is observed that gradient distributions are heavy-tailed, with outliers significantly influencing the design of compression strategies. Existing parameter quantization methods experience performance degradation when this heavy-tailed feature is ignored. In this paper, we introduce a novel compression scheme specifically engineered for heavy-tailed gradients, which effectively combines gradient truncation with quantization. This scheme is adeptly implemented within a communication-limited distributed Stochastic Gradient Descent (SGD) framework. We consider a general family of heavy-tail gradients that follow a power-law distribution, we aim to minimize the error resulting from quantization, thereby determining optimal values for two critical parameters: the truncation threshold and the quantization density. We provide a theoretical analysis on the convergence error bound under both uniform and non-uniform quantization scenarios. Comparative experiments with other benchmarks demonstrate the effectiveness of our proposed method in managing the heavy-tailed gradients in a distributed learning environment.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Truncated Non-Uniform Quantization for Distributed SGD
Authors:
Guangfeng Yan,
Tan Li,
Yuanzhang Xiao,
Congduan Li,
Linqi Song
Abstract:
To address the communication bottleneck challenge in distributed learning, our work introduces a novel two-stage quantization strategy designed to enhance the communication efficiency of distributed Stochastic Gradient Descent (SGD). The proposed method initially employs truncation to mitigate the impact of long-tail noise, followed by a non-uniform quantization of the post-truncation gradients ba…
▽ More
To address the communication bottleneck challenge in distributed learning, our work introduces a novel two-stage quantization strategy designed to enhance the communication efficiency of distributed Stochastic Gradient Descent (SGD). The proposed method initially employs truncation to mitigate the impact of long-tail noise, followed by a non-uniform quantization of the post-truncation gradients based on their statistical characteristics. We provide a comprehensive convergence analysis of the quantized distributed SGD, establishing theoretical guarantees for its performance. Furthermore, by minimizing the convergence error, we derive optimal closed-form solutions for the truncation threshold and non-uniform quantization levels under given communication constraints. Both theoretical insights and extensive experimental evaluations demonstrate that our proposed algorithm outperforms existing quantization schemes, striking a superior balance between communication efficiency and convergence performance.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Towards Quantum-Safe Federated Learning via Homomorphic Encryption: Learning with Gradients
Authors:
Guangfeng Yan,
Shanxiang Lyu,
Hanxu Hou,
Zhiyong Zheng,
Linqi Song
Abstract:
This paper introduces a privacy-preserving distributed learning framework via private-key homomorphic encryption. Thanks to the randomness of the quantization of gradients, our learning with error (LWE) based encryption can eliminate the error terms, thus avoiding the issue of error expansion in conventional LWE-based homomorphic encryption. The proposed system allows a large number of learning pa…
▽ More
This paper introduces a privacy-preserving distributed learning framework via private-key homomorphic encryption. Thanks to the randomness of the quantization of gradients, our learning with error (LWE) based encryption can eliminate the error terms, thus avoiding the issue of error expansion in conventional LWE-based homomorphic encryption. The proposed system allows a large number of learning participants to engage in neural network-based deep learning collaboratively over an honest-but-curious server, while ensuring the cryptographic security of participants' uploaded gradients.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Outlier-immune Data-driven Linear Power Flow Model Construction via Mixed-Integer Programming
Authors:
Guoan Yan,
Zhengshuo Li
Abstract:
The common approaches to construct a data-driven linear power flow (DD-LPF) model cannot completely eliminate the adverse impacts of outliers in a training dataset. In this letter, a novel outlier-immune DD-LPF model construction method via mixed-integer programming is presented for automatically and optimally identifying outliers to form a more accurate LPF model. Two acceleration solution strate…
▽ More
The common approaches to construct a data-driven linear power flow (DD-LPF) model cannot completely eliminate the adverse impacts of outliers in a training dataset. In this letter, a novel outlier-immune DD-LPF model construction method via mixed-integer programming is presented for automatically and optimally identifying outliers to form a more accurate LPF model. Two acceleration solution strategies are further suggested to reduce the computational time. Case studies demonstrate the superior accuracy and comparable computational time of the proposed method when compared to three common approaches.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Realistic Rainy Weather Simulation for LiDARs in CARLA Simulator
Authors:
Donglin Yang,
Zhenfeng Liu,
Wentao Jiang,
Guohang Yan,
Xing Gao,
Botian Shi,
Si Liu,
Xinyu Cai
Abstract:
Employing data augmentation methods to enhance perception performance in adverse weather has attracted considerable attention recently. Most of the LiDAR augmentation methods post-process the existing dataset by physics-based models or machine-learning methods. However, due to the limited environmental annotations and the fixed vehicle trajectories in the existing dataset, it is challenging to edi…
▽ More
Employing data augmentation methods to enhance perception performance in adverse weather has attracted considerable attention recently. Most of the LiDAR augmentation methods post-process the existing dataset by physics-based models or machine-learning methods. However, due to the limited environmental annotations and the fixed vehicle trajectories in the existing dataset, it is challenging to edit the scene and expand the diversity of traffic flow and scenario. To this end, we propose a simulator-based physical modeling approach to augment LiDAR data in rainy weather in order to improve the perception performance of LiDAR in this scenario. We complete the modeling task of the rainy weather in the CARLA simulator and establish a pipeline for LiDAR data collection. In particular, we pay special attention to the spray and splash rolled up by the wheels of surrounding vehicles in rain and complete the simulation of this special scenario through the Spray Emitter method we developed. In addition, we examine the influence of different weather conditions on the intensity of the LiDAR echo, develop a prediction network for the intensity of the LiDAR echo, and complete the simulation of 4-feat LiDAR point cloud data. In the experiment, we observe that the model augmented by the synthetic data improves the object detection task's performance in the rainy sequence of the Waymo Open Dataset. Both the code and the dataset will be made publicly available at https://github.com/PJLab-ADG/PCSim#rainypcsim.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Limit Law for the Maximum Interpoint Distance of High Dimensional Dependent Variables
Authors:
Guowei Yan,
Long Feng
Abstract:
In this paper, we considier the limiting distribution of the maximum interpoint Euclidean distance $M_n=\max _{1 \leq i<j \leq n}\left\|\boldsymbol{X}_i-\boldsymbol{X}_j\right\|$, where $\boldsymbol{X}_1, \boldsymbol{X}_2, \ldots, \boldsymbol{X}_n$ be a random sample coming from a $p$-dimensional population with dependent sub-gaussian components. When the dimension tends to infinity with the sampl…
▽ More
In this paper, we considier the limiting distribution of the maximum interpoint Euclidean distance $M_n=\max _{1 \leq i<j \leq n}\left\|\boldsymbol{X}_i-\boldsymbol{X}_j\right\|$, where $\boldsymbol{X}_1, \boldsymbol{X}_2, \ldots, \boldsymbol{X}_n$ be a random sample coming from a $p$-dimensional population with dependent sub-gaussian components. When the dimension tends to infinity with the sample size, we proves that $M_n^2$ under a suitable normalization asymptotically obeys a Gumbel type distribution. The proofs mainly depend on the Stein-Chen Poisson approximation method and high dimensional Gaussian approximation.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
DePRL: Achieving Linear Convergence Speedup in Personalized Decentralized Learning with Shared Representations
Authors:
Guojun Xiong,
Gang Yan,
Shiqiang Wang,
Jian Li
Abstract:
Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server. However, most existing works focus on a single shared model for all workers regardless of the data heterogeneity problem, rendering the resulting model performing poorly o…
▽ More
Decentralized learning has emerged as an alternative method to the popular parameter-server framework which suffers from high communication burden, single-point failure and scalability issues due to the need of a central server. However, most existing works focus on a single shared model for all workers regardless of the data heterogeneity problem, rendering the resulting model performing poorly on individual workers. In this work, we propose a novel personalized decentralized learning algorithm named DePRL via shared representations. Our algorithm relies on ideas from representation learning theory to learn a low-dimensional global representation collaboratively among all workers in a fully decentralized manner, and a user-specific low-dimensional local head leading to a personalized solution for each worker. We show that DePRL achieves, for the first time, a provable linear speedup for convergence with general non-linear representations (i.e., the convergence rate is improved linearly with respect to the number of workers). Experimental results support our theoretical findings showing the superiority of our method in data heterogeneous environments.
△ Less
Submitted 17 December, 2023;
originally announced December 2023.
-
Layered Randomized Quantization for Communication-Efficient and Privacy-Preserving Distributed Learning
Authors:
Guangfeng Yan,
Tan Li,
Tian Lan,
Kui Wu,
Linqi Song
Abstract:
Next-generation wireless networks, such as edge intelligence and wireless distributed learning, face two critical challenges: communication efficiency and privacy protection. In this work, our focus is on addressing these issues in a distributed learning framework. We consider a new approach that simultaneously achieves communication efficiency and privacy protection by exploiting the privacy adva…
▽ More
Next-generation wireless networks, such as edge intelligence and wireless distributed learning, face two critical challenges: communication efficiency and privacy protection. In this work, our focus is on addressing these issues in a distributed learning framework. We consider a new approach that simultaneously achieves communication efficiency and privacy protection by exploiting the privacy advantage offered by quantization. Specifically, we use a quantization scheme called \textbf{Gau}ssian \textbf{L}ayered \textbf{R}andomized \textbf{Q}uantization (Gau-LRQ) that compresses the raw model gradients using a layer multishift coupler. By adjusting the parameters of Gau-LRQ, we shape the quantization error to follow the expected Gaussian distribution, thus ensuring client-level differential privacy (CLDP). We demonstrate the effectiveness of our proposed Gau-LRQ in the distributed stochastic gradient descent (SGD) framework and theoretically quantify the trade-offs between communication, privacy, and convergence performance. We further improve the convergence performance by enabling dynamic private budget and quantization bit allocation. We achieve this by using an optimization formula that minimizes convergence error subject to the privacy budget constraint. We evaluate our approach on multiple datasets, including MNIST, CIFAR-10, and CIFAR-100, and show that our proposed method outperforms the baselines in terms of learning performance under various privacy constraints. Moreover, we observe that dynamic privacy allocation yields additional accuracy improvements for the models compared to the fixed scheme.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
FlatProxy: A DPU-centric Service Mesh Architecture for Hyperscale Cloud-native Application
Authors:
Ming Li,
Wenyan Lu,
Hanyue Lin,
Jingya Wu,
Yu Zhang,
Guihai Yan
Abstract:
Service mesh is a fundamental technology for building cloud-native applications, which ensures the stable running of a large number of services by an intermediate layer that governs communication between services. However, service mesh is not well suited for high-performance scenarios. The root cause is that the current service mesh is not suitable for the evolution of cloud-native applications. O…
▽ More
Service mesh is a fundamental technology for building cloud-native applications, which ensures the stable running of a large number of services by an intermediate layer that governs communication between services. However, service mesh is not well suited for high-performance scenarios. The root cause is that the current service mesh is not suitable for the evolution of cloud-native applications. On the one hand, the service mesh built on CPU cannot listen to communication bypassing the CPU. On the other hand, service mesh includes many I/O-intensive and computationally-intensive tasks that can overload CPU cores as traffic grows beyond CPU performance.
Therefore, we propose a data-centric service mesh that migrates the proxy of the service mesh to the entrance of the network. Moreover, we also design the DPU-centric FlatProxy, a data-centric service mesh based on DPU. There are three advantages to the DPU-centric service mesh. Firstly, it takes over all traffic flow in and out of the node, which expands the sense scale of the service mesh from container to node. Secondly, it improves communication performance and reduces host resource usage by offloading some functions and optimizing communication. Thirdly, it minimizes performance and security issues through the physical isolation of business services and cloud infrastructure.
Compared with Envoy, the current mainstream service mesh implementation, FlatProxy reduces latency by 90\% and improves throughput by 4x in Gbps and 8x in qps, and it only occupies a small amount of CPU resources.
△ Less
Submitted 3 December, 2023;
originally announced December 2023.
-
The Green functor structure of $RO(D_{2p})$-graded cohomology of a point
Authors:
Guoqi Yan
Abstract:
We compute the $RO(D_{2p})$-graded cohomology of a point with constant coefficient $\underline{\mathbb{Z}}$ together with its Green functor structure. Here $D_{2p}$ is the dihedral group with $p$ an odd prime. This result extends the additive computation of Kriz-Lu.
We compute the $RO(D_{2p})$-graded cohomology of a point with constant coefficient $\underline{\mathbb{Z}}$ together with its Green functor structure. Here $D_{2p}$ is the dihedral group with $p$ an odd prime. This result extends the additive computation of Kriz-Lu.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Estimate of Background Baseline and Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at $\sqrt{s_{\text{NN}}}=200$ GeV at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy (…
▽ More
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy ($v_{2}$), was observed to be close to but systematically larger than the inverse multiplicity ratio. The background baseline for the isobar ratio, $Y = \frac{(Δγ/v_{2})^{\text{Ru}}}{(Δγ/v_{2})^{\text{Zr}}}$, is naively expected to be $\frac{(1/N)^{\text{Ru}}}{(1/N)^{\text{Zr}}}$; however, genuine two- and three-particle correlations are expected to alter it. We estimate the contributions to $Y$ from those correlations, utilizing both the isobar data and HIJING simulations. After including those contributions, we arrive at a final background baseline for $Y$, which is consistent with the isobar data. We extract an upper limit for the CME fraction in the $Δγ$ measurement of approximately $10\%$ at a $95\%$ confidence level on in isobar collisions at $\sqrt{s_{\text{NN}}} = 200$ GeV, with an expected $15\%$ difference in their squared magnetic fields.
△ Less
Submitted 17 July, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Observation of the Antimatter Hypernucleus $^4_{\barΛ}\overline{\hbox{H}}$
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (342 additional authors not shown)
Abstract:
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatt…
▽ More
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatter escapes the rapidly expanding fireball without annihilating, making such collisions an effective experimental tool to create heavy antimatter nuclear objects and study their properties, hoping to shed some light on existing questions on the asymmetry between matter and antimatter. Here we report the first observation of the antimatter hypernucleus \hbox{$^4_{\barΛ}\overline{\hbox{H}}$}, composed of a $\barΛ$ , an antiproton and two antineutrons. The discovery was made through its two-body decay after production in ultrarelativistic heavy-ion collisions by the STAR experiment at the Relativistic Heavy Ion Collider. In total, 15.6 candidate \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} antimatter hypernuclei are obtained with an estimated background count of 6.4. The lifetimes of the antihypernuclei \hbox{$^3_{\barΛ}\overline{\hbox{H}}$} and \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} are measured and compared with the lifetimes of their corresponding hypernuclei, testing the symmetry between matter and antimatter. Various production yield ratios among (anti)hypernuclei and (anti)nuclei are also measured and compared with theoretical model predictions, shedding light on their production mechanisms.
△ Less
Submitted 8 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Channel Autocorrelation Estimation for IRS-Aided Wireless Communications Based on Power Measurements
Authors:
Ge Yan,
Lipeng Zhu,
Rui Zhang
Abstract:
Intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems by reconfiguring wireless channels via passive signal reflection. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-associated links. Prior IRS channel estimation strategies mainly estimate IRS-cascaded channels based…
▽ More
Intelligent reflecting surface (IRS) can bring significant performance enhancement for wireless communication systems by reconfiguring wireless channels via passive signal reflection. However, such performance improvement generally relies on the knowledge of channel state information (CSI) for IRS-associated links. Prior IRS channel estimation strategies mainly estimate IRS-cascaded channels based on the excessive pilot signals received at the users/base station (BS) with time-varying IRS reflections, which, however, are not compatible with the existing channel training/estimation protocol for cellular networks. To address this issue, we propose in this paper a new channel estimation scheme for IRS-assisted communication systems based on the received signal power measured at the user, which is practically attainable without the need of changing the current protocol. Specifically, due to the lack of signal phase information in power measurements, the autocorrelation matrix of the BS-IRS-user cascaded channel is estimated by solving equivalent matrix-rank-minimization problems. Simulation results are provided to verify the effectiveness of the proposed channel estimation algorithm as well as the IRS passive reflection design based on the estimated channel autocorrelation matrix.
△ Less
Submitted 17 October, 2023;
originally announced October 2023.
-
Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Authors:
Open X-Embodiment Collaboration,
Abby O'Neill,
Abdul Rehman,
Abhinav Gupta,
Abhiram Maddukuri,
Abhishek Gupta,
Abhishek Padalkar,
Abraham Lee,
Acorn Pooley,
Agrim Gupta,
Ajay Mandlekar,
Ajinkya Jain,
Albert Tung,
Alex Bewley,
Alex Herzog,
Alex Irpan,
Alexander Khazatsky,
Anant Rai,
Anchit Gupta,
Andrew Wang,
Andrey Kolobov,
Anikait Singh,
Animesh Garg,
Aniruddha Kembhavi,
Annie Xie
, et al. (267 additional authors not shown)
Abstract:
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method…
▽ More
Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning methods train a separate model for every application, every robot, and even every environment. Can we instead train generalist X-robot policy that can be adapted efficiently to new robots, tasks, and environments? In this paper, we provide datasets in standardized data formats and models to make it possible to explore this possibility in the context of robotic manipulation, alongside experimental results that provide an example of effective X-robot policies. We assemble a dataset from 22 different robots collected through a collaboration between 21 institutions, demonstrating 527 skills (160266 tasks). We show that a high-capacity model trained on this data, which we call RT-X, exhibits positive transfer and improves the capabilities of multiple robots by leveraging experience from other platforms. More details can be found on the project website https://robotics-transformer-x.github.io.
△ Less
Submitted 1 June, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
On the structure of the $RO(G)$-graded homotopy of $H\underline{M}$ for cyclic $p$-groups
Authors:
Igor Sikora,
Guoqi Yan
Abstract:
We study the structure of the $RO(G)$-graded homotopy Mackey functors of any Eilenberg-MacLane spectrum $H\underline{M}$ for $G$ a cyclic $p$-group. When $\underline{R}$ is a Green functor, we define orientation classes $u_V$ for $H\underline{R}$ and deduce a generalized gold relation. We deduce the $a_V,u_V$-isomorphism regions of the $RO(G)$-graded homotopy Mackey functors and prove two inductio…
▽ More
We study the structure of the $RO(G)$-graded homotopy Mackey functors of any Eilenberg-MacLane spectrum $H\underline{M}$ for $G$ a cyclic $p$-group. When $\underline{R}$ is a Green functor, we define orientation classes $u_V$ for $H\underline{R}$ and deduce a generalized gold relation. We deduce the $a_V,u_V$-isomorphism regions of the $RO(G)$-graded homotopy Mackey functors and prove two induction theorems. As applications, we compute the positive cone of $H\underline{\mathbb{A}}$, as well as the positive and negative cones of $H\underline{\mathbb{Z}}$. The latter two cones are essential to the slice spectral sequences of $MU^{((C_{2^n}))}$ and its variants.
△ Less
Submitted 18 October, 2023; v1 submitted 29 September, 2023;
originally announced September 2023.
-
Results on Elastic Cross Sections in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$…
▽ More
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$ does not fit the data in the aforementioned $t$ range, and we obtain a much better fit using a second-order polynomial for $B(t)$. The $t$ dependence of $B$ is determined using six subintervals of $t$ in the STAR measured $t$ range, and is in good agreement with the phenomenological models. The measured elastic differential cross section $\mathrm{d}σ/\mathrm{dt}$ agrees well with the results obtained at $\sqrt{s} = 546$ GeV for proton--antiproton collisions by the UA4 experiment. We also determine that the integrated elastic cross section within the STAR $t$-range is $σ^\mathrm{fid}_\mathrm{el} = 462.1 \pm 0.9 (\mathrm{stat.}) \pm 1.1 (\mathrm {syst.}) \pm 11.6 (\mathrm {scale})$~$μ\mathrm{b}$.
△ Less
Submitted 6 May, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Longitudinal and transverse spin transfer to $Λ$ and $\overlineΛ$ hyperons in polarized $p$+$p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (357 additional authors not shown)
Abstract:
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and…
▽ More
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and the transverse spin transfer coefficient, $D_{TT}$, to $Λ$ and $\overlineΛ$ in polarized proton-proton collisions at $\sqrt{s}$ = 200 GeV by the STAR experiment at RHIC. The data set includes longitudinally polarized proton-proton collisions with an integrated luminosity of 52 pb$^{-1}$, and transversely polarized proton-proton collisions with a similar integrated luminosity. Both data sets have about twice the statistics of previous results and cover a kinematic range of $|η_{Λ(\overlineΛ)}|$ $<$ 1.2 and transverse momentum $p_{T,{Λ(\overlineΛ)}}$ up to 8 GeV/$c$. We also report the first measurements of the hyperon spin transfer coefficients $D_{LL}$ and $D_{TT}$ as a function of the fractional jet momentum $z$ carried by the hyperon, which can provide more direct constraints on the polarized fragmentation functions.
△ Less
Submitted 7 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Reaction plane correlated triangular flow in Au+Au collisions at $\sqrt{s_{NN}}=3$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (341 additional authors not shown)
Abstract:
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$,…
▽ More
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$, opposite in sign compared to the slope for directed flow. No significant $v_3$ signal is observed for charged pions and kaons. Comparisons with models suggest that a mean field potential is required to describe these results, and that the triangular shape of the participant nucleons is the result of stopping and nuclear geometry.
△ Less
Submitted 19 April, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Blendshapes GHUM: Real-time Monocular Facial Blendshape Prediction
Authors:
Ivan Grishchenko,
Geng Yan,
Eduard Gabriel Bazavan,
Andrei Zanfir,
Nikolai Chinaev,
Karthik Raveendran,
Matthias Grundmann,
Cristian Sminchisescu
Abstract:
We present Blendshapes GHUM, an on-device ML pipeline that predicts 52 facial blendshape coefficients at 30+ FPS on modern mobile phones, from a single monocular RGB image and enables facial motion capture applications like virtual avatars. Our main contributions are: i) an annotation-free offline method for obtaining blendshape coefficients from real-world human scans, ii) a lightweight real-time…
▽ More
We present Blendshapes GHUM, an on-device ML pipeline that predicts 52 facial blendshape coefficients at 30+ FPS on modern mobile phones, from a single monocular RGB image and enables facial motion capture applications like virtual avatars. Our main contributions are: i) an annotation-free offline method for obtaining blendshape coefficients from real-world human scans, ii) a lightweight real-time model that predicts blendshape coefficients based on facial landmarks.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields
Authors:
Yanjie Ze,
Ge Yan,
Yueh-Hua Wu,
Annabella Macaluso,
Yuying Ge,
Jianglong Ye,
Nicklas Hansen,
Li Erran Li,
Xiaolong Wang
Abstract:
It is a long-standing problem in robotics to develop agents capable of executing diverse manipulation tasks from visual observations in unstructured real-world environments. To achieve this goal, the robot needs to have a comprehensive understanding of the 3D structure and semantics of the scene. In this work, we present $\textbf{GNFactor}$, a visual behavior cloning agent for multi-task robotic m…
▽ More
It is a long-standing problem in robotics to develop agents capable of executing diverse manipulation tasks from visual observations in unstructured real-world environments. To achieve this goal, the robot needs to have a comprehensive understanding of the 3D structure and semantics of the scene. In this work, we present $\textbf{GNFactor}$, a visual behavior cloning agent for multi-task robotic manipulation with $\textbf{G}$eneralizable $\textbf{N}$eural feature $\textbf{F}$ields. GNFactor jointly optimizes a generalizable neural field (GNF) as a reconstruction module and a Perceiver Transformer as a decision-making module, leveraging a shared deep 3D voxel representation. To incorporate semantics in 3D, the reconstruction module utilizes a vision-language foundation model ($\textit{e.g.}$, Stable Diffusion) to distill rich semantic information into the deep 3D voxel. We evaluate GNFactor on 3 real robot tasks and perform detailed ablations on 10 RLBench tasks with a limited number of demonstrations. We observe a substantial improvement of GNFactor over current state-of-the-art methods in seen and unseen tasks, demonstrating the strong generalization ability of GNFactor. Our project website is https://yanjieze.com/GNFactor/ .
△ Less
Submitted 1 September, 2023; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (…
▽ More
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) in the search for the CME. The isobar ratio ($Y$) of CME-sensitive observable, charge separation scaled by elliptic anisotropy, is close to but systematically larger than the inverse multiplicity ratio, the naive background baseline. This indicates the potential existence of a CME signal and the presence of remaining nonflow background due to two- and three-particle correlations, which are different between the isobars. In this post-blind analysis, we estimate the contributions from those nonflow correlations as a background baseline to $Y$, utilizing the isobar data as well as Heavy Ion Jet Interaction Generator simulations. This baseline is found consistent with the isobar ratio measurement, and an upper limit of 10% at 95% confidence level is extracted for the CME fraction in the charge separation measurement in isobar collisions at $\sqrt{s_{\rm NN}}=200$ GeV.
△ Less
Submitted 17 July, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
On the Weight Distribution of Weights Less than $2w_{\min}$ in Polar Codes
Authors:
Zicheng Ye,
Yuan Li,
Huazi Zhang,
Jun Wang,
Guiying Yan,
Zhiming Ma
Abstract:
The number of low-weight codewords is critical to the performance of error-correcting codes. In 1970, Kasami and Tokura characterized the codewords of Reed-Muller (RM) codes whose weights are less than $2w_{\min}$, where $w_{\min}$ represents the minimum weight. In this paper, we extend their results to decreasing polar codes. We present the closed-form expressions for the number of codewords in d…
▽ More
The number of low-weight codewords is critical to the performance of error-correcting codes. In 1970, Kasami and Tokura characterized the codewords of Reed-Muller (RM) codes whose weights are less than $2w_{\min}$, where $w_{\min}$ represents the minimum weight. In this paper, we extend their results to decreasing polar codes. We present the closed-form expressions for the number of codewords in decreasing polar codes with weights less than $2w_{\min}$. Moreover, the proposed enumeration algorithm runs in polynomial time with respect to the code length.
△ Less
Submitted 2 May, 2024; v1 submitted 19 August, 2023;
originally announced August 2023.
-
Jet-hadron correlations with respect to the event plane in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions in STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai,
H. Caines
, et al. (340 additional authors not shown)
Abstract:
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A seco…
▽ More
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A second-order event plane is used in the analysis as an experimental estimate of the reaction plane formed by the collision impact parameter and the beam direction. Charged-particle jets with $15 < p_{\rm T, jet} <$ 20 and $20 < p_{\rm T, jet} <$ 40 GeV/$c$ were reconstructed with the anti-$k_{\rm T}$ algorithm with radius parameter setting of (R=0.4) in the 20-50\% centrality bin to maximize the initial-state eccentricity of the interaction region. The reaction plane fit method is implemented to remove the flow-modulated background with better precision than prior methods. Yields and widths of jet-associated charged-hadron distributions are extracted in three angular bins between the jet axis and the event plane. The event-plane (EP) dependence is further quantified by ratios of the associated yields in different EP bins. No dependence on orientation of the jet axis with respect to the event plane is seen within the uncertainties in the kinematic regime studied. This finding is consistent with a similar experimental observation by ALICE in $\sqrt{s_{\mathrm{NN}}}$ = 2.76 TeV Pb+Pb collision data.
△ Less
Submitted 20 March, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.