-
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance
Authors:
Saurabh Srivastava,
Chengyue Huang,
Weiguo Fan,
Ziyu Yao
Abstract:
Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as "Let's think step by step" remain limited. This study introduces PRomPTed, an approach that optimizes the zero-shot prompts for individual task instances followin…
▽ More
Large language models (LLMs) have revolutionized zero-shot task performance, mitigating the need for task-specific annotations while enhancing task generalizability. Despite its advancements, current methods using trigger phrases such as "Let's think step by step" remain limited. This study introduces PRomPTed, an approach that optimizes the zero-shot prompts for individual task instances following an innovative manner of "LLMs in the loop". Our comprehensive evaluation across 13 datasets and 10 task types based on GPT-4 reveals that PRomPTed significantly outperforms both the naive zero-shot approaches and a strong baseline (i.e., "Output Refinement") which refines the task output instead of the input prompt. Our experimental results also confirmed the generalization of this advantage to the relatively weaker GPT-3.5. Even more intriguingly, we found that leveraging GPT-3.5 to rewrite prompts for the stronger GPT-4 not only matches but occasionally exceeds the efficacy of using GPT-4 as the prompt rewriter. Our research thus presents a huge value in not only enhancing zero-shot LLM performance but also potentially enabling supervising LLMs with their weaker counterparts, a capability attracting much interest recently. Finally, our additional experiments confirm the generalization of the advantages to open-source LLMs such as Mistral 7B and Mixtral 8x7B.
△ Less
Submitted 11 June, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
The Origin of the Consistent Planetary Nebula Luminosity Function Bright-end Cutoff
Authors:
Philippe Z. Yao,
Eliot Quataert
Abstract:
The [O III] 5007 Angstrom line is typically the brightest line in planetary nebula (PN) spectra. Observations show that the brightest [O III] 5007 Angstrom PN in a galaxy -- the planetary nebula luminosity function (PNLF) bright-end cutoff -- is surprisingly independent of galaxy type. To understand the origin of this puzzling uniformity, we simulate PNe with a range of cloud and star parameters u…
▽ More
The [O III] 5007 Angstrom line is typically the brightest line in planetary nebula (PN) spectra. Observations show that the brightest [O III] 5007 Angstrom PN in a galaxy -- the planetary nebula luminosity function (PNLF) bright-end cutoff -- is surprisingly independent of galaxy type. To understand the origin of this puzzling uniformity, we simulate PNe with a range of cloud and star parameters using the photoionization code CLOUDY. We find that the peak [O III] 5007 Angstrom luminosity depends weakly on both the central stellar effective temperature at high temperature and on the total PN ejecta mass; however, the peak [O III] 5007 Angstrom luminosity depends strongly on the central stellar luminosity and the PN dust-to-gas mass ratio. We explain these scalings physically. They imply that a higher dust-to-gas mass ratio at higher central stellar luminosity can help explain a constant bright-end cutoff in the PNLF across galaxy types. This prediction is testable with a survey of galactic PNe. The surviving remnants of double white dwarf mergers should also produce photoionized nebulae analogous to PNe. These may be preferentially present at the high luminosity end of the [O III] PLNF and could explain the existence of PNe in early-type galaxies that are more luminous in [O III] than expected from single-star evolutionary models. The presence of white dwarf mergers in both young and old stellar populations could contribute to the uniformity of the [O III] PNLF across galaxy types; such nebulae would lack the hydrogen lines otherwise characteristic of PNe.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
First measurement of $ΛN$ inelastic scattering with $Λ$ from $e^{+} e^{-} \rightarrow J/ψ\to Λ\barΛ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (626 additional authors not shown)
Abstract:
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the…
▽ More
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the BESIII detector. The total cross section of $Λ+ ^{9}{\rm Be} \rightarrow Σ^+ + X$ is measured to be $σ= (37.3 \pm 4.7 \pm 3.5)~{\rm mb}$ at $Λ$ beam momenta within $[1.057, 1.091]~{\rm GeV}/c$, where the uncertainties are statistical and systematic, respectively. This analysis is the first study of $Λ$-nucleon interactions at an $e^+ e^-$ collider, providing information and constraints relevant for the strong-interaction potential, the origin of color confinement, the unified model for baryon-baryon interactions, and the internal structure of neutron stars.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
A Survey on Image-text Multimodal Models
Authors:
Ruifeng Guo,
Jingxuan Wei,
Linzhuang Sun,
Bihui Yu,
Guiyong Chang,
Dawei Liu,
Sibo Zhang,
Zhengbing Yao,
Mingjun Xu,
Liping Bu
Abstract:
With the significant advancements of Large Language Models (LLMs) in the field of Natural Language Processing (NLP), the development of image-text multimodal models has garnered widespread attention. Current surveys on image-text multimodal models mainly focus on representative models or application domains, but lack a review on how general technical models influence the development of domain-spec…
▽ More
With the significant advancements of Large Language Models (LLMs) in the field of Natural Language Processing (NLP), the development of image-text multimodal models has garnered widespread attention. Current surveys on image-text multimodal models mainly focus on representative models or application domains, but lack a review on how general technical models influence the development of domain-specific models, which is crucial for domain researchers. Based on this, this paper first reviews the technological evolution of image-text multimodal models, from early explorations of feature space to visual language encoding structures, and then to the latest large model architectures. Next, from the perspective of technological evolution, we explain how the development of general image-text multimodal technologies promotes the progress of multimodal technologies in the biomedical field, as well as the importance and complexity of specific datasets in the biomedical domain. Then, centered on the tasks of image-text multimodal models, we analyze their common components and challenges. After that, we summarize the architecture, components, and data of general image-text multimodal models, and introduce the applications and improvements of image-text multimodal models in the biomedical field. Finally, we categorize the challenges faced in the development and application of general models into external factors and intrinsic factors, further refining them into 2 external factors and 5 intrinsic factors, and propose targeted solutions, providing guidance for future research directions. For more details and data, please visit our GitHub page: \url{https://github.com/i2vec/A-survey-on-image-text-multimodal-models}.
△ Less
Submitted 18 June, 2024; v1 submitted 23 September, 2023;
originally announced September 2023.
-
Isoparametric Hypersurfaces in product spaces of space forms
Authors:
Dong Gao,
Hui Ma,
Zeke Yao
Abstract:
We give a complete classification of isoparametric hypersurfaces in a product space $M^2_{κ_1}\times M^2_{κ_2}$ of $2$-dimensional space forms for $κ_i\in \{-1,0,1\}$ with $κ_1\neq κ_2$. In fact we prove that any isoparametic hypersurface in such a space has constant product angle function, which enables us to remove the condition of constant principal curvatures from the classification obtained r…
▽ More
We give a complete classification of isoparametric hypersurfaces in a product space $M^2_{κ_1}\times M^2_{κ_2}$ of $2$-dimensional space forms for $κ_i\in \{-1,0,1\}$ with $κ_1\neq κ_2$. In fact we prove that any isoparametic hypersurface in such a space has constant product angle function, which enables us to remove the condition of constant principal curvatures from the classification obtained recently by J.B.M.dos Santos and J.P.dos Santos.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Updated measurements of the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K \bar{K} π$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (609 additional authors not shown)
Abstract:
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ M…
▽ More
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ MeV/$c^{2}$ and $(10.5 \pm 1.7 (\rm {stat}) \pm 3.5 (\rm {syst}))$ MeV, respectively. The product branching fraction $\mathcal{B}\left(ψ(3686) \rightarrow γη_{c}(2 S)\right) \times \mathcal{B}(η_{c}(2 S) \rightarrow K \bar{K} π)$ is determined to be $(0.97 \pm 0.06 (\rm {stat}) \pm 0.09 (\rm {syst})) \times 10^{-5}$. Using $\mathcal{BR}(η_{c}(2S)\to K\bar{K}π)=(1.86^{+0.68}_{-0.49})\%$, we obtain the branching fraction of the radiative transition to be $\mathcal{BR}(ψ(3686) \to γη_{c}(2S)) = (5.2 \pm 0.3 (\rm {stat}) \pm 0.5 (\rm {syst}) ^{+1.9}_{-1.4} (extr)) \times 10^{-4}$, where the third uncertainty is due to the quoted $\mathcal{BR}(η_{c}(2S) \to K\bar{K}π)$.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Investigation of the $ΔI = 1/2$ rule and test of CP violation through the measurement of decay asymmetry parameters in $Ξ^-$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by fa…
▽ More
Using $(10087\pm44)\times 10^{6}$ $J/ψ$ events collected with the BESIII detector, numerous $Ξ^-$ and $Λ$ decay asymmetry parameters are simultaneously determined from the process $J/ψ\to Ξ^- \barΞ^+ \to Λ(pπ^-) π^- \barΛ(\bar{n} π^0) π^+$ and its charge-conjugate channel. The precisions of $α_0$ for $Λ\to nπ^0$ and $\barα_0$ for $\barΛ \to \bar{n}π^0$ compared to world averages are improved by factors of 4 and 1.7, respectively. The ratio of decay asymmetry parameters of $Λ\to nπ^0$ to that of $Λ\to pπ^-$, $\langle α_0 \rangle/ \langle α_{Λ-} \rangle $, is determined to be $ 0.873 \pm 0.012^{+0.011}_{-0.010}$, where the first and the second uncertainties are statistical and systematic, respectively. The ratio is smaller than unity more than $5σ$, which signifies the existence of the $ΔI = 3/2$ transition in $Λ$ for the first time. Beside, we test for CP violation in $Ξ^- \to Λπ^-$ and in $Λ\to n π^{0}$ with the best precision to date.
△ Less
Submitted 8 January, 2024; v1 submitted 26 September, 2023;
originally announced September 2023.
-
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention
Authors:
Zhewei Yao,
Xiaoxia Wu,
Conglong Li,
Minjia Zhang,
Heyang Qin,
Olatunji Ruwase,
Ammar Ahmad Awan,
Samyam Rajbhandari,
Yuxiong He
Abstract:
Most of the existing multi-modal models, hindered by their incapacity to adeptly manage interleaved image-and-text inputs in multi-image, multi-round dialogues, face substantial constraints in resource allocation for training and data accessibility, impacting their adaptability and scalability across varied interaction realms. To address this, we present the DeepSpeed-VisualChat framework, designe…
▽ More
Most of the existing multi-modal models, hindered by their incapacity to adeptly manage interleaved image-and-text inputs in multi-image, multi-round dialogues, face substantial constraints in resource allocation for training and data accessibility, impacting their adaptability and scalability across varied interaction realms. To address this, we present the DeepSpeed-VisualChat framework, designed to optimize Large Language Models (LLMs) by incorporating multi-modal capabilities, with a focus on enhancing the proficiency of Large Vision and Language Models in handling interleaved inputs. Our framework is notable for (1) its open-source support for multi-round and multi-image dialogues, (2) introducing an innovative multi-modal causal attention mechanism, and (3) utilizing data blending techniques on existing datasets to assure seamless interactions in multi-round, multi-image conversations. Compared to existing frameworks, DeepSpeed-VisualChat shows superior scalability up to 70B parameter language model size, representing a significant advancement in multi-modal language models and setting a solid foundation for future explorations.
△ Less
Submitted 29 November, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Measurement of the $e^{+}e^{-} \to K_{S}^{0} K_{L}^{0} π^{0}$ cross sections from $\sqrt{s}=$ 2.000 to 3.080 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses…
▽ More
Based on $e^{+}e^{-}$ collision data collected at center-of-mass energies from 2.000 to 3.080 GeV by the BESIII detector at the BEPCII collider, a partial wave analysis is performed for the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$. The results allow the Born cross sections of the process $e^{+}e^{-}\to K_{S}^{0} K_{L}^{0} π^{0}$, as well as its subprocesses $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ and $K^{*}_{2}(1430)^{0}\bar{K}^{0}$ to be measured. The Born cross sections for $e^{+}e^{-}\to K_{S}^{0}K_{L}^{0}π^{0}$ are consistent with previous measurements by BaBar, but with substantially improved precision. The Born cross section lineshape of the process $e^{+}e^{-}\to K^{*}(892)^{0}\bar{K}^{0}$ is consistent with a vector meson state around 2.2 GeV with a significance of 3.2$σ$. A Breit-Wigner fit determines its mass as $M_Y=(2164.7\pm9.1\pm3.1)~{\rm{MeV}}/c^{2}$ and its width as $Γ_{Y}=(32.4\pm21.0\pm1.8)~\rm{MeV}$.
△ Less
Submitted 26 February, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Visualizing the Zhang-Rice singlet, molecular orbitals and pair formation in cuprate
Authors:
Shusen Ye,
Jianfa Zhao,
Zhiheng Yao,
Sixuan Chen,
Zehao Dong,
Xintong Li,
Luchuan Shi,
Qingqing Liu,
Changqing Jin,
Yayu Wang
Abstract:
The parent compound of cuprates is a charge-transfer-type Mott insulator with strong hybridization between the Cu $3d_{\mathrm x^2-y^2}$ and O $2p$ orbitals. A key question concerning the pairing mechanism is the behavior of doped holes in the antiferromagnetic (AF) Mott insulator background, which is a prototypical quantum many-body problem. It was proposed that doped hole on the O site tends to…
▽ More
The parent compound of cuprates is a charge-transfer-type Mott insulator with strong hybridization between the Cu $3d_{\mathrm x^2-y^2}$ and O $2p$ orbitals. A key question concerning the pairing mechanism is the behavior of doped holes in the antiferromagnetic (AF) Mott insulator background, which is a prototypical quantum many-body problem. It was proposed that doped hole on the O site tends to form a singlet, known as Zhang-Rice singlet (ZRS), with the unpaired Cu spin. But experimentally little is known about the properties of a single hole and the interplay between them that leads to superconductivity. Here we use scanning tunneling microscopy to visualize the electronic states in hole-doped $\mathrm{Ca_2CuO_2Cl_2}$, aiming to establish the atomic-scale local basis for pair formation. A single doped hole is shown to have an in-gap state and a clover-shaped spatial distribution that can be attributed to a localized ZRS. When the dopants are close enough, they develop delocalized molecular orbitals with characteristic stripe- and ladder-shaped patterns, accompanied by the opening of a small gap around the Fermi level ($E_{\mathrm F}$). With increasing doping, the molecular orbitals proliferate in space and gradually form densely packed plaquettes, but the stripe and ladder patterns remain nearly the same. The low-energy electronic states of the molecular orbitals are intimately related to the local pairing properties, thus play a vitally important role in the emergence of superconductivity. We propose that the Cooper pair is formed by two holes occupying the stripe-like molecular orbital, while the attractive interaction is mediated by the AF spin background.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context
Authors:
Wei Kang,
Xiaoyu Yang,
Zengwei Yao,
Fangjun Kuang,
Yifan Yang,
Liyong Guo,
Long Lin,
Daniel Povey
Abstract:
In this paper, we introduce Libriheavy, a large-scale ASR corpus consisting of 50,000 hours of read English speech derived from LibriVox. To the best of our knowledge, Libriheavy is the largest freely-available corpus of speech with supervisions. Different from other open-sourced datasets that only provide normalized transcriptions, Libriheavy contains richer information such as punctuation, casin…
▽ More
In this paper, we introduce Libriheavy, a large-scale ASR corpus consisting of 50,000 hours of read English speech derived from LibriVox. To the best of our knowledge, Libriheavy is the largest freely-available corpus of speech with supervisions. Different from other open-sourced datasets that only provide normalized transcriptions, Libriheavy contains richer information such as punctuation, casing and text context, which brings more flexibility for system building. Specifically, we propose a general and efficient pipeline to locate, align and segment the audios in previously published Librilight to its corresponding texts. The same as Librilight, Libriheavy also has three training subsets small, medium, large of the sizes 500h, 5000h, 50000h respectively. We also extract the dev and test evaluation sets from the aligned audios and guarantee there is no overlapping speakers and books in training sets. Baseline systems are built on the popular CTC-Attention and transducer models. Additionally, we open-source our dataset creatation pipeline which can also be used to other audio alignment tasks.
△ Less
Submitted 14 January, 2024; v1 submitted 14 September, 2023;
originally announced September 2023.
-
PromptASR for contextualized ASR with controllable style
Authors:
Xiaoyu Yang,
Wei Kang,
Zengwei Yao,
Yifan Yang,
Liyong Guo,
Fangjun Kuang,
Long Lin,
Daniel Povey
Abstract:
Prompts are crucial to large language models as they provide context information such as topic or logical relationships. Inspired by this, we propose PromptASR, a framework that integrates prompts in end-to-end automatic speech recognition (E2E ASR) systems to achieve contextualized ASR with controllable style of transcriptions. Specifically, a dedicated text encoder encodes the text prompts and t…
▽ More
Prompts are crucial to large language models as they provide context information such as topic or logical relationships. Inspired by this, we propose PromptASR, a framework that integrates prompts in end-to-end automatic speech recognition (E2E ASR) systems to achieve contextualized ASR with controllable style of transcriptions. Specifically, a dedicated text encoder encodes the text prompts and the encodings are injected into the speech encoder by cross-attending the features from two modalities. When using the ground truth text from preceding utterances as content prompt, the proposed system achieves 21.9% and 6.8% relative word error rate reductions on a book reading dataset and an in-house dataset compared to a baseline ASR system. The system can also take word-level biasing lists as prompt to improve recognition accuracy on rare words. An additional style prompt can be given to the text encoder and guide the ASR system to output different styles of transcriptions. The code is available at icefall.
△ Less
Submitted 24 January, 2024; v1 submitted 13 September, 2023;
originally announced September 2023.
-
Measurements of the absolute branching fractions of $Ω^-$ decays and test of the $ΔI = 1/2$ rule
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertai…
▽ More
Based on a data set of $(27.12\pm0.10)\times 10^8$ $ψ(3686)$ events collected at the BESIII experiment, the absolute branching fractions of the three dominant $Ω^-$ decays are measured to be $\mathcal{B}_{Ω^- \to Ξ^0 π^-} = (25.03\pm0.44\pm0.53)\%$, $\mathcal{B}_{Ω^- \to Ξ^- π^0} = (8.43\pm0.52\pm0.28)\%$, and $\mathcal{B}_{Ω^- \to ΛK^-} = (66.3\pm0.8\pm2.0)\%$, where the first and second uncertainties are statistical and systematic, respectively. The ratio between $\mathcal{B}_{Ω^- \to Ξ^0 π^-}$ and $\mathcal{B}_{Ω^- \to Ξ^- π^0}$ is determined to be $2.97\pm0.19\pm0.11$, which is in good agreement with the PDG value of $2.74\pm0.15$, but greater by more than four standard deviations than the theoretical prediction of 2 obtained from the $ΔI = 1/2$ rule.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Observation of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ in the amplitude analysis of $D^{+} \to K_{S}^{0}π^+η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (604 additional authors not shown)
Abstract:
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is…
▽ More
We perform for the first time an amplitude analysis of the decay $D^{+}\to K_{S}^{0}π^+η$ and report the observation of the decay $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ using 2.93 fb$^{-1}$ of $e^+e^-$ collision data taken at a center-of-mass energy of 3.773 GeV with the BESIII detector. As the only W-annihilation free decay among $D$ to $a_{0}(980)$-pseudoscalar, $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ is the ideal decay to extract the contributions of the external and internal $W$-emission amplitudes involving $a_{0}(980)$ and study the final-state interactions. The absolute branching fraction of $D^{+}\to K_{S}^{0}π^+η$ is measured to be $(1.27\pm0.04_{\rm stat.}\pm0.03_{\rm syst.})\%$. The product branching fractions of $D^{+}\to K_{S}^{0}a_{0}(980)^{+}$ with $a_{0}(980)^{+}\to π^+η$ and $D^{+}\to π^+ K_0^*(1430)^0$ with $K_0^*(1430)^0\to K_{S}^{0}η$ are measured to be $(1.33\pm0.05_{\rm stat.}\pm0.04_{\rm syst.})\%$ and $(0.14\pm0.03_{\rm stat.}\pm0.01_{\rm syst.})\%$, respectively.
△ Less
Submitted 29 March, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Observation of the Singly Cabibbo-Suppressed Decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be…
▽ More
The singly Cabibbo-suppressed decay $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is observed for the first time with a statistical significance of $6.4σ$ by using 4.5 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.600 and 4.699 GeV with the BESIII detector at BEPCII. The absolute branching fraction of $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ is measured to be $(3.8\pm1.3_{\rm stat}\pm0.2_{\rm syst})\times 10^{-4}$ in a model-independent approach. This is the first observation of a Cabibbo-suppressed $Λ_{c}^{+}$ decay involving $Σ^-$ in the final state. The ratio of branching fractions between $Λ_{c}^{+}\to Σ^{-}K^{+}π^{+}$ and the Cabibbo-favored decay $Λ_{c}^{+}\to Σ^- π^+π^+$ is calculated to be $(0.4 \pm 0.1)s_{c}^{2}$, where $s_{c} \equiv \sinθ_c = 0.2248$ with $θ_c$ the Cabibbo mixing angle. This ratio significantly deviates from $1.0s_{c}^{2}$ and provides important information for the understanding of nonfactorization contributions in $Λ_{c}^{+}$ decays.
△ Less
Submitted 8 May, 2024; v1 submitted 11 September, 2023;
originally announced September 2023.
-
Study of Io's sodium jets with the TRAPPIST telescopes
Authors:
Alexander de Becker,
Linus Head,
Bertrand Bonfond,
Emmanuël Jehin,
Jean Manfroid,
Zhonghua Yao,
Binzheng Zhang,
Denis Grodent,
Nicholas Schneider,
Zouhair Benkhaldoun
Abstract:
Io is the most volcanically active body in the Solar System. This volcanic activity results in the ejection of material into Io's atmosphere, which may then escape from the atmosphere to form various structures in the jovian magnetosphere, including the plasma torus and clouds of neutral particles. The physical processes involved in the escape of particles - for example, how the volcanoes of Io pr…
▽ More
Io is the most volcanically active body in the Solar System. This volcanic activity results in the ejection of material into Io's atmosphere, which may then escape from the atmosphere to form various structures in the jovian magnetosphere, including the plasma torus and clouds of neutral particles. The physical processes involved in the escape of particles - for example, how the volcanoes of Io provide material to the plasma torus - are not yet fully understood. In particular, it is not clear to what extent the sodium jet, one of the sodium neutral clouds related to Io, is a proxy of processes that populate the various reservoirs of plasma in Jupiter's magnetosphere. Here, we report on observations carried out over 17 nights in 2014-2015, 30 nights in 2021, and 23 nights in 2022-2023 with the TRAPPIST telescopes, in which particular attention was paid to the sodium jet and the quantification of their physical properties (length, brightness). It was found that these properties can vary greatly from one jet to another and independently of the position of Io in its orbit. No clear link was found between the presence of jets and global brightening of the plasma torus and extended sodium nebula, indicating that jets do not contribute straightforwardly to their population. This work also demonstrates the advantage of regular and long-term monitoring to understanding the variability of the sodium jet and presents a large corpus of jet detections against which work in related fields may compare.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Measurement of the cross section of $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ at center-of-mass energies between 3.510 and 4.843 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay…
▽ More
Using $e^+e^-$ collision data corresponding to a total integrated luminosity of 12.9 $fb^{-1}$ collected with the BESIII detector at the BEPCII collider, the exclusive Born cross sections and the effective form factors of the reaction $e^+e^-\rightarrowΞ^{-}\barΞ^{+}$ are measured via the single baryon-tag method at 23 center-of-mass energies between 3.510 and 4.843 GeV. Evidence for the decay $ψ(3770)\rightarrowΞ^{-}\barΞ^{+}$ is observed with a significance of 4.5$σ$ by analyzing the measured cross sections together with earlier BESIII results. For the other charmonium(-like) states $ψ(4040)$, $ψ(4160)$, $Y(4230)$, $Y(4360)$, $ψ(4415)$, and $Y(4660)$, no significant signal of their decay to $Ξ^-\bar Ξ^+$ is found. For these states, upper limits of the products of the branching fraction and the electronic partial width at the 90% confidence level are provided.
△ Less
Submitted 30 November, 2023; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined t…
▽ More
By analyzing 7.33\,fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, we search for the semileptonic decays $D^+_s \to K_1(1270)^0 e^+ν_e$ and $D^+_s \to b_1(1235)^0 e^+ν_e$ for the first time. No significant signals are observed for either decay mode. The upper limits on the (product) branching fractions are determined to be ${\mathcal B}[D^+_s \to K_1(1270)^0 e^+ν_e] < 4.1\times 10^{-4}$ and ${\mathcal B}[D^+_s \to b_1(1235)^0 e^+ν_e]\cdot {\mathcal B}[b_1(1235)^0\to ωπ^0] < 6.4\times 10^{-4}$ at 90\% confidence level.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.
-
First Measurement of the Decay Asymmetry in the pure W-boson-exchange Decay $Λ_{c}^{+}\toΞ^{0}K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (618 additional authors not shown)
Abstract:
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be…
▽ More
Based on $4.4~\text{fb}^{-1}$ of $e^{+}e^{-}$ annihilation data collected at the center-of-mass energies between $4.60$ and $4.70~\text{GeV}$ with the BESIII detector at the BEPCII collider, the pure \textit{W}-boson-exchange decay $Λ_{c}^{+}\toΞ^{0}K^{+}$ is studied with a full angular analysis. The corresponding decay asymmetry is measured for the first time to be $α_{Ξ^{0}K^{+}}=0.01\pm0.16({\rm stat.})\pm0.03({\rm syst.})$. This result reflects the non-interference effect between the $S$- and $P$-wave amplitudes. The phase shift between $S$- and $P$-wave amplitudes has two solutions, which are $δ_{p}-δ_{s}=-1.55\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$ or $1.59\pm0.25({\rm stat.})\pm0.05({\rm syst.})~\text{rad}$.
△ Less
Submitted 20 January, 2024; v1 submitted 6 September, 2023;
originally announced September 2023.
-
A coupled-channel analysis of the $X(3872)$ lineshape with BESIII data
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles ar…
▽ More
We perform a study of the $X(3872)$ lineshape using the data samples of $e^+e^-\toγX(3872)$, $X(3872)\to D^0\bar{D}^0 π^0$ and $π^+π^- J/ψ$ collected with the BESIII detector. The effects of the coupled-channels and the off-shell $D^{*0}$ are included in the parameterization of the lineshape. The lineshape mass parameter is obtained to be $M_{X}=(3871.63\pm 0.13^{+0.06}_{-0.05})$ MeV. Two poles are found on the first and second Riemann sheets corresponding to the $D^{*0}\bar{D}^0$ branch cut. The pole location on the first sheet is much closer to the $D^{*0}\bar{D}^0$ threshold than the other, and is determined to be $7.04\pm0.15^{+0.07}_{-0.08}$ MeV above the $D^0\bar{D}^0π^0$ threshold with an imaginary part $-0.19\pm0.08^{+0.14}_{-0.19}$ MeV.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model
Authors:
Fengxiang Bie,
Yibo Yang,
Zhongzhu Zhou,
Adam Ghanem,
Minjia Zhang,
Zhewei Yao,
Xiaoxia Wu,
Connor Holmes,
Pareesa Golnari,
David A. Clifton,
Yuxiong He,
Dacheng Tao,
Shuaiwen Leon Song
Abstract:
Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions. Text-to-image generation using neural networks could be traced back to the emergence of Generative Adversial Network (GAN), followed by the autoregressive Transformer. Diffusion models are one prominent type of generative model used for the genera…
▽ More
Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions. Text-to-image generation using neural networks could be traced back to the emergence of Generative Adversial Network (GAN), followed by the autoregressive Transformer. Diffusion models are one prominent type of generative model used for the generation of images through the systematic introduction of noises with repeating steps. As an effect of the impressive results of diffusion models on image synthesis, it has been cemented as the major image decoder used by text-to-image models and brought text-to-image generation to the forefront of machine-learning (ML) research. In the era of large models, scaling up model size and the integration with large language models have further improved the performance of TTI models, resulting the generation result nearly indistinguishable from real-world images, revolutionizing the way we retrieval images. Our explorative study has incentivised us to think that there are further ways of scaling text-to-image models with the combination of innovative model architectures and prediction enhancement techniques. We have divided the work of this survey into five main sections wherein we detail the frameworks of major literature in order to delve into the different types of text-to-image generation methods. Following this we provide a detailed comparison and critique of these methods and offer possible pathways of improvement for future work. In the future work, we argue that TTI development could yield impressive productivity improvements for creation, particularly in the context of the AIGC era, and could be extended to more complex tasks such as video generation and 3D generation.
△ Less
Submitted 1 September, 2023;
originally announced September 2023.
-
Suppressing electron disorder-induced heating of ultracold neutral plasma via optical lattice
Authors:
HaiBo Wang,
Zonglin Yao,
Fuyang Zhou,
Yong Wu,
Jianguo Wang,
Xiangjun Chen
Abstract:
Disorder-induced heating (DIH) prevents ultracold neutral plasma into electron strong coupling regime. Here we propose a scheme to suppress electronic DIH via optical lattice. We simulate the evolution dynamics of ultracold neutral plasma constrained by three-dimensional optical lattice using classical molecular dynamics method. The results show that for experimentally achievable condition, electr…
▽ More
Disorder-induced heating (DIH) prevents ultracold neutral plasma into electron strong coupling regime. Here we propose a scheme to suppress electronic DIH via optical lattice. We simulate the evolution dynamics of ultracold neutral plasma constrained by three-dimensional optical lattice using classical molecular dynamics method. The results show that for experimentally achievable condition, electronic DIH is suppressed by a factor of 1.3, and the Coulomb coupling strength can reach to 0.8 which is approaching the strong coupling regime. Suppressing electronic DIH via optical lattice may pave a way for the research of electronic strongly coupled plasma.
△ Less
Submitted 31 August, 2023;
originally announced August 2023.
-
$\mathbb{Z}_p$-lattices in semistable Galois representations
Authors:
Zijian Yao
Abstract:
We show that the category of logarithmic prismatic F-crystals on $(\mathcal{O}_K, \varpi^{\mathbb{N}})$ is equivalent to the category of $\mathbb{Z}_p$-lattices in semistable $\text{Gal}_K$-representations. We then apply our method to describe such Galois representations using linear algebraic data via various "logarithmic" versions of Breuil--Kisin modules.
We show that the category of logarithmic prismatic F-crystals on $(\mathcal{O}_K, \varpi^{\mathbb{N}})$ is equivalent to the category of $\mathbb{Z}_p$-lattices in semistable $\text{Gal}_K$-representations. We then apply our method to describe such Galois representations using linear algebraic data via various "logarithmic" versions of Breuil--Kisin modules.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Observation of a vector charmoniumlike state at 4.7 ${\rm GeV}/c^2$ and search for $Z_{cs}$ in $e^+e^-\to K^+K^-J/ψ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (599 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-d…
▽ More
Using data samples with an integrated luminosity of 5.85~fb$^{-1}$ collected at center-of-mass energies from 4.61 to 4.95 GeV with the BESIII detector operating at the BEPCII storage ring, we measure the cross section for the process $e^+e^-\to K^+K^-J/ψ$. A new resonance with a mass of $M = 4708_{-15}^{+17}\pm21$ MeV/$c^{2}$ and a width of $Γ= 126_{-23}^{+27}\pm30$ MeV is observed in the energy-dependent line shape of the $e^+e^-\to K^+K^-J/ψ$ cross section with a significance over $5σ$. The $K^{+}J/ψ$ system is also investigated to search for charged charmoniumlike states, but no significant $Z_{cs}^+$ states are observed. Upper limits on the Born cross sections for $e^+e^-\to K^{-} Z_{cs}(3985)^{+}/K^{-} Z_{cs}(4000)^{+} + c.c.$ with $Z_{cs}(3985)^{\pm}/Z_{cs}(4000)^{\pm}\to K^{\pm} J/ψ$ are reported at 90\% confidence levels. The ratio of branching fractions $\frac{\mathcal{B}(Z_{cs}(3985)^{+}\to K^+ J/ψ)}{\mathcal{B}(Z_{cs}(3985)^{+}\to (\bar{D}^{0}D_s^{*+} + \bar{D}^{*0}D_s^+))}$ is measured to be less than 0.03 at 90\% confidence level.
△ Less
Submitted 24 November, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
Search for the light hadron decay $χ_{c1}(3872) \to π^{+}π^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for…
▽ More
With a data sample corresponding to an integrated luminosity of 11.5~fb$^{-1}$
collected with the BESIII detector operating at the BEPCII storage ring, for the first time the light hadron decay $χ_{c1}(3872) \rightarrow π^{+}π^{-}η$
is searched for. While no significant signal is observed, the upper limits at the 90\% confidence level for
$σ[e^{+}e^{-} \rightarrow γχ_{c1}(3872)] \mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-}η]$ at center-of-mass energies from 4.13 to 4.34 GeV are determined.
By normalizing to the $χ_{c1}(3872)\toπ^+π^- J/ψ$ decay channel, a 90\% confidence level upper limit for the branching fraction ratio
$\mathcal{R}=\mathcal{B}[χ_{c1}(3872) \rightarrowπ^{+}π^{-}η]/\mathcal{B}[χ_{c1}(3872) \rightarrow π^{+}π^{-} J/ψ] < 0.12$ is given.
These measurements provide important inputs for understanding the internal structure of the $χ_{c1}(3872)$ resonance.
△ Less
Submitted 19 January, 2024; v1 submitted 26 August, 2023;
originally announced August 2023.
-
Improved measurement of the branching fractions for $J/ψ\toγπ^0$, $γη$ and $γη^\prime$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=…
▽ More
Using a data sample of $(1.0087\pm 0.0044)\times 10^{10}$ $J/ψ$ events collected with the BESIII detector, the decays of $J/ψ\toγπ^{0} (η, η^\prime)\toγγγ$ are studied. Newly measured branching fractions are $\mathcal{B}$$(J/ψ\toγπ^{0})$=$(3.34\pm 0.02\pm 0.09)\times 10^{-5}$, $\mathcal{B}$$(J/ψ\toγη)$=$(1.096\pm 0.001\pm0.019)\times 10^{-3}$ and $\mathcal{B}$$(J/ψ\toγη^\prime)$=$(5.40\pm 0.01\pm0.11)\times 10^{-3}$, where the first uncertainties are statistical and the second are systematic. These results are consistent with the world average values within two standard deviations. The ratio of partial widths $Γ(J/ψ\toγη^\prime)/Γ(J/ψ\toγη)$ is measured to be $4.93 \pm 0.13$. The singlet-octet pseudoscalar mixing angle $θ_P$ is determined to be $θ_P = -(22.11 \pm0.26)^\circ$ or $-(19.34 \pm 0.34)^\circ$ with two different phenomenological models.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Contrastive Learning of Temporal Distinctiveness for Survival Analysis in Electronic Health Records
Authors:
Mohsen Nayebi Kerdabadi,
Arya Hadizadeh Moghaddam,
Bin Liu,
Mei Liu,
Zijun Yao
Abstract:
Survival analysis plays a crucial role in many healthcare decisions, where the risk prediction for the events of interest can support an informative outlook for a patient's medical journey. Given the existence of data censoring, an effective way of survival analysis is to enforce the pairwise temporal concordance between censored and observed data, aiming to utilize the time interval before censor…
▽ More
Survival analysis plays a crucial role in many healthcare decisions, where the risk prediction for the events of interest can support an informative outlook for a patient's medical journey. Given the existence of data censoring, an effective way of survival analysis is to enforce the pairwise temporal concordance between censored and observed data, aiming to utilize the time interval before censoring as partially observed time-to-event labels for supervised learning. Although existing studies mostly employed ranking methods to pursue an ordering objective, contrastive methods which learn a discriminative embedding by having data contrast against each other, have not been explored thoroughly for survival analysis. Therefore, in this paper, we propose a novel Ontology-aware Temporality-based Contrastive Survival (OTCSurv) analysis framework that utilizes survival durations from both censored and observed data to define temporal distinctiveness and construct negative sample pairs with adjustable hardness for contrastive learning. Specifically, we first use an ontological encoder and a sequential self-attention encoder to represent the longitudinal EHR data with rich contexts. Second, we design a temporal contrastive loss to capture varying survival durations in a supervised setting through a hardness-aware negative sampling mechanism. Last, we incorporate the contrastive task into the time-to-event predictive task with multiple loss components. We conduct extensive experiments using a large EHR dataset to forecast the risk of hospitalized patients who are in danger of developing acute kidney injury (AKI), a critical and urgent medical condition. The effectiveness and explainability of the proposed model are validated through comprehensive quantitative and qualitative studies.
△ Less
Submitted 27 September, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Study of $e^+e^-\toηφ$ at center-of-mass energies from 3.773 to 4.600 GeV
Authors:
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like…
▽ More
We present a study of the process $e^{+}e^{-}\toηφ$ using data samples collected with the BESIII detector corresponding to an integrated luminosity of 15.03 fb$^{-1}$ at 23 center-of-mass energies from 3.773 to 4.600 GeV. The Born cross sections are measured at each energy and a coherent fit to cross-section lineshape is performed using a Breit-Wigner parametrization to search for charmonium-like vector states. No significant signals of the $Y(4230)$ and $Y(4360)$ resonances are observed.
△ Less
Submitted 24 October, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
A Comprehensive Study on Knowledge Graph Embedding over Relational Patterns Based on Rule Learning
Authors:
Long Jin,
Zhen Yao,
Mingyang Chen,
Huajun Chen,
Wen Zhang
Abstract:
Knowledge Graph Embedding (KGE) has proven to be an effective approach to solving the Knowledge Graph Completion (KGC) task. Relational patterns which refer to relations with specific semantics exhibiting graph patterns are an important factor in the performance of KGE models. Though KGE models' capabilities are analyzed over different relational patterns in theory and a rough connection between b…
▽ More
Knowledge Graph Embedding (KGE) has proven to be an effective approach to solving the Knowledge Graph Completion (KGC) task. Relational patterns which refer to relations with specific semantics exhibiting graph patterns are an important factor in the performance of KGE models. Though KGE models' capabilities are analyzed over different relational patterns in theory and a rough connection between better relational patterns modeling and better performance of KGC has been built, a comprehensive quantitative analysis on KGE models over relational patterns remains absent so it is uncertain how the theoretical support of KGE to a relational pattern contributes to the performance of triples associated to such a relational pattern. To address this challenge, we evaluate the performance of 7 KGE models over 4 common relational patterns on 2 benchmarks, then conduct an analysis in theory, entity frequency, and part-to-whole three aspects and get some counterintuitive conclusions. Finally, we introduce a training-free method Score-based Patterns Adaptation (SPA) to enhance KGE models' performance over various relational patterns. This approach is simple yet effective and can be applied to KGE models without additional training. Our experimental results demonstrate that our method generally enhances performance over specific relational patterns. Our source code is available from GitHub at https://github.com/zjukg/Comprehensive-Study-over-Relational-Patterns.
△ Less
Submitted 15 August, 2023;
originally announced August 2023.
-
Conformal order and Poincar$\rm{\acute{e}}$-Klein mapping underlying electrostatics-driven inhomogeneity in tethered membranes
Authors:
Honghui Sun,
Zhenwei Yao
Abstract:
Understanding the organization of matter under the long-range electrostatic force is a fundamental problem in multiple fields. In this work, based on the electrically charged tethered membrane model, we reveal regular structures underlying the lowest-energy states of inhomogeneously stretched planar lattices by a combination of numerical simulation and analytical geometric analysis. Specifically,…
▽ More
Understanding the organization of matter under the long-range electrostatic force is a fundamental problem in multiple fields. In this work, based on the electrically charged tethered membrane model, we reveal regular structures underlying the lowest-energy states of inhomogeneously stretched planar lattices by a combination of numerical simulation and analytical geometric analysis. Specifically, we show the conformal order characterized by the preserved bond angle in the lattice deformation, and reveal the Poincar$\rm{\acute{e}}$-Klein mapping underlying the electrostatics-driven inhomogeneity. The discovery of the Poincar$\rm{\acute{e}}$-Klein mapping, which connects the Poincar$\rm{\acute{e}}$ disk and the Klein disk for the hyperbolic plane, implies the connection of long-range electrostatic force and hyperbolic geometry. We also discuss lattices with patterned charges of opposite signs for modulating in-plane inhomogeneity and even creating 3D shapes, which may have a connection to metamaterials design. This work suggests the geometric analysis as a promising approach for elucidating the organization of matter under the long-range force.
△ Less
Submitted 13 August, 2023;
originally announced August 2023.
-
LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts
Authors:
Shangqing Tu,
Zheyuan Zhang,
Jifan Yu,
Chunyang Li,
Siyu Zhang,
Zijun Yao,
Lei Hou,
Juanzi Li
Abstract:
Teaching assistants have played essential roles in the long history of education. However, few MOOC platforms are providing human or virtual teaching assistants to support learning for massive online students due to the complexity of real-world online education scenarios and the lack of training data. In this paper, we present a virtual MOOC teaching assistant, LittleMu with minimum labeled traini…
▽ More
Teaching assistants have played essential roles in the long history of education. However, few MOOC platforms are providing human or virtual teaching assistants to support learning for massive online students due to the complexity of real-world online education scenarios and the lack of training data. In this paper, we present a virtual MOOC teaching assistant, LittleMu with minimum labeled training data, to provide question answering and chit-chat services. Consisting of two interactive modules of heterogeneous retrieval and language model prompting, LittleMu first integrates structural, semi- and unstructured knowledge sources to support accurate answers for a wide range of questions. Then, we design delicate demonstrations named "Chain of Teach" prompts to exploit the large-scale pre-trained model to handle complex uncollected questions. Except for question answering, we develop other educational services such as knowledge-grounded chit-chat. We test the system's performance via both offline evaluation and online deployment. Since May 2020, our LittleMu system has served over 80,000 users with over 300,000 queries from over 500 courses on XuetangX MOOC platform, which continuously contributes to a more convenient and fair education. Our code, services, and dataset will be available at https://github.com/THU-KEG/VTA.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Gentopia: A Collaborative Platform for Tool-Augmented LLMs
Authors:
Binfeng Xu,
Xukun Liu,
Hua Shen,
Zeyu Han,
Yuhan Li,
Murong Yue,
Zhiyuan Peng,
Yuchen Liu,
Ziyu Yao,
Dongkuan Xu
Abstract:
Augmented Language Models (ALMs) empower large language models with the ability to use tools, transforming them into intelligent agents for real-world interactions. However, most existing frameworks for ALMs, to varying degrees, are deficient in the following critical features: flexible customization, collaborative democratization, and holistic evaluation. We present gentopia, an ALM framework ena…
▽ More
Augmented Language Models (ALMs) empower large language models with the ability to use tools, transforming them into intelligent agents for real-world interactions. However, most existing frameworks for ALMs, to varying degrees, are deficient in the following critical features: flexible customization, collaborative democratization, and holistic evaluation. We present gentopia, an ALM framework enabling flexible customization of agents through simple configurations, seamlessly integrating various language models, task formats, prompting modules, and plugins into a unified paradigm. Furthermore, we establish gentpool, a public platform enabling the registration and sharing of user-customized agents. Agents registered in gentpool are composable such that they can be assembled together for agent collaboration, advancing the democratization of artificial intelligence. To ensure high-quality agents, gentbench, an integral component of gentpool, is designed to thoroughly evaluate user-customized agents across diverse aspects such as safety, robustness, efficiency, etc. We release gentopia on Github and will continuously move forward.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Measurement of the $e^+e^- \to Λ\barΣ^0 + c.c.$ cross sections at $\sqrt{s}$ from 2.3094 to 3.0800 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (601 additional authors not shown)
Abstract:
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical…
▽ More
The Born cross sections and effective form factors of the process $e^+e^-\toΛ\barΣ^0 + c.c.$ are measured at 14 center-of-mass energy points from 2.3094 to 3.0800 GeV, based on data corresponding to an integrated luminosity of $(478.5 \pm 4.8)\ \text{pb}^{-1}$ collected with the BESIII detector. A non-zero Born cross section is observed at the center-of-mass energy of 2.3094 GeV with a statistical significance of more than five standard deviations, and the cross sections at other energies are obtained with improved precision compared to earlier measurements from the BaBar Collaboration. The Born cross-section lineshape is described better by a shape with a plateau near the threshold than by a pQCD motivated functional form.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
PaniniQA: Enhancing Patient Education Through Interactive Question Answering
Authors:
Pengshan Cai,
Zonghai Yao,
Fei Liu,
Dakuo Wang,
Meghan Reilly,
Huixue Zhou,
Lingxi Li,
Yi Cao,
Alok Kapoor,
Adarsha Bajracharya,
Dan Berlowitz,
Hong Yu
Abstract:
Patient portal allows discharged patients to access their personalized discharge instructions in electronic health records (EHRs). However, many patients have difficulty understanding or memorizing their discharge instructions. In this paper, we present PaniniQA, a patient-centric interactive question answering system designed to help patients understand their discharge instructions. PaniniQA firs…
▽ More
Patient portal allows discharged patients to access their personalized discharge instructions in electronic health records (EHRs). However, many patients have difficulty understanding or memorizing their discharge instructions. In this paper, we present PaniniQA, a patient-centric interactive question answering system designed to help patients understand their discharge instructions. PaniniQA first identifies important clinical content from patients' discharge instructions and then formulates patient-specific educational questions. In addition, PaniniQA is also equipped with answer verification functionality to provide timely feedback to correct patients' misunderstandings. Our comprehensive automatic and human evaluation results demonstrate our PaniniQA is capable of improving patients' mastery of their medical instructions through effective interactions
△ Less
Submitted 20 August, 2023; v1 submitted 6 August, 2023;
originally announced August 2023.
-
DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales
Authors:
Zhewei Yao,
Reza Yazdani Aminabadi,
Olatunji Ruwase,
Samyam Rajbhandari,
Xiaoxia Wu,
Ammar Ahmad Awan,
Jeff Rasley,
Minjia Zhang,
Conglong Li,
Connor Holmes,
Zhongzhu Zhou,
Michael Wyatt,
Molly Smith,
Lev Kurilenko,
Heyang Qin,
Masahiro Tanaka,
Shuai Che,
Shuaiwen Leon Song,
Yuxiong He
Abstract:
ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at…
▽ More
ChatGPT-like models have revolutionized various applications in artificial intelligence, from summarization and coding to translation, matching or even surpassing human performance. However, the current landscape lacks an accessible, efficient, and cost-effective end-to-end RLHF (Reinforcement Learning with Human Feedback) training pipeline for these powerful models, particularly when training at the scale of billions of parameters. This paper introduces DeepSpeed-Chat, a novel system that democratizes RLHF training, making it accessible to the AI community. DeepSpeed-Chat offers three key capabilities: an easy-to-use training and inference experience for ChatGPT-like models, a DeepSpeed-RLHF pipeline that replicates the training pipeline from InstructGPT, and a robust DeepSpeed-RLHF system that combines various optimizations for training and inference in a unified way. The system delivers unparalleled efficiency and scalability, enabling training of models with hundreds of billions of parameters in record time and at a fraction of the cost. With this development, DeepSpeed-Chat paves the way for broader access to advanced RLHF training, even for data scientists with limited resources, thereby fostering innovation and further development in the field of AI.
△ Less
Submitted 2 August, 2023;
originally announced August 2023.
-
Determination of the $Σ^{+}$ Timelike Electromagnetic Form Factors
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (604 additional authors not shown)
Abstract:
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike regio…
▽ More
Based on data samples collected with the BESIII detector at the BEPCII collider, the process $e^{+}e^{-} \to Σ^{+}\barΣ^{-}$ is studied at center-of-mass energies $\sqrt{s}$ = 2.3960, 2.6454, and 2.9000 GeV. Using a fully differential angular description of the final state particles, both the relative magnitude and phase information of the $Σ^{+}$ electromagnetic form factors in the timelike region are extracted. The relative phase between the electric and magnetic form factors is determined to be $\sinΔΦ$ = -0.67~$\pm$~0.29~(stat)~$\pm$~0.18~(syst) at $\sqrt{s}$ = 2.3960 GeV, $ΔΦ$ = 55$^{\circ}$~$\pm$~19$^{\circ}$~(stat) $\pm$~14$^{\circ}$~(syst) at $\sqrt{s}$ = 2.6454 GeV, and 78$^{\circ}$~$\pm$~22$^{\circ}$~(stat) $\pm$~9$^{\circ}$~(syst) at $\sqrt{s}$ = 2.9000 GeV. For the first time, the phase of the hyperon electromagnetic form factors is explored in a wide range of four-momentum transfer. The evolution of the phase along with four-momentum transfer is an important input for understanding its asymptotic behavior and the dynamics of baryons.
△ Less
Submitted 5 March, 2024; v1 submitted 29 July, 2023;
originally announced July 2023.
-
Observation of the decay $J/ψ\to e^+ e^- η(1405)$ with $η(1405) \to π^0 f_0(980)$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (601 additional authors not shown)
Abstract:
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be…
▽ More
Using a data sample of $(10087\pm44)\times 10^6$ $J/ψ$ events collected by the BESIII detector in 2009, 2012, 2018 and 2019, the electromagnetic Dalitz process $J/ψ\to e^+ e^- η(1405)$ is observed via the decay $η(1405) \to π^0 f_0(980)$, $f_0(980) \to π^+ π^-$, with a significance of about $9.6σ$. The branching fraction of this decay is measured to be ${\mathcal B}(J/ψ\to e^+ e^- π^0 η(1405) \to e^+ e^- π^0 f_0(980) \to e^+ e^- π^0 π^+ π^-)=(2.02\pm0.24(\rm{stat.})\pm0.09(\rm{syst.}))\times 10^{-7}$. The branching-fraction ratio ${\mathcal B}(J/ψ\to e^+ e^- η(1405))$/${\mathcal B}(J/ψ\to γη(1405))$ is determined to be $(1.35\pm0.19(\rm{stat.})\pm0.06(\rm{syst.}))\times10^{-2}$. Furthermore, an $e^+e^-$ invariant-mass dependent transition form factor of $J/ψ\to e^+ e^-η(1405)$ is presented for the first time. The obtained result provides input for different theoretical models, and is valuable for the improved understanding the intrinsic structure of the $η(1405)$ meson.
△ Less
Submitted 27 July, 2023;
originally announced July 2023.
-
Improved measurement of the branching fraction of $D_s^+\toμ^+ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (598 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data with an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector operating at the BEPCII collider, the branching fraction of the leptonic decay $D_s^+\toμ^+ν_μ$ is measured to be $(0.5294\pm0.0108_{\rm stat}\pm0.0085_{\rm syst})$\%. Based on this, the product of the $D_s^+$ decay constan…
▽ More
Using $e^+e^-$ collision data with an integrated luminosity of $7.33~\mathrm{fb}^{-1}$ collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector operating at the BEPCII collider, the branching fraction of the leptonic decay $D_s^+\toμ^+ν_μ$ is measured to be $(0.5294\pm0.0108_{\rm stat}\pm0.0085_{\rm syst})$\%. Based on this, the product of the $D_s^+$ decay constant $f_{D_s^+}$ and the magnitude of the $c\to s$ quark mixing matrix element $|V_{cs}|$ is determined to be $f_{D_s^+}|V_{cs}|=241.8\pm2.5_{\rm stat}\pm2.2_{\rm syst}~\mathrm{MeV}$. Using the value of $|V_{cs}|$ given by the global standard model fit, $f_{D_s^+}$ is found to be $248.4\pm2.5_{\rm stat}\pm2.2_{\rm syst}$\,MeV. Alternatively, using the value of $f_{D_s^+}$ from a recent lattice quantum chromodynamics calculation, $|V_{cs}|$ is determined to be $0.968\pm0.010_{\rm stat}\pm0.009_{\rm syst}$.
△ Less
Submitted 26 July, 2023;
originally announced July 2023.
-
Measurement of $e^{+}e^{-}\toφη'$ cross sections at center-of-mass energies from 3.508 to 4.951 GeV and search for the decay $ψ(3770)\toφη'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (600 additional authors not shown)
Abstract:
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in ag…
▽ More
The cross sections of the $e^{+}e^{-}\toφη'$ process at center-of-mass energies from 3.508 to 4.951 GeV are measured with high precision using 26.1 fb$^{-1}$ data collected with the BESIII detector operating at the BEPCII storage ring. The cross sections are of the order of a few picobarn, and decrease as the center-of-mass energy increases as $s^{-n/2}$ with $n=4.35\pm 0.14$. This result is in agreement with the Nambu-Jona-Lasinio model prediction of $n=3.5\pm 0.9$. In addition, the charmless decay $ψ(3770)\toφη'$ is searched for by fitting the measured cross sections, yet no significant signal is observed. The upper limit of ${\cal B}(ψ(3770)\toφη')$ at the 90\% confidence level is determined to be $2.3\times 10^{-5}$.
△ Less
Submitted 11 September, 2023; v1 submitted 24 July, 2023;
originally announced July 2023.
-
First Observation of a Three-Resonance Structure in $e^+e^-\rightarrow$Nonopen Charm Hadrons
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state…
▽ More
We report the measurement of the inclusive cross sections for $e^+e^-$$\rightarrow$nOCH (where nOCH denotes non-open charm hadrons) with improved precision at center-of-mass (c.m.) energies from 3.645 to 3.871 GeV. We observe three resonances: $\mathcal R(3760)$, $\mathcal R(3780)$, and $\mathcal R(3810)$ with significances of $8.1σ$, $13.7σ$, and $8.8σ$, respectively. The $\mathcal R(3810)$ state is observed for the first time, while the $\mathcal R(3760)$ and $\mathcal R(3780)$ states are observed for the first time in the nOCH cross sections. Two sets of resonance parameters describe the energy-dependent line shape of the cross sections well. In set I [set II], the $\mathcal R(3810)$ state has mass $(3805.7 \pm 1.1 \pm 2.7)$ [$(3805.7 \pm 1.1 \pm 2.7)$] MeV/$c^2$, total width $(11.6 \pm 2.9 \pm 1.9)$ [$(11.5 \pm 2.8 \pm 1.9)$] MeV, and an electronic width multiplied by the nOCH decay branching fraction of $(10.9\pm 3.8\pm 2.5)$ [$(11.0\pm 3.4\pm 2.5)$] eV. In addition, we measure the branching fractions ${\mathcal B}[{\mathcal R}(3760)$$\rightarrow$nOCH$]=(25.2 \pm 16.1 \pm 30.4)\% [(6.4 \pm 4.8 \pm 7.7)\%]$ and ${\mathcal B}[\mathcal R(3780)$$\rightarrow$nOCH$]=(12.3 \pm 6.6 \pm 8.3)\% [(10.4 \pm 4.8 \pm 7.0)\%]$ for the first time. The $\mathcal R(3760)$ state can be interpreted as an open-charm (OC) molecular state, but containing a simple four-quark state component. The $\mathcal R(3810)$ state can be interpreted as a hadrocharmonium state.
△ Less
Submitted 11 May, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats
Authors:
Xiaoxia Wu,
Zhewei Yao,
Yuxiong He
Abstract:
In the complex domain of large language models (LLMs), striking a balance between computational efficiency and maintaining model quality is a formidable challenge. Navigating the inherent limitations of uniform quantization, particularly when dealing with outliers, and motivated by the launch of NVIDIA's H100 hardware, this study delves into the viability of floating-point (FP) quantization, parti…
▽ More
In the complex domain of large language models (LLMs), striking a balance between computational efficiency and maintaining model quality is a formidable challenge. Navigating the inherent limitations of uniform quantization, particularly when dealing with outliers, and motivated by the launch of NVIDIA's H100 hardware, this study delves into the viability of floating-point (FP) quantization, particularly focusing on FP8 and FP4, as a potential solution. Our comprehensive investigation reveals that for LLMs, FP8 activation consistently outshines its integer (INT8) equivalent, with the performance edge becoming more noticeable in models possessing parameters beyond one billion. For weight quantization, our findings indicate that FP4 exhibits comparable, if not superior, performance to INT4, simplifying deployment on FP-supported hardware like H100. To mitigate the overhead from precision alignment caused by the disparity between weights and activations, we propose two scaling constraints for weight quantization that negligibly impact the performance compared to the standard W4A8 model. We additionally enhance our quantization methods by integrating the Low Rank Compensation (LoRC) strategy, yielding improvements especially in smaller models. The results of our investigation emphasize the immense potential of FP quantization for LLMs, paving the way for high-efficiency deployment in resource-limited settings.
△ Less
Submitted 20 July, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Measurement of the branching fractions of the singly Cabibbo-suppressed decays $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be…
▽ More
Based on 4.5 $\mbox{fb$^{-1}$}$ $e^{+}e^{-}$ collision data collected with BESIII detector at seven energy points between 4.600 and 4.699 GeV, the branching fractions for $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ were measured by means of single-tag method. The branching fractions of $Λ_{c}^{+}\to pη$ and $Λ_{c}^{+}\to pω$ are determined to be $(1.57\pm0.11_{\rm {stat}}\pm0.04_{\rm{syst}})\times10^{-3}$ and $(1.11\pm0.20_{\rm{stat}}\pm0.07_{\rm{syst}})\times10^{-3}$, with a statistical significance of greater than 10 $σ$ and 5.7 $σ$, respectively. These results are consistent with the previous measurements by BESIII, LHCb and Belle, and the result of $Λ_{c}^{+}\to pη$ is the most precise to date.
△ Less
Submitted 17 October, 2023; v1 submitted 18 July, 2023;
originally announced July 2023.
-
Measurement of the Energy-Dependent Electromagnetic Form Factors of a Charmed Baryon
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (598 additional authors not shown)
Abstract:
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process…
▽ More
We study the process $e^{+}e^{-}\toΛ_{c}^{+}\barΛ_c^{-}$ at twelve center-of-mass energies from $4.6119$ to $4.9509~\mathrm{GeV}$ using data samples collected by the BESIII detector at the BEPCII collider. The Born cross sections and effective form factors ($|G_{\mathrm{eff}}|$) are determined with unprecedented precision after combining the single and double-tag methods based on the decay process $Λ_{c}^{+}\to pK^{-}π^{+}$. Flat cross sections around $4.63~\mathrm{GeV}$ are obtained and no indication of the resonant structure $Y(4630)$, as reported by Belle, is found. In addition, no oscillatory behavior is discerned in the $|G_{\mathrm{eff}}|$ energy-dependence of $Λ_{c}^{+}$, in contrast to what is seen for the proton and neutron cases. Analyzing the cross section together with the polar-angle distribution of the $Λ_{c}^{+}$ baryon at each energy point, the moduli of electric and magnetic form factors ($|G_{E}|$ and $|G_{M}|$) are extracted and separated. For the first time, the energy-dependence of the form factor ratio $|G_{E}/G_{M}|$ is observed, which can be well described by an oscillatory function.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering
Authors:
Zijun Yao,
Yuanyong Chen,
Xin Lv,
Shulin Cao,
Amy Xin,
Jifan Yu,
Hailong Jin,
Jianjun Xu,
Peng Zhang,
Lei Hou,
Juanzi Li
Abstract:
We present Visual Knowledge oriented Programming platform (VisKoP), a knowledge base question answering (KBQA) system that integrates human into the loop to edit and debug the knowledge base (KB) queries. VisKoP not only provides a neural program induction module, which converts natural language questions into knowledge oriented program language (KoPL), but also maps KoPL programs into graphical e…
▽ More
We present Visual Knowledge oriented Programming platform (VisKoP), a knowledge base question answering (KBQA) system that integrates human into the loop to edit and debug the knowledge base (KB) queries. VisKoP not only provides a neural program induction module, which converts natural language questions into knowledge oriented program language (KoPL), but also maps KoPL programs into graphical elements. KoPL programs can be edited with simple graphical operators, such as dragging to add knowledge operators and slot filling to designate operator arguments. Moreover, VisKoP provides auto-completion for its knowledge base schema and users can easily debug the KoPL program by checking its intermediate results. To facilitate the practical KBQA on a million-entity-level KB, we design a highly efficient KoPL execution engine for the back-end. Experiment results show that VisKoP is highly efficient and user interaction can fix a large portion of wrong KoPL programs to acquire the correct answer. The VisKoP online demo https://demoviskop.xlore.cn (Stable release of this paper) and https://viskop.xlore.cn (Beta release with new features), highly efficient KoPL engine https://pypi.org/project/kopl-engine, and screencast video https://youtu.be/zAbJtxFPTXo are now publicly available.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
KoRC: Knowledge oriented Reading Comprehension Benchmark for Deep Text Understanding
Authors:
Zijun Yao,
Yantao Liu,
Xin Lv,
Shulin Cao,
Jifan Yu,
Lei Hou,
Juanzi Li
Abstract:
Deep text understanding, which requires the connections between a given document and prior knowledge beyond its text, has been highlighted by many benchmarks in recent years. However, these benchmarks have encountered two major limitations. On the one hand, most of them require human annotation of knowledge, which leads to limited knowledge coverage. On the other hand, they usually use choices or…
▽ More
Deep text understanding, which requires the connections between a given document and prior knowledge beyond its text, has been highlighted by many benchmarks in recent years. However, these benchmarks have encountered two major limitations. On the one hand, most of them require human annotation of knowledge, which leads to limited knowledge coverage. On the other hand, they usually use choices or spans in the texts as the answers, which results in narrow answer space. To overcome these limitations, we build a new challenging benchmark named KoRc in this paper. Compared with previous benchmarks, KoRC has two advantages, i.e., broad knowledge coverage and flexible answer format. Specifically, we utilize massive knowledge bases to guide annotators or large language models (LLMs) to construct knowledgable questions. Moreover, we use labels in knowledge bases rather than spans or choices as the final answers. We test state-of-the-art models on KoRC and the experimental results show that the strongest baseline only achieves 68.3% and 30.0% F1 measure in the in-distribution and out-of-distribution test set, respectively. These results indicate that deep text understanding is still an unsolved challenge. The benchmark dataset, leaderboard, and baseline methods are released in https://github.com/THU-KEG/KoRC.
△ Less
Submitted 6 July, 2023;
originally announced July 2023.
-
Studies of the decay $D^+_s\to K^+K^- μ^+ ν_μ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (598 additional authors not shown)
Abstract:
The $D^+_s\to K^+K^-μ^+ν_μ$ decay is studied based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies in the range from 4.128 to 4.226 GeV. The absolute branching fraction is measured as ${\mathcal B}(D^+_s\to φμ^+ν_μ) = (2.25\pm 0.09 \pm 0.07) \times10^{-2}$, the most precise measurement to date. Combining with the world average of…
▽ More
The $D^+_s\to K^+K^-μ^+ν_μ$ decay is studied based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at center-of-mass energies in the range from 4.128 to 4.226 GeV. The absolute branching fraction is measured as ${\mathcal B}(D^+_s\to φμ^+ν_μ) = (2.25\pm 0.09 \pm 0.07) \times10^{-2}$, the most precise measurement to date. Combining with the world average of ${\mathcal B}(D^+_s\to φe^+ν_e)$, the ratio of the branching fractions obtained is$\frac{{\mathcal B}(D^+_s\to φμ^+ν_μ)}{{\mathcal B}(D^+_s\to φe^+ν_e)} = 0.94\pm0.08$, in agreement with lepton universality. By performing a partial wave analysis, the hadronic form factor ratios at $q^{2}=0$ are extracted, finding $r_{V}=\frac{V(0)}{A_{1}(0)}=1.58\pm0.17\pm0.02$ and $r_{2}=\frac{A_{2}(0)}{A_{1}(0)}=0.71\pm0.14\pm0.02$, where the first uncertainties are statistical and the second are systematic. No significant $S$-wave contribution from $f_0(980)\to K^+K^-$ is found. The upper limit $\mathcal{B}(D_s^+\to f_0(980)μ^{+}ν_μ) \cdot{\mathcal B}(f_0(980)\to K^+K^-) < 5.45 \times 10^{-4}$ is set at 90\% confidence level.
△ Less
Submitted 18 July, 2023; v1 submitted 6 July, 2023;
originally announced July 2023.
-
Measurement of $e^+e^-\to pK^-\barΛ+c.c.$ cross sections between 4.009 GeV and 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
J. Bloms,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (599 additional authors not shown)
Abstract:
Using $e^+e^-$ collision datasets corresponding to total integrated luminosity of 21.7 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 4.009 GeV to 4.951 GeV, the energy-dependent cross sections of $e^+e^-\to pK^-\barΛ+c.c.$ are measured for the first time. By fitting these energy-dependent cross sections, we search for the excited $ψ$ st…
▽ More
Using $e^+e^-$ collision datasets corresponding to total integrated luminosity of 21.7 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider at center-of-mass energies ranging from 4.009 GeV to 4.951 GeV, the energy-dependent cross sections of $e^+e^-\to pK^-\barΛ+c.c.$ are measured for the first time. By fitting these energy-dependent cross sections, we search for the excited $ψ$ states $ψ(4160)$ and $ψ(4415)$, and the vector charmonium-like states $ψ(4230)$, $ψ(4360)$, and $ψ(4660)$. No evidence for these is observed and the upper limits on the branching fractions of these states decaying into $pK^-\bar Λ+c.c.$ are set at the 90\% confidence level.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Search for the semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (603 additional authors not shown)
Abstract:
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be…
▽ More
Using $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII $e^+e^-$ storage ring at the center-of-mass energy of $\sqrt{s}=3.097~\rm{GeV}$, we present a search for the rare semi-muonic charmonium decay $J/ψ\to D^{-}μ^{+}ν_μ+c.c.$. Since no significant signal is observed, we set an upper limit of the branching fraction to be $\mathcal{B}(J/ψ\to D^{-}μ^{+}ν_μ+c.c.)<5.6\times10^{-7}$ at $90\%$ confidence level. This is the first search for the weak decay of charmonium with a muon in the final state.
△ Less
Submitted 12 December, 2023; v1 submitted 5 July, 2023;
originally announced July 2023.
-
A PC-Kriging-HDMR integrated with an adaptive sequential sampling strategy for high-dimensional approximate modeling
Authors:
Yili Zhang,
Hanyan Huang,
Mei Xiong,
Zengquan Yao
Abstract:
High-dimensional complex multi-parameter problems are prevalent in engineering, exceeding the capabilities of traditional surrogate models designed for low/medium-dimensional problems. These models face the curse of dimensionality, resulting in decreased modeling accuracy as the design parameter space expands. Furthermore, the lack of a parameter decoupling mechanism hinders the identification of…
▽ More
High-dimensional complex multi-parameter problems are prevalent in engineering, exceeding the capabilities of traditional surrogate models designed for low/medium-dimensional problems. These models face the curse of dimensionality, resulting in decreased modeling accuracy as the design parameter space expands. Furthermore, the lack of a parameter decoupling mechanism hinders the identification of couplings between design variables, particularly in highly nonlinear cases. To address these challenges and enhance prediction accuracy while reducing sample demand, this paper proposes a PC-Kriging-HDMR approximate modeling method within the framework of Cut-HDMR. The method leverages the precision of PC-Kriging and optimizes test point placement through a multi-stage adaptive sequential sampling strategy. This strategy encompasses a first-stage adaptive proportional sampling criterion and a second-stage central-based maximum entropy criterion. Numerical tests and a practical application involving a cantilever beam demonstrate the advantages of the proposed method. Key findings include: (1) The performance of traditional single-surrogate models, such as Kriging, significantly deteriorates in high-dimensional nonlinear problems compared to combined surrogate models under the Cut-HDMR framework (e.g., Kriging-HDMR, PCE-HDMR, SVR-HDMR, MLS-HDMR, and PC-Kriging-HDMR); (2) The number of samples required for PC-Kriging-HDMR modeling increases polynomially rather than exponentially as the parameter space expands, resulting in substantial computational cost reduction; (3) Among existing Cut-HDMR methods, no single approach outperforms the others in all aspects. However, PC-Kriging-HDMR exhibits improved modeling accuracy and efficiency within the desired improvement range compared to PCE-HDMR and Kriging-HDMR, demonstrating robustness.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
UMASS_BioNLP at MEDIQA-Chat 2023: Can LLMs generate high-quality synthetic note-oriented doctor-patient conversations?
Authors:
Junda Wang,
Zonghai Yao,
Avijit Mitra,
Samuel Osebe,
Zhichao Yang,
Hong Yu
Abstract:
This paper presents UMASS_BioNLP team participation in the MEDIQA-Chat 2023 shared task for Task-A and Task-C. We focus especially on Task-C and propose a novel LLMs cooperation system named a doctor-patient loop to generate high-quality conversation data sets. The experiment results demonstrate that our approaches yield reasonable performance as evaluated by automatic metrics such as ROUGE, medic…
▽ More
This paper presents UMASS_BioNLP team participation in the MEDIQA-Chat 2023 shared task for Task-A and Task-C. We focus especially on Task-C and propose a novel LLMs cooperation system named a doctor-patient loop to generate high-quality conversation data sets. The experiment results demonstrate that our approaches yield reasonable performance as evaluated by automatic metrics such as ROUGE, medical concept recall, BLEU, and Self-BLEU. Furthermore, we conducted a comparative analysis between our proposed method and ChatGPT and GPT-4. This analysis also investigates the potential of utilizing cooperation LLMs to generate high-quality datasets.
△ Less
Submitted 29 June, 2023;
originally announced June 2023.