-
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Authors:
Qingyun Li,
Zhe Chen,
Weiyun Wang,
Wenhai Wang,
Shenglong Ye,
Zhenjiang Jin,
Guanzhou Chen,
Yinan He,
Zhangwei Gao,
Erfei Cui,
Jiashuo Yu,
Hao Tian,
Jiasheng Zhou,
Chao Xu,
Bin Wang,
Xingjian Wei,
Wei Li,
Wenjian Zhang,
Bo Zhang,
Pinlong Cai,
Licheng Wen,
Xiangchao Yan,
Zhenxiang Li,
Pei Chu,
Yi Wang
, et al. (15 additional authors not shown)
Abstract:
Image-text interleaved data, consisting of multiple images and texts arranged in a natural document format, aligns with the presentation paradigm of internet data and closely resembles human reading habits. Recent studies have shown that such data aids multimodal in-context learning and maintains the capabilities of large language models during multimodal fine-tuning. However, the limited scale an…
▽ More
Image-text interleaved data, consisting of multiple images and texts arranged in a natural document format, aligns with the presentation paradigm of internet data and closely resembles human reading habits. Recent studies have shown that such data aids multimodal in-context learning and maintains the capabilities of large language models during multimodal fine-tuning. However, the limited scale and diversity of current image-text interleaved data restrict the development of multimodal large language models. In this paper, we introduce OmniCorpus, a 10 billion-scale image-text interleaved dataset. Using an efficient data engine, we filter and extract large-scale high-quality documents, which contain 8.6 billion images and 1,696 billion text tokens. Compared to counterparts (e.g., MMC4, OBELICS), our dataset 1) has 15 times larger scales while maintaining good data quality; 2) features more diverse sources, including both English and non-English websites as well as video-centric websites; 3) is more flexible, easily degradable from an image-text interleaved format to pure text corpus and image-text pairs. Through comprehensive analysis and experiments, we validate the quality, usability, and effectiveness of the proposed dataset. We hope this could provide a solid data foundation for future multimodal model research. Code and data are released at https://github.com/OpenGVLab/OmniCorpus.
△ Less
Submitted 12 July, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
A Metric-based Principal Curve Approach for Learning One-dimensional Manifold
Authors:
Elvis Han Cui,
Sisi Shao
Abstract:
Principal curve is a well-known statistical method oriented in manifold learning using concepts from differential geometry. In this paper, we propose a novel metric-based principal curve (MPC) method that learns one-dimensional manifold of spatial data. Synthetic datasets Real applications using MNIST dataset show that our method can learn the one-dimensional manifold well in terms of the shape.
Principal curve is a well-known statistical method oriented in manifold learning using concepts from differential geometry. In this paper, we propose a novel metric-based principal curve (MPC) method that learns one-dimensional manifold of spatial data. Synthetic datasets Real applications using MNIST dataset show that our method can learn the one-dimensional manifold well in terms of the shape.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Authors:
Zhe Chen,
Weiyun Wang,
Hao Tian,
Shenglong Ye,
Zhangwei Gao,
Erfei Cui,
Wenwen Tong,
Kongzhi Hu,
Jiapeng Luo,
Zheng Ma,
Ji Ma,
Jiaqi Wang,
Xiaoyi Dong,
Hang Yan,
Hewei Guo,
Conghui He,
Botian Shi,
Zhenjiang Jin,
Chao Xu,
Bin Wang,
Xingjian Wei,
Wei Li,
Wenjian Zhang,
Bo Zhang,
Pinlong Cai
, et al. (10 additional authors not shown)
Abstract:
In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements: (1) Strong Vision Encoder: we explored a continuous learning strategy for the large-scale vision foundation model -- InternViT-6B, boosting its visual…
▽ More
In this report, we introduce InternVL 1.5, an open-source multimodal large language model (MLLM) to bridge the capability gap between open-source and proprietary commercial models in multimodal understanding. We introduce three simple improvements: (1) Strong Vision Encoder: we explored a continuous learning strategy for the large-scale vision foundation model -- InternViT-6B, boosting its visual understanding capabilities, and making it can be transferred and reused in different LLMs. (2) Dynamic High-Resolution: we divide images into tiles ranging from 1 to 40 of 448$\times$448 pixels according to the aspect ratio and resolution of the input images, which supports up to 4K resolution input. (3) High-Quality Bilingual Dataset: we carefully collected a high-quality bilingual dataset that covers common scenes, document images, and annotated them with English and Chinese question-answer pairs, significantly enhancing performance in OCR- and Chinese-related tasks. We evaluate InternVL 1.5 through a series of benchmarks and comparative studies. Compared to both open-source and proprietary models, InternVL 1.5 shows competitive performance, achieving state-of-the-art results in 8 of 18 benchmarks. Code has been released at https://github.com/OpenGVLab/InternVL.
△ Less
Submitted 29 April, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
PDXpower: A Power Analysis Tool for Experimental Design in Pre-clinical Xenograft Studies for Uncensored and Censored Outcomes
Authors:
Shanpeng Li,
Donatello Telesca,
Harley I. Kornblum,
David Nathanson,
Frank Pajonk,
Elvis Han Cui,
Joycelynne Palmer,
Gang Li
Abstract:
In cancer research, leveraging patient-derived xenografts (PDXs) in pre-clinical experiments is a crucial approach for assessing innovative therapeutic strategies. Addressing the inherent variability in treatment response among and within individual PDX lines is essential. However, the current literature lacks a user-friendly statistical power analysis tool capable of concurrently determining the…
▽ More
In cancer research, leveraging patient-derived xenografts (PDXs) in pre-clinical experiments is a crucial approach for assessing innovative therapeutic strategies. Addressing the inherent variability in treatment response among and within individual PDX lines is essential. However, the current literature lacks a user-friendly statistical power analysis tool capable of concurrently determining the required number of PDX lines and animals per line per treatment group in this context. In this paper, we present a simulation-based R package for sample size determination, named `\textbf{PDXpower}', which is publicly available at The Comprehensive R Archive Network \url{https://CRAN.R-project.org/package=PDXpower}. The package is designed to estimate the necessary number of both PDX lines and animals per line per treatment group for the design of a PDX experiment, whether for an uncensored outcome, or a censored time-to-event outcome. Our sample size considerations rely on two widely used analytical frameworks: the mixed effects ANOVA model for uncensored outcomes and Cox's frailty model for censored data outcomes, which effectively account for both inter-PDX variability and intra-PDX correlation in treatment response. Step-by-step illustrations for utilizing the developed package are provided, catering to scenarios with or without preliminary data.
△ Less
Submitted 13 April, 2024;
originally announced April 2024.
-
Teaching MLP More Graph Information: A Three-stage Multitask Knowledge Distillation Framework
Authors:
Junxian Li,
Bin Shi,
Erfei Cui,
Hua Wei,
Qinghua Zheng
Abstract:
We study the challenging problem for inference tasks on large-scale graph datasets of Graph Neural Networks: huge time and memory consumption, and try to overcome it by reducing reliance on graph structure. Even though distilling graph knowledge to student MLP is an excellent idea, it faces two major problems of positional information loss and low generalization. To solve the problems, we propose…
▽ More
We study the challenging problem for inference tasks on large-scale graph datasets of Graph Neural Networks: huge time and memory consumption, and try to overcome it by reducing reliance on graph structure. Even though distilling graph knowledge to student MLP is an excellent idea, it faces two major problems of positional information loss and low generalization. To solve the problems, we propose a new three-stage multitask distillation framework. In detail, we use Positional Encoding to capture positional information. Also, we introduce Neural Heat Kernels responsible for graph data processing in GNN and utilize hidden layer outputs matching for better performance of student MLP's hidden layers. To the best of our knowledge, it is the first work to include hidden layer distillation for student MLP on graphs and to combine graph Positional Encoding with MLP. We test its performance and robustness with several settings and draw the conclusion that our work can outperform well with good stability.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Authors:
Zhaoyang Liu,
Zeqiang Lai,
Zhangwei Gao,
Erfei Cui,
Ziheng Li,
Xizhou Zhu,
Lewei Lu,
Qifeng Chen,
Yu Qiao,
Jifeng Dai,
Wenhai Wang
Abstract:
We present ControlLLM, a novel framework that enables large language models (LLMs) to utilize multi-modal tools for solving complex real-world tasks. Despite the remarkable performance of LLMs, they still struggle with tool invocation due to ambiguous user prompts, inaccurate tool selection and parameterization, and inefficient tool scheduling. To overcome these challenges, our framework comprises…
▽ More
We present ControlLLM, a novel framework that enables large language models (LLMs) to utilize multi-modal tools for solving complex real-world tasks. Despite the remarkable performance of LLMs, they still struggle with tool invocation due to ambiguous user prompts, inaccurate tool selection and parameterization, and inefficient tool scheduling. To overcome these challenges, our framework comprises three key components: (1) a \textit{task decomposer} that breaks down a complex task into clear subtasks with well-defined inputs and outputs; (2) a \textit{Thoughts-on-Graph (ToG) paradigm} that searches the optimal solution path on a pre-built tool graph, which specifies the parameter and dependency relations among different tools; and (3) an \textit{execution engine with a rich toolbox} that interprets the solution path and runs the tools efficiently on different computational devices. We evaluate our framework on diverse tasks involving image, audio, and video processing, demonstrating its superior accuracy, efficiency, and versatility compared to existing methods. The code is at https://github.com/OpenGVLab/ControlLLM.
△ Less
Submitted 18 December, 2023; v1 submitted 26 October, 2023;
originally announced October 2023.
-
Strong decays of the $φ(2170)$ as a fully-strange tetraquark state
Authors:
Yi-Wei Jiang,
Wei-Han Tan,
Hua-Xing Chen,
Er-Liang Cui
Abstract:
We study strong decays of the $φ(2170)$, along with its possible partner $X(2436)$, as two fully-strange tetraquark states of $J^{PC} = 1^{--}$. We consider seven decay channels: $φη$, $φη^\prime$, $φf_0(980)$, $φf_1(1420)$, $h_1(1415) η$, $h_1(1415) η^\prime$, and $h_1(1415) f_1(1420)$. Some of these channels are kinematically possible, and we calculate their relative branching ratios through the…
▽ More
We study strong decays of the $φ(2170)$, along with its possible partner $X(2436)$, as two fully-strange tetraquark states of $J^{PC} = 1^{--}$. We consider seven decay channels: $φη$, $φη^\prime$, $φf_0(980)$, $φf_1(1420)$, $h_1(1415) η$, $h_1(1415) η^\prime$, and $h_1(1415) f_1(1420)$. Some of these channels are kinematically possible, and we calculate their relative branching ratios through the Fierz rearrangement. Future experimental measurements on these ratios can be useful in determining the nature of the $φ(2170)$ and $X(2436)$. The $φ(2170)$ has been observed in the $φf_0(980)$, $φη$, and $φη^\prime$ channels, and we propose to further examine it in the $h_1(1415) η$ channel. Evidences of the $X(2436)$ have been observed in the $φf_0(980)$ channel, and we propose to verify whether this structure exists or not in the $φη$, $φη^\prime$, $h_1(1415) η$, and $h_1(1415) η^\prime$ channels.
△ Less
Submitted 30 October, 2023; v1 submitted 25 October, 2023;
originally announced October 2023.
-
Trajectory-aware Principal Manifold Framework for Data Augmentation and Image Generation
Authors:
Elvis Han Cui,
Bingbin Li,
Yanan Li,
Weng Kee Wong,
Donghui Wang
Abstract:
Data augmentation for deep learning benefits model training, image transformation, medical imaging analysis and many other fields. Many existing methods generate new samples from a parametric distribution, like the Gaussian, with little attention to generate samples along the data manifold in either the input or feature space. In this paper, we verify that there are theoretical and practical advan…
▽ More
Data augmentation for deep learning benefits model training, image transformation, medical imaging analysis and many other fields. Many existing methods generate new samples from a parametric distribution, like the Gaussian, with little attention to generate samples along the data manifold in either the input or feature space. In this paper, we verify that there are theoretical and practical advantages of using the principal manifold hidden in the feature space than the Gaussian distribution. We then propose a novel trajectory-aware principal manifold framework to restore the manifold backbone and generate samples along a specific trajectory. On top of the autoencoder architecture, we further introduce an intrinsic dimension regularization term to make the manifold more compact and enable few-shot image generation. Experimental results show that the novel framework is able to extract more compact manifold representation, improve classification accuracy and generate smooth transformation among few samples.
△ Less
Submitted 30 July, 2023;
originally announced October 2023.
-
Metaheuristic Algorithms in Artificial Intelligence with Applications to Bioinformatics, Biostatistics, Ecology and, the Manufacturing Industries
Authors:
Elvis Han Cui,
Zizhao Zhang,
Culsome Junwen Chen,
Weng Kee Wong
Abstract:
Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. We apply a newly proposed nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA) and demonstrate its flexibility and out-performance relative to its c…
▽ More
Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. We apply a newly proposed nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA) and demonstrate its flexibility and out-performance relative to its competitors in a variety of optimization problems in the statistical sciences. In particular, we show the algorithm is efficient and can incorporate various cost structures or multiple user-specified nonlinear constraints. Our applications include (i) finding maximum likelihood estimates of parameters in a single cell generalized trend model to study pseudotime in bioinformatics, (ii) estimating parameters in a commonly used Rasch model in education research, (iii) finding M-estimates for a Cox regression in a Markov renewal model and (iv) matrix completion to impute missing values in a two compartment model. In addition we discuss applications to (v) select variables optimally in an ecology problem and (vi) design a car refueling experiment for the auto industry using a logistic model with multiple interacting factors.
△ Less
Submitted 16 October, 2023; v1 submitted 8 August, 2023;
originally announced August 2023.
-
Continuous-time multivariate analysis
Authors:
Biplab Paul,
Philip T. Reiss,
Erjia Cui,
Noemi Foà
Abstract:
The starting point for much of multivariate analysis (MVA) is an $n\times p$ data matrix whose $n$ rows represent observations and whose $p$ columns represent variables. Some multivariate data sets, however, may be best conceptualized not as $n$ discrete $p$-variate observations, but as $p$ curves or functions defined on a common time interval. Here we introduce a framework for extending technique…
▽ More
The starting point for much of multivariate analysis (MVA) is an $n\times p$ data matrix whose $n$ rows represent observations and whose $p$ columns represent variables. Some multivariate data sets, however, may be best conceptualized not as $n$ discrete $p$-variate observations, but as $p$ curves or functions defined on a common time interval. Here we introduce a framework for extending techniques of multivariate analysis to such settings. The proposed continuous-time multivariate analysis (CTMVA) framework rests on the assumption that the curves can be represented as linear combinations of basis functions such as $B$-splines, as in the Ramsay-Silverman representation of functional data; but whereas functional data analysis extends MVA to the case of observations that are curves rather than vectors -- heuristically, $n\times p$ data with $p$ infinite -- we are instead concerned with what happens when $n$ is infinite. We present continuous-time extensions of the classical MVA methods of covariance and correlation estimation, principal component analysis, Fisher's linear discriminant analysis, and $k$-means clustering. We show that CTMVA can improve on the performance of classical MVA, in particular for correlation estimation and clustering, and can be applied in some settings where classical MVA cannot, including variables observed at disparate time points. CTMVA is illustrated with a novel perspective on a well-known Canadian weather data set, and with applications to data sets involving international development, brain signals, and air quality. The proposed methods are implemented in the publicly available R package \texttt{ctmva}.
△ Less
Submitted 12 June, 2024; v1 submitted 18 July, 2023;
originally announced July 2023.
-
Scalable regression calibration approaches to correcting measurement error in multi-level generalized functional linear regression models with heteroscedastic measurement errors
Authors:
Yuanyuan Luan,
Roger S. Zoh,
Erjia Cui,
Xue Lan,
Sneha Jadhav,
Carmen D. Tekwe
Abstract:
Wearable devices permit the continuous monitoring of biological processes, such as blood glucose metabolism, and behavior, such as sleep quality and physical activity. The continuous monitoring often occurs in epochs of 60 seconds over multiple days, resulting in high dimensional longitudinal curves that are best described and analyzed as functional data. From this perspective, the functional data…
▽ More
Wearable devices permit the continuous monitoring of biological processes, such as blood glucose metabolism, and behavior, such as sleep quality and physical activity. The continuous monitoring often occurs in epochs of 60 seconds over multiple days, resulting in high dimensional longitudinal curves that are best described and analyzed as functional data. From this perspective, the functional data are smooth, latent functions obtained at discrete time intervals and prone to homoscedastic white noise. However, the assumption of homoscedastic errors might not be appropriate in this setting because the devices collect the data serially. While researchers have previously addressed measurement error in scalar covariates prone to errors, less work has been done on correcting measurement error in high dimensional longitudinal curves prone to heteroscedastic errors. We present two new methods for correcting measurement error in longitudinal functional curves prone to complex measurement error structures in multi-level generalized functional linear regression models. These methods are based on two-stage scalable regression calibration. We assume that the distribution of the scalar responses and the surrogate measures prone to heteroscedastic errors both belong in the exponential family and that the measurement errors follow Gaussian processes. In simulations and sensitivity analyses, we established some finite sample properties of these methods. In our simulations, both regression calibration methods for correcting measurement error performed better than estimators based on averaging the longitudinal functional data and using observations from a single day. We also applied the methods to assess the relationship between physical activity and type 2 diabetes in community dwelling adults in the United States who participated in the National Health and Nutrition Examination Survey.
△ Less
Submitted 20 April, 2024; v1 submitted 21 May, 2023;
originally announced May 2023.
-
A Roadmap to Asymptotic Properties with Applications to COVID-19 Data
Authors:
Elvis Han Cui
Abstract:
Asymptotic properties of statistical estimators play a significant role both in practice and in theory. However, many asymptotic results in statistics rely heavily on the independent and identically distributed (iid) assumption, which is not realistic when we have fixed designs. In this article, we build a roadmap of general procedures for deriving asymptotic properties under fixed designs and the…
▽ More
Asymptotic properties of statistical estimators play a significant role both in practice and in theory. However, many asymptotic results in statistics rely heavily on the independent and identically distributed (iid) assumption, which is not realistic when we have fixed designs. In this article, we build a roadmap of general procedures for deriving asymptotic properties under fixed designs and the observations need not to be iid. We further provide their applications in many statistical applications. Finally, we apply our results to Poisson regression using a COVID-19 dataset as an illustration to demonstrate the power of these results in practice.
△ Less
Submitted 6 October, 2022;
originally announced November 2022.
-
A Tutorial on Statistical Models Based on Counting Processes
Authors:
Elvis Han Cui
Abstract:
Since the famous paper written by Kaplan and Meier in 1958, survival analysis has become one of the most important fields in statistics. Nowadays it is one of the most important statistical tools in analyzing epidemiological and clinical data including COVID-19 pandemic. This article reviews some of the most celebrated and important results and methods, including consistency, asymptotic normality,…
▽ More
Since the famous paper written by Kaplan and Meier in 1958, survival analysis has become one of the most important fields in statistics. Nowadays it is one of the most important statistical tools in analyzing epidemiological and clinical data including COVID-19 pandemic. This article reviews some of the most celebrated and important results and methods, including consistency, asymptotic normality, bias and variance estimation, in survival analysis and the treatment is parallel to the monograph Statistical Models Based on Counting Processes. Other models and results such as semi-Markov models and the Turnbull's estimator that jump out of the classical counting process martingale framework are also discussed.
△ Less
Submitted 23 October, 2022; v1 submitted 30 September, 2022;
originally announced October 2022.
-
D-optimal Approximate Design for Binary Regression and Quantal Response in Toxicology Studies
Authors:
Elvis Han Cui
Abstract:
We provide a systematic treatment of $D$-optimal design for binary regression and quantal response models in toxicology studies. For the two-parameter case, we provide an analytical equation (WC equation) for computing the $D$-optimal design quickly and when analytical solution is not available, we apply particle swarm optimization to solve for the $D$-optimal design. Examples with various link fu…
▽ More
We provide a systematic treatment of $D$-optimal design for binary regression and quantal response models in toxicology studies. For the two-parameter case, we provide an analytical equation (WC equation) for computing the $D$-optimal design quickly and when analytical solution is not available, we apply particle swarm optimization to solve for the $D$-optimal design. Examples with various link functions are given as well as the sensitivity functions. We extend the two-parameter case to three-parameter case by providing a neat formula for the determinant of the information matrix. We also suggest practitioners to work with the neat formula to derive optimal designs for three-parameter binary regression models.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
A case study of glucose levels during sleep using fast function on scalar regression inference
Authors:
Renat Sergazinov,
Andrew Leroux,
Erjia Cui,
Ciprian Crainiceanu,
R. Nisha Aurora,
Naresh M. Punjabi,
Irina Gaynanova
Abstract:
Continuous glucose monitors (CGMs) are increasingly used to measure blood glucose levels and provide information about the treatment and management of diabetes. Our motivating study contains CGM data during sleep for 174 study participants with type II diabetes mellitus measured at a 5-minute frequency for an average of 10 nights. We aim to quantify the effects of diabetes medications and sleep ap…
▽ More
Continuous glucose monitors (CGMs) are increasingly used to measure blood glucose levels and provide information about the treatment and management of diabetes. Our motivating study contains CGM data during sleep for 174 study participants with type II diabetes mellitus measured at a 5-minute frequency for an average of 10 nights. We aim to quantify the effects of diabetes medications and sleep apnea severity on glucose levels. Statistically, this is an inference question about the association between scalar covariates and functional responses. However, many characteristics of the data make analyses difficult, including (1) non-stationary within-day patterns; (2) substantial between-day heterogeneity, non-Gaussianity, and outliers; 3) large dimensionality due to the number of study participants, sleep periods, and time points. We evaluate and compare two methods: fast univariate inference (FUI) and functional additive mixed models (FAMM). We introduce a new approach for calculating p-values for testing a global null effect of covariates using FUI, and provide practical guidelines for speeding up FAMM computations, making it feasible for our data. While FUI and FAMM are philosophically different, they lead to similar point estimators in our study. In contrast to FAMM, FUI is fast, accounts for within-day correlations, and enables the construction of joint confidence intervals. Our analyses reveal that: (1) biguanide medication and sleep apnea severity significantly affect glucose trajectories during sleep, and (2) the estimated effects are time-invariant.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
Identifying the $Ξ_b(6100)$ as the $P$-wave bottom baryon of $J^P=3/2^-$
Authors:
Hui-Min Yang,
Hua-Xing Chen,
Er-Liang Cui,
Qiang Mao
Abstract:
We study the $Ξ_b(6100)$ using the methods of QCD sum rules and light-cone sum rules within the framework of heavy quark effective theory. Our results suggest that the $Ξ_b(6100)$ can be well interpreted as the $P$-wave bottom baryon of $J^P=3/2^-$, belonging to the $SU(3)$ flavor $\mathbf{\bar 3}_F$ representation. It has a partner state of $J^P=1/2^-$, labelled as $Ξ_b(1/2^-)$, whose mass and wi…
▽ More
We study the $Ξ_b(6100)$ using the methods of QCD sum rules and light-cone sum rules within the framework of heavy quark effective theory. Our results suggest that the $Ξ_b(6100)$ can be well interpreted as the $P$-wave bottom baryon of $J^P=3/2^-$, belonging to the $SU(3)$ flavor $\mathbf{\bar 3}_F$ representation. It has a partner state of $J^P=1/2^-$, labelled as $Ξ_b(1/2^-)$, whose mass and width are predicted to be $m_{Ξ_b(1/2^-)}=6.08^{+0.13}_{-0.11}$~GeV and $Γ_{Ξ_b(1/2^-)}=4^{+29}_{-~4}$~MeV, with the mass splitting $ΔM=m_{Ξ_b(6100)}-m_{Ξ_b(1/2^-)}=9\pm3$~MeV. We propose to search for it in the $Ξ_c({1/2}^-)\to Ξ_b^{\prime}π$ decay channel. Our results also suggest that the $Λ_b(5912)$ and $Λ_b(5920)$ are their partner states with $J^P=1/2^-$ and $3/2^-$ respectively, and moreover, the $Λ_c(2595)$, $Λ_c(2625)$, $Ξ_c(2790)$, and $Ξ_c(2815)$ are their charmed partner states.
△ Less
Submitted 12 August, 2022; v1 submitted 15 May, 2022;
originally announced May 2022.
-
Dual Path Structural Contrastive Embeddings for Learning Novel Objects
Authors:
Bingbin Li,
Elvis Han Cui,
Yanan Li,
Donghui Wang,
Weng Kee Wong
Abstract:
Learning novel classes from a very few labeled samples has attracted increasing attention in machine learning areas. Recent research on either meta-learning based or transfer-learning based paradigm demonstrates that gaining information on a good feature space can be an effective solution to achieve favorable performance on few-shot tasks. In this paper, we propose a simple but effective paradigm…
▽ More
Learning novel classes from a very few labeled samples has attracted increasing attention in machine learning areas. Recent research on either meta-learning based or transfer-learning based paradigm demonstrates that gaining information on a good feature space can be an effective solution to achieve favorable performance on few-shot tasks. In this paper, we propose a simple but effective paradigm that decouples the tasks of learning feature representations and classifiers and only learns the feature embedding architecture from base classes via the typical transfer-learning training strategy. To maintain both the generalization ability across base and novel classes and discrimination ability within each class, we propose a dual path feature learning scheme that effectively combines structural similarity with contrastive feature construction. In this way, both inner-class alignment and inter-class uniformity can be well balanced, and result in improved performance. Experiments on three popular benchmarks show that when incorporated with a simple prototype based classifier, our method can still achieve promising results for both standard and generalized few-shot problems in either an inductive or transductive inference setting.
△ Less
Submitted 4 January, 2022; v1 submitted 22 December, 2021;
originally announced December 2021.
-
GEM: A General Evaluation Benchmark for Multimodal Tasks
Authors:
Lin Su,
Nan Duan,
Edward Cui,
Lei Ji,
Chenfei Wu,
Huaishao Luo,
Yongfei Liu,
Ming Zhong,
Taroon Bharti,
Arun Sacheti
Abstract:
In this paper, we present GEM as a General Evaluation benchmark for Multimodal tasks. Different from existing datasets such as GLUE, SuperGLUE, XGLUE and XTREME that mainly focus on natural language tasks, GEM is a large-scale vision-language benchmark, which consists of GEM-I for image-language tasks and GEM-V for video-language tasks. Comparing with existing multimodal datasets such as MSCOCO an…
▽ More
In this paper, we present GEM as a General Evaluation benchmark for Multimodal tasks. Different from existing datasets such as GLUE, SuperGLUE, XGLUE and XTREME that mainly focus on natural language tasks, GEM is a large-scale vision-language benchmark, which consists of GEM-I for image-language tasks and GEM-V for video-language tasks. Comparing with existing multimodal datasets such as MSCOCO and Flicker30K for image-language tasks, YouCook2 and MSR-VTT for video-language tasks, GEM is not only the largest vision-language dataset covering image-language tasks and video-language tasks at the same time, but also labeled in multiple languages. We also provide two baseline models for this benchmark. We will release the dataset, code and baseline models, aiming to advance the development of multilingual multimodal research.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Particle swarm optimization in constrained maximum likelihood estimation a case study
Authors:
Elvis Cui,
Dongyuan Song,
Weng Kee Wong
Abstract:
The aim of paper is to apply two types of particle swarm optimization, global best andlocal best PSO to a constrained maximum likelihood estimation problem in pseudotime anal-ysis, a sub-field in bioinformatics. The results have shown that particle swarm optimizationis extremely useful and efficient when the optimization problem is non-differentiable and non-convex so that analytical solution can…
▽ More
The aim of paper is to apply two types of particle swarm optimization, global best andlocal best PSO to a constrained maximum likelihood estimation problem in pseudotime anal-ysis, a sub-field in bioinformatics. The results have shown that particle swarm optimizationis extremely useful and efficient when the optimization problem is non-differentiable and non-convex so that analytical solution can not be derived and gradient-based methods can not beapplied.
△ Less
Submitted 9 April, 2021;
originally announced April 2021.
-
Light tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$
Authors:
Niu Su,
Rui-Rui Dong,
Hua-Xing Chen,
Wei Chen,
Er-Liang Cui
Abstract:
We apply the method of QCD sum rules to study the $s q \bar s \bar q$ tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$, and extract mass of the lowest-lying state to be $2.33^{+0.19}_{-0.16}$ GeV. To construct the relevant tetraquark currents we need to explicitly add the covariant derivative operator. Our systematical analysis on their relevant interpolating currents indicates t…
▽ More
We apply the method of QCD sum rules to study the $s q \bar s \bar q$ tetraquark states with the exotic quantum number $J^{PC} = 3^{-+}$, and extract mass of the lowest-lying state to be $2.33^{+0.19}_{-0.16}$ GeV. To construct the relevant tetraquark currents we need to explicitly add the covariant derivative operator. Our systematical analysis on their relevant interpolating currents indicates that: a) this state well decays into the $P$-wave $��φ/ωφ$ channel but not into the $ρf_2(1525)/ωf_2(1525)/φf_2(1270)$ channels, and b) it well decays into the $K^*(892) \bar K_2^*(1430)$ channel but not into the $P$-wave $K^*(892) \bar K^*(892)$ channel.
△ Less
Submitted 15 February, 2021; v1 submitted 1 October, 2020;
originally announced October 2020.
-
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
Authors:
Minheng Ni,
Haoyang Huang,
Lin Su,
Edward Cui,
Taroon Bharti,
Lijuan Wang,
Jianfeng Gao,
Dongdong Zhang,
Nan Duan
Abstract:
We present M3P, a Multitask Multilingual Multimodal Pre-trained model that combines multilingual pre-training and multimodal pre-training into a unified framework via multitask pre-training. Our goal is to learn universal representations that can map objects occurred in different modalities or texts expressed in different languages into a common semantic space. In addition, to explicitly encourage…
▽ More
We present M3P, a Multitask Multilingual Multimodal Pre-trained model that combines multilingual pre-training and multimodal pre-training into a unified framework via multitask pre-training. Our goal is to learn universal representations that can map objects occurred in different modalities or texts expressed in different languages into a common semantic space. In addition, to explicitly encourage fine-grained alignment between images and non-English languages, we also propose Multimodal Code-switched Training (MCT) to combine monolingual pre-training and multimodal pre-training via a code-switch strategy. Experiments are performed on the multilingual image retrieval task across two benchmark datasets, including MSCOCO and Multi30K. M3P can achieve comparable results for English and new state-of-the-art results for non-English languages.
△ Less
Submitted 31 March, 2021; v1 submitted 3 June, 2020;
originally announced June 2020.
-
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation
Authors:
Yaobo Liang,
Nan Duan,
Yeyun Gong,
Ning Wu,
Fenfei Guo,
Weizhen Qi,
Ming Gong,
Linjun Shou,
Daxin Jiang,
Guihong Cao,
Xiaodong Fan,
Ruofei Zhang,
Rahul Agrawal,
Edward Cui,
Sining Wei,
Taroon Bharti,
Ying Qiao,
Jiun-Hung Chen,
Winnie Wu,
Shuguang Liu,
Fan Yang,
Daniel Campos,
Rangan Majumder,
Ming Zhou
Abstract:
In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE(Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it pr…
▽ More
In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE(Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it provides 11 diversified tasks that cover both natural language understanding and generation scenarios; (2) for each task, it provides labeled data in multiple languages. We extend a recent cross-lingual pre-trained model Unicoder(Huang et al., 2019) to cover both understanding and generation tasks, which is evaluated on XGLUE as a strong baseline. We also evaluate the base versions (12-layer) of Multilingual BERT, XLM and XLM-R for comparison.
△ Less
Submitted 22 May, 2020; v1 submitted 3 April, 2020;
originally announced April 2020.
-
QCD sum rule studies on the $s s \bar s \bar s$ tetraquark states of $J^{PC} = 0^{-+}$
Authors:
Rui-Rui Dong,
Niu Su,
Hua-Xing Chen,
Er-Liang Cui,
Zhi-Yong Zhou
Abstract:
We apply the method of QCD sum rules to study the $s s \bar s \bar s$ tetraquark states of $J^{PC} = 0^{-+}$. We construct all the relevant $s s \bar s \bar s$ tetraquark currents, and find that there are only two independent ones. We use them to further construct two weakly-correlated mixed currents. One of them leads to reliable QCD sum rule results and the mass is extracted to be…
▽ More
We apply the method of QCD sum rules to study the $s s \bar s \bar s$ tetraquark states of $J^{PC} = 0^{-+}$. We construct all the relevant $s s \bar s \bar s$ tetraquark currents, and find that there are only two independent ones. We use them to further construct two weakly-correlated mixed currents. One of them leads to reliable QCD sum rule results and the mass is extracted to be $2.51^{+0.15}_{-0.12}$ GeV, suggesting that the $X(2370)$ or the $X(2500)$ can be explained as the $ss\bar s\bar s$ tetraquark state of $J^{PC} = 0^{-+}$. To verify this interpretation, we propose to further study the $ππ/K \bar K$ invariant mass spectra of the $J/ψ\to γππη^\prime/γK \bar K η^\prime$ decays in BESIII to examine whether there exists the $f_0(980)$ resonance.
△ Less
Submitted 18 August, 2020; v1 submitted 17 March, 2020;
originally announced March 2020.
-
XGPT: Cross-modal Generative Pre-Training for Image Captioning
Authors:
Qiaolin Xia,
Haoyang Huang,
Nan Duan,
Dongdong Zhang,
Lei Ji,
Zhifang Sui,
Edward Cui,
Taroon Bharti,
Xin Liu,
Ming Zhou
Abstract:
While many BERT-based cross-modal pre-trained models produce excellent results on downstream understanding tasks like image-text retrieval and VQA, they cannot be applied to generation tasks directly. In this paper, we propose XGPT, a new method of Cross-modal Generative Pre-Training for Image Captioning that is designed to pre-train text-to-image caption generators through three novel generation…
▽ More
While many BERT-based cross-modal pre-trained models produce excellent results on downstream understanding tasks like image-text retrieval and VQA, they cannot be applied to generation tasks directly. In this paper, we propose XGPT, a new method of Cross-modal Generative Pre-Training for Image Captioning that is designed to pre-train text-to-image caption generators through three novel generation tasks, including Image-conditioned Masked Language Modeling (IMLM), Image-conditioned Denoising Autoencoding (IDA), and Text-conditioned Image Feature Generation (TIFG). As a result, the pre-trained XGPT can be fine-tuned without any task-specific architecture modifications to create state-of-the-art models for image captioning. Experiments show that XGPT obtains new state-of-the-art results on the benchmark datasets, including COCO Captions and Flickr30k Captions. We also use XGPT to generate new image captions as data augmentation for the image retrieval task and achieve significant improvement on all recall metrics.
△ Less
Submitted 4 March, 2020; v1 submitted 3 March, 2020;
originally announced March 2020.
-
ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data
Authors:
Di Qi,
Lin Su,
Jia Song,
Edward Cui,
Taroon Bharti,
Arun Sacheti
Abstract:
In this paper, we introduce a new vision-language pre-trained model -- ImageBERT -- for image-text joint embedding. Our model is a Transformer-based model, which takes different modalities as input and models the relationship between them. The model is pre-trained on four tasks simultaneously: Masked Language Modeling (MLM), Masked Object Classification (MOC), Masked Region Feature Regression (MRF…
▽ More
In this paper, we introduce a new vision-language pre-trained model -- ImageBERT -- for image-text joint embedding. Our model is a Transformer-based model, which takes different modalities as input and models the relationship between them. The model is pre-trained on four tasks simultaneously: Masked Language Modeling (MLM), Masked Object Classification (MOC), Masked Region Feature Regression (MRFR), and Image Text Matching (ITM). To further enhance the pre-training quality, we have collected a Large-scale weAk-supervised Image-Text (LAIT) dataset from Web. We first pre-train the model on this dataset, then conduct a second stage pre-training on Conceptual Captions and SBU Captions. Our experiments show that multi-stage pre-training strategy outperforms single-stage pre-training. We also fine-tune and evaluate our pre-trained ImageBERT model on image retrieval and text retrieval tasks, and achieve new state-of-the-art results on both MSCOCO and Flickr30k datasets.
△ Less
Submitted 23 January, 2020; v1 submitted 22 January, 2020;
originally announced January 2020.
-
Excited $Ω_b$ baryons and fine structure of strong interaction
Authors:
Hua-Xing Chen,
Er-Liang Cui,
Atsushi Hosaka,
Qiang Mao,
Hui-Min Yang
Abstract:
The heavy baryon system bounded by the strong interaction has a rich internal structure, so its mass spectra can have the fine structure similar to the line spectra of atom bounded by the electromagnetic interaction. We systematically study the internal structure of $P$-wave $Ω_b$ baryons and calculate their $D$-wave decay properties. The present study, together with our previous studies on their…
▽ More
The heavy baryon system bounded by the strong interaction has a rich internal structure, so its mass spectra can have the fine structure similar to the line spectra of atom bounded by the electromagnetic interaction. We systematically study the internal structure of $P$-wave $Ω_b$ baryons and calculate their $D$-wave decay properties. The present study, together with our previous studies on their mass spectra and $S$-wave decay properties, suggest that all the four excited $Ω_b$ baryons recently discovered by LHCb can be well explained as $P$-wave $Ω_b$ baryons, and their beautiful fine structure is directly related to the rich internal structure of $P$-wave $Ω_b$ baryons.
△ Less
Submitted 10 April, 2020; v1 submitted 7 January, 2020;
originally announced January 2020.
-
Projection Pursuit with Applications to scRNA Sequencing Data
Authors:
Elvis Han Cui,
Heather Zhou
Abstract:
In this paper, we explore the limitations of PCA as a dimension reduction technique and study its extension, projection pursuit (PP), which is a broad class of linear dimension reduction methods. We first discuss the relevant concepts and theorems and then apply PCA and PP (with negative standardized Shannon's entropy as the projection index) on single cell RNA sequencing data.
In this paper, we explore the limitations of PCA as a dimension reduction technique and study its extension, projection pursuit (PP), which is a broad class of linear dimension reduction methods. We first discuss the relevant concepts and theorems and then apply PCA and PP (with negative standardized Shannon's entropy as the projection index) on single cell RNA sequencing data.
△ Less
Submitted 13 October, 2022; v1 submitted 16 December, 2019;
originally announced December 2019.
-
Decay properties of $P$-wave heavy baryons accompanied by vector mesons within light-cone sum rules
Authors:
Hui-Min Yang,
Hua-Xing Chen,
Er-Liang Cui,
Atsushi Hosaka,
Qiang Mao
Abstract:
We use the method of light-cone sum rules to study decay properties of $P$-wave bottom baryons belonging to the $SU(3)$ flavor $\mathbf{6}_F$ representation. In Ref.~\cite{Cui:2019dzj} we have studied their mass spectrum and pionic decays, and found that the $Σ_{b}(6097)$ and $Ξ_{b}(6227)$ can be well interpreted as $P$-wave bottom baryons of $J^P = 3/2^-$. In this paper we further study their dec…
▽ More
We use the method of light-cone sum rules to study decay properties of $P$-wave bottom baryons belonging to the $SU(3)$ flavor $\mathbf{6}_F$ representation. In Ref.~\cite{Cui:2019dzj} we have studied their mass spectrum and pionic decays, and found that the $Σ_{b}(6097)$ and $Ξ_{b}(6227)$ can be well interpreted as $P$-wave bottom baryons of $J^P = 3/2^-$. In this paper we further study their decays into ground-state bottom baryons and vector mesons. We propose to search for a new state $Ξ_b({5/2}^-)$, that is the $J^P = 5/2^-$ partner state of the $Ξ_{b}(6227)$, in the $Ξ_b({5/2}^-) \to Ξ_b^{*}ρ\to Ξ_b^{*}ππ$ decay process. Its mass is $12 \pm 5$~MeV larger than that of the $Ξ_{b}(6227)$.
△ Less
Submitted 16 March, 2020; v1 submitted 30 September, 2019;
originally announced September 2019.
-
Identifying the $Ξ_{b}(6227)$ and $Σ_{b}(6097)$ as $P$-wave bottom baryons of $J^P = 3/2^-$
Authors:
Er-Liang Cui,
Hui-Min Yang,
Hua-Xing Chen,
Atsushi Hosaka
Abstract:
We use the method of QCD sum rules within the framework of heavy quark effective theory to study the mass spectrum of the $Σ_{b}(6097)^{\pm}$ and $Ξ_{b}(6227)^{-}$, and use the method of light-cone sum rules still within the heavy quark effective theory to study their decay properties. Our results suggest that they can be well interpreted as $P$-wave bottom baryons with the spin-parity…
▽ More
We use the method of QCD sum rules within the framework of heavy quark effective theory to study the mass spectrum of the $Σ_{b}(6097)^{\pm}$ and $Ξ_{b}(6227)^{-}$, and use the method of light-cone sum rules still within the heavy quark effective theory to study their decay properties. Our results suggest that they can be well interpreted as $P$-wave bottom baryons with the spin-parity $J^P = 3/2^-$. They belong to the baryon doublet $[\mathbf{6}_F, 2, 1, λ]$, where the total and spin angular momenta of the light degree of freedom are $j_l = 2$ and $s_l = 1$, and the orbital angular momentum is between the bottom quark and the two-light-quark system ($λ$-type). This doublet contains six bottom baryons, and we predict masses (mass differences) and decay widths of the other four states to be $M_{Ω_b(3/2^-)} = 6.46 \pm 0.12 {~\rm GeV}$, $Γ_{Ω_b(3/2^-)} = 58{^{+65}_{-33}} {~\rm MeV}$, $M_{Σ_b(5/2^-)}-M_{Σ_b(3/2^-)}= 13 \pm 5 {~\rm MeV}$, $M_{Ξ_b^{\prime}(5/2^-)}-M_{Ξ_b^{\prime}(3/2^-)} = 12 \pm 5 {~\rm MeV}$, and $M_{Ω_b(5/2^-)}-M_{Ω_b(3/2^-)} = 11 \pm 5 {~\rm MeV}$. We propose to search for them in further LHCb experiments.
△ Less
Submitted 7 June, 2019; v1 submitted 25 March, 2019;
originally announced March 2019.
-
QCD sum rule studies on the $s s \bar s \bar s$ tetraquark states with $J^{PC} = 1^{+-}$
Authors:
Er-Liang Cui,
Hui-Min Yang,
Hua-Xing Chen,
Wei Chen,
Cheng-Ping Shen
Abstract:
We apply the method of QCD sum rules to study the structure $X$ newly observed by the BESIII Collaboration in the $φη^\prime$ mass spectrum in 2.0-2.1 GeV region in the $J/ψ\rightarrow φηη^\prime$ decay. We construct all the $s s \bar s \bar s$ tetraquark currents with $J^{PC} = 1^{+-}$, and use them to perform QCD sum rule analyses. One current leads to reliable QCD sum rule results and the mass…
▽ More
We apply the method of QCD sum rules to study the structure $X$ newly observed by the BESIII Collaboration in the $φη^\prime$ mass spectrum in 2.0-2.1 GeV region in the $J/ψ\rightarrow φηη^\prime$ decay. We construct all the $s s \bar s \bar s$ tetraquark currents with $J^{PC} = 1^{+-}$, and use them to perform QCD sum rule analyses. One current leads to reliable QCD sum rule results and the mass is extracted to be $2.00^{+0.10}_{-0.09}$ GeV, suggesting that the structure $X$ can be interpreted as an $s s \bar s \bar s$ tetraquark state with $J^{PC} = 1^{+-}$. The $Y(2175)$ can be interpreted as its $s s \bar s \bar s$ partner having $J^{PC} = 1^{--}$, and we propose to search for the other two partners, the $s s \bar s \bar s$ tetraquark states with $J^{PC} = 1^{++}$ and $1^{-+}$, in the $η^\prime f_0(980)$, $η^\prime K \bar K$, and $η^\prime K \bar K^*$ mass spectra.
△ Less
Submitted 7 January, 2019;
originally announced January 2019.
-
On the distribution of the hitting time for the N-urn Ehrenfest model
Authors:
Cheng Xin,
Minzhi Zhao,
Qiang Yao,
Erjia Cui
Abstract:
In this paper, we consider the N-urn Ehrenfest model. By utilizing an auxiliary continuous-time Markov chain, we obtain the explicit formula for the Laplace transform of the hitting time from a single state to a set A of states where A satisfies some symmetric properties. After obtaining the Laplace transform, we are able to compute the high-order moments(especially, variance) for the hitting time…
▽ More
In this paper, we consider the N-urn Ehrenfest model. By utilizing an auxiliary continuous-time Markov chain, we obtain the explicit formula for the Laplace transform of the hitting time from a single state to a set A of states where A satisfies some symmetric properties. After obtaining the Laplace transform, we are able to compute the high-order moments(especially, variance) for the hitting time.
△ Less
Submitted 21 November, 2018; v1 submitted 9 October, 2018;
originally announced October 2018.
-
A suggested search for doubly charmed baryons of $J^P=3/2^+$ via their electromagnetic transitions
Authors:
Er-Liang Cui,
Hua-Xing Chen,
Wei Chen,
Xiang Liu,
Shi-Lin Zhu
Abstract:
We use the method of light-cone sum rules to study the electromagnetic transition of the $Ξ^{*++}_{cc}$ into $Ξ^{++}_{cc}γ$, whose decay width is estimated to be $13.7~{^{+17.7}_{-~7.9}}$ keV. This value is large enough for the $Ξ^{*++}_{cc}$ to be observed in the $Ξ^{++}_{cc}γ$ channel, and we propose to continually search for it in future LHCb and BelleII experiments.
We use the method of light-cone sum rules to study the electromagnetic transition of the $Ξ^{*++}_{cc}$ into $Ξ^{++}_{cc}γ$, whose decay width is estimated to be $13.7~{^{+17.7}_{-~7.9}}$ keV. This value is large enough for the $Ξ^{*++}_{cc}$ to be observed in the $Ξ^{++}_{cc}γ$ channel, and we propose to continually search for it in future LHCb and BelleII experiments.
△ Less
Submitted 31 January, 2018; v1 submitted 10 December, 2017;
originally announced December 2017.
-
Understanding the internal structures of the $X(4140)$, $X(4274)$, $X(4500)$ and $X(4700)$
Authors:
Hua-Xing Chen,
Er-Liang Cui,
Wei Chen,
Xiang Liu,
Shi-Lin Zhu
Abstract:
We investigate the newly observed $X(4500)$ and $X(4700)$ based on the diquark-antidiquark configuration within the framework of QCD sum rules. Both of them may be interpreted as the $D$-wave $cs\bar{c}\bar{s}$ tetraquark states of $J^P = 0^+$, but with opposite color structures, which is remarkably similar to the result obtained in Ref.~\cite{Chen:2010ze} that the $X(4140)$ and $X(4274)$ can be b…
▽ More
We investigate the newly observed $X(4500)$ and $X(4700)$ based on the diquark-antidiquark configuration within the framework of QCD sum rules. Both of them may be interpreted as the $D$-wave $cs\bar{c}\bar{s}$ tetraquark states of $J^P = 0^+$, but with opposite color structures, which is remarkably similar to the result obtained in Ref.~\cite{Chen:2010ze} that the $X(4140)$ and $X(4274)$ can be both interpreted as the $S$-wave $cs\bar{c}\bar{s}$ tetraquark states of $J^P = 1^+$, also with opposite color structures. However, the extracted masses and these suggested assignments to these $X$ states do depend on these running quark masses where $m_s (2 \mbox{ GeV}) = 95 \pm 5$ MeV and $m_c (m_c) = 1.23 \pm 0.09$ GeV. As a byproduct, the masses of the hidden-bottom partner states of the $X(4500)$ and $X(4700)$ are extracted to be both around 10.64 GeV, which can be searched for in the $Υφ$ invariant mass distribution.
△ Less
Submitted 16 March, 2017; v1 submitted 10 June, 2016;
originally announced June 2016.
-
QCD sum rule study of hidden-charm pentaquarks
Authors:
Hua-Xing Chen,
Er-Liang Cui,
Wei Chen,
T. G. Steele,
Xiang Liu,
Shi-Lin Zhu
Abstract:
We study the mass spectra of hidden-charm pentaquarks having spin $J = {1\over2}/{3\over2}/{5\over2}$ and quark contents $uud c \bar c$. We systematically construct all the relevant local hidden-charm pentaquark currents, and select some of them to perform QCD sum rule analyses. We find that the $P_c(4380)$ and $P_c(4450)$ can be identified as hidden-charm pentaquark states composed of an anti-cha…
▽ More
We study the mass spectra of hidden-charm pentaquarks having spin $J = {1\over2}/{3\over2}/{5\over2}$ and quark contents $uud c \bar c$. We systematically construct all the relevant local hidden-charm pentaquark currents, and select some of them to perform QCD sum rule analyses. We find that the $P_c(4380)$ and $P_c(4450)$ can be identified as hidden-charm pentaquark states composed of an anti-charmed meson and a charmed baryon. Besides them, we also find a) the lowest-lying hidden-charm pentaquark state of $J^P = 1/2^-$ has the mass $4.33^{+0.17}_{-0.13}$ GeV, while the one of $J^P = 1/2^+$ is significantly higher, that is around $4.7-4.9$ GeV; b) the lowest-lying hidden-charm pentaquark state of $J^P = 3/2^-$ has the mass $4.37^{+0.18}_{-0.13}$ GeV, consistent with the $P_c(4380)$ of $J^P = 3/2^-$, while the one of $J^P = 3/2^+$ is also significantly higher, that is above $4.6$ GeV; c) the hidden-charm pentaquark state of $J^P = 5/2^-$ has a mass around $4.5-4.6$ GeV, slightly larger than the $P_c(4450)$ of $J^P = 5/2^+$.
△ Less
Submitted 25 October, 2016; v1 submitted 7 February, 2016;
originally announced February 2016.
-
$a_1(1420)$ resonance as a tetraquark state and its isospin partner
Authors:
Hua-Xing Chen,
Er-Liang Cui,
Wei Chen,
T. G. Steele,
Xiang Liu,
Shi-Lin Zhu
Abstract:
We systematically construct tetraquark currents of $I^GJ^{PC}=1^-1^{++}$ and classify them into types $\mathbf{A}$ (antisymmetric), $\mathbf{S}$ (symmetric) and $\mathbf{M}$ (mixed), based on flavor symmetries of diquarks and antidiquarks composing the tetra quark currents. We use tetraquark currents of type $\mathbf{M}$ to perform QCD sum rule analyses, and find a tetraquark current $η^M_{5μ}$ wi…
▽ More
We systematically construct tetraquark currents of $I^GJ^{PC}=1^-1^{++}$ and classify them into types $\mathbf{A}$ (antisymmetric), $\mathbf{S}$ (symmetric) and $\mathbf{M}$ (mixed), based on flavor symmetries of diquarks and antidiquarks composing the tetra quark currents. We use tetraquark currents of type $\mathbf{M}$ to perform QCD sum rule analyses, and find a tetraquark current $η^M_{5μ}$ with quark contents $q s\bar q \bar s$($q=u$ or $d$) leading to a mass of $1.44 \pm 0.08$ GeV consistent with the $a_1(1420)$ state recently observed by the COMPASS collaboration. Our results support tetraquark explanations for both $a_1(1420)$ and $f_1(1420)$, assuming that they are isospin partners. We also study their possible decay patterns. As tetraquark candidates, the possible decay modes of $a_1(1420)$ are $S$-wave $a_1(1420) \rightarrow K^*(892)K$ and $P$-wave $a_1(1420)\rightarrow f_0(980) π$ while the possible decay patterns of $f_1(1420)$ are $S$-wave $f_1(1420) \rightarrow K^*(892)K$ and $P$-wave $f_1(1420) \rightarrow a_0(980) π$. We speculate that $a_1(1420)$ is partly responsible for the large isospin violation in the $η(1405)\to f_0(980)π_0$ decay mode which is reported by BESIII collaboration in the $J/ψ\toγ3π$ process.
△ Less
Submitted 21 May, 2015; v1 submitted 9 March, 2015;
originally announced March 2015.
-
The D-wave heavy-light mesons from QCD sum rules
Authors:
Dan Zhou,
Er-Liang Cui,
Hua-Xing Chen,
Li-Sheng Geng,
Xiang Liu,
Shi-Lin Zhu
Abstract:
We study the D-wave c_bar s heavy meson doublets (1^-,2^-) and (2^-,3^-) using the method of QCD sum rule in the framework of heavy quark effective theory. Choosing the same threshold values omega_c around 2.7 Gev, we calculate the masses of the 1^- and 3^- states. They are m_{D*_{s1}} = 2.81 \pm 0.10 GeV and m_{D*_{s3}} = 2.85 \pm 0.08 GeV, consistent with the newly observed D*_{s1}(2860) and D*_…
▽ More
We study the D-wave c_bar s heavy meson doublets (1^-,2^-) and (2^-,3^-) using the method of QCD sum rule in the framework of heavy quark effective theory. Choosing the same threshold values omega_c around 2.7 Gev, we calculate the masses of the 1^- and 3^- states. They are m_{D*_{s1}} = 2.81 \pm 0.10 GeV and m_{D*_{s3}} = 2.85 \pm 0.08 GeV, consistent with the newly observed D*_{s1}(2860) and D*_{s3}(2860) states by LHCb. The masses of their 2^- partners are calculated to be 2.82 \pm 0.10 and 2.81 \pm 0.08 GeV. The mass splittings within the same doublet are calculated to be m_{D_{s2}} - m_{D*_{s1}} = 0.016 \pm 0.007 GeV and m_{D*_{s3}} - m_{D'_{s2}} = 0.039 \pm 0.014 GeV.
△ Less
Submitted 27 December, 2014; v1 submitted 7 October, 2014;
originally announced October 2014.
-
QCD sum rule Study of the $d^*(2380)$
Authors:
Hua-Xing Chen,
Er-Liang Cui,
Wei Chen,
T. G. Steele,
Shi-Lin Zhu
Abstract:
We systematically construct $I(J^P)=0(3^+)$ six-quark local interpolating currents without derivative operators. We discuss the best choice of operator, and select three $Δ$-$Δ$ like operators to perform QCD sum rule analyses to calculate the mass of the $d^*(2380)$. The mass extracted from this analysis is $M_{d^*} = 2.4\pm0.2$ GeV, consistent with the $d^*(2380)$ mass observed by the WASA detect…
▽ More
We systematically construct $I(J^P)=0(3^+)$ six-quark local interpolating currents without derivative operators. We discuss the best choice of operator, and select three $Δ$-$Δ$ like operators to perform QCD sum rule analyses to calculate the mass of the $d^*(2380)$. The mass extracted from this analysis is $M_{d^*} = 2.4\pm0.2$ GeV, consistent with the $d^*(2380)$ mass observed by the WASA detector at COSY. We also obtain a sum-rule lower mass bound $M_{d^*} > 2.25$ GeV. We also consider the effect of mixing of singlet dibaryon fields with the same quantum numbers, and perform the QCD sum rule analysis of the mixed interpolating current and extract the mass of the $d^*(2380)$ and its lower mass bound. With optimized mixing parameters, we find that the mixed current does not change the numerical result significantly.
△ Less
Submitted 9 March, 2015; v1 submitted 1 October, 2014;
originally announced October 2014.
-
K-pi interaction in finite volume and the K* resonance
Authors:
Dan Zhou,
Er-Liang Cui,
Hua-Xing Chen,
Li-Sheng Geng,
Li-Hua Zhu
Abstract:
We evaluate energy levels of the K-pi system in the K* channel in finite volume using chiral unitary theory. We use these energy levels to obtain K-pi phase shifts, and then obtain the K* mass and its decay width. We investigate their dependence on the pion mass and compare this with Lattice QCD calculations. We also compare our method with the standard Luscher approach, and solve the inverse prob…
▽ More
We evaluate energy levels of the K-pi system in the K* channel in finite volume using chiral unitary theory. We use these energy levels to obtain K-pi phase shifts, and then obtain the K* mass and its decay width. We investigate their dependence on the pion mass and compare this with Lattice QCD calculations. We also compare our method with the standard Luscher approach, and solve the inverse problem to obtain the K-pi phase shifts from these "synthetic" lattice data.
△ Less
Submitted 23 April, 2015; v1 submitted 30 August, 2014;
originally announced September 2014.