-
Analysis of the full Spitzer microlensing sample I: Dark remnant candidates and Gaia predictions
Authors:
Krzysztof A. Rybicki,
Yossi Shvartzvald,
Jennifer C. Yee,
Sebastiano Calchi Novati,
Eran O. Ofek,
Ian A. Bond,
Charles Beichman,
Geoff Bryden,
Sean Carey,
Calen Henderson,
Wei Zhu,
Michael M. Fausnaugh,
Benjamin Wibking,
Andrzej Udalski,
Radek Poleski,
Przemek Mróz,
Michal K. Szymański,
Igor Soszyński,
Paweł Pietrukowicz,
Szymon Kozłowski,
Jan Skowron,
Krzysztof Ulaczyk,
Patryk Iwanek,
Marcin Wrona,
Yoon-Hyun Ryu
, et al. (48 additional authors not shown)
Abstract:
In the pursuit of understanding the population of stellar remnants within the Milky Way, we analyze the sample of $\sim 950$ microlensing events observed by the Spitzer Space Telescope between 2014 and 2019. In this study we focus on a sub-sample of nine microlensing events, selected based on their long timescales, small microlensing parallaxes and joint observations by the Gaia mission, to increa…
▽ More
In the pursuit of understanding the population of stellar remnants within the Milky Way, we analyze the sample of $\sim 950$ microlensing events observed by the Spitzer Space Telescope between 2014 and 2019. In this study we focus on a sub-sample of nine microlensing events, selected based on their long timescales, small microlensing parallaxes and joint observations by the Gaia mission, to increase the probability that the chosen lenses are massive and the mass is measurable. Among the selected events we identify lensing black holes and neutron star candidates, with potential confirmation through forthcoming release of the Gaia time-series astrometry in 2026. Utilizing Bayesian analysis and Galactic models, along with the Gaia Data Release 3 proper motion data, four good candidates for dark remnants were identified: OGLE-2016-BLG-0293, OGLE-2018-BLG-0483, OGLE-2018-BLG-0662, and OGLE-2015-BLG-0149, with lens masses of $2.98^{+1.75}_{-1.28}~M_{\odot}$, $4.65^{+3.12}_{-2.08}~M_{\odot}$, $3.15^{+0.66}_{-0.64}~M_{\odot}$ and $1.4^{+0.75}_{-0.55}~M_{\odot}$, respectively. Notably, the first two candidates are expected to exhibit astrometric microlensing signals detectable by Gaia, offering the prospect of validating the lens masses. The methodologies developed in this work will be applied to the full Spitzer microlensing sample, populating and analyzing the time-scale ($t_{\rm E}$) vs. parallax ($π_{\rm E}$) diagram to derive constraints on the population of lenses in general and massive remnants in particular.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Identifying the Source of Generation for Large Language Models
Authors:
Bumjin Park,
Jaesik Choi
Abstract:
Large language models (LLMs) memorize text from several sources of documents. In pretraining, LLM trains to maximize the likelihood of text but neither receives the source of the text nor memorizes the source. Accordingly, LLM can not provide document information on the generated content, and users do not obtain any hint of reliability, which is crucial for factuality or privacy infringement. This…
▽ More
Large language models (LLMs) memorize text from several sources of documents. In pretraining, LLM trains to maximize the likelihood of text but neither receives the source of the text nor memorizes the source. Accordingly, LLM can not provide document information on the generated content, and users do not obtain any hint of reliability, which is crucial for factuality or privacy infringement. This work introduces token-level source identification in the decoding step, which maps the token representation to the reference document. We propose a bi-gram source identifier, a multi-layer perceptron with two successive token representations as input for better generalization. We conduct extensive experiments on Wikipedia and PG19 datasets with several LLMs, layer locations, and identifier sizes. The overall results show a possibility of token-level source identifiers for tracing the document, a crucial problem for the safe use of LLMs.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Geometric Remove-and-Retrain (GOAR): Coordinate-Invariant eXplainable AI Assessment
Authors:
Yong-Hyun Park,
Junghoon Seo,
Bomseok Park,
Seongsu Lee,
Junghyo Jo
Abstract:
Identifying the relevant input features that have a critical influence on the output results is indispensable for the development of explainable artificial intelligence (XAI). Remove-and-Retrain (ROAR) is a widely accepted approach for assessing the importance of individual pixels by measuring changes in accuracy following their removal and subsequent retraining of the modified dataset. However, w…
▽ More
Identifying the relevant input features that have a critical influence on the output results is indispensable for the development of explainable artificial intelligence (XAI). Remove-and-Retrain (ROAR) is a widely accepted approach for assessing the importance of individual pixels by measuring changes in accuracy following their removal and subsequent retraining of the modified dataset. However, we uncover notable limitations in pixel-perturbation strategies. When viewed from a geometric perspective, we discover that these metrics fail to discriminate between differences among feature attribution methods, thereby compromising the reliability of the evaluation. To address this challenge, we introduce an alternative feature-perturbation approach named Geometric Remove-and-Retrain (GOAR). Through a series of experiments with both synthetic and real datasets, we substantiate that GOAR transcends the limitations of pixel-centric metrics.
△ Less
Submitted 17 July, 2024;
originally announced July 2024.
-
Centrality dependence of Lévy-stable two-pion Bose-Einstein correlations in $\sqrt{s_{_{NN}}}=200$ GeV Au$+$Au collisions
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
A. Angerami,
K. Aoki,
N. Apadula,
Y. Aramaki,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun,
V. Babintsev,
M. Bai,
B. Bannier,
K. N. Barish,
B. Bassalleck,
S. Bathe
, et al. (377 additional authors not shown)
Abstract:
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability…
▽ More
The PHENIX experiment measured the centrality dependence of two-pion Bose-Einstein correlation functions in $\sqrt{s_{_{NN}}}=200$~GeV Au$+$Au collisions at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. The data are well represented by Lévy-stable source distributions. The extracted source parameters are the correlation-strength parameter $λ$, the Lévy index of stability $α$, and the Lévy-scale parameter $R$ as a function of transverse mass $m_T$ and centrality. The $λ(m_T)$ parameter is constant at larger values of $m_T$, but decreases as $m_T$ decreases. The Lévy scale parameter $R(m_T)$ decreases with $m_T$ and exhibits proportionality to the length scale of the nuclear overlap region. The Lévy exponent $α(m_T)$ is independent of $m_T$ within uncertainties in each investigated centrality bin, but shows a clear centrality dependence. At all centralities, the Lévy exponent $α$ is significantly different from that of Gaussian ($α=2$) or Cauchy ($α=1$) source distributions. Comparisons to the predictions of Monte-Carlo simulations of resonance-decay chains show that in all but the most peripheral centrality class (50%-60%), the obtained results are inconsistent with the measurements, unless a significant reduction of the in-medium mass of the $η'$ meson is included. In each centrality class, the best value of the in-medium $η'$ mass is compared to the mass of the $η$ meson, as well as to several theoretical predictions that consider restoration of $U_A(1)$ symmetry in hot hadronic matter.
△ Less
Submitted 11 July, 2024;
originally announced July 2024.
-
Multi-Object Spectroscopy of Galaxy Clusters at $z \sim 0.95$ in Ultra Deep Survey Field with Different Star-formation Properties and Large-scale Environments
Authors:
Seong-Kook Lee,
Myungshin Im,
Bomi Park,
Minhee Hyun,
Insu Paek,
Dohyeong Kim
Abstract:
While galaxy clusters are dominated by quiescent galaxies at local, they show a wide range in quiescent galaxy fraction (QF) at higher redshifts. Here, we present the discovery of two galaxy clusters at $z \sim 0.95$ with contrasting QFs despite having similar masses (log ($M_{200}/M_{\odot}$)$ \sim 14$) and spectra and redshifts of 29 galaxies in these clusters and 76 galaxies in the surrounding…
▽ More
While galaxy clusters are dominated by quiescent galaxies at local, they show a wide range in quiescent galaxy fraction (QF) at higher redshifts. Here, we present the discovery of two galaxy clusters at $z \sim 0.95$ with contrasting QFs despite having similar masses (log ($M_{200}/M_{\odot}$)$ \sim 14$) and spectra and redshifts of 29 galaxies in these clusters and 76 galaxies in the surrounding area. The clusters are found in the Ultra Deep Survey (UDS) field and confirmed through multi-object spectroscopic (MOS) observation using the Inamori Magellan Areal Camera and Spectrograph (IMACS) on the Magellan telescope. The two clusters exhibit QFs of $0.094^{+0.11}_{-0.032}$ and $0.38^{+0.14}_{-0.11}$, respectively. Analysis of large-scale structures (LSSs) surrounding these clusters finds that properties of these clusters are consistent with the anti-correlation trend between the QF and the extent of surrounding LSS, found in Lee et al. (2019), which can be interpreted as a result from the replenishment of young, star-forming galaxies keeps the QF low when galaxy clusters are accompanied by rich surrounding environments.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Discovering one molecule out of a million: inverse design of molecular hole transporting semiconductors tailored for perovskite solar cells
Authors:
Jianchang Wu,
Luca Torresi,
ManMan Hu,
Patrick Reiser,
Jiyun Zhang,
Juan S. Rocha-Ortiz,
Luyao Wang,
Zhiqiang Xie,
Kaicheng Zhang,
Byung-wook Park,
Anastasia Barabash,
Yicheng Zhao,
Junsheng Luo,
Yunuo Wang,
Larry Lüer,
Lin-Long Deng,
Jens A. Hauch,
Sang Il Seok,
Pascal Friederich,
Christoph J. Brabec
Abstract:
The inverse design of tailored organic molecules for specific optoelectronic devices of high complexity holds an enormous potential, but has not yet been realized1,2. The complexity and literally infinite diversity of conjugated molecular structures present both, an unprecedented opportunity for technological breakthroughs as well as an unseen optimization challenge. Current models rely on big dat…
▽ More
The inverse design of tailored organic molecules for specific optoelectronic devices of high complexity holds an enormous potential, but has not yet been realized1,2. The complexity and literally infinite diversity of conjugated molecular structures present both, an unprecedented opportunity for technological breakthroughs as well as an unseen optimization challenge. Current models rely on big data which do not exist for specialized research films. However, a hybrid computational and high throughput experimental screening workflow allowed us to train predictive models with as little as 149 molecules. We demonstrate a unique closed-loop workflow combining high throughput synthesis and Bayesian optimization that discovers new hole transporting materials with tailored properties for solar cell applications. A series of high-performance molecules were identified from minimal suggestions, achieving up to 26.23% (certified 25.88%) power conversion efficiency in perovskite solar cells. Our work paves the way for rapid, informed discovery in vast molecular libraries, revolutionizing material selection for complex devices. We believe that our approach can be generalized to other emerging fields and indeed accelerate the development of optoelectronic semiconductor devices in general.
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Memorizing Documents with Guidance in Large Language Models
Authors:
Bumjin Park,
Jaesik Choi
Abstract:
Training data plays a pivotal role in AI models. Large language models (LLMs) are trained with massive amounts of documents, and their parameters hold document-related contents. Recently, several studies identified content-specific locations in LLMs by examining the parameters. Instead of the post hoc interpretation, we propose another approach. We propose document-wise memory architecture to trac…
▽ More
Training data plays a pivotal role in AI models. Large language models (LLMs) are trained with massive amounts of documents, and their parameters hold document-related contents. Recently, several studies identified content-specific locations in LLMs by examining the parameters. Instead of the post hoc interpretation, we propose another approach. We propose document-wise memory architecture to track document memories in training. The proposed architecture maps document representations to memory entries, which softly mask memories in the forward process of LLMs. Additionally, we propose document guidance loss, which increases the likelihood of text with document memories and reduces the likelihood of the text with the memories of other documents. Experimental results on Wikitext-103-v1 with Pythia-1B show that the proposed methods provide different memory entries for documents and high recall of document-related content in generation with trained document-wise memories.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Random walk in slowly changing environments
Authors:
Bryan Park,
Souvik Ray
Abstract:
A Random Walk in Changing Environment (RWCE) is a weighted random walk on a locally finite, connected graph $G$ with random, time-dependent edge-weights. This includes self-interacting random walks, where the edge-weights depend on the history of the process. In general, even the basic question of recurrence or transience for RWCEs is difficult, especially when the underlying graph contains cycles…
▽ More
A Random Walk in Changing Environment (RWCE) is a weighted random walk on a locally finite, connected graph $G$ with random, time-dependent edge-weights. This includes self-interacting random walks, where the edge-weights depend on the history of the process. In general, even the basic question of recurrence or transience for RWCEs is difficult, especially when the underlying graph contains cycles. In this note, we derive a condition for recurrence or transience that is too restrictive for classical RWCEs but instead works for any graph $G.$ Namely, we show that any bounded RWCE on $G$ with "slowly" changing edge-weights inherits the recurrence or transience of the initial weighted graph.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
TroL: Traversal of Layers for Large Language and Vision Models
Authors:
Byung-Kwan Lee,
Sangyun Chung,
Chae Won Kim,
Beomchan Park,
Yong Man Ro
Abstract:
Large language and vision models (LLVMs) have been driven by the generalization power of large language models (LLMs) and the advent of visual instruction tuning. Along with scaling them up directly, these models enable LLVMs to showcase powerful vision language (VL) performances by covering diverse tasks via natural language instructions. However, existing open-source LLVMs that perform comparabl…
▽ More
Large language and vision models (LLVMs) have been driven by the generalization power of large language models (LLMs) and the advent of visual instruction tuning. Along with scaling them up directly, these models enable LLVMs to showcase powerful vision language (VL) performances by covering diverse tasks via natural language instructions. However, existing open-source LLVMs that perform comparably to closed-source LLVMs such as GPT-4V are often considered too large (e.g., 26B, 34B, and 110B parameters), having a larger number of layers. These large models demand costly, high-end resources for both training and inference. To address this issue, we present a new efficient LLVM family with 1.8B, 3.8B, and 7B LLM model sizes, Traversal of Layers (TroL), which enables the reuse of layers in a token-wise manner. This layer traversing technique simulates the effect of looking back and retracing the answering stream while increasing the number of forward propagation layers without physically adding more layers. We demonstrate that TroL employs a simple layer traversing approach yet efficiently outperforms the open-source LLVMs with larger model sizes and rivals the performances of the closed-source LLVMs with substantial sizes.
△ Less
Submitted 19 June, 2024; v1 submitted 17 June, 2024;
originally announced June 2024.
-
Four microlensing giant planets detected through signals produced by minor-image perturbations
Authors:
Cheongho Han,
Ian A. Bond,
Chung-Uk Lee,
Andrew Gould,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Youn Kil Jung,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Jennifer C. Yee,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Doeon Kim,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Fumio Abe,
Ken Bando,
Richard Barry
, et al. (41 additional authors not shown)
Abstract:
We investigated the nature of the anomalies appearing in four microlensing events KMT-2020-BLG-0757, KMT-2022-BLG-0732, KMT-2022-BLG-1787, and KMT-2022-BLG-1852. The light curves of these events commonly exhibit initial bumps followed by subsequent troughs that extend across a substantial portion of the light curves. We performed thorough modeling of the anomalies to elucidate their characteristic…
▽ More
We investigated the nature of the anomalies appearing in four microlensing events KMT-2020-BLG-0757, KMT-2022-BLG-0732, KMT-2022-BLG-1787, and KMT-2022-BLG-1852. The light curves of these events commonly exhibit initial bumps followed by subsequent troughs that extend across a substantial portion of the light curves. We performed thorough modeling of the anomalies to elucidate their characteristics. Despite their prolonged durations, which differ from the usual brief anomalies observed in typical planetary events, our analysis revealed that each anomaly in these events originated from a planetary companion located within the Einstein ring of the primary star. It was found that the initial bump arouse when the source star crossed one of the planetary caustics, while the subsequent trough feature occurred as the source traversed the region of minor image perturbations lying between the pair of planetary caustics. The estimated masses of the host and planet, their mass ratios, and the distance to the discovered planetary systems are $(M_{\rm host}/M_\odot, M_{\rm planet}/M_{\rm J}, q/10^{-3}, \dl/{\rm kpc}) = (0.58^{+0.33}_{-0.30}, 10.71^{+6.17}_{-5.61}, 17.61\pm 2.25,6.67^{+0.93}_{-1.30})$ for KMT-2020-BLG-0757, $(0.53^{+0.31}_{-0.31}, 1.12^{+0.65}_{-0.65}, 2.01 \pm 0.07, 6.66^{+1.19}_{-1.84})$ for KMT-2022-BLG-0732, $(0.42^{+0.32}_{-0.23}, 6.64^{+4.98}_{-3.64}, 15.07\pm 0.86, 7.55^{+0.89}_{-1.30})$ for KMT-2022-BLG-1787, and $(0.32^{+0.34}_{-0.19}, 4.98^{+5.42}_{-2.94}, 8.74\pm 0.49, 6.27^{+0.90}_{-1.15})$ for KMT-2022-BLG-1852. These parameters indicate that all the planets are giants with masses exceeding the mass of Jupiter in our solar system and the hosts are low-mass stars with masses substantially less massive than the Sun.
△ Less
Submitted 15 June, 2024;
originally announced June 2024.
-
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
Authors:
Yuma Shirahata,
Byeongseon Park,
Ryuichi Yamamoto,
Kentaro Tachibana
Abstract:
This paper proposes an audio-conditioned phonemic and prosodic annotation model for building text-to-speech (TTS) datasets from unlabeled speech samples. For creating a TTS dataset that consists of label-speech paired data, the proposed annotation model leverages an automatic speech recognition (ASR) model to obtain phonemic and prosodic labels from unlabeled speech samples. By fine-tuning a large…
▽ More
This paper proposes an audio-conditioned phonemic and prosodic annotation model for building text-to-speech (TTS) datasets from unlabeled speech samples. For creating a TTS dataset that consists of label-speech paired data, the proposed annotation model leverages an automatic speech recognition (ASR) model to obtain phonemic and prosodic labels from unlabeled speech samples. By fine-tuning a large-scale pre-trained ASR model, we can construct the annotation model using a limited amount of label-speech paired data within an existing TTS dataset. To alleviate the shortage of label-speech paired data for training the annotation model, we generate pseudo label-speech paired data using text-only corpora and an auxiliary TTS model. This TTS model is also trained with the existing TTS dataset. Experimental results show that the TTS model trained with the dataset created by the proposed annotation method can synthesize speech as naturally as the one trained with a fully-labeled dataset.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Exotic definite four-manifolds with non-cyclic fundamental group
Authors:
Robert Harris,
Patrick Naylor,
B. Doug Park
Abstract:
We construct infinitely many pairwise non-diffeomorphic smooth structures on a definite $4$-manifold with non-cyclic fundamental group $\mathbb{Z}/2\times \mathbb{Z}/2$.
We construct infinitely many pairwise non-diffeomorphic smooth structures on a definite $4$-manifold with non-cyclic fundamental group $\mathbb{Z}/2\times \mathbb{Z}/2$.
△ Less
Submitted 10 June, 2024;
originally announced June 2024.
-
Exclusion of the Cosmological Triangle in Reactor-Based Search for Axion-Like Particles
Authors:
Byung Ju Park,
Jae Jin Choi,
Eunju Jeon,
Jinyu Kim,
Kyungwon Kim,
Sung Hyun Kim,
Sun Kee Kim,
Yeongduk Kim,
Young Ju Ko,
Byoung-Cheol Koh,
Chang Hyon Ha,
Seo Hyun Lee,
In Soo Lee,
Hyunseok Lee,
Hyun Su Lee,
Jaison Lee,
Yoomin Oh,
Doojin Kim
Abstract:
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the…
▽ More
We report new constraints on axion-like particle (ALP) using data corresponding to a sodium iodine target exposure of 3063 kg$\cdot$days from the neutrino elastic scattering observation with NaI (NEON) experiment. A 16.7 kg of thallium-doped sodium iodide target was located 23.7 meters from a 2.8 GW thermal power nuclear reactor. We searched for ALPs produced by high-flux photons by comparing the energy spectra of data collected during reactor-on (1596 kg$\cdot$days exposure) and reactor-off (1467 kg$\cdot$days exposure) periods. No signal consistent with ALP interaction was identified, allowing us to set exclusion limits at the 95% confidence level. Our limits cover previously unexplored regions for both photon couplings (${g_{aγ}}$) and electron couplings (${g_{ae}}$) for axion masses around 1 MeV/c$^2$. Notably, the NEON data excludes the unconstrained region identified by laboratory-based searches for photon couplings within the "cosmological triangle" for the first time. The observed 95\% confidence level limits reach as low as ${g_{aγ}}$ of 4.33$\times$ 10$^{-8}$ GeV$^{-1}$ and ${g_{ae}}$ of 1.10$\times$ 10$^{-9}$ for axion masses of 1.7 MeV/c$^2$ and 1.0 MeV/c$^2$, respectively.
△ Less
Submitted 11 June, 2024; v1 submitted 10 June, 2024;
originally announced June 2024.
-
Stochastic Optimal Control for Diffusion Bridges in Function Spaces
Authors:
Byoungwoo Park,
Jungwon Choi,
Sungbin Lim,
Juho Lee
Abstract:
Recent advancements in diffusion models and diffusion bridges primarily focus on finite-dimensional spaces, yet many real-world problems necessitate operations in infinite-dimensional function spaces for more natural and interpretable formulations. In this paper, we present a theory of stochastic optimal control (SOC) tailored to infinite-dimensional spaces, aiming to extend diffusion-based algori…
▽ More
Recent advancements in diffusion models and diffusion bridges primarily focus on finite-dimensional spaces, yet many real-world problems necessitate operations in infinite-dimensional function spaces for more natural and interpretable formulations. In this paper, we present a theory of stochastic optimal control (SOC) tailored to infinite-dimensional spaces, aiming to extend diffusion-based algorithms to function spaces. Specifically, we demonstrate how Doob's $h$-transform, the fundamental tool for constructing diffusion bridges, can be derived from the SOC perspective and expanded to infinite dimensions. This expansion presents a challenge, as infinite-dimensional spaces typically lack closed-form densities. Leveraging our theory, we establish that solving the optimal control problem with a specific objective function choice is equivalent to learning diffusion-based generative models. We propose two applications: (1) learning bridges between two infinite-dimensional distributions and (2) generative models for sampling from an infinite-dimensional distribution. Our approach proves effective for diverse problems involving continuous function space representations, such as resolution-free images, time-series data, and probability density functions.
△ Less
Submitted 2 June, 2024; v1 submitted 31 May, 2024;
originally announced May 2024.
-
Diffusion Model Patching via Mixture-of-Prompts
Authors:
Seokil Ham,
Sangmin Woo,
Jin-Young Kim,
Hyojun Go,
Byeongjun Park,
Changick Kim
Abstract:
We present Diffusion Model Patching (DMP), a simple method to boost the performance of pre-trained diffusion models that have already reached convergence, with a negligible increase in parameters. DMP inserts a small, learnable set of prompts into the model's input space while keeping the original model frozen. The effectiveness of DMP is not merely due to the addition of parameters but stems from…
▽ More
We present Diffusion Model Patching (DMP), a simple method to boost the performance of pre-trained diffusion models that have already reached convergence, with a negligible increase in parameters. DMP inserts a small, learnable set of prompts into the model's input space while keeping the original model frozen. The effectiveness of DMP is not merely due to the addition of parameters but stems from its dynamic gating mechanism, which selects and combines a subset of learnable prompts at every step of the generative process (e.g., reverse denoising steps). This strategy, which we term "mixture-of-prompts", enables the model to draw on the distinct expertise of each prompt, essentially "patching" the model's functionality at every step with minimal yet specialized parameters. Uniquely, DMP enhances the model by further training on the same dataset on which it was originally trained, even in a scenario where significant improvements are typically not expected due to model convergence. Experiments show that DMP significantly enhances the converged FID of DiT-L/2 on FFHQ 256x256 by 10.38%, achieved with only a 1.43% parameter increase and 50K additional training iterations.
△ Less
Submitted 30 May, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
KMT-2023-BLG-2669: Ninth Free-floating Planet Candidate with $θ_{\rm E}$ measurements
Authors:
Youn Kil Jung,
Kyu-Ha Hwang,
Hongjing Yang,
Andrew Gould,
Jennifer C. Yee,
Cheongho Han,
Michael D. Albrow,
Sun-Ju Chung,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Weicheng Zang,
Sang-Mok Cha,
Dong-Jin Kim,
Seung-Lee Kim,
Chung-Uk Lee,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge
Abstract:
We report a free-floating planet (FFP) candidate identified from the analysis of the microlensing event KMT-2023-BLG-2669. The lensing light curve is characterized by a short duration $(\lesssim 3\,{\rm days})$ and a small amplitude $(\lesssim 0.7\,{\rm mag})$. From the analysis, we find the Einstein timescale of $t_{\rm E} \backsimeq 0.33\,{\rm days}$ and the Einstein radius of…
▽ More
We report a free-floating planet (FFP) candidate identified from the analysis of the microlensing event KMT-2023-BLG-2669. The lensing light curve is characterized by a short duration $(\lesssim 3\,{\rm days})$ and a small amplitude $(\lesssim 0.7\,{\rm mag})$. From the analysis, we find the Einstein timescale of $t_{\rm E} \backsimeq 0.33\,{\rm days}$ and the Einstein radius of $θ_{\rm E} \backsimeq 4.41\,μ{\rm as}$. These measurements enable us to infer the lens mass as $M = 8\,M_{\oplus} (π_{\rm rel} / 0.1\,{\rm mas})^{-1}$, where $π_{\rm rel}$ is the relative lens-source parallax. The inference implies that the lens is a sub-Neptune- to Saturn-mass object depending on its unknown distance. This is the ninth isolated planetary-mass microlens with $θ_{\rm E} < 10\,μ{\rm as}$, which (as shown by \citealt{gould22}) is a useful threshold for a FFP candidate. We conduct extensive searches for possible signals of a host star in the light curve, but find no strong evidence for the host. We discuss the possibility of using late-time high-resolution imaging to probe for possible hosts.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge
Authors:
Brendan Park,
Madeline Janecek,
Naser Ezzati-Jivan,
Yifeng Li,
Ali Emami
Abstract:
Large Language Models (LLMs) have demonstrated remarkable success in tasks like the Winograd Schema Challenge (WSC), showcasing advanced textual common-sense reasoning. However, applying this reasoning to multimodal domains, where understanding text and images together is essential, remains a substantial challenge. To address this, we introduce WinoVis, a novel dataset specifically designed to pro…
▽ More
Large Language Models (LLMs) have demonstrated remarkable success in tasks like the Winograd Schema Challenge (WSC), showcasing advanced textual common-sense reasoning. However, applying this reasoning to multimodal domains, where understanding text and images together is essential, remains a substantial challenge. To address this, we introduce WinoVis, a novel dataset specifically designed to probe text-to-image models on pronoun disambiguation within multimodal contexts. Utilizing GPT-4 for prompt generation and Diffusion Attentive Attribution Maps (DAAM) for heatmap analysis, we propose a novel evaluation framework that isolates the models' ability in pronoun disambiguation from other visual processing challenges. Evaluation of successive model versions reveals that, despite incremental advancements, Stable Diffusion 2.0 achieves a precision of 56.7% on WinoVis, only marginally surpassing random guessing. Further error analysis identifies important areas for future research aimed at advancing text-to-image models in their ability to interpret and interact with the complex visual world.
△ Less
Submitted 3 June, 2024; v1 submitted 25 May, 2024;
originally announced May 2024.
-
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models
Authors:
Byung-Kwan Lee,
Chae Won Kim,
Beomchan Park,
Yong Man Ro
Abstract:
The rapid development of large language and vision models (LLVMs) has been driven by advances in visual instruction tuning. Recently, open-source LLVMs have curated high-quality visual instruction tuning datasets and utilized additional vision encoders or multiple computer vision models in order to narrow the performance gap with powerful closed-source LLVMs. These advancements are attributed to m…
▽ More
The rapid development of large language and vision models (LLVMs) has been driven by advances in visual instruction tuning. Recently, open-source LLVMs have curated high-quality visual instruction tuning datasets and utilized additional vision encoders or multiple computer vision models in order to narrow the performance gap with powerful closed-source LLVMs. These advancements are attributed to multifaceted information required for diverse capabilities, including fundamental image understanding, real-world knowledge about common-sense and non-object concepts (e.g., charts, diagrams, symbols, signs, and math problems), and step-by-step procedures for solving complex questions. Drawing from the multifaceted information, we present a new efficient LLVM, Mamba-based traversal of rationales (Meteor), which leverages multifaceted rationale to enhance understanding and answering capabilities. To embed lengthy rationales containing abundant information, we employ the Mamba architecture, capable of processing sequential data with linear time complexity. We introduce a new concept of traversal of rationale that facilitates efficient embedding of rationale. Subsequently, the backbone multimodal language model (MLM) is trained to generate answers with the aid of rationale. Through these steps, Meteor achieves significant improvements in vision language performances across multiple evaluation benchmarks requiring diverse capabilities, without scaling up the model size or employing additional vision encoders and computer vision models.
△ Less
Submitted 27 May, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
Evaluation of Connected Vehicle Identification-Aware Mixed Traffic Freeway Cooperative Merging
Authors:
Haoji Liu,
Fatemeh Jahedinia,
Zeyu Mu,
B. Brian Park
Abstract:
Cooperative on-ramp merging control for connected automated vehicles (CAVs) has been extensively investigated. However, they did neglect the connected vehicle identification process, which is a must for CAV cooperations. In this paper, we introduced a connected vehicle identification system (VIS) into the on-ramp merging control process for the first time and proposed an evaluation framework to as…
▽ More
Cooperative on-ramp merging control for connected automated vehicles (CAVs) has been extensively investigated. However, they did neglect the connected vehicle identification process, which is a must for CAV cooperations. In this paper, we introduced a connected vehicle identification system (VIS) into the on-ramp merging control process for the first time and proposed an evaluation framework to assess the impacts of VIS on on-ramp merging performance. First, the mixed-traffic cooperative merging problem was formulated. Then, a real-world merging trajectory dataset was processed to generate dangerous merging scenarios. Aiming at resolving the potential collision risks in mixed traffic where CAVs and traditional human-driven vehicles (THVs) coexist, we proposed on-ramp merging strategies for CAVs in different mixed traffic situations considering the connected vehicle identification process. The performances were evaluated via simulations. Results indicated that while safety was assured for all cases with CAVs, the cases with VIS had delayed initiation of cooperation, limiting the range of cooperative merging and leading to increased fuel consumption and acceleration variations.
△ Less
Submitted 20 May, 2024;
originally announced May 2024.
-
KMT-2023-BLG-1866Lb: Microlensing super-Earth around an M dwarf host
Authors:
Cheongho Han,
Ian A. Bond,
Andrzej Udalski,
Chung-Uk Lee,
Andrew Gould,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Youn Kil Jung,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Jennifer C. Yee,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Doeon Kim,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Fumio Abe,
Ken Bando
, et al. (42 additional authors not shown)
Abstract:
We investigate the nature of the short-term anomaly that appears in the lensing light curve of KMT-2023-BLG-1866. The anomaly was only partly covered due to its short duration, less than a day, coupled with cloudy weather conditions and restricted nighttime duration. Considering intricacy of interpreting partially covered signals, we thoroughly explore all potential degenerate solutions. Through t…
▽ More
We investigate the nature of the short-term anomaly that appears in the lensing light curve of KMT-2023-BLG-1866. The anomaly was only partly covered due to its short duration, less than a day, coupled with cloudy weather conditions and restricted nighttime duration. Considering intricacy of interpreting partially covered signals, we thoroughly explore all potential degenerate solutions. Through this process, we identify three planetary scenarios that equally well account for the observed anomaly. These scenarios are characterized by the specific planetary parameters: $(s, q)_{\rm inner} = [0.9740 \pm 0.0083, (2.46 \pm 1.07) \times 10^{-5}]$, $(s, q)_{\rm intermediate} = [0.9779 \pm 0.0017, (1.56 \pm 0.25)\times 10^{-5}]$, and $(s, q)_{\rm outer} = [0.9894 \pm 0.0107, (2.31 \pm 1.29)\times 10^{-5}]$, where $s$ and $q$ denote the projected separation (scaled to the Einstein radius) and mass ratio between the planet and its host, respectively. We identify that the ambiguity between the inner and outer solutions stems from the inner-outer degeneracy, while the similarity between the intermediate solution and the others is due to an accidental degeneracy caused by incomplete anomaly coverage. Through Bayesian analysis utilizing the constraints derived from measured lensing observables and blending flux, our estimation indicates that the lens system comprises a very low-mass planet orbiting an early M-type star situated approximately (6.2 -- 6.5)~kpc from Earth in terms of median posterior values for the different solutions. The median mass of the planet host is in the range of (0.48 -- 0.51)~$M_\odot$, and that of the planet's mass spans a range of (2.6 -- 4.0)~$M_{\rm E}$, varying across different solutions. The detection of KMT-2023-BLG-1866Lb signifies the extension of the lensing surveys to very low-mass planets that have been difficult to be detected from earlier surveys.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Grouping predictors via network-wide metrics
Authors:
Brandon Woosuk Park,
Anand N. Vidyashankar,
Tucker S. McElroy
Abstract:
When multitudes of features can plausibly be associated with a response, both privacy considerations and model parsimony suggest grouping them to increase the predictive power of a regression model. Specifically, the identification of groups of predictors significantly associated with the response variable eases further downstream analysis and decision-making. This paper proposes a new data analys…
▽ More
When multitudes of features can plausibly be associated with a response, both privacy considerations and model parsimony suggest grouping them to increase the predictive power of a regression model. Specifically, the identification of groups of predictors significantly associated with the response variable eases further downstream analysis and decision-making. This paper proposes a new data analysis methodology that utilizes the high-dimensional predictor space to construct an implicit network with weighted edges %and weights on the edges to identify significant associations between the response and the predictors. Using a population model for groups of predictors defined via network-wide metrics, a new supervised grouping algorithm is proposed to determine the correct group, with probability tending to one as the sample size diverges to infinity. For this reason, we establish several theoretical properties of the estimates of network-wide metrics. A novel model-assisted bootstrap procedure that substantially decreases computational complexity is developed, facilitating the assessment of uncertainty in the estimates of network-wide metrics. The proposed methods account for several challenges that arise in the high-dimensional data setting, including (i) a large number of predictors, (ii) uncertainty regarding the true statistical model, and (iii) model selection variability. The performance of the proposed methods is demonstrated through numerical experiments, data from sports analytics, and breast cancer data.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Sharp Maximal function estimates for Multilinear pseudo-differential operators of type (0,0)
Authors:
Bae Jun Park,
Naohito Tomita
Abstract:
In this paper, we study sharp maximal function estimates for multilinear pseudo-differential operators. Our target is operators of type (0, 0) for which a differentiation does not make any decay of the associated symbol. Analogous results for operators of type (ρ, ρ), 0 < ρ< 1, appeared in an earlier work of the authors, but a different approach is given for ρ= 0
In this paper, we study sharp maximal function estimates for multilinear pseudo-differential operators. Our target is operators of type (0, 0) for which a differentiation does not make any decay of the associated symbol. Analogous results for operators of type (ρ, ρ), 0 < ρ< 1, appeared in an earlier work of the authors, but a different approach is given for ρ= 0
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
A manufacturable platform for photonic quantum computing
Authors:
Koen Alexander,
Andrea Bahgat,
Avishai Benyamini,
Dylan Black,
Damien Bonneau,
Stanley Burgos,
Ben Burridge,
Geoff Campbell,
Gabriel Catalano,
Alex Ceballos,
Chia-Ming Chang,
CJ Chung,
Fariba Danesh,
Tom Dauer,
Michael Davis,
Eric Dudley,
Ping Er-Xuan,
Josep Fargas,
Alessandro Farsi,
Colleen Fenrich,
Jonathan Frazer,
Masaya Fukami,
Yogeeswaran Ganesan,
Gary Gibson,
Mercedes Gimeno-Segovia
, et al. (70 additional authors not shown)
Abstract:
Whilst holding great promise for low noise, ease of operation and networking, useful photonic quantum computing has been precluded by the need for beyond-state-of-the-art components, manufactured by the millions. Here we introduce a manufacturable platform for quantum computing with photons. We benchmark a set of monolithically-integrated silicon photonics-based modules to generate, manipulate, ne…
▽ More
Whilst holding great promise for low noise, ease of operation and networking, useful photonic quantum computing has been precluded by the need for beyond-state-of-the-art components, manufactured by the millions. Here we introduce a manufacturable platform for quantum computing with photons. We benchmark a set of monolithically-integrated silicon photonics-based modules to generate, manipulate, network, and detect photonic qubits, demonstrating dual-rail photonic qubits with $99.98\% \pm 0.01\%$ state preparation and measurement fidelity, Hong-Ou-Mandel quantum interference between independent photon sources with $99.50\%\pm0.25\%$ visibility, two-qubit fusion with $99.22\%\pm0.12\%$ fidelity, and a chip-to-chip qubit interconnect with $99.72\%\pm0.04\%$ fidelity, not accounting for loss. In addition, we preview a selection of next generation technologies, demonstrating low-loss silicon nitride waveguides and components, fabrication-tolerant photon sources, high-efficiency photon-number-resolving detectors, low-loss chip-to-fiber coupling, and barium titanate electro-optic phase shifters.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
OGLE-2018-BLG-0971, MOA-2023-BLG-065, and OGLE-2023-BLG-0136: Microlensing events with prominent orbital effects
Authors:
Cheongho Han,
Andrzej Udalski,
Ian A. Bond,
Chung-Uk Lee,
Andrew Gould,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Youn Kil Jung,
Hyoun-Woo Kim,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Jennifer C. Yee,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Doeon Kim,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Przemek Mróz
, et al. (38 additional authors not shown)
Abstract:
We undertake a project to reexamine microlensing data gathered from high-cadence surveys. The aim of the project is to reinvestigate lensing events with light curves exhibiting intricate anomaly features associated with caustics, yet lacking prior proposed models to explain these features. Through detailed reanalyses considering higher-order effects, we identify that accounting for orbital motions…
▽ More
We undertake a project to reexamine microlensing data gathered from high-cadence surveys. The aim of the project is to reinvestigate lensing events with light curves exhibiting intricate anomaly features associated with caustics, yet lacking prior proposed models to explain these features. Through detailed reanalyses considering higher-order effects, we identify that accounting for orbital motions of lenses is vital in accurately explaining the anomaly features observed in the light curves of the lensing events OGLE-2018-BLG-0971, MOA-2023-BLG-065, and OGLE-2023-BLG-0136. We estimate the masses and distances to the lenses by conducting Bayesian analyses using the lensing parameters of the newly found lensing solutions. From these analyses, we identify that the lenses of the events OGLE-2018-BLG-0971 and MOA-2023-BLG-065 are binaries composed of M dwarfs, while the lens of OGLE-2023-BLG-0136 is likely to be a binary composed of an early K-dwarf primary and a late M-dwarf companion. For all lensing events, the probability of the lens residing in the bulge is considerably higher than that of it being located in the disk.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Upgrade of NaI(Tl) crystal encapsulation for the NEON experiment
Authors:
J. J. Choi,
E. J. Jeon,
J. Y. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
C. Ha,
B. J. Park,
S. H. Lee,
I. S. Lee,
H. Lee,
H. S. Lee,
J. Lee,
Y. M. Oh
Abstract:
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which…
▽ More
The Neutrino Elastic-scattering Observation with NaI(Tl) experiment (NEON) aims to detect coherent elastic neutrino-nucleus scattering~(\cenns) in a NaI(Tl) crystal using reactor anti-electron neutrinos at the Hanbit nuclear power plant complex. A total of 13.3 kg of NaI(Tl) crystals were initially installed in December 2020 at the tendon gallery, 23.7$\pm$0.3\,m away from the reactor core, which operates at a thermal power of 2.8\,GW. Initial engineering operation was performed from May 2021 to March 2022 and observed unexpected photomultiplier-induced noise and a decreased light yield that were caused by leakage of liquid scintillator into the detector due to weakness of detector encapsulation. We upgraded the detector encapsulation design to prevent the leakage of the liquid scintillator. Meanwhile two small-sized detectors were replaced with larger ones resulting in a total mass of 16.7\,kg. With this new design implementation, the detector system has been operating stably since April 2022 for over a year without detector gain drop. In this paper, we present an improved crystal encapsulation design and stability of the NEON experiment.
△ Less
Submitted 28 June, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
A Deep Redshift Survey of the Perseus Cluster: Spatial Distribution and Kinematics of Galaxies
Authors:
Wooseok Kang,
Ho Seong Hwang,
Hyunmi Song,
Changbom Park,
Narae Hwang,
Byeong-Gon Park
Abstract:
We study the global kinematics of the Perseus galaxy cluster (Abell 426) at redshift z = 0.017 using a large sample of galaxies from our new MMT/Hectospec spectroscopic observation for this cluster. The sample includes 1447 galaxies with measured redshifts within 60' from the cluster center (1148 from this MMT/Hectospec program and 299 from the literature). The resulting spectroscopic completeness…
▽ More
We study the global kinematics of the Perseus galaxy cluster (Abell 426) at redshift z = 0.017 using a large sample of galaxies from our new MMT/Hectospec spectroscopic observation for this cluster. The sample includes 1447 galaxies with measured redshifts within 60' from the cluster center (1148 from this MMT/Hectospec program and 299 from the literature). The resulting spectroscopic completeness is 67% at r-band apparent magnitude $r_{\rm{Petro, 0}}\leq 18.0$ within 60' from the cluster center. To identify cluster member galaxies in this sample, we develop a new open-source Python package, CausticSNUpy. This code implements the algorithm of the caustic technique and yields 418 member galaxies within 60' of the cluster. We study the cluster using this sample of member galaxies. The cluster shows no significant signal of global rotation. A statistical test shows that the cluster does not have a noticeable substructure within 30'. We find two central regions where the X-ray emitting intracluster medium and galaxies show significant velocity differences ($>7σ$). On a large scale, however, the overall morphology and kinematics between the intracluster medium and galaxies agree well. Our results suggest that the Perseus cluster is a relaxed system and has not experienced a recent merger.
△ Less
Submitted 28 March, 2024;
originally announced March 2024.
-
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts
Authors:
Byeongjun Park,
Hyojun Go,
Jin-Young Kim,
Sangmin Woo,
Seokil Ham,
Changick Kim
Abstract:
Diffusion models have achieved remarkable success across a range of generative tasks. Recent efforts to enhance diffusion model architectures have reimagined them as a form of multi-task learning, where each task corresponds to a denoising task at a specific noise level. While these efforts have focused on parameter isolation and task routing, they fall short of capturing detailed inter-task relat…
▽ More
Diffusion models have achieved remarkable success across a range of generative tasks. Recent efforts to enhance diffusion model architectures have reimagined them as a form of multi-task learning, where each task corresponds to a denoising task at a specific noise level. While these efforts have focused on parameter isolation and task routing, they fall short of capturing detailed inter-task relationships and risk losing semantic information, respectively. In response, we introduce Switch Diffusion Transformer (Switch-DiT), which establishes inter-task relationships between conflicting tasks without compromising semantic information. To achieve this, we employ a sparse mixture-of-experts within each transformer block to utilize semantic information and facilitate handling conflicts in tasks through parameter isolation. Additionally, we propose a diffusion prior loss, encouraging similar tasks to share their denoising paths while isolating conflicting ones. Through these, each transformer block contains a shared expert across all tasks, where the common and task-specific denoising paths enable the diffusion model to construct its beneficial way of synergizing denoising tasks. Extensive experiments validate the effectiveness of our approach in improving both image quality and convergence rate, and further analysis demonstrates that Switch-DiT constructs tailored denoising paths across various generation scenarios.
△ Less
Submitted 10 July, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Authors:
Byung-Kwan Lee,
Beomchan Park,
Chae Won Kim,
Yong Man Ro
Abstract:
The rise of large language models (LLMs) and instruction tuning has led to the current trend of instruction-tuned large language and vision models (LLVMs). This trend involves either meticulously curating numerous instruction tuning datasets tailored to specific objectives or enlarging LLVMs to manage vast amounts of vision language (VL) data. However, current LLVMs have disregarded the detailed a…
▽ More
The rise of large language models (LLMs) and instruction tuning has led to the current trend of instruction-tuned large language and vision models (LLVMs). This trend involves either meticulously curating numerous instruction tuning datasets tailored to specific objectives or enlarging LLVMs to manage vast amounts of vision language (VL) data. However, current LLVMs have disregarded the detailed and comprehensive real-world scene understanding available from specialized computer vision (CV) models in visual perception tasks such as segmentation, detection, scene graph generation (SGG), and optical character recognition (OCR). Instead, the existing LLVMs rely mainly on the large capacity and emergent capabilities of their LLM backbones. Therefore, we present a new LLVM, Mixture of All Intelligence (MoAI), which leverages auxiliary visual information obtained from the outputs of external segmentation, detection, SGG, and OCR models. MoAI operates through two newly introduced modules: MoAI-Compressor and MoAI-Mixer. After verbalizing the outputs of the external CV models, the MoAI-Compressor aligns and condenses them to efficiently use relevant auxiliary visual information for VL tasks. MoAI-Mixer then blends three types of intelligence (1) visual features, (2) auxiliary features from the external CV models, and (3) language features by utilizing the concept of Mixture of Experts. Through this integration, MoAI significantly outperforms both open-source and closed-source LLVMs in numerous zero-shot VL tasks, particularly those related to real-world scene understanding such as object existence, positions, relations, and OCR without enlarging the model size or curating extra visual instruction tuning datasets.
△ Less
Submitted 17 July, 2024; v1 submitted 12 March, 2024;
originally announced March 2024.
-
Fine-Grained Pillar Feature Encoding Via Spatio-Temporal Virtual Grid for 3D Object Detection
Authors:
Konyul Park,
Yecheol Kim,
Junho Koh,
Byungwoo Park,
Jun Won Choi
Abstract:
Developing high-performance, real-time architectures for LiDAR-based 3D object detectors is essential for the successful commercialization of autonomous vehicles. Pillar-based methods stand out as a practical choice for onboard deployment due to their computational efficiency. However, despite their efficiency, these methods can sometimes underperform compared to alternative point encoding techniq…
▽ More
Developing high-performance, real-time architectures for LiDAR-based 3D object detectors is essential for the successful commercialization of autonomous vehicles. Pillar-based methods stand out as a practical choice for onboard deployment due to their computational efficiency. However, despite their efficiency, these methods can sometimes underperform compared to alternative point encoding techniques such as Voxel-encoding or PointNet++. We argue that current pillar-based methods have not sufficiently captured the fine-grained distributions of LiDAR points within each pillar structure. Consequently, there exists considerable room for improvement in pillar feature encoding. In this paper, we introduce a novel pillar encoding architecture referred to as Fine-Grained Pillar Feature Encoding (FG-PFE). FG-PFE utilizes Spatio-Temporal Virtual (STV) grids to capture the distribution of point clouds within each pillar across vertical, temporal, and horizontal dimensions. Through STV grids, points within each pillar are individually encoded using Vertical PFE (V-PFE), Temporal PFE (T-PFE), and Horizontal PFE (H-PFE). These encoded features are then aggregated through an Attentive Pillar Aggregation method. Our experiments conducted on the nuScenes dataset demonstrate that FG-PFE achieves significant performance improvements over baseline models such as PointPillar, CenterPoint-Pillar, and PillarNet, with only a minor increase in computational overhead.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
Probing the mixing between sterile and tau neutrinos in the SHiP experiment
Authors:
Ki-Young Choi,
Sung Hyun Kim,
Yeong Gyun Kim,
Kang Young Lee,
Kyong Sei Lee,
Byung Do Park,
Jong Yoon Sohn,
Seong Moon Yoo,
Chun Sil Yoon
Abstract:
We study the expected sensitivity to the mixing between sterile and tau neutrinos directly from the tau neutrino disappearance in the high-energy fixed target experiment. Here, the beam energy is large enough to produce tau neutrinos at the target with large luminosity. During their propagation to the detector, tau neutrinos may oscillate into sterile neutrinos. By examining the energy spectrum of…
▽ More
We study the expected sensitivity to the mixing between sterile and tau neutrinos directly from the tau neutrino disappearance in the high-energy fixed target experiment. Here, the beam energy is large enough to produce tau neutrinos at the target with large luminosity. During their propagation to the detector, tau neutrinos may oscillate into sterile neutrinos. By examining the energy spectrum of the observed tau neutrino events, we can probe the mixing between sterile and tau neutrinos directly. In this paper, we consider Scattering and Neutrino Detector (SND) at SHiP experiment as a showcase, which uses 400 GeV protons from SPS at CERN, and expect to observe 7,300 tau and anti-tau neutrinos from the $2\times 10^{20}$ POT for 5 years operation. Assuming the uncertainty of 10\%, we find the sensitivity $|U_{τ4}|^2 \sim 0.08$\, (90\% CL) for $Δm_{41}^2 \sim 500\ \mathrm{eV}^2$ with 10\% background to the signal. We also consider a far SND at the end of the SHiP Hidden Sector Decay Spectrometer (HSDS), in which case the sensitivity would be enhanced to $|U_{τ4}|^2 \sim 0.02$. Away from this mass, the sensitivity becomes lower than $|U_{τ4}|^2 \sim 0.15$ for $Δm_{41}^2 \lesssim 100\ \mathrm{eV}^2$ or $Δm_{41}^2\gtrsim 10^4 \mathrm{eV}^2$.
△ Less
Submitted 26 June, 2024; v1 submitted 6 March, 2024;
originally announced March 2024.
-
DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation
Authors:
Sunghyeon Woo,
Baeseong Park,
Byeongwook Kim,
Minjung Jo,
Sejung Kwon,
Dongsuk Jeon,
Dongsoo Lee
Abstract:
Training deep neural networks typically involves substantial computational costs during both forward and backward propagation. The conventional layer dropping techniques drop certain layers during training for reducing the computations burden. However, dropping layers during forward propagation adversely affects the training process by degrading accuracy. In this paper, we propose Dropping Backwar…
▽ More
Training deep neural networks typically involves substantial computational costs during both forward and backward propagation. The conventional layer dropping techniques drop certain layers during training for reducing the computations burden. However, dropping layers during forward propagation adversely affects the training process by degrading accuracy. In this paper, we propose Dropping Backward Propagation (DropBP), a novel approach designed to reduce computational costs while maintaining accuracy. DropBP randomly drops layers during the backward propagation, which does not deviate forward propagation. Moreover, DropBP calculates the sensitivity of each layer to assign appropriate drop rate, thereby stabilizing the training process. DropBP is designed to enhance the efficiency of the training process with backpropagation, thereby enabling the acceleration of both full fine-tuning and parameter-efficient fine-tuning using backpropagation. Specifically, utilizing DropBP in QLoRA reduces training time by 44%, increases the convergence speed to the identical loss level by 1.5$\times$, and enables training with a 6.2$\times$ larger sequence length on a single NVIDIA-A100 80GiB GPU in LLaMA2-70B. The code is available at https://github.com/WooSunghyeon/dropbp.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Waveform Simulation for Scintillation Characteristics of NaI(Tl) Crystal
Authors:
J. J. Choi,
C. Ha,
E. J. Jeon,
K. W. Kim,
S. K. Kim,
Y. D. Kim,
Y. J. Ko,
B. C. Koh,
H. S. Lee,
S. H. Lee,
S. M. Lee,
B. J. Park,
G. H. Yu
Abstract:
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through p…
▽ More
The lowering of the energy threshold in the NaI detector is crucial not only for comprehensive validation of DAMA/LIBRA but also for exploring new possibilities in the search for low-mass dark matter and observing coherent elastic scattering between neutrino and nucleus. Alongside hardware enhancements, extensive efforts have focused on refining event selection to discern noise, achieved through parameter development and the application of machine learning. Acquiring pure, unbiased datasets is crucial in this endeavor, for which a waveform simulation was developed. The simulation data were compared with the experimental data using several pulse shape discrimination parameters to test its performance in describing the experimental data. Additionally, we present the outcomes of multi-variable machine learning trained with simulation data as a scintillation signal sample. The distributions of outcomes for experimental and simulation data show a good agreement. As an application of the waveform simulation, we validate the trigger efficiency alongside estimations derived from the minimally biased measurement data.
△ Less
Submitted 17 June, 2024; v1 submitted 26 February, 2024;
originally announced February 2024.
-
Boundedness criteria for bilinear Fourier multipliers via shifted square function estimates
Authors:
Georgios Dosidis,
Bae Jun Park,
Lenka Slavikova
Abstract:
We prove a sharp criterion for the boundedness of bilinear Fourier multiplier operators associated with symbols obtained by summing all dyadic dilations of a given bounded function $m_0$ compactly supported away from the origin. Our result admits the best possible behavior with respect to a modulation of the function $m_0$ and is intimately connected with optimal bounds for the family of shifted s…
▽ More
We prove a sharp criterion for the boundedness of bilinear Fourier multiplier operators associated with symbols obtained by summing all dyadic dilations of a given bounded function $m_0$ compactly supported away from the origin. Our result admits the best possible behavior with respect to a modulation of the function $m_0$ and is intimately connected with optimal bounds for the family of shifted square functions. As an application, we obtain estimates for bilinear singular integral operators with rough homogeneous kernels whose restriction to the unit sphere belongs to the Orlicz space $L(\log L)^α$. This improves an earlier result of the first and third authors, where such estimates were established for rough kernels belonging to the space $L^q$, $q>1$, on the unit sphere.
△ Less
Submitted 24 February, 2024;
originally announced February 2024.
-
CoLLaVO: Crayon Large Language and Vision mOdel
Authors:
Byung-Kwan Lee,
Beomchan Park,
Chae Won Kim,
Yong Man Ro
Abstract:
The remarkable success of Large Language Models (LLMs) and instruction tuning drives the evolution of Vision Language Models (VLMs) towards a versatile general-purpose model. Yet, it remains unexplored whether current VLMs genuinely possess quality object-level image understanding capabilities determined from 'what objects are in the image?' or 'which object corresponds to a specified bounding box…
▽ More
The remarkable success of Large Language Models (LLMs) and instruction tuning drives the evolution of Vision Language Models (VLMs) towards a versatile general-purpose model. Yet, it remains unexplored whether current VLMs genuinely possess quality object-level image understanding capabilities determined from 'what objects are in the image?' or 'which object corresponds to a specified bounding box?'. Our findings reveal that the image understanding capabilities of current VLMs are strongly correlated with their zero-shot performance on vision language (VL) tasks. This suggests that prioritizing basic image understanding is crucial for VLMs to excel at VL tasks. To enhance object-level image understanding, we propose Crayon Large Language and Vision mOdel (CoLLaVO), which incorporates instruction tuning with Crayon Prompt as a new visual prompt tuning scheme based on panoptic color maps. Furthermore, we present a learning strategy of Dual QLoRA to preserve object-level image understanding without forgetting it during visual instruction tuning, thereby achieving a significant leap in numerous VL benchmarks in a zero-shot setting.
△ Less
Submitted 2 June, 2024; v1 submitted 17 February, 2024;
originally announced February 2024.
-
Gradients of brain organization: Smooth sailing from methods development to user community
Authors:
Jessica Royer,
Casey Paquola,
Sofie L. Valk,
Matthias Kirschner,
Seok-Jun Hong,
Bo-yong Park,
Richard A. I. Bethlehem,
Robert Leech,
B. T. Thomas Yeo,
Elizabeth Jefferies,
Jonathan Smallwood,
Daniel Margulies,
Boris C. Bernhardt
Abstract:
Multimodal neuroimaging grants a powerful in vivo window into the structure and function of the human brain. Recent methodological and conceptual advances have enabled investigations of the interplay between large-scale spatial trends, or gradients, in brain structure and function, offering a framework to unify principles of brain organization across multiple scales. Strong community enthusiasm fo…
▽ More
Multimodal neuroimaging grants a powerful in vivo window into the structure and function of the human brain. Recent methodological and conceptual advances have enabled investigations of the interplay between large-scale spatial trends, or gradients, in brain structure and function, offering a framework to unify principles of brain organization across multiple scales. Strong community enthusiasm for these techniques has been instrumental in their widespread adoption and implementation to answer key questions in neuroscience. Following a brief review of current literature on this framework, this perspective paper will highlight how pragmatic steps aiming to make gradient methods more accessible to the community propelled these techniques to the forefront of neuroscientific inquiry. More specifically, we will emphasize how interest for gradient methods was catalyzed by data sharing, open-source software development, as well as the organization of dedicated workshops led by a diverse team of early career researchers. To this end, we argue that the growing excitement for brain gradients is the result of coordinated and consistent efforts to build an inclusive community and can serve as a case in point for future innovations and conceptual advances in neuroinformatics. We close this perspective paper by discussing challenges for the continuous refinement of neuroscientific theory, methodological innovation, and real-world translation to maintain our collective progress towards integrated models of brain organization.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Monochromatic $k$-connection of graphs
Authors:
Qingqiong Cai,
Shinya Fujita,
Henry Liu,
Boram Park
Abstract:
An edge-coloured path is monochromatic if all of its edges have the same colour. For a $k$-connected graph $G$, the monochromatic $k$-connection number of $G$, denoted by $mc_k(G)$, is the maximum number of colours in an edge-colouring of $G$ such that, any two vertices are connected by $k$ internally vertex-disjoint monochromatic paths. In this paper, we shall study the parameter $mc_k(G)$. We ob…
▽ More
An edge-coloured path is monochromatic if all of its edges have the same colour. For a $k$-connected graph $G$, the monochromatic $k$-connection number of $G$, denoted by $mc_k(G)$, is the maximum number of colours in an edge-colouring of $G$ such that, any two vertices are connected by $k$ internally vertex-disjoint monochromatic paths. In this paper, we shall study the parameter $mc_k(G)$. We obtain bounds for $mc_k(G)$, for general graphs $G$. We also compute $mc_k(G)$ exactly when $k$ is small, and $G$ is a graph on $n$ vertices, with a spanning $k$-connected subgraph having the minimum possible number of edges, namely $\lceil\frac{kn}{2}\rceil$. We prove a similar result when $G$ is a bipartite graph.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
OGLE-2023-BLG-0836L: The sixth microlensing planet in a binary stellar system
Authors:
Cheongho Han,
Andrzej Udalski,
Youn Kil Jung,
Andrew Gould,
Doeon Kim,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Chung-Uk Lee,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Jennifer C. Yee,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Przemek Mróz,
Mateusz J. Mróz,
Michał K. Szymański
, et al. (10 additional authors not shown)
Abstract:
Light curves of microlensing events occasionally deviate from the smooth and symmetric form of a single-lens single-source event. While most of these anomalous events can be accounted for by employing a binary-lens single-source (2L1S) or a single-lens binary-source (1L2S) framework, it is established that a small fraction of events remain unexplained by either of these interpretations. We carry o…
▽ More
Light curves of microlensing events occasionally deviate from the smooth and symmetric form of a single-lens single-source event. While most of these anomalous events can be accounted for by employing a binary-lens single-source (2L1S) or a single-lens binary-source (1L2S) framework, it is established that a small fraction of events remain unexplained by either of these interpretations. We carry out a project in which data collected by high-cadence microlensing surveys were reinvestigated with the aim of uncovering the nature of anomalous lensing events with no proposed 2L1S or 1L2S models. From the project, we find that the anomaly appearing in the lensing event OGLE-2023-BLG-0836 cannot be explained by the usual interpretations and conduct a comprehensive analysis of the event. From thorough modeling of the light curve under sophisticated lens-system configurations, we have arrived at the conclusion that a triple-mass lens system is imperative to account for the anomaly features observed in the lensing light curve. From the Bayesian analysis using the measured observables of the event time scale and angular Einstein radius, we determine that the least massive component of the lens has a planetary mass of $4.36^{+2.35}_{-2.18}~M_{\rm J}$. This planet orbits within a stellar binary system composed of two stars with masses $0.71^{+0.38}_{-0.36}~M_\odot$ and $0.56^{+0.30}_{-0.28}~M_\odot$. This lensing event signifies the sixth occurrence of a planetary microlensing system in which a planet belongs to a stellar binary system.
△ Less
Submitted 17 February, 2024; v1 submitted 12 February, 2024;
originally announced February 2024.
-
Vector-valued estimates for shifted operators
Authors:
Bae Jun Park
Abstract:
Shifted variants of (dyadic) Hardy-Littlewood maximal function and Stein's square function have played a significant role in the study of many important operators such as Calderon commutators, (bilinear) Hilbert transforms, multilinear multipliers, and multilinear rough singular integrals. Estimates for such shifted operators have a certain logarithmic growth in terms of the shift factor, but the…
▽ More
Shifted variants of (dyadic) Hardy-Littlewood maximal function and Stein's square function have played a significant role in the study of many important operators such as Calderon commutators, (bilinear) Hilbert transforms, multilinear multipliers, and multilinear rough singular integrals. Estimates for such shifted operators have a certain logarithmic growth in terms of the shift factor, but the optimality of the logarithmic growth has not yet been fully resolved. In this article, we provide sharp vector-valued shifted maximal inequality for generalized Peetre's maximal function, from which improved estimates for the above shifted operators follow with optimal logarithmic growths in a new way. We also obtain a vector-valued maximal inequality for the shifted (dyadic) Hardy-Littlewood maximal operator.
△ Less
Submitted 31 January, 2024;
originally announced January 2024.
-
Strong odd coloring of sparse graphs
Authors:
Hyemin Kwon,
Boram Park
Abstract:
An odd coloring of a graph $G$ is a proper coloring of $G$ such that for every non-isolated vertex $v$, there is a color appearing an odd number of times in $N_G(v)$. Odd coloring of graphs was studied intensively in recent few years. In this paper, we introduce the notion of a strong odd coloring, as not only a strengthened version of odd coloring, but also a relaxation of square coloring. A stro…
▽ More
An odd coloring of a graph $G$ is a proper coloring of $G$ such that for every non-isolated vertex $v$, there is a color appearing an odd number of times in $N_G(v)$. Odd coloring of graphs was studied intensively in recent few years. In this paper, we introduce the notion of a strong odd coloring, as not only a strengthened version of odd coloring, but also a relaxation of square coloring. A strong odd coloring of a graph $G$ is a proper coloring of $G$ such that for every non-isolated vertex $v$, if a color appears in $N_G(v)$, then it appears an odd number of times in $N_G(v)$. We denote by $χ_{so}(G)$ the smallest integer $k$ such that $G$ admits a strong odd coloring with $k$ colors. We prove that if $G$ is a graph with $mad(G)\le\frac{20}{7}$, then $χ_{so}(G)\le Δ(G)+4$, and the bound is tight. We also prove that if $G$ is a graph with $mad(G)\le\frac{30}{11}$ and $Δ(G)\ge 4$, then $χ_{so}(G)\le Δ(G)+3$.
△ Less
Submitted 22 January, 2024; v1 submitted 21 January, 2024;
originally announced January 2024.
-
MOA-2022-BLG-563Lb, KMT-2023-BLG-0469Lb, and KMT-2023-BLG-0735Lb: Three sub-Jovian-mass microlensing planets
Authors:
Cheongho Han,
Youn Kil Jung,
Ian A. Bond,
Andrew Gould,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Chung-Uk Lee,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Hongjing Yang,
Jennifer C. Yee,
Weicheng Zang,
Sang-Mok Cha,
Doeon Kim,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Fumio Abe,
Richard Barry,
David P. Bennett
, et al. (23 additional authors not shown)
Abstract:
We analyze the anomalies appearing in the light curves of the three microlensing events MOA-2022-BLG-563, KMT-2023-BLG-0469, and KMT-2023-BLG-0735. The anomalies exhibit common short-term dip features that appear near the peak. From the detailed analyses of the light curves, we find that the anomalies were produced by planets accompanied by the lenses of the events. For all three events, the estim…
▽ More
We analyze the anomalies appearing in the light curves of the three microlensing events MOA-2022-BLG-563, KMT-2023-BLG-0469, and KMT-2023-BLG-0735. The anomalies exhibit common short-term dip features that appear near the peak. From the detailed analyses of the light curves, we find that the anomalies were produced by planets accompanied by the lenses of the events. For all three events, the estimated mass ratios between the planet and host are on the order of $10^{-4}$: $q\sim 8 \times 10^{-4}$ for MOA-2022-BLG-563L, $q\sim 2.5\times 10^{-4}$ for KMT-2023-BLG-0469L, and $q\sim 1.9\times 10^{-4}$ for KMT-2023-BLG-0735L. The interpretations of the anomalies are subject to a common inner-outer degeneracy, which causes ambiguity when estimating the projected planet-host separation. We estimated the planet mass, $M_{\rm p}$, host mass, $M_{\rm h}$, and distance, $D_{\rm L}$, to the planetary system by conducting Bayesian analyses using the observables of the events. The estimated physical parameters of the planetary systems are $(M_{\rm h}/M_\odot, M_{\rm p}/M_{\rm J}, D_{\rm L}/{\rm kpc}) = (0.48^{+0.36}_{-0.30}, 0.40^{+0.31}_{-0.25}, 6.53^{+1.12}_{-1.57})$ for MOA-2022-BLG-563L, $(0.47^{+0.35}_{-0.26}, 0.124^{+0.092}_{-0.067}, 7.07^{+1.03}_{-1.19})$ for KMT-2023-BLG-0469L, and $(0.62^{+0.34}_{-0.35}, 0.125^{+0.068}_{-0.070}, 6.26^{+1.27}_{-1.67})$ for KMT-2023-BLG-0735L. According to the estimated parameters, all planets are cold planets with projected separations that are greater than the snow lines of the planetary systems, they have masses that lie between the masses of Uranus and Jupiter of the Solar System, and the hosts of the planets are main-sequence stars that are less massive than the Sun.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
KMT-2023-BLG-0416, KMT-2023-BLG-1454, KMT-2023-BLG-1642: Microlensing planets identified from partially covered signals
Authors:
Cheongho Han,
Andrzej Udalski,
Chung-Uk Lee,
Weicheng Zang,
Michael D. Albrow,
Sun-Ju Chung,
Andrew Gould,
Kyu-Ha Hwang,
Youn Kil Jung,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
In-Gu Shin,
Jennifer C. Yee,
Hongjing Yang,
Sang-Mok Cha,
Doeon Kim,
Dong-Jin Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Przemek Mróz,
Michał K. Szymański,
Jan Skowron
, et al. (10 additional authors not shown)
Abstract:
We investigate the 2023 season data from high-cadence microlensing surveys with the aim of detecting partially covered short-term signals and revealing their underlying astrophysical origins. Through this analysis, we ascertain that the signals observed in the lensing events KMT-2023-BLG-0416, KMT-2023-BLG-1454, and KMT-2023-BLG-1642 are of planetary origin. Considering the potential degeneracy ca…
▽ More
We investigate the 2023 season data from high-cadence microlensing surveys with the aim of detecting partially covered short-term signals and revealing their underlying astrophysical origins. Through this analysis, we ascertain that the signals observed in the lensing events KMT-2023-BLG-0416, KMT-2023-BLG-1454, and KMT-2023-BLG-1642 are of planetary origin. Considering the potential degeneracy caused by the partial coverage of signals, we thoroughly investigate the lensing-parameter plane. In the case of KMT-2023-BLG-0416, we have identified two solution sets, one with a planet-to-host mass ratio of $q\sim 10^{-2}$ and the other with $q\sim 6\times 10^{-5}$, within each of which there are two local solutions emerging due to the inner-outer degeneracy. For KMT-2023-BLG-1454, we discern four local solutions featuring mass ratios of $q\sim (1.7-4.3)\times 10^{-3}$. When it comes to KMT-2023-BLG-1642, we identified two locals with $q\sim (6-10)\times 10^{-3}$ resulting from the inner-outer degeneracy. We estimate the physical lens parameters by conducting Bayesian analyses based on the event time scale and Einstein radius. For KMT-2023-BLG-0416L, the host mass is $\sim 0.6~M_\odot$, and the planet mass is $\sim (6.1-6.7)~M_{\rm J}$ according to one set of solutions and $\sim 0.04~M_{\rm J}$ according to the other set of solutions. KMT-2023-BLG-1454Lb has a mass roughly half that of Jupiter, while KMT-2023-BLG-1646Lb has a mass in the range of between 1.1 to 1.3 times that of Jupiter, classifying them both as giant planets orbiting mid M-dwarf host stars with masses ranging from 0.13 to 0.17 solar masses.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Nonproportionality of NaI(Tl) Scintillation Detector for Dark Matter Search Experiments
Authors:
S. M. Lee,
G. Adhikari,
N. Carlin,
J. Y. Cho,
J. J. Choi,
S. Choi,
A. C. Ezeribe,
L. E. Fran. a,
C. Ha,
I. S. Hahn,
S. J. Hollick,
E. J. Jeon,
H. W. Joo,
W. G. Kang,
M. Kauer,
B. H. Kim,
H. J. Kim,
J. Kim,
K. W. Kim,
S. H. Kim,
S. K. Kim,
S. W. Kim,
W. K. Kim,
Y. D. Kim,
Y. H. Kim
, et al. (37 additional authors not shown)
Abstract:
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced…
▽ More
We present a comprehensive study of the nonproportionality of NaI(Tl) scintillation detectors within the context of dark matter search experiments. Our investigation, which integrates COSINE-100 data with supplementary $γ$ spectroscopy, measures light yields across diverse energy levels from full-energy $γ$ peaks produced by the decays of various isotopes. These $γ$ peaks of interest were produced by decays supported by both long and short-lived isotopes. Analyzing peaks from decays supported only by short-lived isotopes presented a unique challenge due to their limited statistics and overlapping energies, which was overcome by long-term data collection and a time-dependent analysis. A key achievement is the direct measurement of the 0.87 keV light yield, resulting from the cascade following electron capture decay of $^{22}$Na from internal contamination. This measurement, previously accessible only indirectly, deepens our understanding of NaI(Tl) scintillator behavior in the region of interest for dark matter searches. This study holds substantial implications for background modeling and the interpretation of dark matter signals in NaI(Tl) experiments.
△ Less
Submitted 10 May, 2024; v1 submitted 14 January, 2024;
originally announced January 2024.
-
Systematic KMTNet Planetary Anomaly Search. XI. Complete Sample of 2016 Sub-Prime Field Planets
Authors:
In-Gu Shin,
Jennifer C. Yee,
Weicheng Zang,
Cheongho Han,
Hongjing Yang,
Andrew Gould,
Chung-Uk Lee,
Andrzej Udalski,
Takahiro Sumi,
Michael D. Albrow,
Sun-Ju Chung,
Kyu-Ha Hwang,
Youn Kil Jung,
Yoon-Hyun Ryu,
Yossi Shvartzvald,
Sang-Mok Cha,
Dong-Jin Kim,
Hyoun-Woo Kim,
Seung-Lee Kim,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Przemek Mróz,
Michał K. Szymański
, et al. (41 additional authors not shown)
Abstract:
Following Shin et al. (2023b), which is a part of the Systematic KMTNet Planetary Anomaly Search series (i.e., a search for planets in the 2016 KMTNet prime fields), we conduct a systematic search of the 2016 KMTNet sub-prime fields using a semi-machine-based algorithm to identify hidden anomalous events missed by the conventional by-eye search. We find four new planets and seven planet candidates…
▽ More
Following Shin et al. (2023b), which is a part of the Systematic KMTNet Planetary Anomaly Search series (i.e., a search for planets in the 2016 KMTNet prime fields), we conduct a systematic search of the 2016 KMTNet sub-prime fields using a semi-machine-based algorithm to identify hidden anomalous events missed by the conventional by-eye search. We find four new planets and seven planet candidates that were buried in the KMTNet archive. The new planets are OGLE-2016-BLG-1598Lb, OGLE-2016-BLG-1800Lb, MOA-2016-BLG-526Lb, and KMT-2016-BLG-2321Lb, which show typical properties of microlensing planets, i.e., giant planets orbit M dwarf host stars beyond their snow lines. For the planet candidates, we find planet/binary or 2L1S/1L2S degeneracies, which are an obstacle to firmly claiming planet detections. By combining the results of Shin et al. (2023b) and this work, we find a total of nine hidden planets, which is about half the number of planets discovered by eye in 2016. With this work, we have met the goal of the systematic search series for 2016, which is to build a complete microlensing planet sample. We also show that our systematic searches significantly contribute to completing the planet sample, especially for planet/host mass ratios smaller than $10^{-3}$, which were incomplete in previous by-eye searches of the KMTNet archive.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D
Authors:
Sangmin Woo,
Byeongjun Park,
Hyojun Go,
Jin-Young Kim,
Changick Kim
Abstract:
Recent progress in single-image 3D generation highlights the importance of multi-view coherency, leveraging 3D priors from large-scale diffusion models pretrained on Internet-scale images. However, the aspect of novel-view diversity remains underexplored within the research landscape due to the ambiguity in converting a 2D image into 3D content, where numerous potential shapes can emerge. Here, we…
▽ More
Recent progress in single-image 3D generation highlights the importance of multi-view coherency, leveraging 3D priors from large-scale diffusion models pretrained on Internet-scale images. However, the aspect of novel-view diversity remains underexplored within the research landscape due to the ambiguity in converting a 2D image into 3D content, where numerous potential shapes can emerge. Here, we aim to address this research gap by simultaneously addressing both consistency and diversity. Yet, striking a balance between these two aspects poses a considerable challenge due to their inherent trade-offs. This work introduces HarmonyView, a simple yet effective diffusion sampling technique adept at decomposing two intricate aspects in single-image 3D generation: consistency and diversity. This approach paves the way for a more nuanced exploration of the two critical dimensions within the sampling process. Moreover, we propose a new evaluation metric based on CLIP image and text encoders to comprehensively assess the diversity of the generated views, which closely aligns with human evaluators' judgments. In experiments, HarmonyView achieves a harmonious balance, demonstrating a win-win scenario in both consistency and diversity.
△ Less
Submitted 26 December, 2023;
originally announced December 2023.
-
OGLE-2017-BLG-0448Lb: A Low Mass-Ratio Wide-Orbit Microlensing Planet?
Authors:
Ruocheng Zhai,
Radosław Poleski,
Weicheng Zang,
Youn Kil Jung,
Andrzej Udalski,
Renkun Kuang,
Michael D. Albrow,
Sun-Ju Chung,
Andrew Gould,
Cheongho Han,
Kyu-Ha Hwang,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Hongjing Yang,
Jennifer C. Yee,
Sang-Mok Cha,
Dong-Jin Kim,
Hyoun-Woo Kim,
Seung-Lee Kim,
Chung-Uk Lee,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge
, et al. (16 additional authors not shown)
Abstract:
The gravitational microlensing technique is most sensitive to planets in a Jupiter-like orbit and has detected more than 200 planets. However, only a few wide-orbit ($s > 2$) microlensing planets have been discovered, where $s$ is the planet-to-host separation normalized to the angular Einstein ring radius, $θ_{\rm E}$. Here we present the discovery and analysis of a strong candidate wide-orbit mi…
▽ More
The gravitational microlensing technique is most sensitive to planets in a Jupiter-like orbit and has detected more than 200 planets. However, only a few wide-orbit ($s > 2$) microlensing planets have been discovered, where $s$ is the planet-to-host separation normalized to the angular Einstein ring radius, $θ_{\rm E}$. Here we present the discovery and analysis of a strong candidate wide-orbit microlensing planet in the event, OGLE-2017-BLG-0448. The whole light curve exhibits long-term residuals to the static binary-lens single-source model, so we investigate the residuals by adding the microlensing parallax, microlensing xallarap, an additional lens, or an additional source. For the first time, we observe a complex degeneracy between all four effects. The wide-orbit models with $s \sim 2.5$ and a planet-to-host mass-ratio of $q \sim 10^{-4}$ are significantly preferred, but we cannot rule out the close models with $s \sim 0.35$ and $q \sim 10^{-3}$. A Bayesian analysis based on a Galactic model indicates that, despite the complicated degeneracy, the surviving wide-orbit models all contain a super-Earth-mass to Neptune-mass planet at a projected planet-host separation of $\sim 6$ au and the surviving close-orbit models all consist of a Jovian-mass planet at $\sim 1$ au. The host star is probably an M or K dwarf. We discuss the implications of this dimension-degeneracy disaster on microlensing light-curve analysis and its potential impact on statistical studies.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
OGLE-2019-BLG-1180Lb: Discovery of a Wide-orbit Jupiter-mass Planet around a Late-type Star
Authors:
Sun-Ju Chung,
Andrzej Udalski,
Jennifer C. Yee,
Andrew Gould,
Michael D. Albrow,
Youn Kil Jung,
Kyu-Ha Hwang,
Cheongho Han,
Yoon-Hyun Ryu,
In-Gu Shin,
Yossi Shvartzvald,
Hongjing Yang,
Weicheng Zang,
Sang-Mok Cha,
Dong-Jin Kim,
Seung-Lee Kim,
Chung-Uk Lee,
Dong-Joo Lee,
Yongseok Lee,
Byeong-Gon Park,
Richard W. Pogge,
Radek Poleski,
Przemek Mróz,
Jan Skowron,
Michał K. Szymański
, et al. (8 additional authors not shown)
Abstract:
We report on the discovery and analysis of the planetary microlensing event OGLE-2019-BLG-1180 with a planet-to-star mass ratio $q \sim 0.003$. The event OGLE-2019-BLG-1180 has unambiguous cusp-passing and caustic-crossing anomalies, which were caused by a wide planetary caustic with $s \simeq 2$, where $s$ is the star-planet separation in units of the angular Einstein radius $θ_{E}$. Thanks to we…
▽ More
We report on the discovery and analysis of the planetary microlensing event OGLE-2019-BLG-1180 with a planet-to-star mass ratio $q \sim 0.003$. The event OGLE-2019-BLG-1180 has unambiguous cusp-passing and caustic-crossing anomalies, which were caused by a wide planetary caustic with $s \simeq 2$, where $s$ is the star-planet separation in units of the angular Einstein radius $θ_{E}$. Thanks to well-covered anomalies by the Korea Micorolensing Telescope Network (KMTNet), we measure both the angular Einstein radius and the microlens parallax in spite of a relatively short event timescale of $t_{E} = 28$ days. However, because of a weak constraint on the parallax, we conduct a Bayesian analysis to estimate the physical lens parameters. We find that the lens system is a super-Jupiter-mass planet of $M_{p} = 1.75^{+0.54}_{-0.51} M_{J}$ orbiting a late-type star of $M_{h}=0.55^{+0.27}_{-0.26} M_\odot$ at a distance of $D_{L} = 6.1^{+0.9}_{-1.3}$ kpc. The projected star-planet separation is $a_{\perp} = 5.19^{+0.90}_{-1.23}$ au, which means that the planet orbits at about four times the snow line of the host star. Considering the relative lens-source proper motion of $μ_{rel} = 6$ mas/yr, the lens will be separated from the source by 60 mas in 2029. At that time one can measure the lens flux from adaptive optics imaging of Kec or a next-generation 30 m class telescope. OGLE-2019-BLG-1180Lb represents a growing population of wide-orbit planets detected by KMTNet, so we also present a general investigation into prospects for further expanding the sample of such planets.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Multi-Rate Variable-Length CSI Compression for FDD Massive MIMO
Authors:
Bumsu Park,
Heedong Do,
Namyoon Lee
Abstract:
For frequency-division-duplexing (FDD) systems, channel state information (CSI) should be fed back from the user terminal to the base station. This feedback overhead becomes problematic as the number of antennas grows. To alleviate this issue, we propose a flexible CSI compression method using variational autoencoder (VAE) with an entropy bottleneck structure, which can support multi-rate and vari…
▽ More
For frequency-division-duplexing (FDD) systems, channel state information (CSI) should be fed back from the user terminal to the base station. This feedback overhead becomes problematic as the number of antennas grows. To alleviate this issue, we propose a flexible CSI compression method using variational autoencoder (VAE) with an entropy bottleneck structure, which can support multi-rate and variable-length operation. Numerical study confirms that the proposed method outperforms the existing CSI compression techniques in terms of normalized mean squared error.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
A graph-theoretic remark on Stieltjes moment sequences
Authors:
Bryan Park
Abstract:
For any integer $k\geq 1,$ define $L_k: \mathbb{R}^\mathbb{N}\to \mathbb{R}^\mathbb{N}$ by $(a_n)_{n\in\mathbb{N}}\mapsto (a'_n)_{n\in\mathbb{N}}$ where $a'_n=\det(a_{n+i+j})_{i,j=0}^{k-1}$. Previously, Zhu showed that $L_k$ preserves the Stieltjes moment (SM) property of sequences (Proc. Am. Math. Soc., 2019). The proof used the characterization of SM sequences in terms of positive semidefinite H…
▽ More
For any integer $k\geq 1,$ define $L_k: \mathbb{R}^\mathbb{N}\to \mathbb{R}^\mathbb{N}$ by $(a_n)_{n\in\mathbb{N}}\mapsto (a'_n)_{n\in\mathbb{N}}$ where $a'_n=\det(a_{n+i+j})_{i,j=0}^{k-1}$. Previously, Zhu showed that $L_k$ preserves the Stieltjes moment (SM) property of sequences (Proc. Am. Math. Soc., 2019). The proof used the characterization of SM sequences in terms of positive semidefinite Hankel matrices. In this note, we give another proof by viewing SM sequences as weighted enumerations of closed walks on $\mathbb{N}$. Our proof is essentially a double-counting argument that views a $k$-tuple of non-crossing Dyck paths as a single closed walk on some bipartite subgraph of $\mathbb{N}^k.$
△ Less
Submitted 5 June, 2024; v1 submitted 24 November, 2023;
originally announced November 2023.
-
Triple-sinusoid hedgehog lattice in a centrosymmetric Kondo metal
Authors:
Soohyeon Shin,
Jin-Hong Park,
Romain Sibille,
Harim Jang,
Tae Beom Park,
Suyoung Kim,
Tian Shang,
Marisa Medarde,
Eric D. Bauer,
Oksana Zaharko,
Michel Kenzelmann,
Tuson Park
Abstract:
Superposed symmetry-equivalent magnetic ordering wave vectors can lead to topologically non-trivial spin textures, such as magnetic skyrmions and hedgehogs, and give rise to novel quantum phenomena due to fictitious magnetic fields associated with a non-zero Berry curvature of these spin textures. To date, all known spin textures are constructed through the superposition of multiple spiral orders,…
▽ More
Superposed symmetry-equivalent magnetic ordering wave vectors can lead to topologically non-trivial spin textures, such as magnetic skyrmions and hedgehogs, and give rise to novel quantum phenomena due to fictitious magnetic fields associated with a non-zero Berry curvature of these spin textures. To date, all known spin textures are constructed through the superposition of multiple spiral orders, where spins vary in directions with constant amplitude. Recent theoretical studies have suggested that multiple sinusoidal orders, where collinear spins vary in amplitude, can construct distinct topological spin textures regarding chirality properties. However, such textures have yet to be experimentally realised. In this work, we report the observation of a zero-field magnetic hedgehog lattice from a superposition of triple sinusoidal wave vectors in the magnetically frustrated Kondo lattice CePtAl4Ge2. Notably, we also observe the emergence of anomalous electrical and thermodynamic behaviours near the field-induced transition from the zero-field topological hedgehog lattice to a non-topological sinusoidal state. These observations highlight the role of Kondo coupling in stabilising the zero-field hedgehog state in the Kondo lattice and warrant an expedited search for other topological magnetic structures coupled with Kondo coupling.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.