-
Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals
Authors:
Seung Gyo Jeong,
Seong Won Cho,
Sehwan Song,
Jin Young Oh,
Do Gyeom Jeong,
Gyeongtak Han,
Hu Young Jeong,
Ahmed Yousef Mohamed,
Woo-suk Noh,
Sungkyun Park,
Jong Seok Lee,
Suyoun Lee,
Young-Min Kim,
Deok-Yong Cho,
Woo Seok Choi
Abstract:
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi…
▽ More
Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designing oxide superlattices with a correlated ferromagnetic SrRuO3 and nonmagnetic SrTiO3 layers, we observed modulated ferromagnetic behavior with the change of the SrRuO3 thickness. Especially, for three-unit-cell-thick layers, we observe a significant 1,500% improvement of coercive field in the anomalous Hall effect, which cannot be solely attributed to the dimensional crossover in ferromagnetism. The atomic-scale heterostructures further reveal the systematic modulation of anisotropy for the lattice structure and orbital hybridization, explaining the enhanced magnetic anisotropy. Our findings provide valuable insights into engineering the anisotropic hybridization of synthetic magnetic crystals, offering a tunable spin order for various applications.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Learnability of Parameter-Bounded Bayes Nets
Authors:
Arnab Bhattacharyya,
Davin Choo,
Sutanu Gayen,
Dimitrios Myrisiotis
Abstract:
Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $P$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Bayes net tha…
▽ More
Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $P$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Bayes net that represents $P$. They called this problem LEARN. In this work, we extend the $\mathsf{NP}$-hardness result of LEARN and prove the $\mathsf{NP}$-hardness of a promise search variant of LEARN, whereby the Bayes net in question is guaranteed to exist and one is asked to find such a Bayes net. We complement our hardness result with a positive result about the sample complexity that is sufficient to recover a parameter-bounded Bayes net that is close (in TV distance) to a given distribution $P$, that is represented by some parameter-bounded Bayes net, generalizing a degree-bounded sample complexity result of Brustle et al. (EC 2020).
△ Less
Submitted 30 June, 2024;
originally announced July 2024.
-
Origin of Distinct Insulating Domains in the Layered Charge Density Wave Material 1T-TaS2
Authors:
Hyungryul Yang,
Byeongin Lee,
Junho Bang,
Sunghun Kim,
Dirk Wulferding,
Sung-Hoon Lee,
Doohee Cho
Abstract:
Vertical charge order shapes the electronic properties in layered charge density wave (CDW) materials. Various stacking orders inevitably create nanoscale domains with distinct electronic structures inaccessible to bulk probes. Here, the stacking characteristics of bulk 1$T$-TaS$2$ are analyzed using scanning tunneling spectroscopy (STS) and density functional theory (DFT) calculations. It is obse…
▽ More
Vertical charge order shapes the electronic properties in layered charge density wave (CDW) materials. Various stacking orders inevitably create nanoscale domains with distinct electronic structures inaccessible to bulk probes. Here, the stacking characteristics of bulk 1$T$-TaS$2$ are analyzed using scanning tunneling spectroscopy (STS) and density functional theory (DFT) calculations. It is observed that Mott-insulating domains undergo a transition to band-insulating domains restoring vertical dimerization of the CDWs. Furthermore, STS measurements covering a wide terrace reveal two distinct band insulating domains differentiated by band edge broadening. These DFT calculations reveal that the Mott insulating layers preferably reside on the subsurface, forming broader band edges in the neighboring band insulating layers. Ultimately, buried Mott insulating layers believed to harbor the quantum spin liquid phase are identified. These results resolve persistent issues regarding vertical charge order in 1$T$-TaS$2$, providing a new perspective for investigating emergent quantum phenomena in layered CDW materials.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
Charge ordered phases in the hole-doped triangular Mott insulator 4Hb-TaS2
Authors:
Junho Bang,
Byeongin Lee,
Hyungryul Yang,
Sunghun Kim,
Dirk Wulferding,
Doohee Cho
Abstract:
4Hb-TaS2 has been proposed to possess unconventional superconductivity with broken time reveral symmetry due to distinctive layered structure, featuring a heterojunction between a 2D triangular Mott insulator and a charge density wave metal. However, since a frustrated spin state in the correlated insulating layer is susceptible to charge ordering with carrier doping, it is required to investigate…
▽ More
4Hb-TaS2 has been proposed to possess unconventional superconductivity with broken time reveral symmetry due to distinctive layered structure, featuring a heterojunction between a 2D triangular Mott insulator and a charge density wave metal. However, since a frustrated spin state in the correlated insulating layer is susceptible to charge ordering with carrier doping, it is required to investigate the charge distribution driven by inter-layer charge transfer to understand its superconductivity. Here, we use scanning tunneling microscopy and spectroscopy (STM/S) to investigate the charge ordered phases of 1T-TaS2 layers within 4Hb-TaS2, explicitly focusing on the non-half-filled regime. Our STS results show an energy gap which exhibits an out-of-phase relation with the charge density. We ascribe the competition between on-site and nonlocal Coulomb repulsion as the driving force for the charge-ordered insulating phase of a doped triangular Mott insulator. In addition, we discuss the role of the insulating layer in the enhanced superconductivity of 4Hb-TaS2.
△ Less
Submitted 17 June, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech
Authors:
Deok-Hyeon Cho,
Hyung-Seok Oh,
Seung-Bin Kim,
Sang-Hoon Lee,
Seong-Whan Lee
Abstract:
Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressi…
▽ More
Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressive emotional speech by using a spherical emotion vector to control the emotional style and intensity of the synthetic speech. Without any human annotation, we use the arousal, valence, and dominance pseudo-labels to model the complex nature of emotion via a Cartesian-spherical transformation. Furthermore, we propose a dual conditional adversarial network to improve the quality of generated speech by reflecting the multi-aspect characteristics. The experimental results demonstrate the model ability to control emotional style and intensity with high-quality expressive speech.
△ Less
Submitted 11 June, 2024;
originally announced June 2024.
-
Online bipartite matching with imperfect advice
Authors:
Davin Choo,
Themis Gouleakis,
Chun Kai Ling,
Arnab Bhattacharyya
Abstract:
We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust…
▽ More
We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust under the adversarial arrival model. Meanwhile, under the random arrival model, we show how one can utilize methods from distribution testing to design an algorithm that takes in external advice about the online vertices and provably achieves competitive ratio interpolating between any ratio attainable by advice-free methods and the optimal ratio of 1, depending on the advice quality.
△ Less
Submitted 23 May, 2024; v1 submitted 15 May, 2024;
originally announced May 2024.
-
HyperCLOVA X Technical Report
Authors:
Kang Min Yoo,
Jaegeun Han,
Sookyo In,
Heewon Jeon,
Jisu Jeong,
Jaewook Kang,
Hyunwook Kim,
Kyung-Min Kim,
Munhyong Kim,
Sungju Kim,
Donghyun Kwak,
Hanock Kwak,
Se Jung Kwon,
Bado Lee,
Dongsoo Lee,
Gichang Lee,
Jooho Lee,
Baeseong Park,
Seongjin Shin,
Joonsang Yu,
Seolki Baek,
Sumin Byeon,
Eungsup Cho,
Dooseok Choe,
Jeesung Han
, et al. (371 additional authors not shown)
Abstract:
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t…
▽ More
We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment to responsible AI. The model is evaluated across various benchmarks, including comprehensive reasoning, knowledge, commonsense, factuality, coding, math, chatting, instruction-following, and harmlessness, in both Korean and English. HyperCLOVA X exhibits strong reasoning capabilities in Korean backed by a deep understanding of the language and cultural nuances. Further analysis of the inherent bilingual nature and its extension to multilingualism highlights the model's cross-lingual proficiency and strong generalization ability to untargeted languages, including machine translation between several language pairs and cross-lingual inference tasks. We believe that HyperCLOVA X can provide helpful guidance for regions or countries in developing their sovereign LLMs.
△ Less
Submitted 13 April, 2024; v1 submitted 2 April, 2024;
originally announced April 2024.
-
Analytic asymptotic formulas for effective parameters of planar elastic composites
Authors:
Daehee Cho,
Doosung Choi,
Mikyoung Lim
Abstract:
We investigate the effective elastic properties of periodic dilute two-phase composites consisting of an homogeneous isotropic matrix and a periodic array of rigid inclusions. We assume the rigid inclusion in a unit cell is a simply connected, bounded domain so that there exists an exterior conformal mapping corresponding the inclusion. Recently, an analytical series solution method for the elasti…
▽ More
We investigate the effective elastic properties of periodic dilute two-phase composites consisting of an homogeneous isotropic matrix and a periodic array of rigid inclusions. We assume the rigid inclusion in a unit cell is a simply connected, bounded domain so that there exists an exterior conformal mapping corresponding the inclusion. Recently, an analytical series solution method for the elastic problem with a rigid inclusion was developed based on the layer potential technique and the geometric function theory \cite{Mattei:2021:EAS}. In this paper, by using the series solution method, we derive expression formulas for the elastic moment tensors--the coefficients of the multipole expansion associated with an elastic inclusion--of an inclusion of arbitrary shape. These formulas for the elastic moment tensors lead us to analytic asymptotic formulas for the effective parameters of the periodic elastic composites with rigid inclusions in terms of the associated exterior conformal mapping.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Geometric series solution for the plane elastostatic problem in the presence of a cavity
Authors:
Daehee Cho,
Doosung Choi,
Mikyoung Lim
Abstract:
This paper presents an analytic series solution method for the elastic inclusion problem in a two-dimensional unbounded isotropic medium with a cavity. Generalizing the work of Mattei and Lim \cite{Mattei:2021:EAS}, this study develops an analytic series solution method for the elastic inclusion problem to encompass a cavity problem. The central mathematical challenge tackled in this research is t…
▽ More
This paper presents an analytic series solution method for the elastic inclusion problem in a two-dimensional unbounded isotropic medium with a cavity. Generalizing the work of Mattei and Lim \cite{Mattei:2021:EAS}, this study develops an analytic series solution method for the elastic inclusion problem to encompass a cavity problem. The central mathematical challenge tackled in this research is to deal with the conormal derivative condition. By using the complex-variable formulation for the conormal derivative, we effectively deal with the boundary condition and derive an explicit series solution for the plane elastostatic problem with a cavity of arbitrary shape subject to arbitrary far-field loading. The solution is expressed as a series expansion in terms of the given far-field loading and the exterior conformal mapping associated with the cavity.
△ Less
Submitted 23 March, 2024;
originally announced March 2024.
-
Analytic shape recovery of an elastic inclusion from elastic moment tensors
Authors:
Daehee Cho,
Mikyoung Lim
Abstract:
In this paper, we present an analytic non-iterative approach for recovering a planar isotropic elastic inclusion embedded in an unbounded medium from the elastic moment tensors (EMTs), which are coefficients for the multipole expansion of field perturbation caused by the inclusion. EMTs contain information about the inclusion's material and geometric properties and, as is well known, the inclusion…
▽ More
In this paper, we present an analytic non-iterative approach for recovering a planar isotropic elastic inclusion embedded in an unbounded medium from the elastic moment tensors (EMTs), which are coefficients for the multipole expansion of field perturbation caused by the inclusion. EMTs contain information about the inclusion's material and geometric properties and, as is well known, the inclusion can be approximated by a disk from leading-order EMTs. We define the complex contracted EMTs as the linear combinations of EMTs where the expansion coefficients are given from complex-valued background polynomial solutions. By using the layer potential technique for the Lamé system and the theory of conformal mapping, we derive explicit asymptotic formulas in terms of the complex contracted EMTs for the shape of the inclusion, treating the inclusion as a perturbed disk. These formulas lead us to an analytic non-iterative algorithm for elastic inclusion reconstruction using EMTs. We perform numerical experiments to demonstrate the validity and limitations of our proposed method.
△ Less
Submitted 3 March, 2024;
originally announced March 2024.
-
Envy-Free House Allocation with Minimum Subsidy
Authors:
Davin Choo,
Yan Hao Ling,
Warut Suksompong,
Nicholas Teh,
Jian Zhang
Abstract:
House allocation refers to the problem where $m$ houses are to be allocated to $n$ agents so that each agent receives one house. Since an envy-free house allocation does not always exist, we consider finding such an allocation in the presence of subsidy. We show that computing an envy-free allocation with minimum subsidy is NP-hard in general, but can be done efficiently if $m$ differs from $n$ by…
▽ More
House allocation refers to the problem where $m$ houses are to be allocated to $n$ agents so that each agent receives one house. Since an envy-free house allocation does not always exist, we consider finding such an allocation in the presence of subsidy. We show that computing an envy-free allocation with minimum subsidy is NP-hard in general, but can be done efficiently if $m$ differs from $n$ by an additive constant or if the agents have identical utilities.
△ Less
Submitted 2 March, 2024;
originally announced March 2024.
-
Causal Discovery under Off-Target Interventions
Authors:
Davin Choo,
Kirankumar Shiragur,
Caroline Uhler
Abstract:
Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions…
▽ More
Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions with the natural goal of minimizing the number of interventions performed. We propose the following stochastic intervention model which subsumes existing adaptive noiseless interventions in the literature while capturing scenarios such as fat-hand interventions and CRISPR gene knockouts: any intervention attempt results in an actual intervention on a random subset of vertices, drawn from a distribution dependent on attempted action. Under this model, we study the two fundamental problems in causal discovery of verification and search and provide approximation algorithms with polylogarithmic competitive ratios and provide some preliminary experimental results.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
DurFlex-EVC: Duration-Flexible Emotional Voice Conversion with Parallel Generation
Authors:
Hyung-Seok Oh,
Sang-Hoon Lee,
Deok-Hyeon Cho,
Seong-Whan Lee
Abstract:
Emotional voice conversion (EVC) seeks to modify the emotional tone of a speaker's voice while preserving the original linguistic content and the speaker's unique vocal characteristics. Recent advancements in EVC have involved the simultaneous modeling of pitch and duration, utilizing the potential of sequence-to-sequence (seq2seq) models. To enhance reliability and efficiency in conversion, this…
▽ More
Emotional voice conversion (EVC) seeks to modify the emotional tone of a speaker's voice while preserving the original linguistic content and the speaker's unique vocal characteristics. Recent advancements in EVC have involved the simultaneous modeling of pitch and duration, utilizing the potential of sequence-to-sequence (seq2seq) models. To enhance reliability and efficiency in conversion, this study shifts focus towards parallel speech generation. We introduce Duration-Flexible EVC (DurFlex-EVC), which integrates a style autoencoder and unit aligner. Traditional models, while incorporating self-supervised learning (SSL) representations that contain both linguistic and paralinguistic information, have neglected this dual nature, leading to reduced controllability. Addressing this issue, we implement cross-attention to synchronize these representations with various emotions. Additionally, a style autoencoder is developed for the disentanglement and manipulation of style elements. The efficacy of our approach is validated through both subjective and objective evaluations, establishing its superiority over existing models in the field.
△ Less
Submitted 7 March, 2024; v1 submitted 15 January, 2024;
originally announced January 2024.
-
An unconventional platform for two-dimensional Kagome flat bands on semiconductor surfaces
Authors:
Jae Hyuck Lee,
GwanWoo Kim,
Inkyung Song,
Yejin Kim,
Yeonjae Lee,
Sung Jong Yoo,
Deok-Yong Cho,
Jun-Won Rhim,
Jongkeun Jung,
Gunn Kim,
Changyoung Kim
Abstract:
In condensed matter physics, the Kagome lattice and its inherent flat bands have attracted considerable attention for their potential to host a variety of exotic physical phenomena. Despite extensive efforts to fabricate thin films of Kagome materials aimed at modulating the flat bands through electrostatic gating or strain manipulation, progress has been limited. Here, we report the observation o…
▽ More
In condensed matter physics, the Kagome lattice and its inherent flat bands have attracted considerable attention for their potential to host a variety of exotic physical phenomena. Despite extensive efforts to fabricate thin films of Kagome materials aimed at modulating the flat bands through electrostatic gating or strain manipulation, progress has been limited. Here, we report the observation of a novel $d$-orbital hybridized Kagome-derived flat band in Ag/Si(111) $\sqrt{3}\times\sqrt{3}$ as revealed by angle-resolved photoemission spectroscopy. Our findings indicate that silver atoms on a silicon substrate form a Kagome-like structure, where a delicate balance in the hopping parameters of the in-plane $d$-orbitals leads to destructive interference, resulting in a flat band. These results not only introduce a new platform for Kagome physics but also illuminate the potential for integrating metal-semiconductor interfaces into Kagome-related research, thereby opening a new avenue for exploring ideal two-dimensional Kagome systems.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
Melting of unidirectional charge density waves across twin domain boundaries in GdTe$_{3}$
Authors:
Sanghun Lee,
Eunseo Kim,
Junho Bang,
Jongho Park,
Changyoung Kim,
Dirk Wulferding,
Doohee Cho
Abstract:
Solids undergoing a transition from order to disorder experience the proliferation of topological defects. The melting process generates transient quantum states. However, their dynamical nature with femtosecond lifetime hinders exploration with atomic precision. Here, we suggest an alternative approach to the dynamical melting process by focusing on the interface created by competing degenerate q…
▽ More
Solids undergoing a transition from order to disorder experience the proliferation of topological defects. The melting process generates transient quantum states. However, their dynamical nature with femtosecond lifetime hinders exploration with atomic precision. Here, we suggest an alternative approach to the dynamical melting process by focusing on the interface created by competing degenerate quantum states. We use a scanning tunneling microscope (STM) to visualize the unidirectional charge density wave (CDW) and its spatial progression ("static melting") across a twin domain boundary (TDB) in the layered material GdTe$_{3}$. Combining STM with a spatial lock-in technique, we reveal that the order parameter amplitude attenuates with the formation of dislocations and thus two different unidirectional CDWs coexist near the TDB, reducing the CDW anisotropy. Notably, we discover a correlation between this anisotropy and the CDW gap. Our study provides valuable insight into the behavior of topological defects and transient quantum states.
△ Less
Submitted 14 December, 2023;
originally announced December 2023.
-
Deterministic Guidance Diffusion Model for Probabilistic Weather Forecasting
Authors:
Donggeun Yoon,
Minseok Seo,
Doyi Kim,
Yeji Choi,
Donghyeon Cho
Abstract:
Weather forecasting requires not only accuracy but also the ability to perform probabilistic prediction. However, deterministic weather forecasting methods do not support probabilistic predictions, and conversely, probabilistic models tend to be less accurate. To address these challenges, in this paper, we introduce the \textbf{\textit{D}}eterministic \textbf{\textit{G}}uidance \textbf{\textit{D}}…
▽ More
Weather forecasting requires not only accuracy but also the ability to perform probabilistic prediction. However, deterministic weather forecasting methods do not support probabilistic predictions, and conversely, probabilistic models tend to be less accurate. To address these challenges, in this paper, we introduce the \textbf{\textit{D}}eterministic \textbf{\textit{G}}uidance \textbf{\textit{D}}iffusion \textbf{\textit{M}}odel (DGDM) for probabilistic weather forecasting, integrating benefits of both deterministic and probabilistic approaches. During the forward process, both the deterministic and probabilistic models are trained end-to-end. In the reverse process, weather forecasting leverages the predicted result from the deterministic model, using as an intermediate starting point for the probabilistic model. By fusing deterministic models with probabilistic models in this manner, DGDM is capable of providing accurate forecasts while also offering probabilistic predictions. To evaluate DGDM, we assess it on the global weather forecasting dataset (WeatherBench) and the common video frame prediction benchmark (Moving MNIST). We also introduce and evaluate the Pacific Northwest Windstorm (PNW)-Typhoon weather satellite dataset to verify the effectiveness of DGDM in high-resolution regional forecasting. As a result of our experiments, DGDM achieves state-of-the-art results not only in global forecasting but also in regional forecasting. The code is available at: \url{https://github.com/DongGeun-Yoon/DGDM}.
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Photo-induced charge carrier dynamics in a semiconductor-based ion trap investigated via motion-sensitive qubit transitions
Authors:
Woojun Lee,
Daun Chung,
Honggi Jeon,
Beomgeun Cho,
KwangYeul Choi,
SeungWoo Yoo,
Changhyun Jung,
Junho Jeong,
Changsoon Kim,
Dong-Il "Dan'' Cho,
Taehyun Kim
Abstract:
Ion trap systems built upon microfabricated chips have emerged as a promising platform for quantum computing to achieve reproducible and scalable structures. However, photo-induced charging of materials in such chips can generate undesired stray electric fields that disrupt the quantum state of the ion, limiting high-fidelity quantum control essential for practical quantum computing. While crude u…
▽ More
Ion trap systems built upon microfabricated chips have emerged as a promising platform for quantum computing to achieve reproducible and scalable structures. However, photo-induced charging of materials in such chips can generate undesired stray electric fields that disrupt the quantum state of the ion, limiting high-fidelity quantum control essential for practical quantum computing. While crude understanding of the phenomena has been gained heuristically over the past years, explanations for the microscopic mechanism of photo-generated charge carrier dynamics remains largely elusive. Here, we present a photo-induced charging model for semiconductors, whose verification is enabled by a systematic interaction between trapped ions and photo-induced stray fields from exposed silicon surfaces in our chip. We use motion-sensitive qubit transitions to directly characterize the stray field and analyze its effect on the quantum dynamics of the trapped ion. In contrast to incoherent errors arising from the thermal motion of the ion, coherent errors are induced by the stray field, whose effect is significantly imprinted during the quantum control of the ion. These errors are investigated in depth and methods to mitigate them are discussed. Finally, we extend the implications of our study to other photo-induced charging mechanisms prevalent in ion traps.
△ Less
Submitted 29 November, 2023;
originally announced December 2023.
-
Suppression of Antiferromagnetic Order by Strain in Honeycomb Cobaltate: Implication for Quantum Spin Liquid
Authors:
Gye-Hyeon Kim,
Miju Park,
Uksam Choi,
Baekjune Kang,
Uihyeon Seo,
GwangCheol Ji,
Seunghyeon Noh,
Deok-Yong Cho,
Jung-Woo Yoo,
Jong Mok Ok,
Changhee Sohn
Abstract:
Recently, layered honeycomb cobaltates have been predicted as a new promising system for realizing the Kitaev quantum spin liquid, a many-body quantum entangled ground state characterized by fractional excitations. However, these cobaltates, similar to other candidate materials, exhibit classical antiferromagnetic ordering at low temperatures, which impedes the formation of the expected quantum st…
▽ More
Recently, layered honeycomb cobaltates have been predicted as a new promising system for realizing the Kitaev quantum spin liquid, a many-body quantum entangled ground state characterized by fractional excitations. However, these cobaltates, similar to other candidate materials, exhibit classical antiferromagnetic ordering at low temperatures, which impedes the formation of the expected quantum state. Here, we demonstrate that the control of the trigonal crystal field of Co ions is crucial to suppress classical antiferromagnetic ordering and to locate its ground state in closer vicinity to quantum spin liquid in layered honeycomb cobaltates. By utilizing heterostructure engineering on Cu3Co2SbO6 thin films, we adjust the trigonal distortion of CoO6 octahedra and the associated trigonal crystal field. The original Néel temperature of 16 K in bulk Cu3Co2SbO6 decreases (increases) to 7.8 K (22.7 K) in strained Cu3Co2SbO6 films by decreasing (increasing) the magnitude of the trigonal crystal fields. Our experimental finding substantiates the potential of layered honeycomb cobaltate heterostructures and strain engineering to accomplish the extremely elusive quantum phase of matter.
△ Less
Submitted 20 December, 2023; v1 submitted 16 November, 2023;
originally announced November 2023.
-
Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement
Authors:
Daesol Cho,
Seungjae Lee,
H. Jin Kim
Abstract:
Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for curriculum RL called Diversify for Disagreement & Conquer (D2C). Unlike previous curriculum learning methods, D2C r…
▽ More
Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for curriculum RL called Diversify for Disagreement & Conquer (D2C). Unlike previous curriculum learning methods, D2C requires only a few examples of desired outcomes and works in any environment, regardless of its geometry or the distribution of the desired outcome examples. The proposed method performs diversification of the goal-conditional classifiers to identify similarities between visited and desired outcome states and ensures that the classifiers disagree on states from out-of-distribution, which enables quantifying the unexplored region and designing an arbitrary goal-conditioned intrinsic reward signal in a simple and intuitive way. The proposed method then employs bipartite matching to define a curriculum learning objective that produces a sequence of well-adjusted intermediate goals, which enable the agent to automatically explore and conquer the unexplored region. We present experimental results demonstrating that D2C outperforms prior curriculum RL methods in both quantitative and qualitative aspects, even with the arbitrarily distributed desired outcome examples.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
CQM: Curriculum Reinforcement Learning with a Quantized World Model
Authors:
Seungjae Lee,
Daesol Cho,
Jonghae Park,
H. Jin Kim
Abstract:
Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on manually specified goal spaces. To alleviate this limitation and improve the scalability of the curriculum, we p…
▽ More
Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on manually specified goal spaces. To alleviate this limitation and improve the scalability of the curriculum, we propose a novel curriculum method that automatically defines the semantic goal space which contains vital information for the curriculum process, and suggests curriculum goals over it. To define the semantic goal space, our method discretizes continuous observations via vector quantized-variational autoencoders (VQ-VAE) and restores the temporal relations between the discretized observations by a graph. Concurrently, ours suggests uncertainty and temporal distance-aware curriculum goals that converges to the final goals over the automatically composed goal space. We demonstrate that the proposed method allows efficient explorations in an uninformed environment with raw goal examples only. Also, ours outperforms the state-of-the-art curriculum RL methods on data efficiency and performance, in various goal-reaching tasks even with ego-centric visual inputs.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Learning bounded-degree polytrees with known skeleton
Authors:
Davin Choo,
Joy Qiping Yang,
Arnab Bhattacharyya,
Clément L. Canonne
Abstract:
We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results…
▽ More
We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results by providing an efficient algorithm which learns $d$-polytrees in polynomial time and sample complexity for any bounded $d$ when the underlying undirected graph (skeleton) is known. We complement our algorithm with an information-theoretic sample complexity lower bound, showing that the dependence on the dimension and target accuracy parameters are nearly tight.
△ Less
Submitted 21 January, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Micromotion compensation of trapped ions by qubit transition and direct scanning of dc voltages
Authors:
Woojun Lee,
Daun Chung,
Jiyong Kang,
Honggi Jeon,
Changhyun Jung,
Dong-Il "Dan" Cho,
Taehyun Kim
Abstract:
Excess micromotion is detrimental to accurate qubit control of trapped ions, thus measuring and minimizing it is crucial. In this paper, we present a simple approach for measuring and suppressing excess micromotion of trapped ions by leveraging the existing laser-driven qubit transition scheme combined with direct scanning of dc voltages. The compensation voltage is deduced by analyzing the Bessel…
▽ More
Excess micromotion is detrimental to accurate qubit control of trapped ions, thus measuring and minimizing it is crucial. In this paper, we present a simple approach for measuring and suppressing excess micromotion of trapped ions by leveraging the existing laser-driven qubit transition scheme combined with direct scanning of dc voltages. The compensation voltage is deduced by analyzing the Bessel expansion of a scanned qubit transition rate. The method provides a fair level of sensitivity for practical quantum computing applications, while demanding minimal deviation of trap condition. By accomplishing compensation of excess micromotion in the qubit momentum-excitation direction, the scheme offers an additional avenue for excess micromotion compensation, complementing existing compensation schemes.
△ Less
Submitted 2 December, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Adaptivity Complexity for Causal Graph Discovery
Authors:
Davin Choo,
Kirankumar Shiragur
Abstract:
Causal discovery from interventional data is an important problem, where the task is to design an interventional strategy that learns the hidden ground truth causal graph $G(V,E)$ on $|V| = n$ nodes while minimizing the number of performed interventions. Most prior interventional strategies broadly fall into two categories: non-adaptive and adaptive. Non-adaptive strategies decide on a single fixe…
▽ More
Causal discovery from interventional data is an important problem, where the task is to design an interventional strategy that learns the hidden ground truth causal graph $G(V,E)$ on $|V| = n$ nodes while minimizing the number of performed interventions. Most prior interventional strategies broadly fall into two categories: non-adaptive and adaptive. Non-adaptive strategies decide on a single fixed set of interventions to be performed while adaptive strategies can decide on which nodes to intervene on sequentially based on past interventions. While adaptive algorithms may use exponentially fewer interventions than their non-adaptive counterparts, there are practical concerns that constrain the amount of adaptivity allowed. Motivated by this trade-off, we study the problem of $r$-adaptivity, where the algorithm designer recovers the causal graph under a total of $r$ sequential rounds whilst trying to minimize the total number of interventions. For this problem, we provide a $r$-adaptive algorithm that achieves $O(\min\{r,\log n\} \cdot n^{1/\min\{r,\log n\}})$ approximation with respect to the verification number, a well-known lower bound for adaptive algorithms. Furthermore, for every $r$, we show that our approximation is tight. Our definition of $r$-adaptivity interpolates nicely between the non-adaptive ($r=1$) and fully adaptive ($r=n$) settings where our approximation simplifies to $O(n)$ and $O(\log n)$ respectively, matching the best-known approximation guarantees for both extremes. Our results also extend naturally to the bounded size interventions.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
Active causal structure learning with advice
Authors:
Davin Choo,
Themis Gouleakis,
Arnab Bhattacharyya
Abstract:
We introduce the problem of active causal structure learning with advice. In the typical well-studied setting, the learning algorithm is given the essential graph for the observational distribution and is asked to recover the underlying causal directed acyclic graph (DAG) $G^*$ while minimizing the number of interventions made. In our setting, we are additionally given side information about…
▽ More
We introduce the problem of active causal structure learning with advice. In the typical well-studied setting, the learning algorithm is given the essential graph for the observational distribution and is asked to recover the underlying causal directed acyclic graph (DAG) $G^*$ while minimizing the number of interventions made. In our setting, we are additionally given side information about $G^*$ as advice, e.g. a DAG $G$ purported to be $G^*$. We ask whether the learning algorithm can benefit from the advice when it is close to being correct, while still having worst-case guarantees even when the advice is arbitrarily bad. Our work is in the same space as the growing body of research on algorithms with predictions. When the advice is a DAG $G$, we design an adaptive search algorithm to recover $G^*$ whose intervention cost is at most $O(\max\{1, \log ψ\})$ times the cost for verifying $G^*$; here, $ψ$ is a distance measure between $G$ and $G^*$ that is upper bounded by the number of variables $n$, and is exactly 0 when $G=G^*$. Our approximation factor matches the state-of-the-art for the advice-less setting.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum
Authors:
Jigang Kim,
Daesol Cho,
H. Jin Kim
Abstract:
While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied agents due to the time-consuming and cumbersome workarounds for resetting in the physical world. Hence, there has…
▽ More
While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied agents due to the time-consuming and cumbersome workarounds for resetting in the physical world. Hence, there has been a growing interest in autonomous RL (ARL) methods that are capable of learning from non-episodic interactions. However, existing works on ARL are limited by their reliance on prior data and are unable to learn in environments where task-relevant interactions are sparse. In contrast, we propose a demonstration-free ARL algorithm via Implicit and Bi-directional Curriculum (IBC). With an auxiliary agent that is conditionally activated upon learning progress and a bidirectional goal curriculum based on optimal transport, our method outperforms previous methods, even the ones that leverage demonstrations.
△ Less
Submitted 8 June, 2023; v1 submitted 17 May, 2023;
originally announced May 2023.
-
Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs
Authors:
Jiao Chen,
Luyi Ma,
Xiaohan Li,
Nikhil Thakurdesai,
Jianpeng Xu,
Jason H. D. Cho,
Kaushiki Nag,
Evren Korpeoglu,
Sushant Kumar,
Kannan Achan
Abstract:
Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains…
▽ More
Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains and the associated cost of human labor. Recently, breakthroughs in Large Language Models (LLMs) have shown surprising results in numerous natural language processing tasks. In this paper, we conduct an empirical study of LLMs for relation labeling in e-commerce KGs, investigating their powerful learning capabilities in natural language and effectiveness in predicting relations between product types with limited labeled data. We evaluate various LLMs, including PaLM and GPT-3.5, on benchmark datasets, demonstrating their ability to achieve competitive performance compared to humans on relation labeling tasks using just 1 to 5 labeled examples per relation. Additionally, we experiment with different prompt engineering techniques to examine their impact on model performance. Our results show that LLMs significantly outperform existing KG completion models in relation labeling for e-commerce KGs and exhibit performance strong enough to replace human labeling.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
The Sharp Power Law of Local Search on Expanders
Authors:
Simina Brânzei,
Davin Choo,
Nicholas Recker
Abstract:
Local search is a powerful heuristic in optimization and computer science, the complexity of which was studied in the white box and black box models. In the black box model, we are given a graph $G = (V,E)$ and oracle access to a function $f : V \to \mathbb{R}$. The local search problem is to find a vertex $v$ that is a local minimum, i.e. with $f(v) \leq f(u)$ for all $(u,v) \in E$, using as few…
▽ More
Local search is a powerful heuristic in optimization and computer science, the complexity of which was studied in the white box and black box models. In the black box model, we are given a graph $G = (V,E)$ and oracle access to a function $f : V \to \mathbb{R}$. The local search problem is to find a vertex $v$ that is a local minimum, i.e. with $f(v) \leq f(u)$ for all $(u,v) \in E$, using as few queries as possible. The query complexity is well understood on the grid and the hypercube, but much less is known beyond.
We show the query complexity of local search on $d$-regular expanders with constant degree is $Ω\left(\frac{\sqrt{n}}{\log{n}}\right)$, where $n$ is the number of vertices. This matches within a logarithmic factor the upper bound of $O(\sqrt{n})$ for constant degree graphs from Aldous (1983), implying that steepest descent with a warm start is an essentially optimal algorithm for expanders. The best lower bound known from prior work was $Ω\left(\frac{\sqrt[8]{n}}{\log{n}}\right)$, shown by Santha and Szegedy (2004) for quantum and randomized algorithms.
We obtain this result by considering a broader framework of graph features such as vertex congestion and separation number. We show that for each graph, the randomized query complexity of local search is $Ω\left(\frac{n^{1.5}}{g}\right)$, where $g$ is the vertex congestion of the graph; and $Ω\left(\sqrt[4]{\frac{s}Δ}\right)$, where $s$ is the separation number and $Δ$ is the maximum degree. For separation number the previous bound was $Ω\left(\sqrt[8]{\frac{s}Δ} /\log{n}\right)$, given by Santha and Szegedy for quantum and randomized algorithms.
We also show a variant of the relational adversary method from Aaronson (2006), which is asymptotically at least as strong as the version in Aaronson (2006) for all randomized algorithms and strictly stronger for some problems.
△ Less
Submitted 15 August, 2023; v1 submitted 14 May, 2023;
originally announced May 2023.
-
New metrics and search algorithms for weighted causal DAGs
Authors:
Davin Choo,
Kirankumar Shiragur
Abstract:
Recovering causal relationships from data is an important problem. Using observational data, one can typically only recover causal graphs up to a Markov equivalence class and additional assumptions or interventional data are needed for complete recovery. In this work, under some standard assumptions, we study causal graph discovery via adaptive interventions with node-dependent interventional cost…
▽ More
Recovering causal relationships from data is an important problem. Using observational data, one can typically only recover causal graphs up to a Markov equivalence class and additional assumptions or interventional data are needed for complete recovery. In this work, under some standard assumptions, we study causal graph discovery via adaptive interventions with node-dependent interventional costs. For this setting, we show that no algorithm can achieve an approximation guarantee that is asymptotically better than linear in the number of vertices with respect to the verification number; a well-established benchmark for adaptive search algorithms. Motivated by this negative result, we define a new benchmark that captures the worst-case interventional cost for any search algorithm. Furthermore, with respect to this new benchmark, we provide adaptive search algorithms that achieve logarithmic approximations under various settings: atomic, bounded size interventions and generalized cost objectives.
△ Less
Submitted 29 May, 2023; v1 submitted 7 May, 2023;
originally announced May 2023.
-
Localization using Multi-Focal Spatial Attention for Masked Face Recognition
Authors:
Yooshin Cho,
Hanbyel Cho,
Hyeong Gwon Hong,
Jaesung Ahn,
Dongmin Cho,
JungWoo Chang,
Junmo Kim
Abstract:
Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric reco…
▽ More
Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric recognition systems. Thus, in this paper, we propose Complementary Attention Learning and Multi-Focal Spatial Attention that precisely removes masked region by training complementary spatial attention to focus on two distinct regions: masked regions and backgrounds. In our method, standard spatial attention and networks focus on unmasked regions, and extract mask-invariant features while minimizing the loss of the conventional Face Recognition (FR) performance. For conventional FR, we evaluate the performance on the IJB-C, Age-DB, CALFW, and CPLFW datasets. We evaluate the MFR performance on the ICCV2021-MFR/Insightface track, and demonstrate the improved performance on the both MFR and FR datasets. Additionally, we empirically verify that spatial attention of proposed method is more precisely activated in unmasked regions.
△ Less
Submitted 7 September, 2023; v1 submitted 3 May, 2023;
originally announced May 2023.
-
Soundini: Sound-Guided Diffusion for Natural Video Editing
Authors:
Seung Hyun Lee,
Sieun Kim,
Innfarn Yoo,
Feng Yang,
Donghyeon Cho,
Youngseo Kim,
Huiwen Chang,
Jinkyu Kim,
Sangpil Kim
Abstract:
We propose a method for adding sound-guided visual effects to specific regions of videos with a zero-shot setting. Animating the appearance of the visual effect is challenging because each frame of the edited video should have visual changes while maintaining temporal consistency. Moreover, existing video editing solutions focus on temporal consistency across frames, ignoring the visual style vari…
▽ More
We propose a method for adding sound-guided visual effects to specific regions of videos with a zero-shot setting. Animating the appearance of the visual effect is challenging because each frame of the edited video should have visual changes while maintaining temporal consistency. Moreover, existing video editing solutions focus on temporal consistency across frames, ignoring the visual style variations over time, e.g., thunderstorm, wave, fire crackling. To overcome this limitation, we utilize temporal sound features for the dynamic style. Specifically, we guide denoising diffusion probabilistic models with an audio latent representation in the audio-visual latent space. To the best of our knowledge, our work is the first to explore sound-guided natural video editing from various sound sources with sound-specialized properties, such as intensity, timbre, and volume. Additionally, we design optical flow-based guidance to generate temporally consistent video frames, capturing the pixel-wise relationship between adjacent frames. Experimental results show that our method outperforms existing video editing techniques, producing more realistic visual effects that reflect the properties of sound. Please visit our page: https://kuai-lab.github.io/soundini-gallery/.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
Use of vector polarizability to manipulate alkali-metal atoms
Authors:
D. Cho
Abstract:
We review a few ideas and experiments that our laboratory at Korea University has proposed and carried out to use vector polarizability βto manipulate alkali-metal atoms. βcomes from spin-orbit coupling, and it produces an ac Stark shift that resembles a Zeeman shift. When a circularly polarized laser field is properly detuned between the D1 and D2 transitions, an ac Stark shift of a ground-state…
▽ More
We review a few ideas and experiments that our laboratory at Korea University has proposed and carried out to use vector polarizability βto manipulate alkali-metal atoms. βcomes from spin-orbit coupling, and it produces an ac Stark shift that resembles a Zeeman shift. When a circularly polarized laser field is properly detuned between the D1 and D2 transitions, an ac Stark shift of a ground-state atom takes the form of a pure Zeeman shift. We call it the "analogous Zeeman effect", and experimentally demonstrated an optical Stern-Gerlach effect and an optical trap that behaves exactly like a magnetic trap. By tuning polarization of a trapping beam, and thereby controlling a shift proportional to β, we demonstrated elimination of an inhomogeneous broadening of a ground hyperfine transition in an optical trap. We call it "magic polarization". We also showed significant narrowing of a Raman sideband transition at a special well depth. A Raman sideband in an optical trap is broadened owing to anharmonicity of the trap potential, and the broadening can be eliminated by a beta-induced differential ac Stark shift at what we call a "magic well depth". Finally, we proposed and experimentally demonstrated a cooling scheme that incorporated the idea of velocity-selective coherent population trapping to Raman sideband cooling to enhance cooling efficiency of the latter outside of the Lamb-Dicke regime. We call it "motion-selective coherent population trapping", and βis responsible for the selectivity. We include a program file that calculates both scalar and vector polarizabilities of a given alkali-metal atom when the wavelength of an applied field is specified. It also calculates depth of a potential well and photon-scattering rate of a trapped atom in a specific ground state when power, minimum spot size, and polarization of a trap beam are given.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Charge density wave surface reconstruction in a van der Waals layered material
Authors:
Sung-Hoon Lee,
Doohee Cho
Abstract:
Surface reconstruction plays a vital role in determining the surface electronic structure and chemistry of semiconductors and metal oxides. However, it has been commonly believed that surface reconstruction does not occur in van der Waals layered materials, as they do not undergo significant bond breaking during surface formation. In this study, we present evidence that charge density wave (CDW) o…
▽ More
Surface reconstruction plays a vital role in determining the surface electronic structure and chemistry of semiconductors and metal oxides. However, it has been commonly believed that surface reconstruction does not occur in van der Waals layered materials, as they do not undergo significant bond breaking during surface formation. In this study, we present evidence that charge density wave (CDW) order in these materials can, in fact, cause CDW surface reconstruction through interlayer coupling. Using density functional theory calculations on the 1T-TaS2 surface, we reveal that CDW reconstruction, involving concerted small atomic displacements in the subsurface layer, results in a significant modification of the surface electronic structure, transforming it from a Mott insulator to a band insulator. This new form of surface reconstruction explains several previously unexplained observations on the 1T-TaS2 surface and has important implications for interpreting surface phenomena in CDW-ordered layered materials.
△ Less
Submitted 25 August, 2023; v1 submitted 18 March, 2023;
originally announced March 2023.
-
Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation
Authors:
Daesol Cho,
Seungjae Lee,
H. Jin Kim
Abstract:
Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable results, most of the previous works still have difficulty in proposing curriculum due to the absence of a mechani…
▽ More
Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable results, most of the previous works still have difficulty in proposing curriculum due to the absence of a mechanism for obtaining calibrated guidance to the desired outcome state without any prior domain knowledge. To alleviate it, we propose an uncertainty & temporal distance-aware curriculum goal generation method for the outcome-directed RL via solving a bipartite matching problem. It could not only provide precisely calibrated guidance of the curriculum to the desired outcome states but also bring much better sample efficiency and geometry-agnostic curriculum goal proposal capability compared to previous curriculum RL methods. We demonstrate that our algorithm significantly outperforms these prior methods in a variety of challenging navigation tasks and robotic manipulation tasks in a quantitative and qualitative way.
△ Less
Submitted 20 February, 2023; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Subset verification and search algorithms for causal DAGs
Authors:
Davin Choo,
Kirankumar Shiragur
Abstract:
Learning causal relationships between variables is a fundamental task in causal inference and directed acyclic graphs (DAGs) are a popular choice to represent the causal relationships. As one can recover a causal graph only up to its Markov equivalence class from observations, interventions are often used for the recovery task. Interventions are costly in general and it is important to design algo…
▽ More
Learning causal relationships between variables is a fundamental task in causal inference and directed acyclic graphs (DAGs) are a popular choice to represent the causal relationships. As one can recover a causal graph only up to its Markov equivalence class from observations, interventions are often used for the recovery task. Interventions are costly in general and it is important to design algorithms that minimize the number of interventions performed. In this work, we study the problem of identifying the smallest set of interventions required to learn the causal relationships between a subset of edges (target edges). Under the assumptions of faithfulness, causal sufficiency, and ideal interventions, we study this problem in two settings: when the underlying ground truth causal graph is known (subset verification) and when it is unknown (subset search). For the subset verification problem, we provide an efficient algorithm to compute a minimum sized interventional set; we further extend these results to bounded size non-atomic interventions and node-dependent interventional costs. For the subset search problem, in the worst case, we show that no algorithm (even with adaptivity or randomization) can achieve an approximation ratio that is asymptotically better than the vertex cover of the target edges when compared with the subset verification number. This result is surprising as there exists a logarithmic approximation algorithm for the search problem when we wish to recover the whole causal graph. To obtain our results, we prove several interesting structural properties of interventional causal graphs that we believe have applications beyond the subset verification/search problems studied here.
△ Less
Submitted 13 February, 2024; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Atomic and Electronic Structures of Correlated SrRuO3/SrTiO3 Superlattices
Authors:
Seung Gyo Jeong,
Ahmed Yousef Mohamed,
Deok-Yong Cho,
Woo Seok Choi
Abstract:
Atomic-scale precision epitaxy of perovskite oxide superlattices provides unique opportunities for controlling the correlated electronic structures, activating effective control knobs for intriguing functionalities including electromagnetic, thermoelectric, and electrocatalytic behaviors. In this study, we investigated the close interplay between the atomic and electronic structures of correlated…
▽ More
Atomic-scale precision epitaxy of perovskite oxide superlattices provides unique opportunities for controlling the correlated electronic structures, activating effective control knobs for intriguing functionalities including electromagnetic, thermoelectric, and electrocatalytic behaviors. In this study, we investigated the close interplay between the atomic and electronic structures of correlated superlattices synthesized by atomic-scale precision epitaxy. In particular, we employ superlattices composed of correlated magnetic SrRuO3 (SRO) and quantum paraelectric SrTiO3 (STO) layers. In those superlattices, RuO6 octahedral distortion is systematically controlled from 167 to 175 degrees depending on the thickness of the STO layers, also affecting the TiO6 octahedral distortion within the STO layer. Customized octahedral distortion within SRO/STO superlattices in turn modifies the electronic structures of both the Ti and Ru compounds, observed by X-ray absorption spectroscopy. Our results identify the close correlation between atomic lattice and electronic structures enabled by the facile controllability of atomic-scale epitaxy, which would be useful for designing future correlated oxide devices.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
"I Want to Figure Things Out": Supporting Exploration in Navigation for People with Visual Impairments
Authors:
Gaurav Jain,
Yuanyang Teng,
Dong Heon Cho,
Yunhao Xing,
Maryam Aziz,
Brian A. Smith
Abstract:
Navigation assistance systems (NASs) aim to help visually impaired people (VIPs) navigate unfamiliar environments. Most of today's NASs support VIPs via turn-by-turn navigation, but a growing body of work highlights the importance of exploration as well. It is unclear, however, how NASs should be designed to help VIPs explore unfamiliar environments. In this paper, we perform a qualitative study t…
▽ More
Navigation assistance systems (NASs) aim to help visually impaired people (VIPs) navigate unfamiliar environments. Most of today's NASs support VIPs via turn-by-turn navigation, but a growing body of work highlights the importance of exploration as well. It is unclear, however, how NASs should be designed to help VIPs explore unfamiliar environments. In this paper, we perform a qualitative study to understand VIPs' information needs and challenges with respect to exploring unfamiliar environments, with the aim of informing the design of NASs that support exploration. Our findings reveal the types of spatial information that VIPs need as well as factors that affect VIPs' information preferences. We also discover specific challenges that VIPs face that future NASs can address such as orientation and mobility education and collaborating effectively with others. We present design implications for NASs that support exploration, and we identify specific research opportunities and discuss open socio-technical challenges for making such NASs possible. We conclude by reflecting on our study procedure to inform future approaches in research on ethical considerations that may be adopted while interacting with the broader VIP community.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Learning and Testing Latent-Tree Ising Models Efficiently
Authors:
Davin Choo,
Yuval Dagan,
Constantinos Daskalakis,
Anthimos Vardis Kandiros
Abstract:
We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide…
▽ More
We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide an efficient algorithm with fewer samples for testing whether two latent-tree Ising models have leaf-node distributions that are close or far in Total Variation distance. We obtain our algorithms by showing novel localization results for the total variation distance between the leaf-node distributions of tree-structured Ising models, in terms of their marginals on pairs of leaves.
△ Less
Submitted 10 July, 2023; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Fast Computer Model Calibration using Annealed and Transformed Variational Inference
Authors:
Dongkyu Derek Cho,
Won Chang,
Jaewoo Park
Abstract:
Computer models play a crucial role in numerous scientific and engineering domains. To ensure the accuracy of simulations, it is essential to properly calibrate the input parameters of these models through statistical inference. While Bayesian inference is the standard approach for this task, employing Markov Chain Monte Carlo methods often encounters computational hurdles due to the costly evalua…
▽ More
Computer models play a crucial role in numerous scientific and engineering domains. To ensure the accuracy of simulations, it is essential to properly calibrate the input parameters of these models through statistical inference. While Bayesian inference is the standard approach for this task, employing Markov Chain Monte Carlo methods often encounters computational hurdles due to the costly evaluation of likelihood functions and slow mixing rates. Although variational inference (VI) can be a fast alternative to traditional Bayesian approaches, VI has limited applicability due to boundary issues and local optima problems. To address these challenges, we propose flexible VI methods based on deep generative models that do not require parametric assumptions on the variational distribution. We embed a surjective transformation in our framework to avoid posterior truncation at the boundary. Additionally, we provide theoretical conditions that guarantee the success of the algorithm. Furthermore, our temperature annealing scheme can prevent being trapped in local optima through a series of intermediate posteriors. We apply our method to infectious disease models and a geophysical model, illustrating that the proposed method can provide fast and accurate inference compared to its competitors.
△ Less
Submitted 5 March, 2024; v1 submitted 22 November, 2022;
originally announced November 2022.
-
IFQA: Interpretable Face Quality Assessment
Authors:
Byungho Jo,
Donghyeon Cho,
In Kyu Park,
Sungeun Hong
Abstract:
Existing face restoration models have relied on general assessment metrics that do not consider the characteristics of facial regions. Recent works have therefore assessed their methods using human studies, which is not scalable and involves significant effort. This paper proposes a novel face-centric metric based on an adversarial framework where a generator simulates face restoration and a discr…
▽ More
Existing face restoration models have relied on general assessment metrics that do not consider the characteristics of facial regions. Recent works have therefore assessed their methods using human studies, which is not scalable and involves significant effort. This paper proposes a novel face-centric metric based on an adversarial framework where a generator simulates face restoration and a discriminator assesses image quality. Specifically, our per-pixel discriminator enables interpretable evaluation that cannot be provided by traditional metrics. Moreover, our metric emphasizes facial primary regions considering that even minor changes to the eyes, nose, and mouth significantly affect human cognition. Our face-oriented metric consistently surpasses existing general or facial image quality assessment metrics by impressive margins. We demonstrate the generalizability of the proposed strategy in various architectural designs and challenging scenarios. Interestingly, we find that our IFQA can lead to performance improvement as an objective function.
△ Less
Submitted 16 November, 2022; v1 submitted 13 November, 2022;
originally announced November 2022.
-
Honeycomb oxide heterostructure: a new platform for Kitaev quantum spin liquid
Authors:
Baekjune Kang,
Miju Park,
Sehwan Song,
Seunghyun Noh,
Daeseong Choe,
Minsik Kong,
Minjae Kim,
Choongwon Seo,
Eun Kyo Ko,
Gangsan Yi,
Jung-woo Yoo,
Sungkyun Park,
Jong Mok Ok,
Changhee Sohn
Abstract:
Kitaev quantum spin liquid, massively quantum entangled states, is so scarce in nature that searching for new candidate systems remains a great challenge. Honeycomb heterostructure could be a promising route to realize and utilize such an exotic quantum phase by providing additional controllability of Hamiltonian and device compatibility, respectively. Here, we provide epitaxial honeycomb oxide th…
▽ More
Kitaev quantum spin liquid, massively quantum entangled states, is so scarce in nature that searching for new candidate systems remains a great challenge. Honeycomb heterostructure could be a promising route to realize and utilize such an exotic quantum phase by providing additional controllability of Hamiltonian and device compatibility, respectively. Here, we provide epitaxial honeycomb oxide thin film Na3Co2SbO6, a candidate of Kitaev quantum spin liquid proposed recently. We found a spin glass and antiferromagnetic ground states depending on Na stoichiometry, signifying not only the importance of Na vacancy control but also strong frustration in Na3Co2SbO6. Despite its classical ground state, the field-dependent magnetic susceptibility shows remarkable scaling collapse with a single critical exponent, which can be interpreted as evidence of quantum criticality. Its electronic ground state and derived spin Hamiltonian from spectroscopies are consistent with the predicted Kitaev model. Our work provides a unique route to the realization and utilization of Kitaev quantum spin liquid.
△ Less
Submitted 8 February, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Lightweight Alpha Matting Network Using Distillation-Based Channel Pruning
Authors:
Donggeun Yoon,
Jinsun Park,
Donghyeon Cho
Abstract:
Recently, alpha matting has received a lot of attention because of its usefulness in mobile applications such as selfies. Therefore, there has been a demand for a lightweight alpha matting model due to the limited computational resources of commercial portable devices. To this end, we suggest a distillation-based channel pruning method for the alpha matting networks. In the pruning step, we remove…
▽ More
Recently, alpha matting has received a lot of attention because of its usefulness in mobile applications such as selfies. Therefore, there has been a demand for a lightweight alpha matting model due to the limited computational resources of commercial portable devices. To this end, we suggest a distillation-based channel pruning method for the alpha matting networks. In the pruning step, we remove channels of a student network having fewer impacts on mimicking the knowledge of a teacher network. Then, the pruned lightweight student network is trained by the same distillation loss. A lightweight alpha matting model from the proposed method outperforms existing lightweight methods. To show superiority of our algorithm, we provide various quantitative and qualitative experiments with in-depth analyses. Furthermore, we demonstrate the versatility of the proposed distillation-based channel pruning method by applying it to semantic segmentation.
△ Less
Submitted 14 October, 2022;
originally announced October 2022.
-
S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning
Authors:
Daesol Cho,
Dongseok Shim,
H. Jin Kim
Abstract:
Offline reinforcement learning (Offline RL) suffers from the innate distributional shift as it cannot interact with the physical environment during training. To alleviate such limitation, state-based offline RL leverages a learned dynamics model from the logged experience and augments the predicted state transition to extend the data distribution. For exploiting such benefit also on the image-base…
▽ More
Offline reinforcement learning (Offline RL) suffers from the innate distributional shift as it cannot interact with the physical environment during training. To alleviate such limitation, state-based offline RL leverages a learned dynamics model from the logged experience and augments the predicted state transition to extend the data distribution. For exploiting such benefit also on the image-based RL, we firstly propose a generative model, S2P (State2Pixel), which synthesizes the raw pixel of the agent from its corresponding state. It enables bridging the gap between the state and the image domain in RL algorithms, and virtually exploring unseen image distribution via model-based transition in the state space. Through experiments, we confirm that our S2P-based image synthesis not only improves the image-based offline RL performance but also shows powerful generalization capability on unseen tasks.
△ Less
Submitted 30 September, 2022;
originally announced September 2022.
-
Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation
Authors:
Dae-Young Song,
Geonsoo Lee,
HeeKyung Lee,
Gi-Mun Um,
Donghyeon Cho
Abstract:
Recently, there has been growing attention on an end-to-end deep learning-based stitching model. However, the most challenging point in deep learning-based stitching is to obtain pairs of input images with a narrow field of view and ground truth images with a wide field of view captured from real-world scenes. To overcome this difficulty, we develop a weakly-supervised learning mechanism to train…
▽ More
Recently, there has been growing attention on an end-to-end deep learning-based stitching model. However, the most challenging point in deep learning-based stitching is to obtain pairs of input images with a narrow field of view and ground truth images with a wide field of view captured from real-world scenes. To overcome this difficulty, we develop a weakly-supervised learning mechanism to train the stitching model without requiring genuine ground truth images. In addition, we propose a stitching model that takes multiple real-world fisheye images as inputs and creates a 360 output image in an equirectangular projection format. In particular, our model consists of color consistency corrections, warping, and blending, and is trained by perceptual and SSIM losses. The effectiveness of the proposed algorithm is verified on two real-world stitching datasets.
△ Less
Submitted 13 September, 2022;
originally announced September 2022.
-
Apiary: A DBMS-Integrated Transactional Function-as-a-Service Framework
Authors:
Peter Kraft,
Qian Li,
Kostis Kaffes,
Athinagoras Skiadopoulos,
Deeptaanshu Kumar,
Danny Cho,
Jason Li,
Robert Redmond,
Nathan Weckwerth,
Brian Xia,
Peter Bailis,
Michael Cafarella,
Goetz Graefe,
Jeremy Kepner,
Christos Kozyrakis,
Michael Stonebraker,
Lalith Suresh,
Xiangyao Yu,
Matei Zaharia
Abstract:
Developers increasingly use function-as-a-service (FaaS) platforms for data-centric applications that perform low-latency and transactional operations on data, such as for microservices or web serving. Unfortunately, existing FaaS platforms support these applications poorly because they physically and logically separate application logic, executed in cloud functions, from data management, done in…
▽ More
Developers increasingly use function-as-a-service (FaaS) platforms for data-centric applications that perform low-latency and transactional operations on data, such as for microservices or web serving. Unfortunately, existing FaaS platforms support these applications poorly because they physically and logically separate application logic, executed in cloud functions, from data management, done in interactive transactions accessing remote storage. Physical separation harms performance while logical separation complicates efficiently providing transactional guarantees and fault tolerance.
This paper introduces Apiary, a novel DBMS-integrated FaaS platform for deploying and composing fault-tolerant transactional functions. Apiary physically co-locates and logically integrates function execution and data management by wrapping a distributed DBMS engine and using it as a unified runtime for function execution, data management, and operational logging, thus providing similar or stronger transactional guarantees as comparable systems while greatly improving performance and observability. To allow developers to write complex stateful programs, we leverage this integration to enable efficient and fault-tolerant function composition, building a frontend for orchestrating workflows of functions with the guarantees that each workflow runs to completion and each function in a workflow executes exactly once. We evaluate Apiary against research and production FaaS platforms and show it outperforms them by 2--68x on microservice workloads by reducing communication overhead.
△ Less
Submitted 30 June, 2023; v1 submitted 27 August, 2022;
originally announced August 2022.
-
Verification and search algorithms for causal DAGs
Authors:
Davin Choo,
Kirankumar Shiragur,
Arnab Bhattacharyya
Abstract:
We study two problems related to recovering causal graphs from interventional data: (i) $\textit{verification}$, where the task is to check if a purported causal graph is correct, and (ii) $\textit{search}$, where the task is to recover the correct causal graph. For both, we wish to minimize the number of interventions performed. For the first problem, we give a characterization of a minimal sized…
▽ More
We study two problems related to recovering causal graphs from interventional data: (i) $\textit{verification}$, where the task is to check if a purported causal graph is correct, and (ii) $\textit{search}$, where the task is to recover the correct causal graph. For both, we wish to minimize the number of interventions performed. For the first problem, we give a characterization of a minimal sized set of atomic interventions that is necessary and sufficient to check the correctness of a claimed causal graph. Our characterization uses the notion of $\textit{covered edges}$, which enables us to obtain simple proofs and also easily reason about earlier known results. We also generalize our results to the settings of bounded size interventions and node-dependent interventional costs. For all the above settings, we provide the first known provable algorithms for efficiently computing (near)-optimal verifying sets on general graphs. For the second problem, we give a simple adaptive algorithm based on graph separators that produces an atomic intervention set which fully orients any essential graph while using $\mathcal{O}(\log n)$ times the optimal number of interventions needed to $\textit{verify}$ (verifying size) the underlying DAG on $n$ vertices. This approximation is tight as $\textit{any}$ search algorithm on an essential line graph has worst case approximation ratio of $Ω(\log n)$ with respect to the verifying size. With bounded size interventions, each of size $\leq k$, our algorithm gives an $\mathcal{O}(\log n \cdot \log k)$ factor approximation. Our result is the first known algorithm that gives a non-trivial approximation guarantee to the verifying size on general unweighted graphs and with bounded size interventions.
△ Less
Submitted 9 October, 2022; v1 submitted 30 June, 2022;
originally announced June 2022.
-
CROM: Continuous Reduced-Order Modeling of PDEs Using Implicit Neural Representations
Authors:
Peter Yichen Chen,
Jinxu Xiang,
Dong Heon Cho,
Yue Chang,
G A Pershing,
Henrique Teles Maia,
Maurizio M. Chiaramonte,
Kevin Carlberg,
Eitan Grinspun
Abstract:
The long runtime of high-fidelity partial differential equation (PDE) solvers makes them unsuitable for time-critical applications. We propose to accelerate PDE solvers using reduced-order modeling (ROM). Whereas prior ROM approaches reduce the dimensionality of discretized vector fields, our continuous reduced-order modeling (CROM) approach builds a low-dimensional embedding of the continuous vec…
▽ More
The long runtime of high-fidelity partial differential equation (PDE) solvers makes them unsuitable for time-critical applications. We propose to accelerate PDE solvers using reduced-order modeling (ROM). Whereas prior ROM approaches reduce the dimensionality of discretized vector fields, our continuous reduced-order modeling (CROM) approach builds a low-dimensional embedding of the continuous vector fields themselves, not their discretization. We represent this reduced manifold using continuously differentiable neural fields, which may train on any and all available numerical solutions of the continuous system, even when they are obtained using diverse methods or discretizations. We validate our approach on an extensive range of PDEs with training data from voxel grids, meshes, and point clouds. Compared to prior discretization-dependent ROM methods, such as linear subspace proper orthogonal decomposition (POD) and nonlinear manifold neural-network-based autoencoders, CROM features higher accuracy, lower memory consumption, dynamically adaptive resolutions, and applicability to any discretization. For equal latent space dimension, CROM exhibits 79$\times$ and 49$\times$ better accuracy, and 39$\times$ and 132$\times$ smaller memory footprint, than POD and autoencoder methods, respectively. Experiments demonstrate 109$\times$ and 89$\times$ wall-clock speedups over unreduced models on CPUs and GPUs, respectively. Videos and codes are available on the project page: https://crom-pde.github.io
△ Less
Submitted 3 March, 2023; v1 submitted 6 June, 2022;
originally announced June 2022.
-
Single-electron charge transfer into putative Majorana and trivial modes in individual vortices
Authors:
Jian-Feng Ge,
Koen M. Bastiaans,
Damianos Chatzopoulos,
Doohee Cho,
Willem O. Tromp,
Tjerk Benschop,
Jiasen Niu,
Genda Gu,
Milan P. Allan
Abstract:
Majorana bound states are putative collective excitations in solids that exhibit the self-conjugate property of Majorana fermions - they are their own antiparticles. In iron-based superconductors, zero-energy states in vortices have been reported as potential Majorana bound states, but the evidence remains controversial. Here, we use scanning tunneling noise spectroscopy to study the tunneling pro…
▽ More
Majorana bound states are putative collective excitations in solids that exhibit the self-conjugate property of Majorana fermions - they are their own antiparticles. In iron-based superconductors, zero-energy states in vortices have been reported as potential Majorana bound states, but the evidence remains controversial. Here, we use scanning tunneling noise spectroscopy to study the tunneling process into vortex bound states in the conventional superconductor NbSe2, and in the putative Majorana platform FeTe0.55Se0.45. We find that tunneling into vortex bound states in both cases exhibits charge transfer of a single electron charge. Our data for the zero-energy bound states in FeTe0.55Se0.45 exclude the possibility of Yu-Shiba-Rusinov states and are consistent with both Majorana bound states and trivial vortex bound states. Our results open an avenue for investigating the exotic states in vortex cores and for future Majorana devices, although further theoretical investigations involving charge dynamics and superconducting tips are necessary.
△ Less
Submitted 25 May, 2023; v1 submitted 20 May, 2022;
originally announced May 2022.
-
Motion-selective coherent population trapping by Raman sideband cooling along two paths in a $Λ$ configuration
Authors:
Sooyoung Park,
Meung Ho Seo,
Ryun Ah Kim,
D. Cho
Abstract:
We report our experiment on sideband cooling with two Raman transitions in a $Λ$ configuration that allows selective coherent population trapping (CPT) of the motional ground state. The cooling method is applied to $^{87}$Rb atoms in a circularly-polarized one-dimensional optical lattice. Owing to the vector polarizability, the vibration frequency of a trapped atom depends on its Zeeman quantum nu…
▽ More
We report our experiment on sideband cooling with two Raman transitions in a $Λ$ configuration that allows selective coherent population trapping (CPT) of the motional ground state. The cooling method is applied to $^{87}$Rb atoms in a circularly-polarized one-dimensional optical lattice. Owing to the vector polarizability, the vibration frequency of a trapped atom depends on its Zeeman quantum number, and CPT resonance for a pair of bound states in the $Λ$ configuration depends on their vibrational quantum numbers. We call this scheme motion-selective coherent population trapping (MSCPT) and it is a trapped-atom analogue to the velocity-selective CPT developed for free He atoms. We observe a pronounced dip in temperature near a detuning for the Raman beams to satisfy the CPT resonance condition for the motional ground state. Although the lowest temperature we obtain is ten times the recoil limit owing to the large Lamb-Dicke parameter of 2.3 in our apparatus, the experiment demonstrates that MSCPT enhances the effectiveness of Raman sideband cooling and enlarges the range of its application. Discussions on design parameters optimized for MSCPT on $^{87}$Rb atoms and opportunities provided by diatomic polar molecules, whose Stark shift shows strong dependence on the rotational quantum number, are included.
△ Less
Submitted 10 May, 2022;
originally announced May 2022.
-
Motion-selective coherent population trapping for subrecoil cooling of optically trapped atoms outside the Lamb-Dicke regime
Authors:
Hyun Gyung Lee,
Sooyoung Park,
Meung Ho Seo,
D. Cho
Abstract:
We propose a scheme that combines velocity-selective coherent population trapping (CPT) and Raman sideband cooling (RSC) for subrecoil cooling of optically trapped atoms outside the Lamb-Dicke regime. This scheme is based on an inverted $\mathsf{Y}$ configuration in an alkali-metal atom. It consists of a $Λ$ formed by two Raman transitions between the ground hyperfine levels and the $D$ transition…
▽ More
We propose a scheme that combines velocity-selective coherent population trapping (CPT) and Raman sideband cooling (RSC) for subrecoil cooling of optically trapped atoms outside the Lamb-Dicke regime. This scheme is based on an inverted $\mathsf{Y}$ configuration in an alkali-metal atom. It consists of a $Λ$ formed by two Raman transitions between the ground hyperfine levels and the $D$ transition, allowing RSC along two paths and formation of a CPT dark state. Using state-dependent difference in vibration frequency of the atom in a circularly polarized trap, we can tune the $Λ$ to make only the motional ground state a CPT dark state. We call this scheme motion-selective coherent population trapping (MSCPT). We write the master equations for RSC and MSCPT and solve them numerically for a $^{87}$Rb atom in a one-dimensional optical lattice when the Lamb-Dicke parameter is 1. Although MSCPT reaches the steady state slowly compared with RSC, the former consistently produces colder atoms than the latter. The numerical results also show that subrecoil cooling by MSCPT outside the Lamb-Dicke regime is possible under a favorable, yet experimentally feasible, condition. We explain this performance quantitatively by calculating the relative darkness of each motional state. Finally, we discuss on application of the MSCPT scheme to an optically trapped diatomic polar molecule whose Stark shift and vibration frequency exhibit large variations depending on the rotational quantum number.
△ Less
Submitted 26 May, 2022; v1 submitted 2 May, 2022;
originally announced May 2022.
-
Unsupervised Reinforcement Learning for Transferable Manipulation Skill Discovery
Authors:
Daesol Cho,
Jigang Kim,
H. Jin Kim
Abstract:
Current reinforcement learning (RL) in robotics often experiences difficulty in generalizing to new downstream tasks due to the innate task-specific training paradigm. To alleviate it, unsupervised RL, a framework that pre-trains the agent in a task-agnostic manner without access to the task-specific reward, leverages active exploration for distilling diverse experience into essential skills or re…
▽ More
Current reinforcement learning (RL) in robotics often experiences difficulty in generalizing to new downstream tasks due to the innate task-specific training paradigm. To alleviate it, unsupervised RL, a framework that pre-trains the agent in a task-agnostic manner without access to the task-specific reward, leverages active exploration for distilling diverse experience into essential skills or reusable knowledge. For exploiting such benefits also in robotic manipulation, we propose an unsupervised method for transferable manipulation skill discovery that ties structured exploration toward interacting behavior and transferable skill learning. It not only enables the agent to learn interaction behavior, the key aspect of the robotic manipulation learning, without access to the environment reward, but also to generalize to arbitrary downstream manipulation tasks with the learned task-agnostic skills. Through comparative experiments, we show that our approach achieves the most diverse interacting behavior and significantly improves sample efficiency in downstream tasks including the extension to multi-object, multitask problems.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.