-
A low-eccentricity migration pathway for a 13-h-period Earth analogue in a four-planet system
Authors:
Luisa Maria Serrano,
Davide Gandolfi,
Alexander J. Mustill,
Oscar Barragán,
Judith Korth,
Fei Dai,
Seth Redfield,
Malcolm Fridlund,
Kristine W. F. Lam,
Matías R. Díaz,
Sascha Grziwa,
Karen A. Collins,
John H. Livingston,
William D. Cochran,
Coel Hellier,
Salvatore E. Bellomo,
Trifon Trifonov,
Florian Rodler,
Javier Alarcon,
Jon M. Jenkins,
David W. Latham,
George Ricker,
Sara Seager,
Roland Vanderspeck,
Joshua N. Winn
, et al. (25 additional authors not shown)
Abstract:
It is commonly accepted that exoplanets with orbital periods shorter than 1 day, also known as ultra-short period (USP) planets, formed further out within their natal protoplanetary disk, before migrating to their current-day orbits via dynamical interactions. One of the most accepted theories suggests a violent scenario involving high-eccentricity migration followed by tidal circularization. Here…
▽ More
It is commonly accepted that exoplanets with orbital periods shorter than 1 day, also known as ultra-short period (USP) planets, formed further out within their natal protoplanetary disk, before migrating to their current-day orbits via dynamical interactions. One of the most accepted theories suggests a violent scenario involving high-eccentricity migration followed by tidal circularization. Here, we present the discovery of a four planet system orbiting the bright (V=10.5) K6 dwarf star TOI-500. The innermost planet is a transiting, Earth-sized USP planet with an orbital period of $\sim$ 13 hours, a mass of 1.42 $\pm$ 0.18 M$_{\oplus}$, a radius of $1.166^{0.061}_{-0.058}$ R$_{\oplus}$, and a mean density of 4.89$^{+1.03}_{-0.88}$ gcm$^{-3}$. Via Doppler spectroscopy, we discovered that the system hosts three outer planets on nearly circular orbits with periods of 6.6, 26.2, and 61.3d and minimum masses of 5.03 $\pm$ 0.41 M$_{\oplus}$, 33.12 $\pm$ 0.88 M$_{\oplus}$ and 15.05$^{+1.12}_{-1.11}$ M$_{\oplus}$, respectively. The presence of both a USP planet and a low-mass object on a 6.6-day orbit indicates that the architecture of this system can be explained via a scenario in which the planets started on low-eccentricity orbits, then moved inwards through a quasi-static secular migration. Our numerical simulations show that this migration channel can bring TOI-500 b to its current location in 2 Gyrs, starting from an initial orbit of 0.02au. TOI-500 is the first four planet system known to host a USP Earth analog whose current architecture can be explained via a non-violent migration scenario.
△ Less
Submitted 28 April, 2022;
originally announced April 2022.
-
The TESS-Keck Survey. XI. Mass Measurements for Four Transiting sub-Neptunes orbiting K dwarf TOI-1246
Authors:
Emma V. Turtelboom,
Lauren M. Weiss,
Courtney D. Dressing,
Grzegorz Nowak,
Enric Pallé,
Corey Beard,
Sarah Blunt,
Casey Brinkman,
Ashley Chontos,
Zachary R. Claytor,
Fei Dai,
Paul A. Dalba,
Steven Giacalone,
Erica Gonzales,
Caleb K. Harada,
Michelle L. Hill,
Rae Holcomb,
Judith Korth,
Jack Lubin,
Thomas Masseron,
Mason MacDougall,
Andrew W. Mayo,
Teo Močnik,
Joseph M. Akana Murphy,
Alex S. Polanski
, et al. (56 additional authors not shown)
Abstract:
Multi-planet systems are valuable arenas for investigating exoplanet architectures and comparing planetary siblings. TOI-1246 is one such system, with a moderately bright K dwarf ($\rm{V=11.6,~K=9.9}$) and four transiting sub-Neptunes identified by TESS with orbital periods of $4.31~\rm{d},~5.90~\rm{d},~18.66~\rm{d}$, and $~37.92~\rm{d}$. We collected 130 radial velocity observations with Keck/HIR…
▽ More
Multi-planet systems are valuable arenas for investigating exoplanet architectures and comparing planetary siblings. TOI-1246 is one such system, with a moderately bright K dwarf ($\rm{V=11.6,~K=9.9}$) and four transiting sub-Neptunes identified by TESS with orbital periods of $4.31~\rm{d},~5.90~\rm{d},~18.66~\rm{d}$, and $~37.92~\rm{d}$. We collected 130 radial velocity observations with Keck/HIRES and TNG/HARPS-N to measure planet masses. We refit the 14 sectors of TESS photometry to refine planet radii ($\rm{2.97 \pm 0.06~R_\oplus},\rm{2.47 \pm 0.08~R_\oplus}, \rm{3.46 \pm 0.09~R_\oplus}$, $\rm{3.72 \pm 0.16~R_\oplus}$), and confirm the four planets. We find that TOI-1246 e is substantially more massive than the three inner planets ($\rm{8.1 \pm 1.1 M_\oplus}$, $\rm{8.8 \pm 1.2 M_\oplus}$, $\rm{5.3 \pm 1.7 M_\oplus}$, $\rm{14.8 \pm 2.3 M_\oplus}$). The two outer planets, TOI-1246 d and TOI-1246 e, lie near to the 2:1 resonance ($\rm{P_{e}/P_{d}=2.03}$) and exhibit transit timing variations. TOI-1246 is one of the brightest four-planet systems, making it amenable for continued observations. It is one of only six systems with measured masses and radii for all four transiting planets. The planet densities range from $\rm{0.70 \pm 0.24}$ to $3.21 \pm 0.44 \rm{g/cm^3}$, implying a range of bulk and atmospheric compositions. We also report a fifth planet candidate found in the RV data with a minimum mass of 25.6 $\pm$ 3.6 $\rm{M_\oplus}$. This planet candidate is exterior to TOI-1246 e with a candidate period of 93.8 d, and we discuss the implications if it is confirmed to be planetary in nature.
△ Less
Submitted 25 April, 2022;
originally announced April 2022.
-
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis
Authors:
Rongjie Huang,
Max W. Y. Lam,
Jun Wang,
Dan Su,
Dong Yu,
Yi Ren,
Zhou Zhao
Abstract:
Denoising diffusion probabilistic models (DDPMs) have recently achieved leading performances in many generative tasks. However, the inherited iterative sampling process costs hindered their applications to speech synthesis. This paper proposes FastDiff, a fast conditional diffusion model for high-quality speech synthesis. FastDiff employs a stack of time-aware location-variable convolutions of div…
▽ More
Denoising diffusion probabilistic models (DDPMs) have recently achieved leading performances in many generative tasks. However, the inherited iterative sampling process costs hindered their applications to speech synthesis. This paper proposes FastDiff, a fast conditional diffusion model for high-quality speech synthesis. FastDiff employs a stack of time-aware location-variable convolutions of diverse receptive field patterns to efficiently model long-term time dependencies with adaptive conditions. A noise schedule predictor is also adopted to reduce the sampling steps without sacrificing the generation quality. Based on FastDiff, we design an end-to-end text-to-speech synthesizer, FastDiff-TTS, which generates high-fidelity speech waveforms without any intermediate feature (e.g., Mel-spectrogram). Our evaluation of FastDiff demonstrates the state-of-the-art results with higher-quality (MOS 4.28) speech samples. Also, FastDiff enables a sampling speed of 58x faster than real-time on a V100 GPU, making diffusion models practically applicable to speech synthesis deployment for the first time. We further show that FastDiff generalized well to the mel-spectrogram inversion of unseen speakers, and FastDiff-TTS outperformed other competing methods in end-to-end text-to-speech synthesis. Audio samples are available at \url{https://FastDiff.github.io/}.
△ Less
Submitted 21 April, 2022;
originally announced April 2022.
-
UniGDD: A Unified Generative Framework for Goal-Oriented Document-Grounded Dialogue
Authors:
Chang Gao,
Wenxuan Zhang,
Wai Lam
Abstract:
The goal-oriented document-grounded dialogue aims at responding to the user query based on the dialogue context and supporting document. Existing studies tackle this problem by decomposing it into two sub-tasks: knowledge identification and response generation. However, such pipeline methods would unavoidably suffer from the error propagation issue. This paper proposes to unify these two sub-tasks…
▽ More
The goal-oriented document-grounded dialogue aims at responding to the user query based on the dialogue context and supporting document. Existing studies tackle this problem by decomposing it into two sub-tasks: knowledge identification and response generation. However, such pipeline methods would unavoidably suffer from the error propagation issue. This paper proposes to unify these two sub-tasks via sequentially generating the grounding knowledge and the response. We further develop a prompt-connected multi-task learning strategy to model the characteristics and connections of different tasks and introduce linear temperature scheduling to reduce the negative effect of irrelevant document information. Experimental results demonstrate the effectiveness of our framework.
△ Less
Submitted 16 April, 2022;
originally announced April 2022.
-
A Unified Multi-task Learning Framework for Multi-goal Conversational Recommender Systems
Authors:
Yang Deng,
Wenxuan Zhang,
Weiwen Xu,
Wenqiang Lei,
Tat-Seng Chua,
Wai Lam
Abstract:
Recent years witnessed several advances in developing multi-goal conversational recommender systems (MG-CRS) that can proactively attract users' interests and naturally lead user-engaged dialogues with multiple conversational goals and diverse topics. Four tasks are often involved in MG-CRS, including Goal Planning, Topic Prediction, Item Recommendation, and Response Generation. Most existing stud…
▽ More
Recent years witnessed several advances in developing multi-goal conversational recommender systems (MG-CRS) that can proactively attract users' interests and naturally lead user-engaged dialogues with multiple conversational goals and diverse topics. Four tasks are often involved in MG-CRS, including Goal Planning, Topic Prediction, Item Recommendation, and Response Generation. Most existing studies address only some of these tasks. To handle the whole problem of MG-CRS, modularized frameworks are adopted where each task is tackled independently without considering their interdependencies. In this work, we propose a novel Unified MultI-goal conversational recommeNDer system, namely UniMIND. In specific, we unify these four tasks with different formulations into the same sequence-to-sequence (Seq2Seq) paradigm. Prompt-based learning strategies are investigated to endow the unified model with the capability of multi-task learning. Finally, the overall learning and inference procedure consists of three stages, including multi-task learning, prompt-based tuning, and inference. Experimental results on two MG-CRS benchmarks (DuRecDial and TG-ReDial) show that UniMIND achieves state-of-the-art performance on all tasks with a unified model. Extensive analyses and discussions are provided for shedding some new perspectives for MG-CRS.
△ Less
Submitted 14 April, 2022;
originally announced April 2022.
-
The optimized point-coupling interaction for the relativistic energy density functional of Hartree-Bogoliubov approach quantifying the nuclear bulk properties
Authors:
Zi Xin Liu,
Yi Hua Lam,
Ning Lu,
Peter Ring
Abstract:
We propose a newly optimized nonlinear point-coupling parameterized interaction, PC-L3R, for the relativistic Hartree-Bogoliubov framework with a further optimized separable pairing force by fitting to observables, i.e., the binding energies of 91 spherical nuclei, charge radii of 63 nuclei, and 12 sets of mean pairing gaps consisting of 54 nuclei in total. The separable pairing force strengths of…
▽ More
We propose a newly optimized nonlinear point-coupling parameterized interaction, PC-L3R, for the relativistic Hartree-Bogoliubov framework with a further optimized separable pairing force by fitting to observables, i.e., the binding energies of 91 spherical nuclei, charge radii of 63 nuclei, and 12 sets of mean pairing gaps consisting of 54 nuclei in total. The separable pairing force strengths of proton and neutron are optimized together with the point-coupling constants, and are justified in satisfactory reproducing the empirical pairing gaps. The comparison of experimental binding energies compiled in AME2020 for 91 nuclei with the ones generated from the present and other commonly used point-coupling interactions indicates that the implementation of PC-L3R in relativistic Hartree-Bogoliubov yields the lowest root-mean-square deviation. The charge radii satisfactory agree with experiment. Meanwhile, PC-L3R is capable of estimating the saturation properties of the symmetric nuclear matter and of appropriately predicting the isospin and mass dependence of binding energy. The experimental odd-even staggering of single nucleon separation energies is well reproduced. The comparison of the estimated binding energies for 7,373 nuclei based on the PC-L3R and other point-coupling interactions is also presented.
△ Less
Submitted 8 May, 2023; v1 submitted 11 April, 2022;
originally announced April 2022.
-
Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks
Authors:
Haoran Yang,
Piji Li,
Wai Lam
Abstract:
Parameter-efficient tuning aims to distill knowledge for downstream tasks by optimizing a few introduced parameters while freezing the pretrained language models (PLMs). Continuous prompt tuning which prepends a few trainable vectors to the embeddings of input is one of these methods and has drawn much attention due to its effectiveness and efficiency. This family of methods can be illustrated as…
▽ More
Parameter-efficient tuning aims to distill knowledge for downstream tasks by optimizing a few introduced parameters while freezing the pretrained language models (PLMs). Continuous prompt tuning which prepends a few trainable vectors to the embeddings of input is one of these methods and has drawn much attention due to its effectiveness and efficiency. This family of methods can be illustrated as exerting nonlinear transformations of hidden states inside PLMs. However, a natural question is ignored: can the hidden states be directly used for classification without changing them? In this paper, we aim to answer this question by proposing a simple tuning method which only introduces three trainable vectors. Firstly, we integrate all layers hidden states using the introduced vectors. And then, we input the integrated hidden state(s) to a task-specific linear classifier to predict categories. This scheme is similar to the way ELMo utilises hidden states except that they feed the hidden states to LSTM-based models. Although our proposed tuning scheme is simple, it achieves comparable performance with prompt tuning methods like P-tuning and P-tuning v2, verifying that original hidden states do contain useful information for classification tasks. Moreover, our method has an advantage over prompt tuning in terms of time and the number of parameters.
△ Less
Submitted 13 April, 2022; v1 submitted 10 April, 2022;
originally announced April 2022.
-
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Authors:
Max W. Y. Lam,
Jun Wang,
Dan Su,
Dong Yu
Abstract:
Diffusion probabilistic models (DPMs) and their extensions have emerged as competitive generative models yet confront challenges of efficient sampling. We propose a new bilateral denoising diffusion model (BDDM) that parameterizes both the forward and reverse processes with a schedule network and a score network, which can train with a novel bilateral modeling objective. We show that the new surro…
▽ More
Diffusion probabilistic models (DPMs) and their extensions have emerged as competitive generative models yet confront challenges of efficient sampling. We propose a new bilateral denoising diffusion model (BDDM) that parameterizes both the forward and reverse processes with a schedule network and a score network, which can train with a novel bilateral modeling objective. We show that the new surrogate objective can achieve a lower bound of the log marginal likelihood tighter than a conventional surrogate. We also find that BDDM allows inheriting pre-trained score network parameters from any DPMs and consequently enables speedy and stable learning of the schedule network and optimization of a noise schedule for sampling. Our experiments demonstrate that BDDMs can generate high-fidelity audio samples with as few as three sampling steps. Moreover, compared to other state-of-the-art diffusion-based neural vocoders, BDDMs produce comparable or higher quality samples indistinguishable from human speech, notably with only seven sampling steps (143x faster than WaveGrad and 28.6x faster than DiffWave). We release our code at https://github.com/tencent-ailab/bddm.
△ Less
Submitted 25 March, 2022;
originally announced March 2022.
-
TOI-1670 b and c: An Inner Sub-Neptune with an Outer Warm Jupiter Unlikely to have Originated from High-Eccentricity Migration
Authors:
Quang H. Tran,
Brendan P. Bowler,
Michael Endl,
William D. Cochran,
Phillip J. MacQueen,
Davide Gandolfi,
Carina M. Persson,
Malcolm Fridlund,
Enric Palle,
Grzegorz Nowak,
Hans J. Deeg,
Rafael Luque,
John H. Livingston,
Petr Kabáth,
Marek Skarka,
Ján Šubjak,
Steve B. Howell,
Simon H. Albrecht,
Karen A. Collins,
Massimiliano Esposito,
Vincent Van Eylen,
Sascha Grziwa,
Elisa Goffo,
Chelsea X. Huang,
Jon M. Jenkins
, et al. (16 additional authors not shown)
Abstract:
We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observ…
▽ More
We report the discovery of two transiting planets around the bright ($V=9.9$ mag) main sequence F7 star TOI-1670 by the Transiting Exoplanet Survey Satellite. TOI-1670 b is a sub-Neptune ($R_\mathrm{b} = 2.06_{-0.15}^{+0.19}$ $R_\oplus$) on a 10.9-day orbit and TOI-1670 c is a warm Jupiter ($R_\mathrm{c} = 0.987_{-0.025}^{+0.025}$ $R_\mathrm{Jup}$) on a 40.7-day orbit. Using radial velocity observations gathered with the Tull coudé Spectrograph on the Harlan J. Smith telescope and HARPS-N on the Telescopio Nazionale Galileo, we find a planet mass of $M_\mathrm{c} = 0.63_{-0.08}^{+0.09}$ $M_\mathrm{Jup}$ for the outer warm Jupiter, implying a mean density of $ρ_c = 0.81_{-0.11}^{+0.13}$ g cm$^{-3}$. The inner sub-Neptune is undetected in our radial velocity data ($M_\mathrm{b} < 0.13$ $M_\mathrm{Jup}$ at the 99% confidence level). Multi-planet systems like TOI-1670 hosting an outer warm Jupiter on a nearly circular orbit ($e_\mathrm{c} = 0.09_{-0.04}^{+0.05}$) and one or more inner coplanar planets are more consistent with "gentle" formation mechanisms such as disk migration or $in$ $situ$ formation rather than high-eccentricity migration. Of the 11 known systems with a warm Jupiter and a smaller inner companion, 8 (73%) are near a low-order mean-motion resonance, which can be a signature of migration. TOI-1670 joins two other systems (27% of this subsample) with period commensurabilities greater than 3, a common feature of $in$ $situ$ formation or halted inward migration. TOI-1670 and the handful of similar systems support a diversity of formation pathways for warm Jupiters.
△ Less
Submitted 8 March, 2022;
originally announced March 2022.
-
Delaunay decompositions minimizing energy of weighted toroidal graphs
Authors:
Wai Yeung Lam
Abstract:
Given a weighted toroidal graph, each realization to a Euclidean torus is associated with the Dirichlet energy. By minimizing the energy over all possible Euclidean structures and over all realizations within a fixed homotopy class, one obtains a harmonic map into an optimal Euclidean torus. We show that only with this optimal Euclidean structure, the harmonic map and the edge weights are induced…
▽ More
Given a weighted toroidal graph, each realization to a Euclidean torus is associated with the Dirichlet energy. By minimizing the energy over all possible Euclidean structures and over all realizations within a fixed homotopy class, one obtains a harmonic map into an optimal Euclidean torus. We show that only with this optimal Euclidean structure, the harmonic map and the edge weights are induced from a weighted Delaunay decomposition.
△ Less
Submitted 7 March, 2022;
originally announced March 2022.
-
A Survey on Aspect-Based Sentiment Analysis: Tasks, Methods, and Challenges
Authors:
Wenxuan Zhang,
Xin Li,
Yang Deng,
Lidong Bing,
Wai Lam
Abstract:
As an important fine-grained sentiment analysis problem, aspect-based sentiment analysis (ABSA), aiming to analyze and understand people's opinions at the aspect level, has been attracting considerable interest in the last decade. To handle ABSA in different scenarios, various tasks are introduced for analyzing different sentiment elements and their relations, including the aspect term, aspect cat…
▽ More
As an important fine-grained sentiment analysis problem, aspect-based sentiment analysis (ABSA), aiming to analyze and understand people's opinions at the aspect level, has been attracting considerable interest in the last decade. To handle ABSA in different scenarios, various tasks are introduced for analyzing different sentiment elements and their relations, including the aspect term, aspect category, opinion term, and sentiment polarity. Unlike early ABSA works focusing on a single sentiment element, many compound ABSA tasks involving multiple elements have been studied in recent years for capturing more complete aspect-level sentiment information. However, a systematic review of various ABSA tasks and their corresponding solutions is still lacking, which we aim to fill in this survey. More specifically, we provide a new taxonomy for ABSA which organizes existing studies from the axes of concerned sentiment elements, with an emphasis on recent advances of compound ABSA tasks. From the perspective of solutions, we summarize the utilization of pre-trained language models for ABSA, which improved the performance of ABSA to a new stage. Besides, techniques for building more practical ABSA systems in cross-domain/lingual scenarios are discussed. Finally, we review some emerging topics and discuss some open challenges to outlook potential future directions of ABSA.
△ Less
Submitted 6 November, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
A Radial Velocity Study of the Planetary System of Pi Mensae: Improved Planet Parameters for PI Mensae c and a Third Planet on a 125-d Orbit
Authors:
Artie P. Hatzes,
Davide Gandolfi,
Judith Korth,
Florian Rodler,
Silvia Sabotta,
Massimiliano Esposito,
Oscar Barragan,
Vincent Van Eylen John H. Livingston,
Luisa Maria Serrano,
Rafael Luque,
Alexis M. S. Smith,
Seth Redfield,
Carina M. Persson,
Martin Paetzold,
Enric Palle,
Grzegorz Nowak,
Hannah L. M. Osborne,
Norio Narita,
Savita Mathur,
Kristine W. F. Lam,
Petr Kabath,
Marshall C. Johnson,
Eike W. Guenther,
Sascha Grziwa,
Elisa Goffo
, et al. (11 additional authors not shown)
Abstract:
Pi Men hosts a transiting planet detected by the TESS space mission and an outer planet in a 5.7-yr orbit discovered by RV surveys. We studied this system using new radial velocity (RV) measurements taken with the HARPS spectrograph on ESO's 3.6-m telescope as well as archival data. We constrain the stellar RV semi-amplitude due to the transiting planet, Pi Men c, as K_c = 1.21 +/- 0.12 m/s result…
▽ More
Pi Men hosts a transiting planet detected by the TESS space mission and an outer planet in a 5.7-yr orbit discovered by RV surveys. We studied this system using new radial velocity (RV) measurements taken with the HARPS spectrograph on ESO's 3.6-m telescope as well as archival data. We constrain the stellar RV semi-amplitude due to the transiting planet, Pi Men c, as K_c = 1.21 +/- 0.12 m/s resulting in a planet mass of M_c = 3.63 +/- 0.38 M_Earth. A planet radius of R_c= 2.145 +/- 0.015 R_Earth yields a bulk density of rho = 2.03 +/- 0.22 g/cm^{-3}. The precisely determined density of this planet and the brightness of the host star make Pi Men c an excellent laboratory for internal structure and atmospheric characterization studies. Our HARPS RV measurements also reveal compelling evidence for a third body, PI Men d, with a minimum mass M sin i = 13.38 +/- 1.35 M_Earth orbiting with a period of P_d = 125 d on an eccentric orbit (e = 0.22). A simple dynamical analysis indicates that the orbit of Pi Men d is stable on timescales of at least 20 Myrs. Given the mutual inclination between the outer gaseous giant and the inner rocky planet and the presence of a third body at 125 d, Pi Men is an important planetary system for dynamical and formation studies.
△ Less
Submitted 3 March, 2022; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Improving Lexical Embeddings for Robust Question Answering
Authors:
Weiwen Xu,
Bowei Zou,
Wai Lam,
Ai Ti Aw
Abstract:
Recent techniques in Question Answering (QA) have gained remarkable performance improvement with some QA models even surpassed human performance. However, the ability of these models in truly understanding the language still remains dubious and the models are revealing limitations when facing adversarial examples. To strengthen the robustness of QA models and their generalization ability, we propo…
▽ More
Recent techniques in Question Answering (QA) have gained remarkable performance improvement with some QA models even surpassed human performance. However, the ability of these models in truly understanding the language still remains dubious and the models are revealing limitations when facing adversarial examples. To strengthen the robustness of QA models and their generalization ability, we propose a representation Enhancement via Semantic and Context constraints (ESC) approach to improve the robustness of lexical embeddings. Specifically, we insert perturbations with semantic constraints and train enhanced contextual representations via a context-constraint loss to better distinguish the context clues for the correct answer. Experimental results show that our approach gains significant robustness improvement on four adversarial test sets.
△ Less
Submitted 28 February, 2022;
originally announced February 2022.
-
Domain Adversarial Spatial-Temporal Network: A Transferable Framework for Short-term Traffic Forecasting across Cities
Authors:
Yihong Tang,
Ao Qu,
Andy H. F. Chow,
William H. K. Lam,
S. C. Wong,
Wei Ma
Abstract:
Accurate real-time traffic forecast is critical for intelligent transportation systems (ITS) and it serves as the cornerstone of various smart mobility applications. Though this research area is dominated by deep learning, recent studies indicate that the accuracy improvement by developing new model structures is becoming marginal. Instead, we envision that the improvement can be achieved by trans…
▽ More
Accurate real-time traffic forecast is critical for intelligent transportation systems (ITS) and it serves as the cornerstone of various smart mobility applications. Though this research area is dominated by deep learning, recent studies indicate that the accuracy improvement by developing new model structures is becoming marginal. Instead, we envision that the improvement can be achieved by transferring the "forecasting-related knowledge" across cities with different data distributions and network topologies. To this end, this paper aims to propose a novel transferable traffic forecasting framework: Domain Adversarial Spatial-Temporal Network (DASTNet). DASTNet is pre-trained on multiple source networks and fine-tuned with the target network's traffic data. Specifically, we leverage the graph representation learning and adversarial domain adaptation techniques to learn the domain-invariant node embeddings, which are further incorporated to model the temporal traffic data. To the best of our knowledge, we are the first to employ adversarial multi-domain adaptation for network-wide traffic forecasting problems. DASTNet consistently outperforms all state-of-the-art baseline methods on three benchmark datasets. The trained DASTNet is applied to Hong Kong's new traffic detectors, and accurate traffic predictions can be delivered immediately (within one day) when the detector is available. Overall, this study suggests an alternative to enhance the traffic forecasting methods and provides practical implications for cities lacking historical traffic data.
△ Less
Submitted 19 August, 2022; v1 submitted 7 February, 2022;
originally announced February 2022.
-
User Satisfaction Estimation with Sequential Dialogue Act Modeling in Goal-oriented Conversational Systems
Authors:
Yang Deng,
Wenxuan Zhang,
Wai Lam,
Hong Cheng,
Helen Meng
Abstract:
User Satisfaction Estimation (USE) is an important yet challenging task in goal-oriented conversational systems. Whether the user is satisfied with the system largely depends on the fulfillment of the user's needs, which can be implicitly reflected by users' dialogue acts. However, existing studies often neglect the sequential transitions of dialogue act or rely heavily on annotated dialogue act l…
▽ More
User Satisfaction Estimation (USE) is an important yet challenging task in goal-oriented conversational systems. Whether the user is satisfied with the system largely depends on the fulfillment of the user's needs, which can be implicitly reflected by users' dialogue acts. However, existing studies often neglect the sequential transitions of dialogue act or rely heavily on annotated dialogue act labels when utilizing dialogue acts to facilitate USE. In this paper, we propose a novel framework, namely USDA, to incorporate the sequential dynamics of dialogue acts for predicting user satisfaction, by jointly learning User Satisfaction Estimation and Dialogue Act Recognition tasks. In specific, we first employ a Hierarchical Transformer to encode the whole dialogue context, with two task-adaptive pre-training strategies to be a second-phase in-domain pre-training for enhancing the dialogue modeling ability. In terms of the availability of dialogue act labels, we further develop two variants of USDA to capture the dialogue act information in either supervised or unsupervised manners. Finally, USDA leverages the sequential transitions of both content and act features in the dialogue to predict the user satisfaction. Experimental results on four benchmark goal-oriented dialogue datasets across different applications show that the proposed method substantially and consistently outperforms existing methods on USE, and validate the important role of dialogue act sequences in USE.
△ Less
Submitted 6 February, 2022;
originally announced February 2022.
-
Measuring the complexity of micro and nanostructured surfaces
Authors:
A. Arapis,
V. Constantoudis,
D. Kontziampasis,
A. Milionis,
C. W. E. Lam,
A. Tripathy,
D. Poulikakos,
E. Gogolides
Abstract:
Nanostructured surfaces usually exhibit complicated morphologies that cannot be described in terms of Euclidean geometry. Simultaneously, they do not constitute fully random noise fields to be characterized by simple stochastics and probability theory. In most cases, nanomorphologies consist of complicated mixtures of order and randomness, which should be described quantitatively if one aims to co…
▽ More
Nanostructured surfaces usually exhibit complicated morphologies that cannot be described in terms of Euclidean geometry. Simultaneously, they do not constitute fully random noise fields to be characterized by simple stochastics and probability theory. In most cases, nanomorphologies consist of complicated mixtures of order and randomness, which should be described quantitatively if one aims to control their fabrication and properties. In this work, inspired by recent developments in complexity theory, we propose a method to measure nanomorphology complexity that is based on the deviation from the average symmetry of surfaces. We present the methodology for its calculation and the validation of its performance, using a series of synthetic surfaces where the proposed complexity measure obtains a maximum value at the most heterogeneous morphologies between the fully ordered and fully random cases. Additionally, we measure the complexity of experimental micro and nanostructured surfaces (polymeric and metallic), and demonstrate the usefulness of the proposed method in quantifying the impact of processing conditions on their morphologies. Finally, we hint on the relationship between the complexity measure and the functional properties of surfaces.
△ Less
Submitted 2 February, 2022;
originally announced February 2022.
-
TOI-1268b: the youngest, hot, Saturn-mass transiting exoplanet
Authors:
J. Šubjak,
M. Endl,
P. Chaturvedi,
R. Karjalainen,
W. D. Cochran,
M. Esposito,
D. Gandolfi,
K. W. F. Lam,
K. Stassun,
J. Žák,
N. Lodieu,
H. M. J. Boffin,
P. J. MacQueen,
A. Hatzes,
E. W. Guenther,
I. Georgieva,
S. Grziwa,
H. Schmerling,
M. Skarka,
M. Blažek,
M. Karjalainen,
M. Špoková,
H. Isaacson,
A. W. Howard,
C. J. Burke
, et al. (19 additional authors not shown)
Abstract:
We report the discovery of TOI-1268b, a transiting Saturn-mass planet from the TESS space mission. With an age of less than one Gyr, derived from various age indicators, TOI-1268b is the youngest Saturn-mass planet known to date and contributes to the small sample of well characterised young planets. It has an orbital period of $P\,=\,8.1577080\pm0.0000044$ days, and transits an early K dwarf star…
▽ More
We report the discovery of TOI-1268b, a transiting Saturn-mass planet from the TESS space mission. With an age of less than one Gyr, derived from various age indicators, TOI-1268b is the youngest Saturn-mass planet known to date and contributes to the small sample of well characterised young planets. It has an orbital period of $P\,=\,8.1577080\pm0.0000044$ days, and transits an early K dwarf star with a mass of $M_\star$ = $ 0.96 \pm 0.04$ $M_{\odot}$, a radius of $R_\star$ = $ 0.92 \pm 0.06$ $R_{\odot}$, an effective temperature of $T_\mathrm{eff}\,=\,5300\pm100$ K, and a metallicity of $0.36\pm0.06$ dex. By combining TESS photometry with high-resolution spectra acquired with the Tull spectrograph at McDonald observatory, and the high-resolution spectrographs at Tautenburg and Ondrejov observatories, we measured a planetary mass of $M_\mathrm{p}\,=\,96.4 \pm 8.3\,M_{\oplus}$ and a radius of $R_\mathrm{p}\,=\,9.1 \pm 0.6\,R_{\oplus}$. TOI-1268 is an ideal system to study the role of star-planet tidal interactions for non-inflated Saturn-mass planets. We used system parameters derived in this paper to constrain the planet tidal quality factor to the range of $10^{4.5-5.3}$. When compared with the sample of other non-inflated Saturn-mass planets, TOI-1268b is one of the best candidates for transmission spectroscopy studies.
△ Less
Submitted 23 February, 2022; v1 submitted 31 January, 2022;
originally announced January 2022.
-
Towards Personalized Answer Generation in E-Commerce via Multi-Perspective Preference Modeling
Authors:
Yang Deng,
Yaliang Li,
Wenxuan Zhang,
Bolin Ding,
Wai Lam
Abstract:
Recently, Product Question Answering (PQA) on E-Commerce platforms has attracted increasing attention as it can act as an intelligent online shopping assistant and improve the customer shopping experience. Its key function, automatic answer generation for product-related questions, has been studied by aiming to generate content-preserving while question-related answers. However, an important chara…
▽ More
Recently, Product Question Answering (PQA) on E-Commerce platforms has attracted increasing attention as it can act as an intelligent online shopping assistant and improve the customer shopping experience. Its key function, automatic answer generation for product-related questions, has been studied by aiming to generate content-preserving while question-related answers. However, an important characteristic of PQA, i.e., personalization, is neglected by existing methods. It is insufficient to provide the same "completely summarized" answer to all customers, since many customers are more willing to see personalized answers with customized information only for themselves, by taking into consideration their own preferences towards product aspects or information needs. To tackle this challenge, we propose a novel Personalized Answer GEneration method (PAGE) with multi-perspective preference modeling, which explores historical user-generated contents to model user preference for generating personalized answers in PQA. Specifically, we first retrieve question-related user history as external knowledge to model knowledge-level user preference. Then we leverage Gaussian Softmax distribution model to capture latent aspect-level user preference. Finally, we develop a persona-aware pointer network to generate personalized answers in terms of both content and style by utilizing personal user preference and dynamic user vocabulary. Experimental results on real-world E-Commerce QA datasets demonstrate that the proposed method outperforms existing methods by generating informative and customized answers, and show that answer generation in E-Commerce can benefit from personalization.
△ Less
Submitted 27 December, 2021;
originally announced December 2021.
-
Automatic Meta-Path Discovery for Effective Graph-Based Recommendation
Authors:
Wentao Ning,
Reynold Cheng,
Jiajun Shen,
Nur Al Hasan Haldar,
Ben Kao,
Xiao Yan,
Nan Huo,
Wai Kit Lam,
Tian Li,
Bo Tang
Abstract:
Heterogeneous Information Networks (HINs) are labeled graphs that depict relationships among different types of entities (e.g., users, movies and directors). For HINs, meta-path-based recommenders (MPRs) utilize meta-paths (i.e., abstract paths consisting of node and link types) to predict user preference, and have attracted a lot of attention due to their explainability and performance. We observ…
▽ More
Heterogeneous Information Networks (HINs) are labeled graphs that depict relationships among different types of entities (e.g., users, movies and directors). For HINs, meta-path-based recommenders (MPRs) utilize meta-paths (i.e., abstract paths consisting of node and link types) to predict user preference, and have attracted a lot of attention due to their explainability and performance. We observe that the performance of MPRs is highly sensitive to the meta-paths they use, but existing works manually select the meta-paths from many possible ones. Thus, to discover effective meta-paths automatically, we propose the Reinforcement learning-based Meta-path Selection (RMS) framework. Specifically, we define a vector encoding for meta-paths and design a policy network to extend meta-paths. The policy network is trained based on the results of downstream recommendation tasks and an early stopping approximation strategy is proposed to speed up training. RMS is a general model, and it can work with all existing MPRs. We also propose a new MPR called RMS-HRec, which uses an attention mechanism to aggregate information from the meta-paths. We conduct extensive experiments on real datasets. Compared with the manually selected meta-paths, the meta-paths identified by RMS consistently improve recommendation quality. Moreover, RMS-HRec outperforms state-of-the-art recommender systems by an average of 7% in hit ratio. The codes and datasets are available on https://github.com/Stevenn9981/RMS-HRec.
△ Less
Submitted 7 September, 2022; v1 submitted 23 December, 2021;
originally announced December 2021.
-
GJ 367b: A dense ultra-short period sub-Earth planet transiting a nearby red dwarf star
Authors:
Kristine W. F. Lam,
Szilárd Csizmadia,
Nicola Astudillo-Defru,
Xavier Bonfils,
Davide Gandolfi,
Sebastiano Padovan,
Massimiliano Esposito,
Coel Hellier,
Teruyuki Hirano,
John Livingston,
Felipe Murgas,
Alexis M. S. Smith,
Karen A. Collins,
Savita Mathur,
Rafael A. Garcia,
Steve B. Howell,
Nuno C. Santos,
Fei Dai,
George R. Ricker,
Roland Vanderspek,
David W. Latham,
Sara Seager,
Joshua N. Winn,
Jon M. Jenkins,
Simon Albrecht
, et al. (53 additional authors not shown)
Abstract:
Ultra-short-period (USP) exoplanets have orbital periods shorter than one day. Precise masses and radii of USPs could provide constraints on their unknown formation and evolution processes. We report the detection and characterization of the USP planet GJ 367b using high precision photometry and radial velocity observations. GJ 367b orbits a bright (V-band magnitude = 10.2), nearby, red (M-type) d…
▽ More
Ultra-short-period (USP) exoplanets have orbital periods shorter than one day. Precise masses and radii of USPs could provide constraints on their unknown formation and evolution processes. We report the detection and characterization of the USP planet GJ 367b using high precision photometry and radial velocity observations. GJ 367b orbits a bright (V-band magnitude = 10.2), nearby, red (M-type) dwarf star every 7.7 hours. GJ 367b has a radius of $0.718 \pm 0.054$ Earth-radii, a mass of $0.546 \pm 0.078$ Earth-masses, making it a sub-Earth. The corresponding bulk density is $8.106 \pm 2.165$ g cm$^-3$, close to that of iron. An interior structure model predicts the planet has an iron core radius fraction of $86 \pm 5\%$, similar to Mercury's interior.
△ Less
Submitted 2 December, 2021;
originally announced December 2021.
-
K2-99 revisited: a non-inflated warm Jupiter, and a temperate giant planet on a 522-d orbit around a subgiant
Authors:
A. M. S. Smith,
S. N. Breton,
Sz. Csizmadia,
F. Dai,
D. Gandolfi,
R. A. García,
A. W. Howard,
H. Isaacson,
J. Korth,
K. W. F. Lam,
S. Mathur,
G. Nowak,
F. Pérez Hernández,
C. M. Persson,
S. H. Albrecht,
O. Barragán,
J. Cabrera,
W. D. Cochran,
H. J. Deeg,
M. Fridlund,
I. Y. Georgieva,
E. Goffo,
E. W. Guenther,
A. P. Hatzes,
P. Kabath
, et al. (7 additional authors not shown)
Abstract:
We report new photometric and spectroscopic observations of the K2-99 planetary system. Asteroseismic analysis of the short-cadence light curve from K2's Campaign 17 allows us to refine the stellar properties. We find K2-99 to be significantly smaller than previously thought, with $R_{\star} = 2.55\pm0.02$ $\mathrm{R_\odot}$. The new light curve also contains four transits of K2-99b, which we use…
▽ More
We report new photometric and spectroscopic observations of the K2-99 planetary system. Asteroseismic analysis of the short-cadence light curve from K2's Campaign 17 allows us to refine the stellar properties. We find K2-99 to be significantly smaller than previously thought, with $R_{\star} = 2.55\pm0.02$ $\mathrm{R_\odot}$. The new light curve also contains four transits of K2-99b, which we use to improve our knowledge of the planetary properties. We find the planet to be a non-inflated warm Jupiter, with $R_\mathrm{b} = 1.06 \pm 0.01$ $\mathrm{R_{Jup}}$. Sixty new radial velocity measurements from HARPS, HARPS-N, and HIRES enable the determination of the orbital parameters of K2-99c, which were previously poorly constrained. We find that this outer planet has a minimum mass $M_\mathrm{c} \sin i_\mathrm{c} = 8.4\pm0.2$ $\mathrm{M_{Jup}}$, and an eccentric orbit ($e_\mathrm{c} = 0.210 \pm 0.009$) with a period of $522.2\pm1.4$ d. Upcoming TESS observations in 2022 have a good chance of detecting the transit of this planet, if the mutual inclination between the two planetary orbits is small.
△ Less
Submitted 29 November, 2021;
originally announced November 2021.
-
Partner Personas Generation for Diverse Dialogue Generation
Authors:
Hongyuan Lu,
Wai Lam,
Hong Cheng,
Helen M. Meng
Abstract:
Incorporating personas information allows diverse and engaging responses in dialogue response generation. Unfortunately, prior works have primarily focused on self personas and have overlooked the value of partner personas. Moreover, in practical applications, the availability of ground truth partner personas is often not the case. This paper attempts to tackle these issues by offering a novel fra…
▽ More
Incorporating personas information allows diverse and engaging responses in dialogue response generation. Unfortunately, prior works have primarily focused on self personas and have overlooked the value of partner personas. Moreover, in practical applications, the availability of ground truth partner personas is often not the case. This paper attempts to tackle these issues by offering a novel framework that leverages automatic partner personas generation to enhance the succeeding dialogue generation. We incorporate reinforcement learning with a dedicatedly designed critic network for reward judgement. Experimental results from both automatic and human evaluation demonstrate a) Our framework is capable of generating relevant, informative and coherent partner personas, even compared to the ground truth partner personas. b) Generated partner personas enhance the succeeding response generation, thus surpassing our baselines and comparison model when partner personas are missing during the inference stage. c) Our framework generates responses that are more informative and engaging than our baseline conditioned on the ground truth partner personas during inference. d) Our dedicatedly designed critic network reinforces our framework effectively. Finally, our framework gives better explainability and reduces the demands for external databases for partner personas.
△ Less
Submitted 27 November, 2021;
originally announced November 2021.
-
Sentiment Analysis of Fashion Related Posts in Social Media
Authors:
Yifei Yuan,
Wai Lam
Abstract:
The role of social media in fashion industry has been blooming as the years have continued on. In this work, we investigate sentiment analysis for fashion related posts in social media platforms. There are two main challenges of this task. On the first place, information of different modalities must be jointly considered to make the final predictions. On the second place, some unique fashion relat…
▽ More
The role of social media in fashion industry has been blooming as the years have continued on. In this work, we investigate sentiment analysis for fashion related posts in social media platforms. There are two main challenges of this task. On the first place, information of different modalities must be jointly considered to make the final predictions. On the second place, some unique fashion related attributes should be taken into account. While most existing works focus on traditional multimodal sentiment analysis, they always fail to exploit the fashion related attributes in this task. We propose a novel framework that jointly leverages the image vision, post text, as well as fashion attribute modality to determine the sentiment category. One characteristic of our model is that it extracts fashion attributes and integrates them with the image vision information for effective representation. Furthermore, it exploits the mutual relationship between the fashion attributes and the post texts via a mutual attention mechanism. Since there is no existing dataset suitable for this task, we prepare a large-scale sentiment analysis dataset of over 12k fashion related social media posts. Extensive experiments are conducted to demonstrate the effectiveness of our model.
△ Less
Submitted 15 November, 2021;
originally announced November 2021.
-
Turning Traffic Monitoring Cameras into Intelligent Sensors for Traffic Density Estimation
Authors:
Zijian Hu,
William H. K. Lam,
S. C. Wong,
Andy H. F. Chow,
Wei Ma
Abstract:
Accurate traffic state information plays a pivotal role in the Intelligent Transportation Systems (ITS), and it is an essential input to various smart mobility applications such as signal coordination and traffic flow prediction. The current practice to obtain the traffic state information is through specialized sensors such as loop detectors and speed cameras. In most metropolitan areas, traffic…
▽ More
Accurate traffic state information plays a pivotal role in the Intelligent Transportation Systems (ITS), and it is an essential input to various smart mobility applications such as signal coordination and traffic flow prediction. The current practice to obtain the traffic state information is through specialized sensors such as loop detectors and speed cameras. In most metropolitan areas, traffic monitoring cameras have been installed to monitor the traffic conditions on arterial roads and expressways, and the collected videos or images are mainly used for visual inspection by traffic engineers. Unfortunately, the data collected from traffic monitoring cameras are affected by the 4L characteristics: Low frame rate, Low resolution, Lack of annotated data, and Located in complex road environments. Therefore, despite the great potentials of the traffic monitoring cameras, the 4L characteristics hinder them from providing useful traffic state information (e.g., speed, flow, density). This paper focuses on the traffic density estimation problem as it is widely applicable to various traffic surveillance systems. To the best of our knowledge, there is a lack of the holistic framework for addressing the 4L characteristics and extracting the traffic density information from traffic monitoring camera data. In view of this, this paper proposes a framework for estimating traffic density using uncalibrated traffic monitoring cameras with 4L characteristics. The proposed framework consists of two major components: camera calibration and vehicle detection. The camera calibration method estimates the actual length between pixels in the images and videos, and the vehicle counts are extracted from the deep-learning-based vehicle detection method. Combining the two components, high-granular traffic density can be estimated. To validate the proposed framework, two case studies were conducted in Hong Kong and Sacramento. The results show that the Mean Absolute Error (MAE) in camera calibration is less than 0.2 meters out of 6 meters, and the accuracy of vehicle detection under various conditions is approximately 90%. Overall, the MAE for the estimated density is 9.04 veh/km/lane in Hong Kong and 1.30 veh/km/lane in Sacramento. The research outcomes can be used to calibrate the speed-density fundamental diagrams, and the proposed framework can provide accurate and real-time traffic information without installing additional sensors.
△ Less
Submitted 29 October, 2021;
originally announced November 2021.
-
Kaleidoscope Eyes: Microstructure and Optical Performance of Chiton Ocelli
Authors:
Leanne Friedrich,
Wai Sze Lam,
Lyle Gordon,
Paul Smeets,
Robert Free,
Lesley Brooker,
Russell Chipman,
Derk Joester
Abstract:
The chiton Acanthopleura granulata uses aragonitic lenses embedded in its shell to focus light onto photoreceptors. Because aragonite is biaxially birefringent, the microstructure of the lens greatly impacts the optical performance. In addition, the chiton lives in the intertidal, so lenses experience two environments with different refractive indices: air and water. Using EBSD, we find that the l…
▽ More
The chiton Acanthopleura granulata uses aragonitic lenses embedded in its shell to focus light onto photoreceptors. Because aragonite is biaxially birefringent, the microstructure of the lens greatly impacts the optical performance. In addition, the chiton lives in the intertidal, so lenses experience two environments with different refractive indices: air and water. Using EBSD, we find that the lens is polycrystalline and contains curved grain boundaries. A combination of large, twinned grains and nanotwins ensure that the aragonitic <001> axis is consistent across the lens. However, the orientation of the <001> axis relative to the shell varies between lenses. Ray tracing simulations predict the optical performance of lenses of various microstructures in wet and dry environments. Though twinning helps to limit birefringence-induced aberrations, variations in the orientation of the <001> axis between lenses lead to variations in focal lengths between lenses and cause image doubling in some lenses. As such, the birefringence of aragonite does not help the lens to transmit focused images in both air and water.
△ Less
Submitted 12 September, 2021;
originally announced October 2021.
-
The Impact of the New $^{65\!}$As(p,$γ$)$^{66\!}$Se Reaction Rate on the Two-Proton Sequential Capture of $^{64}\!$Ge, Weak GeAs Cycles, and Type-I X-Ray Bursts such as the Clocked Burster GS 1826$-$24
Authors:
Yi Hua Lam,
Zi Xin Liu,
Alexander Heger,
Ning Lu,
Adam Michael Jacobs,
Zac Johnston
Abstract:
We re-assess $^{65}$As(p,$γ$)$^{66}$Se reaction rates based on a set of proton thresholds of $^{66}$Se, $S_\mathrm{p}$($^{66}$Se), estimated from the experimental mirror nuclear masses, theoretical mirror displacement energies, and full $pf$-model space shell-model calculation. The self-consistent relativistic Hartree-Bogoliubov theory is employed to obtain the mirror displacement energies with mu…
▽ More
We re-assess $^{65}$As(p,$γ$)$^{66}$Se reaction rates based on a set of proton thresholds of $^{66}$Se, $S_\mathrm{p}$($^{66}$Se), estimated from the experimental mirror nuclear masses, theoretical mirror displacement energies, and full $pf$-model space shell-model calculation. The self-consistent relativistic Hartree-Bogoliubov theory is employed to obtain the mirror displacement energies with much reduced uncertainty, and thus reducing the proton-threshold uncertainty up to 161 keV compared to the AME2020 evaluation. Using the simulation instantiated by the one-dimensional multi-zone hydrodynamic code, KEPLER, that closely reproduces the observed GS 1826$-$24 clocked bursts, the present forward and reverse $^{65}$As(p,$γ$)$^{66}$Se reaction rates based on a selected $S_\mathrm{p}$($^{66}$Se) = 2.469$\pm$0.054 MeV, and the latest $^{22}$Mg($α$,p)$^{25}$Al, $^{56}$Ni(p,$γ$)$^{57}$Cu(p,$γ$)$^{58}$Zn, $^{55}$Ni(p,$γ$)$^{56}$Cu, and $^{64}$Ge(p,$γ$)$^{65}$As reaction rates, we find that though the GeAs cycles is weakly established in the rapid-proton capture process path, the $^{65}$As(p,$γ$)$^{66}$Se reaction still strongly characterizes the burst tail end due to the two-proton sequential capture on $^{64}$Ge, not found by Cyburt et al. (2016) sensitivity study. The $^{65}$As(p,$γ$)$^{66}$Se reaction influences the abundances of nuclei $A$ = 64, 68, 72, 76, and 80 up to a factor of 1.4. The new $S_\mathrm{p}$($^{66}$Se) and the inclusion of the updated $^{22}$Mg($α$,p)$^{25}$Al reaction rate increases the production of $^{12}$C up to a factor of $4.5$ that is not observable and could be the main fuel for superburst. The waiting point status of and two-proton sequential capture on $^{64}$Ge, weak-cycle feature of GeAs at region heavier than $^{64}$Ge, and impact of other possible $S_\mathrm{p}$($^{66}$Se) are also discussed.
△ Less
Submitted 11 January, 2022; v1 submitted 26 October, 2021;
originally announced October 2021.
-
Aspect Sentiment Quad Prediction as Paraphrase Generation
Authors:
Wenxuan Zhang,
Yang Deng,
Xin Li,
Yifei Yuan,
Lidong Bing,
Wai Lam
Abstract:
Aspect-based sentiment analysis (ABSA) has been extensively studied in recent years, which typically involves four fundamental sentiment elements, including the aspect category, aspect term, opinion term, and sentiment polarity. Existing studies usually consider the detection of partial sentiment elements, instead of predicting the four elements in one shot. In this work, we introduce the Aspect S…
▽ More
Aspect-based sentiment analysis (ABSA) has been extensively studied in recent years, which typically involves four fundamental sentiment elements, including the aspect category, aspect term, opinion term, and sentiment polarity. Existing studies usually consider the detection of partial sentiment elements, instead of predicting the four elements in one shot. In this work, we introduce the Aspect Sentiment Quad Prediction (ASQP) task, aiming to jointly detect all sentiment elements in quads for a given opinionated sentence, which can reveal a more comprehensive and complete aspect-level sentiment structure. We further propose a novel \textsc{Paraphrase} modeling paradigm to cast the ASQP task to a paraphrase generation process. On one hand, the generation formulation allows solving ASQP in an end-to-end manner, alleviating the potential error propagation in the pipeline solution. On the other hand, the semantics of the sentiment elements can be fully exploited by learning to generate them in the natural language form. Extensive experiments on benchmark datasets show the superiority of our proposed method and the capacity of cross-task transfer with the proposed unified \textsc{Paraphrase} modeling framework.
△ Less
Submitted 2 October, 2021;
originally announced October 2021.
-
Multilingual AMR Parsing with Noisy Knowledge Distillation
Authors:
Deng Cai,
Xin Li,
Jackie Chun-Sing Ho,
Lidong Bing,
Wai Lam
Abstract:
We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher. We constrain our exploration in a strict multilingual setting: there is but one model to parse all different languages including English. We identify that noisy input and precise output are the key to s…
▽ More
We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher. We constrain our exploration in a strict multilingual setting: there is but one model to parse all different languages including English. We identify that noisy input and precise output are the key to successful distillation. Together with extensive pre-training, we obtain an AMR parser whose performances surpass all previously published results on four different foreign languages, including German, Spanish, Italian, and Chinese, by large margins (up to 18.8 \textsc{Smatch} points on Chinese and on average 11.3 \textsc{Smatch} points). Our parser also achieves comparable performance on English to the latest state-of-the-art English-only parser.
△ Less
Submitted 13 October, 2021; v1 submitted 30 September, 2021;
originally announced September 2021.
-
Isobaric Multiplet Mass Equation for $A \le 71$ Revisited
Authors:
Yi Hua Lam,
Bertram Blank,
Nadezda A. Smirnova,
Jean Bernard Bueb,
Maria Susai Antony
Abstract:
Accurate mass determination of short-lived nuclides by Penning-trap spectrometers and progress in the spectroscopy of proton-rich nuclei have triggered renewed interest in the isobaric multiplet mass equation (IMME). The energy levels of the members of $T=1/2, 1, 3/2,$ and 2 multiplets and the coefficients of the IMME are tabulated for $A\le 71$. The new compilation is based on the most recent mas…
▽ More
Accurate mass determination of short-lived nuclides by Penning-trap spectrometers and progress in the spectroscopy of proton-rich nuclei have triggered renewed interest in the isobaric multiplet mass equation (IMME). The energy levels of the members of $T=1/2, 1, 3/2,$ and 2 multiplets and the coefficients of the IMME are tabulated for $A\le 71$. The new compilation is based on the most recent mass evaluation (AME2011) and it includes the experimental results on energies of the states evaluated up to end of 2011. Taking into account the error bars, a significant deviation from the quadratic form of the IMME for the $A=9, 35$ quartets and the $A=32$ quintet is observed.
△ Less
Submitted 27 September, 2021;
originally announced September 2021.
-
Exploiting Reasoning Chains for Multi-hop Science Question Answering
Authors:
Weiwen Xu,
Yang Deng,
Huihui Zhang,
Deng Cai,
Wai Lam
Abstract:
We propose a novel Chain Guided Retriever-reader ({\tt CGR}) framework to model the reasoning chain for multi-hop Science Question Answering. Our framework is capable of performing explainable reasoning without the need of any corpus-specific annotations, such as the ground-truth reasoning chain, or human-annotated entity mentions. Specifically, we first generate reasoning chains from a semantic g…
▽ More
We propose a novel Chain Guided Retriever-reader ({\tt CGR}) framework to model the reasoning chain for multi-hop Science Question Answering. Our framework is capable of performing explainable reasoning without the need of any corpus-specific annotations, such as the ground-truth reasoning chain, or human-annotated entity mentions. Specifically, we first generate reasoning chains from a semantic graph constructed by Abstract Meaning Representation of retrieved evidence facts. A \textit{Chain-aware loss}, concerning both local and global chain information, is also designed to enable the generated chains to serve as distant supervision signals for training the retriever, where reinforcement learning is also adopted to maximize the utility of the reasoning chains. Our framework allows the retriever to capture step-by-step clues of the entire reasoning process, which is not only shown to be effective on two challenging multi-hop Science QA tasks, namely OpenBookQA and ARC-Challenge, but also favors explainability.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Contrastive Representation Learning for Exemplar-Guided Paraphrase Generation
Authors:
Haoran Yang,
Wai Lam,
Piji Li
Abstract:
Exemplar-Guided Paraphrase Generation (EGPG) aims to generate a target sentence which conforms to the style of the given exemplar while encapsulating the content information of the source sentence. In this paper, we propose a new method with the goal of learning a better representation of the style andthe content. This method is mainly motivated by the recent success of contrastive learning which…
▽ More
Exemplar-Guided Paraphrase Generation (EGPG) aims to generate a target sentence which conforms to the style of the given exemplar while encapsulating the content information of the source sentence. In this paper, we propose a new method with the goal of learning a better representation of the style andthe content. This method is mainly motivated by the recent success of contrastive learning which has demonstrated its power in unsupervised feature extraction tasks. The idea is to design two contrastive losses with respect to the content and the style by considering two problem characteristics during training. One characteristic is that the target sentence shares the same content with the source sentence, and the second characteristic is that the target sentence shares the same style with the exemplar. These two contrastive losses are incorporated into the general encoder-decoder paradigm. Experiments on two datasets, namely QQP-Pos and ParaNMT, demonstrate the effectiveness of our proposed constrastive losses.
△ Less
Submitted 3 September, 2021;
originally announced September 2021.
-
Transitions for exceptional times in dynamical first-passage percolation
Authors:
Michael Damron,
Jack Hanson,
David Harper,
Wai-Kit Lam
Abstract:
In first-passage percolation (FPP), we let $(τ_v)$ be i.i.d. nonnegative weights on the vertices of a graph and study the weight of the minimal path between distant vertices. If $F$ is the distribution function of $τ_v$, there are different regimes: if $F(0)$ is small, this weight typically grows like a linear function of the distance, and when $F(0)$ is large, the weight is typically of order one…
▽ More
In first-passage percolation (FPP), we let $(τ_v)$ be i.i.d. nonnegative weights on the vertices of a graph and study the weight of the minimal path between distant vertices. If $F$ is the distribution function of $τ_v$, there are different regimes: if $F(0)$ is small, this weight typically grows like a linear function of the distance, and when $F(0)$ is large, the weight is typically of order one. In between these is the critical regime in which the weight can diverge, but does so sublinearly. We study a dynamical version of critical FPP on the triangular lattice where vertices resample their weights according to independent rate-one Poisson processes. We prove that if $\sum F^{-1}(1/2+1/2^k) = \infty$, then a.s. there are exceptional times at which the weight grows atypically, but if $\sum k^{7/8} F^{-1}(1/2+1/2^k) <\infty$, then a.s. there are no such times. Furthermore, in the former case, we compute the Hausdorff and Minkowski dimensions of the exceptional set and show that they can be but need not be equal. These results show a wider range of dynamical behavior than one sees in subcritical (usual) FPP.
△ Less
Submitted 30 August, 2021;
originally announced August 2021.
-
Sentence Structure and Word Relationship Modeling for Emphasis Selection
Authors:
Haoran Yang,
Wai Lam
Abstract:
Emphasis Selection is a newly proposed task which focuses on choosing words for emphasis in short sentences. Traditional methods only consider the sequence information of a sentence while ignoring the rich sentence structure and word relationship information. In this paper, we propose a new framework that considers sentence structure via a sentence structure graph and word relationship via a word…
▽ More
Emphasis Selection is a newly proposed task which focuses on choosing words for emphasis in short sentences. Traditional methods only consider the sequence information of a sentence while ignoring the rich sentence structure and word relationship information. In this paper, we propose a new framework that considers sentence structure via a sentence structure graph and word relationship via a word similarity graph. The sentence structure graph is derived from the parse tree of a sentence. The word similarity graph allows nodes to share information with their neighbors since we argue that in emphasis selection, similar words are more likely to be emphasized together. Graph neural networks are employed to learn the representation of each node of these two graphs. Experimental results demonstrate that our framework can achieve superior performance.
△ Less
Submitted 29 August, 2021;
originally announced August 2021.
-
Enhancing condensation on soft substrates through bulk lubricant infusion
Authors:
Chander Shekhar Sharma,
Athanasios Milionis,
Abhinav Naga,
Cheuk Wing Edmond Lam,
Gabriel Rodriguez,
Marco Francesco Del Ponte,
Valentina Negri,
Hopf Raoul,
Maria D'Acunzi,
Hans-Jürgen Butt,
Doris Vollmer,
Dimos Poulikakos
Abstract:
Soft substrates such as polydimethylsiloxane (PDMS) enhance droplet nucleation during the condensation of water vapour, because their deformability inherently reduces the energetic threshold for heterogeneous nucleation relative to rigid substrates. However, this enhanced droplet nucleation is counteracted later in the condensation cycle, when the viscoelastic dissipation inhibits condensate dropl…
▽ More
Soft substrates such as polydimethylsiloxane (PDMS) enhance droplet nucleation during the condensation of water vapour, because their deformability inherently reduces the energetic threshold for heterogeneous nucleation relative to rigid substrates. However, this enhanced droplet nucleation is counteracted later in the condensation cycle, when the viscoelastic dissipation inhibits condensate droplet shedding from the substrate. Here, we show that bulk lubricant infusion in the soft substrate is a potential pathway for overcoming this limitation. We demonstrate that even 5% bulk lubricant infusion in PDMS reduces viscoelastic dissipation in the substrate by more than 30 times and more than doubles the droplet nucleation density. We correlate the droplet nucleation and growth rate with the material properties controlled by design, i.e. the fraction and composition of uncrosslinked chains, shear modulus, and viscoelastic dissipation. Through in-situ, microscale condensation on the substrates, we show that the increase in nucleation density and reduction in pre-coalescence droplet growth rate is insensitive to the percentage of lubricant in PDMS. Our results indicate the presence of a lubricant layer on the substrate surface that cloaks the growing condensate droplets. We visualize the cloaking effect and show that lubricant infusion in PDMS significantly increases the rate of cloaking compared to PDMS without any lubricant infusion. Finally, we show that the overall enhanced condensation due to bulk lubricant infusion in PDMS leads to more than 40% increase in dewing on the substrate.
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
The power of wavelets in analysis of transit and phase curves in presence of stellar variability and instrumental noise I. Method and validation
Authors:
Sz. Csizmadia,
A. M. S. Smith,
J. Cabrera,
P. Klagyivik,
A. Chaushev,
K. W. F. Lam
Abstract:
Stellar photometric variability and instrumental effects, like cosmic ray hits, data discontinuities, data leaks, instrument aging etc. cause difficulties in the characterization of exoplanets and have an impact on the accuracy and precision of the modelling and detectability of transits, occultations and phase curves. This paper aims to make an attempt to improve the transit, occultation and phas…
▽ More
Stellar photometric variability and instrumental effects, like cosmic ray hits, data discontinuities, data leaks, instrument aging etc. cause difficulties in the characterization of exoplanets and have an impact on the accuracy and precision of the modelling and detectability of transits, occultations and phase curves. This paper aims to make an attempt to improve the transit, occultation and phase-curve modelling in the presence of strong stellar variability and instrumental noise. We invoke the wavelet-formulation to reach this goal. We explore the capabilities of the software package Transit and Light Curve Modeller (TLCM). It is able to perform a joint radial velocity and light curve fit or light curve fit only. It models the transit, occultation, beaming, ellipsoidal and reflection effects in the light curves (including the gravity darkening effect, too). The red-noise, the stellar variability and instrumental effects are modelled via wavelets. The wavelet-fit is constrained by prescribing that the final white noise level must be equal to the average of the uncertainties of the photometric data points. This helps to avoid the overfit and regularizes the noise model. The approach was tested by injecting synthetic light curves into Kepler's short cadence data and then modelling them. The method performs well over a certain signal-to-noise (S/N) ratio. In general a S/N ratio of 10 is needed to get good results but some parameters requires larger S/N, some others can be retrieved at lower S/Ns. We give limits in terms of signal-to-noise ratio for every studied system parameter which is needed to accurate parameter retrieval. The wavelet-approach is able to manage and to remove the impacts of data discontinuities, cosmic ray events, long-term stellar variability and instrument ageing, short term stellar variability and pulsation and flares among others. (...)
△ Less
Submitted 26 August, 2021;
originally announced August 2021.
-
Bilateral Denoising Diffusion Models
Authors:
Max W. Y. Lam,
Jun Wang,
Rongjie Huang,
Dan Su,
Dong Yu
Abstract:
Denoising diffusion probabilistic models (DDPMs) have emerged as competitive generative models yet brought challenges to efficient sampling. In this paper, we propose novel bilateral denoising diffusion models (BDDMs), which take significantly fewer steps to generate high-quality samples. From a bilateral modeling objective, BDDMs parameterize the forward and reverse processes with a score network…
▽ More
Denoising diffusion probabilistic models (DDPMs) have emerged as competitive generative models yet brought challenges to efficient sampling. In this paper, we propose novel bilateral denoising diffusion models (BDDMs), which take significantly fewer steps to generate high-quality samples. From a bilateral modeling objective, BDDMs parameterize the forward and reverse processes with a score network and a scheduling network, respectively. We show that a new lower bound tighter than the standard evidence lower bound can be derived as a surrogate objective for training the two networks. In particular, BDDMs are efficient, simple-to-train, and capable of further improving any pre-trained DDPM by optimizing the inference noise schedules. Our experiments demonstrated that BDDMs can generate high-fidelity samples with as few as 3 sampling steps and produce comparable or even higher quality samples than DDPMs using 1000 steps with only 16 sampling steps (a 62x speedup).
△ Less
Submitted 14 September, 2021; v1 submitted 26 August, 2021;
originally announced August 2021.
-
37 New Validated Planets in Overlapping K2 Campaigns
Authors:
J. P. de Leon,
J. Livingston,
M. Endl,
W. D. Cochran,
T. Hirano,
R. A. Garcia,
S. Mathur,
K. W. F. Lam,
J. Korth,
A. A. Trani,
F. Dai,
E. Diez Alonso,
A. Castro-Gonzalez,
M. Fridlund,
A. Fukui,
D. Gandolfi,
P. Kabath,
M. Kuzuhara,
R. Luque,
A. B. Savel,
H. Gill,
C. Dressing,
S. Giacalone,
N. Narita,
E. Palle
, et al. (2 additional authors not shown)
Abstract:
We analysed 68 candidate planetary systems first identified during Campaigns 5 and 6 (C5 and C6) of the NASA \textit{K2} mission. We set out to validate these systems by using a suite of follow-up observations, including adaptive optics, speckle imaging, and reconnaissance spectroscopy. The overlap between C5 with C16 and C18, and C6 with C17, yields lightcurves with long baselines that allow us t…
▽ More
We analysed 68 candidate planetary systems first identified during Campaigns 5 and 6 (C5 and C6) of the NASA \textit{K2} mission. We set out to validate these systems by using a suite of follow-up observations, including adaptive optics, speckle imaging, and reconnaissance spectroscopy. The overlap between C5 with C16 and C18, and C6 with C17, yields lightcurves with long baselines that allow us to measure the transit ephemeris very precisely, revisit single transit candidates identified in earlier campaigns, and search for additional transiting planets with longer periods not detectable in previous works. Using \texttt{vespa}, we compute false positive probabilities of less than 1\% for 37 candidates orbiting 29 unique host stars and hence statistically validate them as planets. These planets have a typical size of $2.2R_{\oplus}$ and orbital periods between 1.99 and 52.71 days. We highlight interesting systems including a sub-Neptune with the longest period detected by \textit{K2}, sub-Saturns around F stars, several multi-planetary systems in a variety of architectures. These results show that a wealth of planetary systems still remains in the \textit{K2} data, some of which can be validated using minimal follow-up observations and taking advantage of analyses presented in previous catalogs.
△ Less
Submitted 6 September, 2021; v1 submitted 12 August, 2021;
originally announced August 2021.
-
Advancement of Photospheric Radius Expansion and Clocked Type-I X-Ray Burst Models with the New $^{22}$Mg$(α,p)^{25}$Al Reaction Rate Determined at Gamow Energy
Authors:
J. Hu,
H. Yamaguchi,
Y. H. Lam,
A. Heger,
D. Kahl,
A. M. Jacobs,
Z. Johnston,
S. W. Xu,
N. T. Zhang,
S. B. Ma,
L. H. Ru,
E. Q. Liu,
T. Liu,
S. Hayakawa,
L. Yang,
H. Shimizu,
C. B. Hamill,
A. St J. Murphy,
J. Su,
X. Fang,
K. Y. Chae,
M. S. Kwag,
S. M. Cha,
N. N. Duy,
N. K. Uyen
, et al. (12 additional authors not shown)
Abstract:
We report the first (in)elastic scattering measurement of $^{25}\mathrm{Al}+p$ with the capability to select and measure in a broad energy range the proton resonances in $^{26}$Si contributing to the $^{22}$Mg$(α,p)$ reaction at type I x-ray burst energies. We measured spin-parities of four resonances above the $α$ threshold of $^{26}$Si that are found to strongly impact the $^{22}$Mg$(α,p)$ rate.…
▽ More
We report the first (in)elastic scattering measurement of $^{25}\mathrm{Al}+p$ with the capability to select and measure in a broad energy range the proton resonances in $^{26}$Si contributing to the $^{22}$Mg$(α,p)$ reaction at type I x-ray burst energies. We measured spin-parities of four resonances above the $α$ threshold of $^{26}$Si that are found to strongly impact the $^{22}$Mg$(α,p)$ rate. The new rate advances a state-of-the-art model to remarkably reproduce light curves of the GS 1826$-$24 clocked burster with mean deviation $<9$ % and permits us to discover a strong correlation between the He abundance in the accreting envelope of photospheric radius expansion burster and the dominance of $^{22}$Mg$(α,p)$ branch.
△ Less
Submitted 20 October, 2021; v1 submitted 10 August, 2021;
originally announced August 2021.
-
The Regulated NiCu Cycles with the new $^{57}$Cu(p,$γ$)$^{58}$Zn reaction rate and the Influence on Type-I X-Ray Bursts: GS 1826$-$24 Clocked Burster
Authors:
Yi Hua Lam,
Ning Lu,
Alexander Heger,
Adam Michael Jacobs,
Nadezda A. Smirnova,
Teresa Kurtukian Nieto,
Zac Johnston,
Shigeru Kubono
Abstract:
During the X-ray bursts of GS 1826$-$24, "clocked burster", the nuclear reaction flow that surges through the rapid-proton capture process path has to pass through the NiCu cycles before reaching the ZnGa cycles that moderate the further extent of hydrogen burning in the region above germanium and selenium isotopes. The $^{57}$Cu(p,$γ$)$^{58}$Zn reaction located in the NiCu cycles plays an importa…
▽ More
During the X-ray bursts of GS 1826$-$24, "clocked burster", the nuclear reaction flow that surges through the rapid-proton capture process path has to pass through the NiCu cycles before reaching the ZnGa cycles that moderate the further extent of hydrogen burning in the region above germanium and selenium isotopes. The $^{57}$Cu(p,$γ$)$^{58}$Zn reaction located in the NiCu cycles plays an important role in influencing the burst light curves as found by Cyburt et al. (2016). We deduce the $^{57}$Cu(p,$γ$)$^{58}$Zn reaction rate based on the experimentally determined important nuclear structure information, isobaric-multiplet-mass equation, and large-scale shell model calculations. Based on the isobaric-multiplet-mass equation, we propose a possible order of $1^+_1$ and $2^+_3$ dominant resonance states and constrain the resonance energy of the $1^+_2$ state. The latter reduces the contribution of the $1^+_2$ dominant resonance state. The new reaction rate is up to a factor of four lower than the Forstner et al. (2001) rate recommended by JINA REACLIB v2.2 at the temperature regime sensitive to clocked bursts of GS 1826$-$24. Using the simulation from the one-dimensional implicit hydrodynamic code, KEPLER, to model the thermonuclear X-ray bursts of GS 1826$-$24 clocked burster, we find that the new $^{57}$Cu(p,$γ$)$^{58}$Zn coupled with the latest $^{56}$Ni(p,$γ$)$^{57}$Cu and $^{55}$Ni(p,$γ$)$^{56}$Cu reaction rates redistributes the reaction flow in the NiCu cycles and strongly influences the burst ash composition, whereas the $^{59}$Cu(p,$α$)$^{56}$Ni and $^{59}$Cu(p,$γ$)$^{60}$Zn reactions suppress the influence of the $^{57}$Cu(p,$γ$)$^{58}$Zn reaction and diminish the impact of nuclear reaction flow that by-passes the important $^{56}$Ni waiting point induced by the $^{55}$Ni(p,$γ$)$^{56}$Cu reaction on burst light curve.
△ Less
Submitted 15 January, 2022; v1 submitted 24 July, 2021;
originally announced July 2021.
-
Learning to Rank Question Answer Pairs with Bilateral Contrastive Data Augmentation
Authors:
Yang Deng,
Wenxuan Zhang,
Wai Lam
Abstract:
In this work, we propose a novel and easy-to-apply data augmentation strategy, namely Bilateral Generation (BiG), with a contrastive training objective for improving the performance of ranking question answer pairs with existing labeled data. In specific, we synthesize pseudo-positive QA pairs in contrast to the original negative QA pairs with two pre-trained generation models, one for question ge…
▽ More
In this work, we propose a novel and easy-to-apply data augmentation strategy, namely Bilateral Generation (BiG), with a contrastive training objective for improving the performance of ranking question answer pairs with existing labeled data. In specific, we synthesize pseudo-positive QA pairs in contrast to the original negative QA pairs with two pre-trained generation models, one for question generation, the other for answer generation, which are fine-tuned on the limited positive QA pairs from the original dataset. With the augmented dataset, we design a contrastive training objective for learning to rank question answer pairs. Experimental results on three benchmark datasets show that our method significantly improves the performance of ranking models by making full use of existing labeled data and can be easily applied to different ranking models.
△ Less
Submitted 29 September, 2021; v1 submitted 21 June, 2021;
originally announced June 2021.
-
Near-critical avalanches in 2D frozen percolation and forest fires
Authors:
Wai-Kit Lam,
Pierre Nolin
Abstract:
We study two closely related processes on the triangular lattice: frozen percolation, where connected components of occupied vertices freeze (they stop growing) as soon as they contain at least $N$ vertices, and forest fire processes, where connected components burn (they become entirely vacant) at rate $ζ> 0$. In this paper, we prove that when the density of occupied sites approaches the critical…
▽ More
We study two closely related processes on the triangular lattice: frozen percolation, where connected components of occupied vertices freeze (they stop growing) as soon as they contain at least $N$ vertices, and forest fire processes, where connected components burn (they become entirely vacant) at rate $ζ> 0$. In this paper, we prove that when the density of occupied sites approaches the critical threshold for Bernoulli percolation, both processes display a striking phenomenon: the appearance of near-critical "avalanches".
More specifically, we analyze the avalanches, all the way up to the natural characteristic scale of each model, which constitutes an important step toward understanding the self-organized critical behavior of such processes. For frozen percolation, we show in particular that the number of frozen clusters surrounding a given vertex is asymptotically equivalent to $(\log(96/5))^{-1} \log \log N$ as $N \to \infty$. A similar mechanism underlies forest fires, enabling us to obtain an analogous result for these processes, but with substantially more work: the number of burnt clusters is equivalent to $(\log(96/41))^{-1} \log \log (ζ^{-1})$ as $ζ\searrow 0$. Moreover, almost all of these clusters have a volume $ζ^{- 91/55 + o(1)}$.
For forest fires, the percolation process with impurities introduced in arXiv:1810.08181 plays a crucial role in our proofs, and we extend the results in that paper, up to a positive density of impurities. In addition, we develop a novel exploration procedure to couple full-plane forest fires with processes in finite but large enough (compared to the characteristic scale) domains.
△ Less
Submitted 3 November, 2021; v1 submitted 18 June, 2021;
originally announced June 2021.
-
A General Purpose Transpiler for Fully Homomorphic Encryption
Authors:
Shruthi Gorantala,
Rob Springer,
Sean Purser-Haskell,
William Lam,
Royce Wilson,
Asra Ali,
Eric P. Astor,
Itai Zukerman,
Sam Ruth,
Christoph Dibak,
Phillipp Schoppmann,
Sasha Kulankhina,
Alain Forget,
David Marn,
Cameron Tew,
Rafael Misoczki,
Bernat Guillen,
Xinyu Ye,
Dennis Kraft,
Damien Desfontaines,
Aishe Krishnamurthy,
Miguel Guevara,
Irippuge Milinda Perera,
Yurii Sushko,
Bryant Gipson
Abstract:
Fully homomorphic encryption (FHE) is an encryption scheme which enables computation on encrypted data without revealing the underlying data. While there have been many advances in the field of FHE, developing programs using FHE still requires expertise in cryptography. In this white paper, we present a fully homomorphic encryption transpiler that allows developers to convert high-level code (e.g.…
▽ More
Fully homomorphic encryption (FHE) is an encryption scheme which enables computation on encrypted data without revealing the underlying data. While there have been many advances in the field of FHE, developing programs using FHE still requires expertise in cryptography. In this white paper, we present a fully homomorphic encryption transpiler that allows developers to convert high-level code (e.g., C++) that works on unencrypted data into high-level code that operates on encrypted data. Thus, our transpiler makes transformations possible on encrypted data.
Our transpiler builds on Google's open-source XLS SDK (https://github.com/google/xls) and uses an off-the-shelf FHE library, TFHE (https://tfhe.github.io/tfhe/), to perform low-level FHE operations. The transpiler design is modular, which means the underlying FHE library as well as the high-level input and output languages can vary. This modularity will help accelerate FHE research by providing an easy way to compare arbitrary programs in different FHE schemes side-by-side. We hope this lays the groundwork for eventual easy adoption of FHE by software developers. As a proof-of-concept, we are releasing an experimental transpiler (https://github.com/google/fully-homomorphic-encryption/tree/main/transpiler) as open-source software.
△ Less
Submitted 15 June, 2021;
originally announced June 2021.
-
Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Authors:
Max W. Y. Lam,
Jun Wang,
Chao Weng,
Dan Su,
Dong Yu
Abstract:
End-to-end speech recognition generally uses hand-engineered acoustic features as input and excludes the feature extraction module from its joint optimization. To extract learnable and adaptive features and mitigate information loss, we propose a new encoder that adopts globally attentive locally recurrent (GALR) networks and directly takes raw waveform as input. We observe improved ASR performanc…
▽ More
End-to-end speech recognition generally uses hand-engineered acoustic features as input and excludes the feature extraction module from its joint optimization. To extract learnable and adaptive features and mitigate information loss, we propose a new encoder that adopts globally attentive locally recurrent (GALR) networks and directly takes raw waveform as input. We observe improved ASR performance and robustness by applying GALR on different window lengths to aggregate fine-grain temporal information into multi-scale acoustic features. Experiments are conducted on a benchmark dataset AISHELL-2 and two large-scale Mandarin speech corpus of 5,000 hours and 21,000 hours. With faster speed and comparable model size, our proposed multi-scale GALR waveform encoder achieved consistent character error rate reductions (CERRs) from 7.9% to 28.1% relative over strong baselines, including Conformer and TDNN-Conformer. In particular, our approach demonstrated notable robustness than the traditional handcrafted features and outperformed the baseline MFCC-based TDNN-Conformer model by a 15.2% CERR on a music-mixed real-world speech test set.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Conversational Fashion Image Retrieval via Multiturn Natural Language Feedback
Authors:
Yifei Yuan,
Wai Lam
Abstract:
We study the task of conversational fashion image retrieval via multiturn natural language feedback. Most previous studies are based on single-turn settings. Existing models on multiturn conversational fashion image retrieval have limitations, such as employing traditional models, and leading to ineffective performance. We propose a novel framework that can effectively handle conversational fashio…
▽ More
We study the task of conversational fashion image retrieval via multiturn natural language feedback. Most previous studies are based on single-turn settings. Existing models on multiturn conversational fashion image retrieval have limitations, such as employing traditional models, and leading to ineffective performance. We propose a novel framework that can effectively handle conversational fashion image retrieval with multiturn natural language feedback texts. One characteristic of the framework is that it searches for candidate images based on exploitation of the encoded reference image and feedback text information together with the conversation history. Furthermore, the image fashion attribute information is leveraged via a mutual attention strategy. Since there is no existing fashion dataset suitable for the multiturn setting of our task, we derive a large-scale multiturn fashion dataset via additional manual annotation efforts on an existing single-turn dataset. The experiments show that our proposed model significantly outperforms existing state-of-the-art methods.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Dynamic Semantic Graph Construction and Reasoning for Explainable Multi-hop Science Question Answering
Authors:
Weiwen Xu,
Huihui Zhang,
Deng Cai,
Wai Lam
Abstract:
Knowledge retrieval and reasoning are two key stages in multi-hop question answering (QA) at web scale. Existing approaches suffer from low confidence when retrieving evidence facts to fill the knowledge gap and lack transparent reasoning process. In this paper, we propose a new framework to exploit more valid facts while obtaining explainability for multi-hop QA by dynamically constructing a sema…
▽ More
Knowledge retrieval and reasoning are two key stages in multi-hop question answering (QA) at web scale. Existing approaches suffer from low confidence when retrieving evidence facts to fill the knowledge gap and lack transparent reasoning process. In this paper, we propose a new framework to exploit more valid facts while obtaining explainability for multi-hop QA by dynamically constructing a semantic graph and reasoning over it. We employ Abstract Meaning Representation (AMR) as semantic graph representation. Our framework contains three new ideas: (a) {\tt AMR-SG}, an AMR-based Semantic Graph, constructed by candidate fact AMRs to uncover any hop relations among question, answer and multiple facts. (b) A novel path-based fact analytics approach exploiting {\tt AMR-SG} to extract active facts from a large fact pool to answer questions. (c) A fact-level relation modeling leveraging graph convolution network (GCN) to guide the reasoning process. Results on two scientific multi-hop QA datasets show that we can surpass recent approaches including those using additional knowledge graphs while maintaining high explainability on OpenBookQA and achieve a new state-of-the-art result on ARC-Challenge in a computationally practicable setting.
△ Less
Submitted 25 May, 2021;
originally announced May 2021.
-
Neural Machine Translation with Monolingual Translation Memory
Authors:
Deng Cai,
Yan Wang,
Huayang Li,
Wai Lam,
Lemao Liu
Abstract:
Prior work has proved that Translation memory (TM) can boost the performance of Neural Machine Translation (NMT). In contrast to existing work that uses bilingual corpus as TM and employs source-side similarity search for memory retrieval, we propose a new framework that uses monolingual memory and performs learnable memory retrieval in a cross-lingual manner. Our framework has unique advantages.…
▽ More
Prior work has proved that Translation memory (TM) can boost the performance of Neural Machine Translation (NMT). In contrast to existing work that uses bilingual corpus as TM and employs source-side similarity search for memory retrieval, we propose a new framework that uses monolingual memory and performs learnable memory retrieval in a cross-lingual manner. Our framework has unique advantages. First, the cross-lingual memory retriever allows abundant monolingual data to be TM. Second, the memory retriever and NMT model can be jointly optimized for the ultimate translation goal. Experiments show that the proposed method obtains substantial improvements. Remarkably, it even outperforms strong TM-augmented NMT baselines using bilingual TM. Owning to the ability to leverage monolingual data, our model also demonstrates effectiveness in low-resource and domain adaptation scenarios.
△ Less
Submitted 2 June, 2021; v1 submitted 24 May, 2021;
originally announced May 2021.
-
Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning
Authors:
Yang Deng,
Yaliang Li,
Fei Sun,
Bolin Ding,
Wai Lam
Abstract:
Conversational recommender systems (CRS) enable the traditional recommender systems to explicitly acquire user preferences towards items and attributes through interactive conversations. Reinforcement learning (RL) is widely adopted to learn conversational recommendation policies to decide what attributes to ask, which items to recommend, and when to ask or recommend, at each conversation turn. Ho…
▽ More
Conversational recommender systems (CRS) enable the traditional recommender systems to explicitly acquire user preferences towards items and attributes through interactive conversations. Reinforcement learning (RL) is widely adopted to learn conversational recommendation policies to decide what attributes to ask, which items to recommend, and when to ask or recommend, at each conversation turn. However, existing methods mainly target at solving one or two of these three decision-making problems in CRS with separated conversation and recommendation components, which restrict the scalability and generality of CRS and fall short of preserving a stable training procedure. In the light of these challenges, we propose to formulate these three decision-making problems in CRS as a unified policy learning task. In order to systematically integrate conversation and recommendation components, we develop a dynamic weighted graph based RL method to learn a policy to select the action at each conversation turn, either asking an attribute or recommending items. Further, to deal with the sample efficiency issue, we propose two action selection strategies for reducing the candidate action space according to the preference and entropy information. Experimental results on two benchmark CRS datasets and a real-world E-Commerce application show that the proposed method not only significantly outperforms state-of-the-art methods but also enhances the scalability and stability of CRS.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Hot planets around cool stars -- two short-period mini-Neptunes transiting the late K-dwarf TOI-1260
Authors:
I. Y. Georgieva,
C. M. Persson,
O. Barragán,
G. Nowak,
M. Fridlund,
D. Locci,
E. Palle,
R. Luque,
I. Carleo,
D. Gandolfi,
S. R. Kane,
J. Korth,
K. G. Stassun,
J. Livingston,
E. C. Matthews,
K. A. Collins,
S. B. Howell,
L. M. Serrano,
S. Albrecht,
A. Bieryla,
C. E. Brasseur,
D. Ciardi,
W. D. Cochran,
K. D. Colon,
I. J. M. Crossfield
, et al. (34 additional authors not shown)
Abstract:
We present the discovery and characterization of two sub-Neptunes in close orbits, as well as a tentative outer planet of a similar size, orbiting TOI-1260 - a low metallicity K6V dwarf star. Photometry from TESS yields radii of $R_{\rm b} = 2.33 \pm 0.10$ $R_{\oplus}$ and $R_{\rm c} = 2.82 \pm 0.15$ $R_{\oplus}$, and periods of 3.13 and 7.49 days for TOI-1260b and TOI-1260c, respectively. We comb…
▽ More
We present the discovery and characterization of two sub-Neptunes in close orbits, as well as a tentative outer planet of a similar size, orbiting TOI-1260 - a low metallicity K6V dwarf star. Photometry from TESS yields radii of $R_{\rm b} = 2.33 \pm 0.10$ $R_{\oplus}$ and $R_{\rm c} = 2.82 \pm 0.15$ $R_{\oplus}$, and periods of 3.13 and 7.49 days for TOI-1260b and TOI-1260c, respectively. We combined the TESS data with a series of ground-based follow-up observations to characterize the planetary system. From HARPS-N high-precision radial velocities we obtain $M_{\rm b} = 8.61_{ - 1.46 } ^ { + 1.36 }$ $M_{\oplus}$ and $M_{\rm c} = 11.84_{ - 3.23 } ^ { + 3.38 }$ $M_{\oplus}$. The star is moderately active with a complex activity pattern, which necessitated the use of Gaussian process regression for both the light curve detrending and the radial velocity modelling, in the latter case guided by suitable activity indicators. We successfully disentangle the stellar-induced signal from the planetary signals, underlining the importance and usefulness of the Gaussian Process approach. We test the system's stability against atmospheric photoevaporation and find that the TOI-1260 planets are classic examples of the structure and composition ambiguity typical for the $2-3$ $R_{\oplus}$ range.
△ Less
Submitted 4 August, 2021; v1 submitted 12 April, 2021;
originally announced April 2021.
-
Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge
Authors:
Yang Deng,
Yuexiang Xie,
Yaliang Li,
Min Yang,
Wai Lam,
Ying Shen
Abstract:
Answer selection, which is involved in many natural language processing applications such as dialog systems and question answering (QA), is an important yet challenging task in practice, since conventional methods typically suffer from the issues of ignoring diverse real-world background knowledge. In this paper, we extensively investigate approaches to enhancing the answer selection model with ex…
▽ More
Answer selection, which is involved in many natural language processing applications such as dialog systems and question answering (QA), is an important yet challenging task in practice, since conventional methods typically suffer from the issues of ignoring diverse real-world background knowledge. In this paper, we extensively investigate approaches to enhancing the answer selection model with external knowledge from knowledge graph (KG). First, we present a context-knowledge interaction learning framework, Knowledge-aware Neural Network (KNN), which learns the QA sentence representations by considering a tight interaction with the external knowledge from KG and the textual information. Then, we develop two kinds of knowledge-aware attention mechanism to summarize both the context-based and knowledge-based interactions between questions and answers. To handle the diversity and complexity of KG information, we further propose a Contextualized Knowledge-aware Attentive Neural Network (CKANN), which improves the knowledge representation learning with structure information via a customized Graph Convolutional Network (GCN) and comprehensively learns context-based and knowledge-based sentence representation via the multi-view knowledge-aware attention mechanism. We evaluate our method on four widely-used benchmark QA datasets, including WikiQA, TREC QA, InsuranceQA and Yahoo QA. Results verify the benefits of incorporating external knowledge from KG, and show the robust superiority and extensive applicability of our method.
△ Less
Submitted 12 April, 2021;
originally announced April 2021.
-
Ultra-Thin Lubricant-Infused Vertical Graphene Nanoscaffolds for High-Performance Dropwise Condensation
Authors:
Abinash Tripathy,
Cheuk Wing Edmond Lam,
Diana Davila,
Matteo Donati,
Athanasios Milionis,
Chander Shekhar Sharma,
Dimos Poulikakos
Abstract:
Lubricant-infused surfaces (LIS) are highly efficient in repelling water and constitute a very promising family of materials for condensation processes occurring in a broad range of energy applications. However, the performance of LIS in such processes is limited by the inherent thermal resistance imposed by the thickness of the lubricant and supporting surface structure, as well as by the gradual…
▽ More
Lubricant-infused surfaces (LIS) are highly efficient in repelling water and constitute a very promising family of materials for condensation processes occurring in a broad range of energy applications. However, the performance of LIS in such processes is limited by the inherent thermal resistance imposed by the thickness of the lubricant and supporting surface structure, as well as by the gradual depletion of the lubricant over time. Here we present a remarkable, ultra-thin (~70 nm) and conductive LIS architecture, obtained by infusing lubricant into a vertically grown graphene nanoscaffold on copper. The ultra-thin nature of the scaffold, combined with the high in-plane thermal conductivity of graphene, drastically minimize earlier limitations, effectively doubling the heat transfer performance compared to a state-of-the-art CuO LIS surface. We show that the effect of the thermal resistance to the heat transfer performance of a LIS surface, although often overlooked, can be so detrimental that a simple nanostructured CuO surface can outperform a CuO LIS surface, despite film condensation on the former. The present vertical graphene LIS is also found to be resistant to lubricant depletion, maintaining stable dropwise condensation for at least ~7 hours with no significant change of advancing contact angle and contact angle hysteresis. The lubricant consumed by the vertical graphene LIS is 52.6% less than the existing state-of-the-art CuO LIS, making also the fabrication process more economical.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.