subscribe to arXiv mailings

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Authors: Seungju Han, Kavel Rao, Allyson Ettinger, Liwei Jiang, Bill Yuchen Lin, Nathan Lambert, Yejin Choi, Nouha Dziri

Abstract: We introduce WildGuard -- an open, light-weight moderation tool for LLM safety that achieves three goals: (1) identifying malicious intent in user prompts, (2) detecting safety risks of model responses, and (3) determining model refusal rate. Together, WildGuard serves the increasing needs for automatic safety moderation and evaluation of LLM interactions, providing a one-stop tool with enhanced a… ▽ More We introduce WildGuard -- an open, light-weight moderation tool for LLM safety that achieves three goals: (1) identifying malicious intent in user prompts, (2) detecting safety risks of model responses, and (3) determining model refusal rate. Together, WildGuard serves the increasing needs for automatic safety moderation and evaluation of LLM interactions, providing a one-stop tool with enhanced accuracy and broad coverage across 13 risk categories. While existing open moderation tools such as Llama-Guard2 score reasonably well in classifying straightforward model interactions, they lag far behind a prompted GPT-4, especially in identifying adversarial jailbreaks and in evaluating models' refusals, a key measure for evaluating safety behaviors in model responses. To address these challenges, we construct WildGuardMix, a large-scale and carefully balanced multi-task safety moderation dataset with 92K labeled examples that cover vanilla (direct) prompts and adversarial jailbreaks, paired with various refusal and compliance responses. WildGuardMix is a combination of WildGuardTrain, the training data of WildGuard, and WildGuardTest, a high-quality human-annotated moderation test set with 5K labeled items covering broad risk scenarios. Through extensive evaluations on WildGuardTest and ten existing public benchmarks, we show that WildGuard establishes state-of-the-art performance in open-source safety moderation across all the three tasks compared to ten strong existing open-source moderation models (e.g., up to 26.4% improvement on refusal detection). Importantly, WildGuard matches and sometimes exceeds GPT-4 performance (e.g., up to 3.9% improvement on prompt harmfulness identification). WildGuard serves as a highly effective safety moderator in an LLM interface, reducing the success rate of jailbreak attacks from 79.8% to 2.4%. △ Less

Submitted 9 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

Comments: First two authors contributed equally. Third and fourth authors contributed equally

arXiv:2406.18362 [pdf, other]

Non-Markovian Quantum Exceptional Points

Authors: Jhen-Dong Lin, Po-Chen Kuo, Neill Lambert, Adam Miranowicz, Franco Nori, Yueh-Nan Chen

Abstract: Exceptional points (EPs) are singularities in the spectra of non-Hermitian operators, where eigenvalues and eigenvectors coalesce. Recently, open quantum systems have been increasingly explored as EP testbeds due to their natural non-Hermitian nature. However, existing works mostly focus on the Markovian limit, leaving a gap in understanding EPs in the non-Markovian regime. In this work, we addres… ▽ More Exceptional points (EPs) are singularities in the spectra of non-Hermitian operators, where eigenvalues and eigenvectors coalesce. Recently, open quantum systems have been increasingly explored as EP testbeds due to their natural non-Hermitian nature. However, existing works mostly focus on the Markovian limit, leaving a gap in understanding EPs in the non-Markovian regime. In this work, we address this gap by proposing a theoretical framework based on two numerically exact descriptions of non-Markovian dynamics: the pseudomode mapping and the hierarchical equations of motion. The proposed framework enables conventional spectral analysis for EP identification, establishing direct links between EPs and dynamic manifestations in open systems, such as non-exponential decays and enhanced sensitivity to external perturbations. We unveil pure non-Markovian EPs that are unobservable in the Markovian limit. Remarkably, the EP aligns with the Markovian-to-non-Markovian transition, and the EP condition is adjustable by modifying environmental spectral properties. Moreover, we show that structured environments can elevate EP order, thereby enhancing the system's sensitivity. These findings lay a theoretical foundation and open new avenues for non-Markovian reservoir engineering and non-Hermitian physics. △ Less

Submitted 26 June, 2024; originally announced June 2024.

Comments: 10+5 pages, 2 figures

arXiv:2406.13956 [pdf]

Orbit symmetry breaking in MXene implements enhanced soft bioelectronic implants

Authors: Yizhang Wu, Yuan Li, Yihan Liu, Dashuai Zhu, Sicheng Xing, Noah Lambert, Hannah Weisbecker, Siyuan Liu, Brayden Davis, Lin Zhang, Meixiang Wang, Gongkai Yuan, Chris Zhoufan You, Anran Zhang, Cate Duncan, Wanrong Xie, Yihang Wang, Yong Wang, Sreya Kanamurlapudi, Garcia-Guzman Evert, Arjun Putcha, Michael D. Dickey, Ke Huang, Wubin Bai

Abstract: Bioelectronic implants with soft mechanics, biocompatibility, and excellent electrical performance enable biomedical implants to record electrophysiological signals and execute interventions within internal organs, promising to revolutionize the diagnosing, monitoring, and treatment of various pathological conditions. However, challenges remain in improving excessive impedance at the bioelectronic… ▽ More Bioelectronic implants with soft mechanics, biocompatibility, and excellent electrical performance enable biomedical implants to record electrophysiological signals and execute interventions within internal organs, promising to revolutionize the diagnosing, monitoring, and treatment of various pathological conditions. However, challenges remain in improving excessive impedance at the bioelectronic-tissue interface and thus the efficacy of electrophysiological signaling and intervention. Here, we devise orbit symmetry breaking in MXene (a low-cost scalability, biocompatible, and conductive 2D layered material, that we refer to as OBXene), that exhibits low bioelectronic-tissue impedance, originating from the out-of-plane charge transfer. Furthermore, the Schottky-induced piezoelectricity stemming from the asymmetric orbital configuration of OBXene facilitates interlayered charge transport in the device. In this study, we report an OBXene-based cardiac patch applied on the left ventricular epicardium of both rodent and porcine models to enable spatiotemporal epicardium mapping and pacing, while coupling the wireless and battery-free operation for long-term real-time recording and closed-loop stimulation. △ Less

Submitted 19 June, 2024; originally announced June 2024.

arXiv:2406.09279 [pdf, other]

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Authors: Hamish Ivison, Yizhong Wang, Jiacheng Liu, Zeqiu Wu, Valentina Pyatkin, Nathan Lambert, Noah A. Smith, Yejin Choi, Hannaneh Hajishirzi

Abstract: Learning from preference feedback has emerged as an essential step for improving the generation quality and performance of modern language models (LMs). Despite its widespread use, the way preference-based learning is applied varies wildly, with differing data, learning algorithms, and evaluations used, making disentangling the impact of each aspect difficult. In this work, we identify four core a… ▽ More Learning from preference feedback has emerged as an essential step for improving the generation quality and performance of modern language models (LMs). Despite its widespread use, the way preference-based learning is applied varies wildly, with differing data, learning algorithms, and evaluations used, making disentangling the impact of each aspect difficult. In this work, we identify four core aspects of preference-based learning: preference data, learning algorithm, reward model, and policy training prompts, systematically investigate the impact of these components on downstream model performance, and suggest a recipe for strong learning for preference feedback. Our findings indicate that all aspects are important for performance, with better preference data leading to the largest improvements, followed by the choice of learning algorithm, the use of improved reward models, and finally the use of additional unlabeled prompts for policy training. Notably, PPO outperforms DPO by up to 2.5% in math and 1.2% in general domains. High-quality preference data leads to improvements of up to 8% in instruction following and truthfulness. Despite significant gains of up to 5% in mathematical evaluation when scaling up reward models, we surprisingly observe marginal improvements in other categories. We publicly release the code used for training (https://github.com/hamishivi/EasyLM) and evaluating (https://github.com/allenai/open-instruct) our models, along with the models and datasets themselves (https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618). △ Less

Submitted 13 June, 2024; originally announced June 2024.

Comments: Preprint

arXiv:2405.15802 [pdf]

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Authors: Adrien Basdevant, Camille François, Victor Storchan, Kevin Bankston, Ayah Bdeir, Brian Behlendorf, Merouane Debbah, Sayash Kapoor, Yann LeCun, Mark Surman, Helen King-Turvey, Nathan Lambert, Stefano Maffulli, Nik Marda, Govind Shivkumar, Justine Tunney

Abstract: Over the past year, there has been a robust debate about the benefits and risks of open sourcing foundation models. However, this discussion has often taken place at a high level of generality or with a narrow focus on specific technical attributes. In part, this is because defining open source for foundation models has proven tricky, given its significant differences from traditional software dev… ▽ More Over the past year, there has been a robust debate about the benefits and risks of open sourcing foundation models. However, this discussion has often taken place at a high level of generality or with a narrow focus on specific technical attributes. In part, this is because defining open source for foundation models has proven tricky, given its significant differences from traditional software development. In order to inform more practical and nuanced decisions about opening AI systems, including foundation models, this paper presents a framework for grappling with openness across the AI stack. It summarizes previous work on this topic, analyzes the various potential reasons to pursue openness, and outlines how openness varies in different parts of the AI stack, both at the model and at the system level. In doing so, its authors hope to provide a common descriptive framework to deepen a nuanced and rigorous understanding of openness in AI and enable further work around definitions of openness and safety in AI. △ Less

Submitted 17 May, 2024; originally announced May 2024.

arXiv:2405.13276 [pdf, other]

Lee-Yang theory of the superradiant phase transition in the open Dicke model

Authors: Fredrik Brange, Neill Lambert, Franco Nori, Christian Flindt

Abstract: The Dicke model describes an ensemble of two-level atoms that are coupled to a confined light mode of an optical cavity. Above a critical coupling, the cavity becomes macroscopically occupied, and the system enters the superradiant phase. This phase transition can be observed by detecting the photons that are emitted from the cavity; however, it only becomes apparent in the limit of long observati… ▽ More The Dicke model describes an ensemble of two-level atoms that are coupled to a confined light mode of an optical cavity. Above a critical coupling, the cavity becomes macroscopically occupied, and the system enters the superradiant phase. This phase transition can be observed by detecting the photons that are emitted from the cavity; however, it only becomes apparent in the limit of long observation times, while actual experiments are of a finite duration. To circumvent this problem, we here make use of recent advances in Lee-Yang theories of phase transitions to show that the superradiant phase transition can be inferred from the factorial cumulants of the photon emission statistics obtained during a finite measurement time. Specifically, from the factorial cumulants, we can determine the complex singularities of generating functions that describe the photon emission statistics, and by extrapolating their positions to the long-time limit, one can detect the superradiant phase transition. We also show that the convergence points determine the tails of the large-deviation statistics of the photon current. Our work demonstrates how phase transitions in the Dicke model and in other quantum many-body systems can be detected from measurements of a finite duration. △ Less

Submitted 21 May, 2024; originally announced May 2024.

Comments: 10 pages, 4 figures

arXiv:2405.06552 [pdf, other]

Non-Relativistic Intersecting Branes, Newton-Cartan Geometry and AdS/CFT

Authors: Neil Lambert, Joseph Smith

Abstract: We discuss non-relativistic variants of four-dimensional ${\cal N}$=4 super-Yang-Mills theory obtained from generalised Newton-Cartan geometric limits of D3-branes in ten-dimensional spacetime. We argue that the natural interpretation of these limits is that they correspond to non-relativistic D1-branes or D3-branes intersecting the original D3-branes. The resulting gauge theories have dynamics th… ▽ More We discuss non-relativistic variants of four-dimensional ${\cal N}$=4 super-Yang-Mills theory obtained from generalised Newton-Cartan geometric limits of D3-branes in ten-dimensional spacetime. We argue that the natural interpretation of these limits is that they correspond to non-relativistic D1-branes or D3-branes intersecting the original D3-branes. The resulting gauge theories have dynamics that reduce to quantum mechanics on monopole moduli space or two-dimensional sigma-models on Hitchin moduli space respectively. We show that these theories possess interesting infinite-dimensional symmetries and we discuss the dual $AdS$ geometries. △ Less

Submitted 28 June, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

Comments: 43 pages; v2: improved discussion of S-duality, typos corrected, version accepted for publication in JHEP

arXiv:2405.01511 [pdf, other]

D2PO: Discriminator-Guided DPO with Response Evaluation Models

Authors: Prasann Singhal, Nathan Lambert, Scott Niekum, Tanya Goyal, Greg Durrett

Abstract: Varied approaches for aligning language models have been proposed, including supervised fine-tuning, RLHF, and direct optimization methods such as DPO. Although DPO has rapidly gained popularity due to its straightforward training process and competitive results, there is an open question of whether there remain practical advantages of using a discriminator, like a reward model, to evaluate respon… ▽ More Varied approaches for aligning language models have been proposed, including supervised fine-tuning, RLHF, and direct optimization methods such as DPO. Although DPO has rapidly gained popularity due to its straightforward training process and competitive results, there is an open question of whether there remain practical advantages of using a discriminator, like a reward model, to evaluate responses. We propose D2PO, discriminator-guided DPO, an approach for the online setting where preferences are being collected throughout learning. As we collect gold preferences, we use these not only to train our policy, but to train a discriminative response evaluation model to silver-label even more synthetic data for policy training. We explore this approach across a set of diverse tasks, including a realistic chat setting, we find that our approach leads to higher-quality outputs compared to DPO with the same data budget, and greater efficiency in terms of preference data requirements. Furthermore, we show conditions under which silver labeling is most helpful: it is most effective when training the policy with DPO, outperforming traditional PPO, and benefits from maintaining a separate discriminator from the policy model. △ Less

Submitted 2 May, 2024; originally announced May 2024.

Comments: 20 pages, 12 figures

arXiv:2404.10271 [pdf, other]

Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback

Authors: Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mossé, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, William S. Zwicker

Abstract: Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level prin… ▽ More Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the input into consistent data about "collective" preferences or otherwise use it to make collective choices about model behavior? In this paper, we argue that the field of social choice is well positioned to address these questions, and we discuss ways forward for this agenda, drawing on discussions in a recent workshop on Social Choice for AI Ethics and Safety held in Berkeley, CA, USA in December 2023. △ Less

Submitted 4 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

Comments: 15 pages, 4 figures

MSC Class: 68T01; 68T50; 91B14; 91B12 ACM Class: I.2.0; I.2.7; K.4.2; I.2.m; J.4

arXiv:2403.14455 [pdf, other]

Non-Markovian skin effect

Authors: Po-Chen Kuo, Shen-Liang Yang, Neill Lambert, Jhen-Dong Lin, Yi-Te Huang, Franco Nori, Yueh-Nan Chen

Abstract: The Liouvillian skin effect and the non-Hermitian skin effect have both been used to explain the localization of eigenmodes near system boundaries, though the former is arguably more accurate in some regimes due to its incorporation of quantum jumps. However, these frameworks predominantly focus on weak Markovian interactions, neglecting the potentially crucial role of memory effects. To address t… ▽ More The Liouvillian skin effect and the non-Hermitian skin effect have both been used to explain the localization of eigenmodes near system boundaries, though the former is arguably more accurate in some regimes due to its incorporation of quantum jumps. However, these frameworks predominantly focus on weak Markovian interactions, neglecting the potentially crucial role of memory effects. To address this, we investigate, utilizing the powerful hierarchical equations of motion method, how a non-Markovian environment can modify the Liouvillian skin effect. We demonstrate that a non-Markovian environment can induce not only a ``thick skin effect", where the skin mode broadens and shifts into the bulk, but also skin-mode coherence, leading to the coherence-delocalization and oscillatory relaxation with a characteristic linear scaling with system size. Remarkably, both the skin-mode and steady-state coherence exhibit resistance to decoherence from additional environmental noise. These findings highlight the profound impact of system-bath correlations on relaxation and localization, revealing unique phenomena beyond conventional Markovian approximations. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 15 pages, 9 figures

arXiv:2403.13787 [pdf, other]

RewardBench: Evaluating Reward Models for Language Modeling

Authors: Nathan Lambert, Valentina Pyatkin, Jacob Morrison, LJ Miranda, Bill Yuchen Lin, Khyathi Chandu, Nouha Dziri, Sachin Kumar, Tom Zick, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi

Abstract: Reward models (RMs) are at the crux of successfully using RLHF to align pretrained models to human preferences, yet there has been relatively little study that focuses on evaluation of those models. Evaluating reward models presents an opportunity to understand the opaque technologies used for alignment of language models and which values are embedded in them. Resources for reward model training a… ▽ More Reward models (RMs) are at the crux of successfully using RLHF to align pretrained models to human preferences, yet there has been relatively little study that focuses on evaluation of those models. Evaluating reward models presents an opportunity to understand the opaque technologies used for alignment of language models and which values are embedded in them. Resources for reward model training and understanding are sparse in the nascent open-source community around them. To enhance scientific understanding of reward models, we present RewardBench, a benchmark dataset and code-base for evaluation. The RewardBench dataset is a collection of prompt-chosen-rejected trios spanning chat, reasoning, and safety, to benchmark how reward models perform on challenging, structured and out-of-distribution queries. We create specific comparison datasets for RMs that have subtle, but verifiable reasons (e.g. bugs, incorrect facts) why one answer should be preferred to another. On the RewardBench leaderboard, we evaluate reward models trained with a variety of methods, such as the direct MLE training of classifiers and the implicit reward modeling of Direct Preference Optimization (DPO). We present many findings on propensity for refusals, reasoning limitations, and instruction following shortcomings of various reward models towards a better understanding of the RLHF process. △ Less

Submitted 8 June, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

Comments: 44 pages, 19 figures, 12 tables

arXiv:2402.16827 [pdf, other]

A Survey on Data Selection for Language Models

Authors: Alon Albalak, Yanai Elazar, Sang Michael Xie, Shayne Longpre, Nathan Lambert, Xinyi Wang, Niklas Muennighoff, Bairu Hou, Liangming Pan, Haewon Jeong, Colin Raffel, Shiyu Chang, Tatsunori Hashimoto, William Yang Wang

Abstract: A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the am… ▽ More A major factor in the recent success of large language models is the use of enormous and ever-growing text datasets for unsupervised pre-training. However, naively training a model on all available data may not be optimal (or feasible), as the quality of available text data can vary. Filtering out data can also decrease the carbon footprint and financial costs of training models by reducing the amount of training required. Data selection methods aim to determine which candidate data points to include in the training dataset and how to appropriately sample from the selected data points. The promise of improved data selection methods has caused the volume of research in the area to rapidly expand. However, because deep learning is mostly driven by empirical evidence and experimentation on large-scale data is expensive, few organizations have the resources for extensive data selection research. Consequently, knowledge of effective data selection practices has become concentrated within a few organizations, many of which do not openly share their findings and methodologies. To narrow this gap in knowledge, we present a comprehensive review of existing literature on data selection methods and related research areas, providing a taxonomy of existing approaches. By describing the current landscape of research, this work aims to accelerate progress in data selection by establishing an entry point for new and established researchers. Additionally, throughout this review we draw attention to noticeable holes in the literature and conclude the paper by proposing promising avenues for future research. △ Less

Submitted 8 March, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

Comments: Paper list available at https://github.com/alon-albalak/data-selection-survey

arXiv:2402.00838 [pdf, other]

OLMo: Accelerating the Science of Language Models

Authors: Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam , et al. (18 additional authors not shown)

Abstract: Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details of their training data, architectures, and development undisclosed. Given the importance of these details in scientifically studying these models… ▽ More Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details of their training data, architectures, and development undisclosed. Given the importance of these details in scientifically studying these models, including their biases and potential risks, we believe it is essential for the research community to have access to powerful, truly open LMs. To this end, we have built OLMo, a competitive, truly Open Language Model, to enable the scientific study of language models. Unlike most prior efforts that have only released model weights and inference code, we release OLMo alongside open training data and training and evaluation code. We hope this release will empower the open research community and inspire a new wave of innovation. △ Less

Submitted 7 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

arXiv:2402.00159 [pdf, other]

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Authors: Luca Soldaini, Rodney Kinney, Akshita Bhagia, Dustin Schwenk, David Atkinson, Russell Authur, Ben Bogin, Khyathi Chandu, Jennifer Dumas, Yanai Elazar, Valentin Hofmann, Ananya Harsh Jha, Sachin Kumar, Li Lucy, Xinxi Lyu, Nathan Lambert, Ian Magnusson, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Abhilasha Ravichander, Kyle Richardson, Zejiang Shen , et al. (11 additional authors not shown)

Abstract: Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often released without accompanying training data or recipes to reproduce them. As a result, it is challenging to conduct and advance scientific research on language modeling, such as understanding how training dat… ▽ More Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often released without accompanying training data or recipes to reproduce them. As a result, it is challenging to conduct and advance scientific research on language modeling, such as understanding how training data impacts model capabilities and limitations. To facilitate scientific research on language model pretraining, we curate and release Dolma, a three-trillion-token English corpus, built from a diverse mixture of web content, scientific papers, code, public-domain books, social media, and encyclopedic materials. We extensively document Dolma, including its design principles, details about its construction, and a summary of its contents. We present analyses and experimental results on intermediate states of Dolma to share what we have learned about important data curation practices. Finally, we open-source our data curation toolkit to enable reproduction of our work as well as support further research in large-scale data curation. △ Less

Submitted 6 June, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

Comments: Accepted at ACL 2024; Dataset: https://hf.co/datasets/allenai/dolma; Code: https://github.com/allenai/dolma

arXiv:2401.14955 [pdf, ps, other]

Non-Relativistic M2-Branes and the AdS/CFT Correspondence

Authors: Neil Lambert, Joseph Smith

Abstract: A non-relativistic limit of the AdS/CFT correspondence is studied in the context of M2-branes. On the field theory side this corresponds to a near-BPS limit of ABJM that localises onto solutions of Hitchin's equations. It is shown that the symmetries of the theory include an infinite-dimensional enhancement of the spatial symmetry algebra corresponding to time-dependent holomorphic transformations… ▽ More A non-relativistic limit of the AdS/CFT correspondence is studied in the context of M2-branes. On the field theory side this corresponds to a near-BPS limit of ABJM that localises onto solutions of Hitchin's equations. It is shown that the symmetries of the theory include an infinite-dimensional enhancement of the spatial symmetry algebra corresponding to time-dependent holomorphic transformations. Taking the limit of the gravitational dual splits the geometry into three 'large' directions and eight 'small' directions and corresponds to the Membrane-Newton-Cartan limit of eleven-dimensional supergravity. This has the effect of reducing the $AdS_4$ factor to an $AdS_2$ factor for the near-horizon limit of the M2-brane metric. Evidence is presented that the duality is maintained after the limit. △ Less

Submitted 7 May, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

Comments: 50 pages; v2: improved discussion of subleading fields, references added, minor corrections, v3: added discussion of limit on sphere, improved explanations, typos corrected, version accepted for publication in JHEP

arXiv:2401.11830 [pdf, other]

Non-Hermitian Pseudomodes for Strongly Coupled Open Quantum Systems: Unravelings, Correlations and Thermodynamics

Authors: Paul Menczel, Ken Funo, Mauro Cirio, Neill Lambert, Franco Nori

Abstract: The pseudomode framework provides an exact description of the dynamics of an open quantum system coupled to a non-Markovian environment. Using this framework, the influence of the environment on the system is studied in an equivalent model, where the open system is coupled to a finite number of unphysical pseudomodes that follow a time-local master equation. Building on the insight that this maste… ▽ More The pseudomode framework provides an exact description of the dynamics of an open quantum system coupled to a non-Markovian environment. Using this framework, the influence of the environment on the system is studied in an equivalent model, where the open system is coupled to a finite number of unphysical pseudomodes that follow a time-local master equation. Building on the insight that this master equation does not need to conserve the hermiticity of the pseudomode state, we here ask for the most general conditions on the master equation that guarantee the correct reproduction of the system's original dynamics. We demonstrate that our generalized approach decreases the number of pseudomodes that are required to model, for example, underdamped environments at finite temperature. We also provide an unraveling of the master equation into quantum jump trajectories of non-Hermitian states, which further facilitates the utilization of the pseudomode technique for numerical calculations by enabling the use of easily parallelizable Monte Carlo simulations. Finally, we show that pseudomodes, despite their unphysical nature, provide a natural picture in which physical processes, such as the creation of system-bath correlations or the exchange of heat, can be studied. Hence, our results pave the way for future investigations of the system-environment interaction leading to a better understanding of open quantum systems far from the Markovian weak-coupling limit. △ Less

Submitted 26 February, 2024; v1 submitted 22 January, 2024; originally announced January 2024.

Comments: 24 pages, 6 figures

arXiv:2401.07932 [pdf, ps, other]

Six-Dimensional Correlators From a Five-Dimensional Operator Product Expansion

Authors: N. Lambert, A. Lipstein, R. Mouland

Abstract: In this letter we discuss the operator product expansion of scalar operators in five-dimensional field theories with an $SU(1,3)\times U(1)$ spacetime symmetry. Such theories arise by a novel conformal null reduction of six-dimensional Lorentzian conformal field theories. Unlike Lorentzian conformal field theories, three-point functions of generic operators in such theories are not completely fixe… ▽ More In this letter we discuss the operator product expansion of scalar operators in five-dimensional field theories with an $SU(1,3)\times U(1)$ spacetime symmetry. Such theories arise by a novel conformal null reduction of six-dimensional Lorentzian conformal field theories. Unlike Lorentzian conformal field theories, three-point functions of generic operators in such theories are not completely fixed by $SU(1,3)\times U(1)$ symmetry. However, we show that in a special case the functional form of the OPE coefficients can be fully determined, and we use them to fix the form of the three-point function. The result is shown to agree with correlation functions obtained by reduction of six-dimensional conformal field theories. △ Less

Submitted 15 May, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

Comments: 17 pages, Typo corrected and some further clarifications. To appear in JHEP

arXiv:2311.15240 [pdf, other]

Modeling the unphysical pseudomode model with physical ensembles: simulation, mitigation, and restructuring of non-Markovian quantum noise

Authors: Mauro Cirio, Si Luo, Pengfei Liang, Franco Nori, Neill Lambert

Abstract: The influence of a Gaussian environment on a quantum system can be described by effectively replacing the continuum with a discrete set of ancillary quantum and classical degrees of freedom. This defines a pseudomode model which can be used to classically simulate the reduced system dynamics. Here, we consider an alternative point of view and analyze the potential benefits of an analog or digital… ▽ More The influence of a Gaussian environment on a quantum system can be described by effectively replacing the continuum with a discrete set of ancillary quantum and classical degrees of freedom. This defines a pseudomode model which can be used to classically simulate the reduced system dynamics. Here, we consider an alternative point of view and analyze the potential benefits of an analog or digital quantum simulation of the pseudomode model itself. Superficially, such a direct experimental implementation is, in general, impossible due to the unphysical properties of the effective degrees of freedom involved. However, we show that the effects of the unphysical pseudomode model can still be reproduced using measurement results over an ensemble of physical systems involving ancillary harmonic modes and an optional stochastic driving field. This is done by introducing an extrapolation technique whose efficiency is limited by stability against imprecision in the measurement data. We examine how such a simulation would allow us to (i) perform accurate quantum simulation of the effects of complex non-perturbative and non-Markovian environments in regimes that are challenging for classical simulation, (ii) conversely, mitigate potential unwanted non-Markovian noise present in quantum devices, and (iii) restructure some of some of the properties of a given physical bath, such as its temperature. △ Less

Submitted 26 November, 2023; originally announced November 2023.

Comments: 30 pages, 10 figures

arXiv:2311.10702 [pdf, other]

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Authors: Hamish Ivison, Yizhong Wang, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Abstract: Since the release of TÜLU [Wang et al., 2023b], open resources for instruction tuning have developed quickly, from better base models to new finetuning techniques. We test and incorporate a number of these advances into TÜLU, resulting in TÜLU 2, a suite of improved TÜLU models for advancing the understanding and best practices of adapting pretrained language models to downstream tasks and user pr… ▽ More Since the release of TÜLU [Wang et al., 2023b], open resources for instruction tuning have developed quickly, from better base models to new finetuning techniques. We test and incorporate a number of these advances into TÜLU, resulting in TÜLU 2, a suite of improved TÜLU models for advancing the understanding and best practices of adapting pretrained language models to downstream tasks and user preferences. Concretely, we release: (1) TÜLU-V2-mix, an improved collection of high-quality instruction datasets; (2) TÜLU 2, LLAMA-2 models finetuned on the V2 mixture; (3) TÜLU 2+DPO, TÜLU 2 models trained with direct preference optimization (DPO), including the largest DPO-trained model to date (TÜLU 2+DPO 70B); (4) CODE TÜLU 2, CODE LLAMA models finetuned on our V2 mix that outperform CODE LLAMA and its instruction-tuned variant, CODE LLAMA-Instruct. Our evaluation from multiple perspectives shows that the TÜLU 2 suite achieves state-of-the-art performance among open models and matches or exceeds the performance of GPT-3.5-turbo-0301 on several benchmarks. We release all the checkpoints, data, training and evaluation code to facilitate future open efforts on adapting large language models. △ Less

Submitted 19 November, 2023; v1 submitted 17 November, 2023; originally announced November 2023.

Comments: technical report; fixed zephyr numbers

arXiv:2311.00168 [pdf, other]

The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

Authors: Nathan Lambert, Roberto Calandra

Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a powerful technique to make large language models (LLMs) more capable in complex settings. RLHF proceeds as collecting human preference data, training a reward model on said data, and optimizing a base ML model with respect to said reward for extrinsic evaluation metrics (e.g. MMLU, GSM8k). RLHF relies on many assumptions about how… ▽ More Reinforcement learning from human feedback (RLHF) has emerged as a powerful technique to make large language models (LLMs) more capable in complex settings. RLHF proceeds as collecting human preference data, training a reward model on said data, and optimizing a base ML model with respect to said reward for extrinsic evaluation metrics (e.g. MMLU, GSM8k). RLHF relies on many assumptions about how the various pieces fit together, such as a reward model capturing human preferences and an RL optimizer extracting the right signal from a reward model. As the RLHF process involves many distinct design decisions, it is easy to assume that multiple processes are correlated and therefore numerically linked. This apparent correlation is often not true, where reward models are easily overoptimized or RL optimizers can reduce performance on tasks not modeled in the data. Notable manifestations of models trained with imperfect RLHF systems are those that are prone to refusing basic requests for safety reasons or appearing lazy in generations. As chat model evaluation becomes increasingly nuanced, the reliance on a perceived link between reward model training, RL scores, and downstream performance drives these issues, which we describe as an objective mismatch. In this paper, we illustrate the causes of this issue, reviewing relevant literature from model-based reinforcement learning, and argue for solutions. By solving objective mismatch in RLHF, the ML models of the future will be more precisely aligned to user instructions for both safety and helpfulness. △ Less

Submitted 1 February, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

Comments: 11 pages, 5 figures

arXiv:2310.16944 [pdf, other]

Zephyr: Direct Distillation of LM Alignment

Authors: Lewis Tunstall, Edward Beeching, Nathan Lambert, Nazneen Rajani, Kashif Rasul, Younes Belkada, Shengyi Huang, Leandro von Werra, Clémentine Fourrier, Nathan Habib, Nathan Sarrazin, Omar Sanseviero, Alexander M. Rush, Thomas Wolf

Abstract: We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on larger models significantly improves task accuracy; however, these models are unaligned, i.e. they do not respond well to natural prompts. To distill this property, we experiment with the use of preference data from AI Feedback (AIF). Start… ▽ More We aim to produce a smaller language model that is aligned to user intent. Previous research has shown that applying distilled supervised fine-tuning (dSFT) on larger models significantly improves task accuracy; however, these models are unaligned, i.e. they do not respond well to natural prompts. To distill this property, we experiment with the use of preference data from AI Feedback (AIF). Starting from a dataset of outputs ranked by a teacher model, we apply distilled direct preference optimization (dDPO) to learn a chat model with significantly improved intent alignment. The approach requires only a few hours of training without any additional sampling during fine-tuning. The final result, Zephyr-7B, sets the state-of-the-art on chat benchmarks for 7B parameter models, and requires no human annotation. In particular, results on MT-Bench show that Zephyr-7B surpasses Llama2-Chat-70B, the best open-access RLHF-based model. Code, models, data, and tutorials for the system are available at https://github.com/huggingface/alignment-handbook. △ Less

Submitted 25 October, 2023; originally announced October 2023.

arXiv:2310.13595 [pdf, other]

The History and Risks of Reinforcement Learning and Human Feedback

Authors: Nathan Lambert, Thomas Krendl Gilbert, Tom Zick

Abstract: Reinforcement learning from human feedback (RLHF) has emerged as a powerful technique to make large language models (LLMs) easier to use and more effective. A core piece of the RLHF process is the training and utilization of a model of human preferences that acts as a reward function for optimization. This approach, which operates at the intersection of many stakeholders and academic disciplines,… ▽ More Reinforcement learning from human feedback (RLHF) has emerged as a powerful technique to make large language models (LLMs) easier to use and more effective. A core piece of the RLHF process is the training and utilization of a model of human preferences that acts as a reward function for optimization. This approach, which operates at the intersection of many stakeholders and academic disciplines, remains poorly understood. RLHF reward models are often cited as being central to achieving performance, yet very few descriptors of capabilities, evaluations, training methods, or open-source models exist. Given this lack of information, further study and transparency is needed for learned RLHF reward models. In this paper, we illustrate the complex history of optimizing preferences, and articulate lines of inquiry to understand the sociotechnical context of reward models. In particular, we highlight the ontological differences between costs, rewards, and preferences at stake in RLHF's foundations, related methodological tensions, and possible research directions to improve general understanding of how reward models function. △ Less

Submitted 28 November, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: 14 pages, 3 figures

arXiv:2310.12700 [pdf, other]

doi 10.1093/nar/gkae289

Towards Parsimonious Generative Modeling of RNA Families

Authors: Francesco Calvanese, Camille N. Lambert, Philippe Nghe, Francesco Zamponi, Martin Weigt

Abstract: Generative probabilistic models emerge as a new paradigm in data-driven, evolution-informed design of biomolecular sequences. This paper introduces a novel approach, called Edge Activation Direct Coupling Analysis (eaDCA), tailored to the characteristics of RNA sequences, with a strong emphasis on simplicity, efficiency, and interpretability. eaDCA explicitly constructs sparse coevolutionary model… ▽ More Generative probabilistic models emerge as a new paradigm in data-driven, evolution-informed design of biomolecular sequences. This paper introduces a novel approach, called Edge Activation Direct Coupling Analysis (eaDCA), tailored to the characteristics of RNA sequences, with a strong emphasis on simplicity, efficiency, and interpretability. eaDCA explicitly constructs sparse coevolutionary models for RNA families, achieving performance levels comparable to more complex methods while utilizing a significantly lower number of parameters. Our approach demonstrates efficiency in generating artificial RNA sequences that closely resemble their natural counterparts in both statistical analyses and SHAPE-MaP experiments, and in predicting the effect of mutations. Notably, eaDCA provides a unique feature: estimating the number of potential functional sequences within a given RNA family. For example, in the case of cyclic di-AMP riboswitches (RF00379), our analysis suggests the existence of approximately $\mathbf{10^{39}}$ functional nucleotide sequences. While huge compared to the known $< \mathbf{4,000}$ natural sequences, this number represents only a tiny fraction of the vast pool of nearly $\mathbf{10^{82}}$ possible nucleotide sequences of the same length (136 nucleotides). These results underscore the promise of sparse and interpretable generative models, such as eaDCA, in enhancing our understanding of the expansive RNA sequence space. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 33 pages (including SI)

Journal ref: Nucleic Acids Research, gkae289 (2024)

arXiv:2310.12539 [pdf, ps, other]

Fixing detailed balance in ancilla-based dissipative state engineering

Authors: Neill Lambert, Mauro Cirio, Jhen-dong Lin, Paul Menczel, Pengfei Liang, Franco Nori

Abstract: Dissipative state engineering is a general term for a protocol which prepares the ground state of a complex many-body Hamiltonian using engineered dissipation or engineered environments. Recently, it was shown that a version of this protocol, where the engineered environment consists of one or more dissipative qubit ancillas tuned to be resonant with the low-energy transitions of a many-body syste… ▽ More Dissipative state engineering is a general term for a protocol which prepares the ground state of a complex many-body Hamiltonian using engineered dissipation or engineered environments. Recently, it was shown that a version of this protocol, where the engineered environment consists of one or more dissipative qubit ancillas tuned to be resonant with the low-energy transitions of a many-body system, resulted in the combined system evolving to reasonable approximation to the ground state. This potentially broadens the applicability of the method beyond non-frustrated systems, to which it was previously restricted. Here we argue that this approach has an intrinsic limitation because the ancillas, seen as an effective bath by the system in the weak-coupling limit, do not give the detailed balance expected for a true zero-temperature environment. Our argument is based on the study of a similar approach employing linear coupling to bosonic ancillas. We explore overcoming this limitation using a recently developed technique from open-quantum-systems called pseudomodes. With a simple example model of a 1D quantum Ising chain, we show that detailed balance can be fixed, and a more accurate estimation of the ground state obtained, at the cost of two additional unphysical dissipative modes and the extrapolation error of implementing those modes in physical systems. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 10 pages, 6 figures

arXiv:2310.06253 [pdf, other]

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Authors: Ran Wei, Nathan Lambert, Anthony McDonald, Alfredo Garcia, Roberto Calandra

Abstract: Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient, adaptive, and explainable by learning an explicit model of the environment. While the capabilities of MBRL agents have significantly improved in recent years, how to best learn the model is still an unresolved question. The majority of MBRL algorithms aim at training the model to make accurate predictions about th… ▽ More Model-based Reinforcement Learning (MBRL) aims to make agents more sample-efficient, adaptive, and explainable by learning an explicit model of the environment. While the capabilities of MBRL agents have significantly improved in recent years, how to best learn the model is still an unresolved question. The majority of MBRL algorithms aim at training the model to make accurate predictions about the environment and subsequently using the model to determine the most rewarding actions. However, recent research has shown that model predictive accuracy is often not correlated with action quality, tracing the root cause to the objective mismatch between accurate dynamics model learning and policy optimization of rewards. A number of interrelated solution categories to the objective mismatch problem have emerged as MBRL continues to mature as a research area. In this work, we provide an in-depth survey of these solution categories and propose a taxonomy to foster future research. △ Less

Submitted 6 April, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

arXiv:2308.00862 [pdf, ps, other]

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

Authors: Sarah Shoker, Andrew Reddie, Sarah Barrington, Ruby Booth, Miles Brundage, Husanjot Chahal, Michael Depp, Bill Drexel, Ritwik Gupta, Marina Favaro, Jake Hecla, Alan Hickey, Margarita Konaev, Kirthi Kumar, Nathan Lambert, Andrew Lohn, Cullen O'Keefe, Nazneen Rajani, Michael Sellitto, Robert Trager, Leah Walker, Alexa Wehsener, Jessica Young

Abstract: Foundation models could eventually introduce several pathways for undermining state security: accidents, inadvertent escalation, unintentional conflict, the proliferation of weapons, and the interference with human diplomacy are just a few on a long list. The Confidence-Building Measures for Artificial Intelligence workshop hosted by the Geopolitics Team at OpenAI and the Berkeley Risk and Securit… ▽ More Foundation models could eventually introduce several pathways for undermining state security: accidents, inadvertent escalation, unintentional conflict, the proliferation of weapons, and the interference with human diplomacy are just a few on a long list. The Confidence-Building Measures for Artificial Intelligence workshop hosted by the Geopolitics Team at OpenAI and the Berkeley Risk and Security Lab at the University of California brought together a multistakeholder group to think through the tools and strategies to mitigate the potential risks introduced by foundation models to international security. Originating in the Cold War, confidence-building measures (CBMs) are actions that reduce hostility, prevent conflict escalation, and improve trust between parties. The flexibility of CBMs make them a key instrument for navigating the rapid changes in the foundation model landscape. Participants identified the following CBMs that directly apply to foundation models and which are further explained in this conference proceedings: 1. crisis hotlines 2. incident sharing 3. model, transparency, and system cards 4. content provenance and watermarks 5. collaborative red teaming and table-top exercises and 6. dataset and evaluation sharing. Because most foundation model developers are non-government entities, many CBMs will need to involve a wider stakeholder community. These measures can be implemented either by AI labs or by relevant government actors. △ Less

Submitted 3 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

arXiv:2307.11226 [pdf, other]

BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft

Authors: Alexander N. Alvara, Lydia Lee, Emmanuel Sin, Nathan Lambert, Andrew J. Westphal, Kristofer S. J. Pister

Abstract: Leveraging advancements in micro-scale technology, we propose a fleet of autonomous, low-cost, small solar sails for interplanetary exploration. The Berkeley Low-cost Interplanetary Solar Sail (BLISS) project aims to utilize small-scale technologies to create a fleet of tiny interplanetary femto-spacecraft for rapid, low-cost exploration of the inner solar system. This paper describes the hardware… ▽ More Leveraging advancements in micro-scale technology, we propose a fleet of autonomous, low-cost, small solar sails for interplanetary exploration. The Berkeley Low-cost Interplanetary Solar Sail (BLISS) project aims to utilize small-scale technologies to create a fleet of tiny interplanetary femto-spacecraft for rapid, low-cost exploration of the inner solar system. This paper describes the hardware required to build a nearly 10 g spacecraft using a 1 m$^2$ solar sail steered by micro-electromechanical systems (MEMS) inchworm actuators. The trajectory control to a NEO, here 101955 Bennu, is detailed along with the low-level actuation control of the solar sail and the specifications of proposed onboard communication and computation. Two other applications are also shortly considered: sample return from dozens of Jupiter-family comets and interstellar comet rendezvous and imaging. The paper concludes by discussing the fundamental scaling limits and future directions for steerable autonomous miniature solar sails with onboard custom computers and sensors. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: 16 pages, 13 figures, 5 tables, 23 equations, and just over 10 years

arXiv:2306.10862 [pdf]

doi 10.51926/ISTE.9091.ch3

Enjeux de communication dans la multirepr{é}sentation cartographique reproductible

Authors: Nicolas Lambert, Timothée Giraud, Ronan Ysebaert

Abstract: This chapter deepens cartographic communication through a cartographic multirepresentation exercise. Using a single dataset on World population data, the chapter presents a series of 13 different maps to illustrate how mapping is primarily a matter of choices and methods. This chapter deepens cartographic communication through a cartographic multirepresentation exercise. Using a single dataset on World population data, the chapter presents a series of 13 different maps to illustrate how mapping is primarily a matter of choices and methods. △ Less

Submitted 19 June, 2023; originally announced June 2023.

Comments: in French language

Journal ref: Communication cartographique, ISTE Group, pp.73-102, 2022

arXiv:2306.07522 [pdf, other]

doi 10.1038/s42005-023-01427-2

HierarchicalEOM.jl: An efficient Julia framework for hierarchical equations of motion in open quantum systems

Authors: Yi-Te Huang, Po-Chen Kuo, Neill Lambert, Mauro Cirio, Simon Cross, Shen-Liang Yang, Franco Nori, Yueh-Nan Chen

Abstract: The hierarchical equations of motion (HEOM) approach can describe the reduced dynamics of a system simultaneously coupled to multiple bosonic and fermionic environments. The complexity of exactly describing the system-environment interaction with the HEOM method usually results in time-consuming calculations and a large memory cost. Here, we introduce an open-source software package called Hierarc… ▽ More The hierarchical equations of motion (HEOM) approach can describe the reduced dynamics of a system simultaneously coupled to multiple bosonic and fermionic environments. The complexity of exactly describing the system-environment interaction with the HEOM method usually results in time-consuming calculations and a large memory cost. Here, we introduce an open-source software package called HierarchicalEOM.jl: a Julia framework integrating the HEOM approach. HierarchicalEOM.jl features a collection of methods to compute bosonic and fermionic spectra, stationary states, and the full dynamics in the extended space of all auxiliary density operators (ADOs). The required handling of the ADOs multi-indexes is achieved through a user-friendly interface. We exemplify the functionalities of the package by analyzing a single impurity Anderson model, and an ultra-strongly coupled charge-cavity system interacting with bosonic and fermionic reservoirs. HierarchicalEOM.jl achieves a significant speedup with respect to the corresponding method in the Quantum Toolbox in Python (QuTiP), upon which this package is founded. △ Less

Submitted 23 October, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

Comments: 20 pages, 7 figures, 4 tables

Journal ref: Commun. Phys. 6, 313 (2023)

arXiv:2303.11758 [pdf, other]

The Closed and Open Unbalanced Dicke Trimer Model: Critical Properties and Nonlinear Semiclassical Dynamics

Authors: Cheng Zhang, Pengfei Liang, Neill Lambert, Mauro Cirio

Abstract: We study a generalization of a recently introduced Dicke trimer model [Phys. Rev. Lett. 128, 163601, Phys. Rev. Research 5, L042016], which allows for cavity losses and unbalanced light-matter interactions (in which rotating and counter-rotating terms can be tuned independently). We find that in the extreme unbalanced limit, the $U(1)$ symmetry of the Tavis-Cummings model is restored, qualitativel… ▽ More We study a generalization of a recently introduced Dicke trimer model [Phys. Rev. Lett. 128, 163601, Phys. Rev. Research 5, L042016], which allows for cavity losses and unbalanced light-matter interactions (in which rotating and counter-rotating terms can be tuned independently). We find that in the extreme unbalanced limit, the $U(1)$ symmetry of the Tavis-Cummings model is restored, qualitatively altering the critical phenomena in the superradiant phase due to the presence of a zero-energy mode. To analyze this general regime, we develop a semiclassical theory based on a re-quantization technique. This theory also provides further physical insight on a recently reported anomalous finite critical fluctuations in the time-reversal broken regime. Moving to the open-Dicke case, by introducing local dissipation to the cavities, we observe the emergence of a rich range of nonequilibrium phases characterized by trivial and non-trivial dynamical signatures. In the former case, we identify, when time-reversal symmetry is present, a new stationary phase that features superradiant states in two of the three cavities and a normal state in the other cavity. In the latter case, we observe the emergence of dynamical phases in which the system exhibits superradiant oscillations, characterized by periodic or chaotic phase space patterns. The landscape of transitions associated with these dynamical phases features a wide range of qualitatively different behaviours such as Hopf bifurcations, anomalous Hopf bifurcations, collisions between basins of attraction, and exterior crises. We highlight how the two-critical-scalings feature of the closed model is robust under dissipation while the phenomenon of anomalous finite critical fluctuations becomes a mean-field scaling in the open model. △ Less

Submitted 2 November, 2023; v1 submitted 21 March, 2023; originally announced March 2023.

Comments: 22 pages, 13 figures

arXiv:2302.10955 [pdf, ps, other]

doi 10.1016/j.physletb.2023.137888

Duality and Fluxes in the Sen Formulation of Self-Dual Fields

Authors: Neil Lambert

Abstract: In Sen's formulation of self-dual fields one finds two closed forms: $H^{(g)}$ and $H^{(s)}$. Only the former couples to sources and the spacetime metric. The latter has the wrong sign kinetic term but decouples and hence might be regarded as an unphysical artifact. In this letter we illustrate how an electromagnetic duality associated to the potential for $H^{(s)}$ gives rise to a T-like duality… ▽ More In Sen's formulation of self-dual fields one finds two closed forms: $H^{(g)}$ and $H^{(s)}$. Only the former couples to sources and the spacetime metric. The latter has the wrong sign kinetic term but decouples and hence might be regarded as an unphysical artifact. In this letter we illustrate how an electromagnetic duality associated to the potential for $H^{(s)}$ gives rise to a T-like duality in the partition function for $H^{(g)}$. We then compute the partition function on a (4k+2)-dimensional torus highlighting its dependence on the choice of flux of $H^{(s)}$. Lastly we compute the two-point function of Wilson Surface operators. △ Less

Submitted 4 April, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: 20 pages, Typos corrected, to appear in PLB

arXiv:2302.01044 [pdf, other]

doi 10.1103/PhysRevResearch.5.043177

Kondo QED: The Kondo effect and photon trapping in a two-impurity Anderson model ultra-strongly coupled to light

Authors: Po-Chen Kuo, Neill Lambert, Mauro Cirio, Yi-Te Huang, Franco Nori, Yueh-Nan Chen

Abstract: The Kondo effect is one of the most studied examples of strongly correlated quantum many-body physics. Another type of strongly correlated physics that has only recently been explored in detail (and become experimentally accessible) is that of ultrastrong coupling between light and matter. Here, we study a system which we denote as "Kondo QED") that combines both phenomena, consisting of a two-imp… ▽ More The Kondo effect is one of the most studied examples of strongly correlated quantum many-body physics. Another type of strongly correlated physics that has only recently been explored in detail (and become experimentally accessible) is that of ultrastrong coupling between light and matter. Here, we study a system which we denote as "Kondo QED") that combines both phenomena, consisting of a two-impurity Anderson model ultra-strongly coupled to a single-mode cavity. While presented as an abstract model, it is relevant for a range of future hybrid cavity-QED systems. Using the hierarchical equations of motion approach we show that the ultrastrong coupling of cavity photons to the electronic states (impurity) noticeably suppresses the electronic Kondo resonance due to the destruction of many-body correlations of the Kondo cloud. We observe this transfer of correlations from the Kondo cloud to the cavity by computing the entropy and mutual information of the impurity-cavity subsystems. In addition, in the weak lead-coupling limit and at zero-bias, the model exhibits a ground-state photon accumulation effect originating entirely from counter-rotating terms in the impurity-cavity interaction. Interestingly, in the strong lead-coupling limit, this accumulation is ``Kondo-enhanced'' by new transition paths opening when increasing the hybridization to the leads. This suggests a new mechanism for the generation of real photons from virtual states. We further show that the suppression of the Kondo effect is stable under broadening of the cavity resonance as a consequence of the interaction to an external bosonic continuum. Our findings pave the way for the simultaneous control of both the Kondo QED effect and a photon accumulation effect using the ultrastrong coupling of light and matter. △ Less

Submitted 19 July, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

Comments: 23 pages, 11 figures, 2 tables

Journal ref: Phys. Rev. Research 5, 043177 (2023)

arXiv:2301.11492 [pdf, ps, other]

Recovering utility

Authors: Christopher P. Chambers, Federico Echenique, Nicolas S. Lambert

Abstract: We provide sufficient conditions under which a utility function may be recovered from a finite choice experiment. Identification, as is commonly understood in decision theory, is not enough. We provide a general recoverability result that is widely applicable to modern theories of choice under uncertainty. Key is to allow for a monetary environment, in which an objective notion of monotonicity is… ▽ More We provide sufficient conditions under which a utility function may be recovered from a finite choice experiment. Identification, as is commonly understood in decision theory, is not enough. We provide a general recoverability result that is widely applicable to modern theories of choice under uncertainty. Key is to allow for a monetary environment, in which an objective notion of monotonicity is meaningful. In such environments, we show that subjective expected utility, as well as variational preferences, and other parametrizations of utilities over uncertain acts are recoverable. We also consider utility recovery in a statistical model with noise and random deviations from utility maximization. △ Less

Submitted 26 January, 2023; originally announced January 2023.

arXiv:2301.07554 [pdf, other]

doi 10.1103/PRXQuantum.4.030316

A quantum-classical decomposition of Gaussian quantum environments: a stochastic pseudomode model

Authors: Si Luo, Neill Lambert, Pengfei Liang, Mauro Cirio

Abstract: We show that the effect of a Gaussian Bosonic environment linearly coupled to a quantum system can be simulated by a stochastic Lindblad master equation characterized by a set of ancillary Bosonic modes initially at zero temperature and classical stochastic fields. We test the method for Ohmic environments with exponential and polynomial cut-offs against, respectively, the Hierarchical Equations o… ▽ More We show that the effect of a Gaussian Bosonic environment linearly coupled to a quantum system can be simulated by a stochastic Lindblad master equation characterized by a set of ancillary Bosonic modes initially at zero temperature and classical stochastic fields. We test the method for Ohmic environments with exponential and polynomial cut-offs against, respectively, the Hierarchical Equations of Motion and the deterministic pseudomode model with respect to which the number of ancillary quantum degrees of freedom is reduced. For a subset of rational spectral densities, all parameters are explicitly specified without the need of any fitting procedure, thereby simplifying the modeling strategy. Interestingly, the classical fields in this decomposition must sometimes be imaginary-valued, which can have counter-intuitive effects on the system properties which we demonstrate by showing that they can decrease the entropy of the system, in contrast to real-valued fields. △ Less

Submitted 14 June, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

Comments: 41 pages, 11 figures

Journal ref: PRX Quantum 4, 030316 (2023)

arXiv:2212.07717 [pdf, ps, other]

doi 10.1007/JHEP03(2023)069

RG Flows and Symmetry Enhancement in Five-Dimensional Lifshitz Gauge Theories

Authors: Neil Lambert, Joseph Smith

Abstract: Lagrangian gauge theories with a z=2 Lifshitz scaling provide a family of interacting, asymptotically free five-dimensional field theories. We examine some of their quantum properties, extending previous results to include matter. We present no-go theorems that, in the absence of constraints, such theories cannot admit a spinorial supersymmetry or a boost symmetry. However, we argue that there exi… ▽ More Lagrangian gauge theories with a z=2 Lifshitz scaling provide a family of interacting, asymptotically free five-dimensional field theories. We examine some of their quantum properties, extending previous results to include matter. We present no-go theorems that, in the absence of constraints, such theories cannot admit a spinorial supersymmetry or a boost symmetry. However, we argue that there exist renormalization group flows whose fixed points can admit supersymmetry and boosts, i.e. super-Schrodinger symmetry. We also present examples of Lifshitz gauge theories with a scalar supersymmetry. △ Less

Submitted 13 March, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

Comments: Minor corrections and some improved explanations. Version to appear in JHEP

arXiv:2212.05129 [pdf, other]

Measuring Data

Authors: Margaret Mitchell, Alexandra Sasha Luccioni, Nathan Lambert, Marissa Gerchick, Angelina McMillan-Major, Ezinwanne Ozoani, Nazneen Rajani, Tristan Thrush, Yacine Jernite, Douwe Kiela

Abstract: We identify the task of measuring data to quantitatively characterize the composition of machine learning data and datasets. Similar to an object's height, width, and volume, data measurements quantify different attributes of data along common dimensions that support comparison. Several lines of research have proposed what we refer to as measurements, with differing terminology; we bring some of t… ▽ More We identify the task of measuring data to quantitatively characterize the composition of machine learning data and datasets. Similar to an object's height, width, and volume, data measurements quantify different attributes of data along common dimensions that support comparison. Several lines of research have proposed what we refer to as measurements, with differing terminology; we bring some of this work together, particularly in fields of computer vision and language, and build from it to motivate measuring data as a critical component of responsible AI development. Measuring data aids in systematically building and analyzing machine learning (ML) data towards specific goals and gaining better control of what modern ML systems will learn. We conclude with a discussion of the many avenues of future work, the limitations of data measurements, and how to leverage these measurement approaches in research and practice. △ Less

Submitted 13 February, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

arXiv:2209.02409 [pdf]

Ultra-Low Temperature Li/CFx Batteries Enabled by Fast-transport and Anion-pairing Liquefied Gas Electrolytes

Authors: Yijie Yin, John Holoubek, Alex Liu, Baharak Sayahpour, Ganesh Raghavendran, Guorui Cai, Bing Han, Matthew Mayer, Noah B. Schorr, Timothy N. Lambert, Katharine L. Harrison, Weikang Li, Zheng Chen, Y. Shirley Meng

Abstract: Lithium fluorinated carbon is one of the most promising chemistries for high-energy-density primary energy storage systems in applications where rechargeability is not required. Though Li/CFx demonstrates high energy density under ambient conditions, achieving such a high energy density when exposed to subzero temperatures remains a challenge, particularly under high current density. Here, we repo… ▽ More Lithium fluorinated carbon is one of the most promising chemistries for high-energy-density primary energy storage systems in applications where rechargeability is not required. Though Li/CFx demonstrates high energy density under ambient conditions, achieving such a high energy density when exposed to subzero temperatures remains a challenge, particularly under high current density. Here, we report a liquefied gas electrolyte with an anion-pair solvation structure based on dimethyl ether with a low melting point and low viscosity, leading to high ionic conductivity between a wide temperature range. Besides that, through systematic X-ray photoelectron spectroscopy integrated with transmission electron microscopy characterizations, we evaluate the interface of CFx for low-temperature performance. We conclude that the fast transport and anion-pairing solvation structure of the electrolyte bring about reduced charge transfer resistance at low temperatures, which resulted in significantly enhanced performance of Li/CFx cells. Utilizing 50 mg/cm2 loading electrodes, the Li/CFx still displayed 1530 Wh/kg at reduced temperature. This work provides insights into the electrolyte design that may overcome the operational limits of batteries in extreme environments. △ Less

Submitted 1 September, 2022; originally announced September 2022.

arXiv:2208.02472 [pdf, other]

doi 10.1103/PhysRevResearch.4.033143

Space-time dual quantum Zeno effect: Interferometric engineering of open quantum system dynamics

Authors: Jhen-Dong Lin, Ching-Yu Huang, Neill Lambert, Guang-Yin Chen, Franco Nori, Yueh-Nan Chen

Abstract: Superposition of trajectories, which modify quantum evolutions by superposing paths through interferometry, has been utilized to enhance various quantum communication tasks. However, little is known about its impact from the viewpoint of open quantum systems. Thus, we examine this subject from the perspective of system-environment interactions. We show that the superposition of multiple trajectori… ▽ More Superposition of trajectories, which modify quantum evolutions by superposing paths through interferometry, has been utilized to enhance various quantum communication tasks. However, little is known about its impact from the viewpoint of open quantum systems. Thus, we examine this subject from the perspective of system-environment interactions. We show that the superposition of multiple trajectories can result in quantum state freezing, suggesting a space-time dual to the quantum Zeno effect. Moreover, non-trivial Dicke-like super(sub)radiance can be triggered without utilizing multi-atom correlations. △ Less

Submitted 4 August, 2022; originally announced August 2022.

Comments: 12 pages and 4 figures

Journal ref: PhysRevResearch.4.033143 (2022)

arXiv:2207.12156 [pdf, other]

doi 10.1038/s42005-023-01457-w

Sudden change of the photon output field marks phase transitions in the quantum Rabi model

Authors: Ye-Hong Chen, Yuan Qiu, Adam Miranowicz, Neill Lambert, Wei Qin, Roberto Stassi, Yan Xia, Shi-Biao Zheng, Franco Nori

Abstract: The experimental observation of quantum phase transitions predicted by the quantum Rabi model in quantum critical systems is usually challenging due to the lack of signature experimental observables associated with them. Here, we describe a method to identify the dynamical critical phenomenon in the quantum Rabi model consisting of a three-level atom and a cavity at the quantum phase transition. S… ▽ More The experimental observation of quantum phase transitions predicted by the quantum Rabi model in quantum critical systems is usually challenging due to the lack of signature experimental observables associated with them. Here, we describe a method to identify the dynamical critical phenomenon in the quantum Rabi model consisting of a three-level atom and a cavity at the quantum phase transition. Such a critical phenomenon manifests itself as a sudden change of steady-state output photons in the system driven by two classical fields, when both the atom and the cavity are initially unexcited. The process occurs as the high-frequency pump field is converted into the low-frequency Stokes field and multiple cavity photons in the normal phase, while this conversion cannot occur in the superradiant phase. The sudden change of steady-state output photons is an experimentally accessible measure to probe quantum phase transitions, as it does not require preparing the equilibrium state. △ Less

Submitted 6 January, 2024; v1 submitted 25 July, 2022; originally announced July 2022.

Comments: has been published in Communcations Physics as a regular article

Journal ref: Communications Physics, 7, 5 (2024)

arXiv:2207.05780 [pdf, other]

doi 10.1103/PhysRevResearch.5.033011

A pseudo-fermion method for the exact description of fermionic environments: from single-molecule electronics to Kondo resonance

Authors: Mauro Cirio, Neill Lambert, Pengfei Liang, Po-Chen Kuo, Yueh-Nan Chen, Paul Menczel, Ken Funo, Franco Nori

Abstract: We develop a discrete fermion approach for modelling the strong interaction of an arbitrary system interacting with continuum electronic reservoirs. The approach is based on a pseudo-fermion decomposition of the continuum bath correlation functions, and is only limited by the accuracy of this decomposition. We show that to obtain this decomposition one can allow for imaginary pseudo-fermion parame… ▽ More We develop a discrete fermion approach for modelling the strong interaction of an arbitrary system interacting with continuum electronic reservoirs. The approach is based on a pseudo-fermion decomposition of the continuum bath correlation functions, and is only limited by the accuracy of this decomposition. We show that to obtain this decomposition one can allow for imaginary pseudo-fermion parameters, and strong damping in individual pseudo-fermions, without introducing unwanted approximations. For a non-interacting single-resonant level, we benchmark our approach against an analytical solution and an exact hierachical-equations-of-motion approach. We also show that, for the interacting case, this simple method can capture the strongly correlated low-temperature physics of Kondo resonance. △ Less

Submitted 30 January, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: 15 pages, 4 figures

Journal ref: Phys. Rev. Research 5, 033011 (2023)

arXiv:2207.05512 [pdf, other]

doi 10.1103/PhysRevLett.131.113601

Observation of a superradiant phase transition with emergent cat states

Authors: Ri-Hua Zheng, Wen Ning, Ye-Hong Chen, Jia-Hao Lü, Li-Tuo Shen, Kai Xu, Yu-Ran Zhang, Da Xu, Hekang Li, Yan Xia, Fan Wu, Zhen-Biao Yang, Adam Miranowicz, Neill Lambert, Dongning Zheng, Heng Fan, Franco Nori, Shi-Biao Zheng

Abstract: Superradiant phase transitions (SPTs) are important for understanding light-matter interactions at the quantum level, and play a central role in criticality-enhanced quantum sensing. So far, SPTs have been observed in driven-dissipative systems, but the emergent light fields did not show any nonclassical characteristic due to the presence of strong dissipation. Here we report an experimental demon… ▽ More Superradiant phase transitions (SPTs) are important for understanding light-matter interactions at the quantum level, and play a central role in criticality-enhanced quantum sensing. So far, SPTs have been observed in driven-dissipative systems, but the emergent light fields did not show any nonclassical characteristic due to the presence of strong dissipation. Here we report an experimental demonstration of the SPT featuring the emergence of a highly nonclassical photonic field, realized with a resonator coupled to a superconducting qubit, implementing the quantum Rabi model. We fully characterize the light-matter state by Wigner matrix tomography. The measured matrix elements exhibit quantum interference intrinsic of a photonic mesoscopic superposition, and reveal light-matter entanglement △ Less

Submitted 11 September, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: 20 pages, 19 figures, 2 tables

Journal ref: Phys. Rev. Lett. 131, 113601 (2023)

arXiv:2206.00647 [pdf, other]

doi 10.1016/j.jpowsour.2022.231893

A Pseudo-Two-Dimensional (P2D) Model for FeS2 Conversion Cathode Batteries

Authors: Jeffrey S. Horner, Grace Whang, Igor V. Kolesnichenko, Timothy N. Lambert, Bruce S. Dunn, Scott A. Roberts

Abstract: Conversion cathode materials are gaining interest for secondary batteries due to their high theoretical energy and power density. However, practical application as a secondary battery material is currently limited by practical issues such as poor cyclability. To better understand these materials, we have developed a pseudo-two-dimensional model for conversion cathodes. We apply this model to FeS2… ▽ More Conversion cathode materials are gaining interest for secondary batteries due to their high theoretical energy and power density. However, practical application as a secondary battery material is currently limited by practical issues such as poor cyclability. To better understand these materials, we have developed a pseudo-two-dimensional model for conversion cathodes. We apply this model to FeS2 - a material that undergoes intercalation followed by conversion during discharge. The model is derived from the half-cell Doyle-Fuller-Newman model with additional loss terms added to reflect the converted shell resistance as the reaction progresses. We also account for polydisperse active material particles by incorporating a variable active surface area and effective particle radius. Using the model, we show that the leading loss mechanisms for FeS2 are associated with solid-state diffusion and electrical transport limitations through the converted shell material. The polydisperse simulations are also compared to a monodisperse system, and we show that polydispersity has very little effect on the intercalation behavior yet leads to capacity loss during the conversion reaction. We provide the code as an open-source Python Battery Mathematical Modelling (PyBaMM) model that can be used to identify performance limitations for other conversion cathode materials. △ Less

Submitted 16 July, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

Journal ref: Journal of Power Sources Volume 544, 1 October 2022, 231893

arXiv:2204.10817 [pdf, other]

Reward Reports for Reinforcement Learning

Authors: Thomas Krendl Gilbert, Nathan Lambert, Sarah Dean, Tom Zick, Aaron Snoswell

Abstract: Building systems that are good for society in the face of complex societal effects requires a dynamic approach. Recent approaches to machine learning (ML) documentation have demonstrated the promise of discursive frameworks for deliberation about these complexities. However, these developments have been grounded in a static ML paradigm, leaving the role of feedback and post-deployment performance… ▽ More Building systems that are good for society in the face of complex societal effects requires a dynamic approach. Recent approaches to machine learning (ML) documentation have demonstrated the promise of discursive frameworks for deliberation about these complexities. However, these developments have been grounded in a static ML paradigm, leaving the role of feedback and post-deployment performance unexamined. Meanwhile, recent work in reinforcement learning has shown that the effects of feedback and optimization objectives on system behavior can be wide-ranging and unpredictable. In this paper we sketch a framework for documenting deployed and iteratively updated learning systems, which we call Reward Reports. Taking inspiration from various contributions to the technical literature on reinforcement learning, we outline Reward Reports as living documents that track updates to design choices and assumptions behind what a particular automated system is optimizing for. They are intended to track dynamic phenomena arising from system deployment, rather than merely static properties of models or data. After presenting the elements of a Reward Report, we discuss a concrete example: Meta's BlenderBot 3 chatbot. Several others for game-playing (DeepMind's MuZero), content recommendation (MovieLens), and traffic control (Project Flow) are included in the appendix. △ Less

Submitted 19 March, 2023; v1 submitted 22 April, 2022; originally announced April 2022.

arXiv:2203.09637 [pdf, other]

Investigating Compounding Prediction Errors in Learned Dynamics Models

Authors: Nathan Lambert, Kristofer Pister, Roberto Calandra

Abstract: Accurately predicting the consequences of agents' actions is a key prerequisite for planning in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies on the iterative learning and prediction of state-action transitions to solve a task. Deep MBRL has become a popular candidate, using a neural network to learn a dynamics model that predicts with each pass from high-… ▽ More Accurately predicting the consequences of agents' actions is a key prerequisite for planning in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies on the iterative learning and prediction of state-action transitions to solve a task. Deep MBRL has become a popular candidate, using a neural network to learn a dynamics model that predicts with each pass from high-dimensional states to actions. These "one-step" predictions are known to become inaccurate over longer horizons of composed prediction - called the compounding error problem. Given the prevalence of the compounding error problem in MBRL and related fields of data-driven control, we set out to understand the properties of and conditions causing these long-horizon errors. In this paper, we explore the effects of subcomponents of a control problem on long term prediction error: including choosing a system, collecting data, and training a model. These detailed quantitative studies on simulated and real-world data show that the underlying dynamics of a system are the strongest factor determining the shape and magnitude of prediction error. Given a clearer understanding of compounding prediction error, researchers can implement new types of models beyond "one-step" that are more useful for control. △ Less

Submitted 17 March, 2022; originally announced March 2022.

Comments: 25 pages, 19 figures

arXiv:2202.05716 [pdf]

Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems

Authors: Thomas Krendl Gilbert, Sarah Dean, Tom Zick, Nathan Lambert

Abstract: In the long term, reinforcement learning (RL) is considered by many AI theorists to be the most promising path to artificial general intelligence. This places RL practitioners in a position to design systems that have never existed before and lack prior documentation in law and policy. Public agencies could intervene on complex dynamics that were previously too opaque to deliberate about, and long… ▽ More In the long term, reinforcement learning (RL) is considered by many AI theorists to be the most promising path to artificial general intelligence. This places RL practitioners in a position to design systems that have never existed before and lack prior documentation in law and policy. Public agencies could intervene on complex dynamics that were previously too opaque to deliberate about, and long-held policy ambitions would finally be made tractable. In this whitepaper we illustrate this potential and how it might be technically enacted in the domains of energy infrastructure, social media recommender systems, and transportation. Alongside these unprecedented interventions come new forms of risk that exacerbate the harms already generated by standard machine learning tools. We correspondingly present a new typology of risks arising from RL design choices, falling under four categories: scoping the horizon, defining rewards, pruning information, and training multiple agents. Rather than allowing RL systems to unilaterally reshape human domains, policymakers need new mechanisms for the rule of reason, foreseeability, and interoperability that match the risks these systems pose. We argue that criteria for these choices may be drawn from emerging subfields within antitrust, tort, and administrative law. It will then be possible for courts, federal and state agencies, and non-governmental organizations to play more active roles in RL specification and evaluation. Building on the "model cards" and "datasheets" frameworks proposed by Mitchell et al. and Gebru et al., we argue the need for Reward Reports for AI systems. Reward Reports are living documents for proposed RL deployments that demarcate design choices. △ Less

Submitted 11 February, 2022; originally announced February 2022.

Comments: 60 pages

Journal ref: Center for Long Term Cybersecurity Whitepaper Series Feb. 2022; see release https://cltc.berkeley.edu/2022/02/08/reward-reports/

arXiv:2201.11861 [pdf, other]

The Challenges of Exploration for Offline Reinforcement Learning

Authors: Nathan Lambert, Markus Wulfmeier, William Whitney, Arunkumar Byravan, Michael Bloesch, Vibhavari Dasagi, Tim Hertweck, Martin Riedmiller

Abstract: Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked processes of reinforcement learning: collecting informative experience and inferring optimal behaviour. The second step has been widely studied in the offline setting, but just as critical to data-efficient RL is the collection of informative data. The task-agnostic setting for data collection, where the task is… ▽ More Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked processes of reinforcement learning: collecting informative experience and inferring optimal behaviour. The second step has been widely studied in the offline setting, but just as critical to data-efficient RL is the collection of informative data. The task-agnostic setting for data collection, where the task is not known a priori, is of particular interest due to the possibility of collecting a single dataset and using it to solve several downstream tasks as they arise. We investigate this setting via curiosity-based intrinsic motivation, a family of exploration methods which encourage the agent to explore those states or transitions it has not yet learned to model. With Explore2Offline, we propose to evaluate the quality of collected data by transferring the collected data and inferring policies with reward relabelling and standard offline RL algorithms. We evaluate a wide variety of data collection strategies, including a new exploration agent, Intrinsic Model Predictive Control (IMPC), using this scheme and demonstrate their performance on various tasks. We use this decoupled framework to strengthen intuitions about exploration and the data prerequisites for effective offline RL. △ Less

Submitted 18 February, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

arXiv:2112.14860 [pdf, ps, other]

Non-Lorentzian $SU(1,n)$ Spacetime Symmetry in Various Dimensions

Authors: Neil Lambert, Rishi Mouland, Tristan Orchard

Abstract: We discuss non-Lorentzian Lagrangian field theories in $2n-1$ dimensions that admit an $SU(1,n)$ spacetime symmetry which includes a scaling transformation. These can be obtained by a conformal compactification of a $2n$-dimensional Minkowskian conformal field theory. We discuss the symmetry algebra, its representations including primary fields and unitarity bounds. We also give various examples o… ▽ More We discuss non-Lorentzian Lagrangian field theories in $2n-1$ dimensions that admit an $SU(1,n)$ spacetime symmetry which includes a scaling transformation. These can be obtained by a conformal compactification of a $2n$-dimensional Minkowskian conformal field theory. We discuss the symmetry algebra, its representations including primary fields and unitarity bounds. We also give various examples of free theories in a variety of dimensions and a discussion of how to reconstruct the parent $2n$-dimensional theory. △ Less

Submitted 4 March, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

Comments: 31 pages, appendix added. Matches published version

arXiv:2112.00040 [pdf, ps, other]

doi 10.1007/JHEP04(2022)115

A Path Integral for the Chiral-Form Partition Function

Authors: Enrico Andriolo, Neil Lambert, Tristan Orchard, Constantinos Papageorgakis

Abstract: Starting from the recent action proposed by Sen [1,2], we evaluate the partition function of the compact chiral boson on a two-dimensional torus using a path-integral formulation. Crucially, we use a Wick-rotation procedure obtained from a complex deformation of the physical spacetime metric. This directly reproduces the expected result including general characteristics for the theta functions. We… ▽ More Starting from the recent action proposed by Sen [1,2], we evaluate the partition function of the compact chiral boson on a two-dimensional torus using a path-integral formulation. Crucially, we use a Wick-rotation procedure obtained from a complex deformation of the physical spacetime metric. This directly reproduces the expected result including general characteristics for the theta functions. We also present results for the chiral 2-form potential in six dimensions which can be readily extended to 4k+2 dimensions. △ Less

Submitted 14 April, 2022; v1 submitted 30 November, 2021; originally announced December 2021.

Comments: 40 pages; v2: minor corrections; v3: references added and presentation streamlined

arXiv:2110.14441 [pdf, other]

doi 10.1007/JHEP04(2022)114

Gauging Discrete Symmetries of $T_N$-theories in Five Dimensions

Authors: Bobby Acharya, Neil Lambert, Marwan Najjar, Eirik Eik Svanes, Jiahua Tian

Abstract: We study the gauging of a discrete $\mathbb{Z}_3$ symmetry in the five-dimensional superconformal $T_N$ theories. We argue that this leads to an infinite sequence of five-dimensional superconformal theories with either $E_6 \times SU(N)$ or $SU(3)\times SU(N)$ global symmetry group. In the $M$-theory realisation of $T_N$ theories as residing at the origin in the Calabi-Yau orbifolds… ▽ More We study the gauging of a discrete $\mathbb{Z}_3$ symmetry in the five-dimensional superconformal $T_N$ theories. We argue that this leads to an infinite sequence of five-dimensional superconformal theories with either $E_6 \times SU(N)$ or $SU(3)\times SU(N)$ global symmetry group. In the $M$-theory realisation of $T_N$ theories as residing at the origin in the Calabi-Yau orbifolds ${\mathbb{C}^3 \over {\mathbb{Z}_N \times \mathbb{Z}_N}}$ we identify the $\mathbb{Z}_3$ symmetry geometrically and the new theories arise from $M$-theory on the non-Abelian orbifolds $({\mathbb{C}^3 \over {\mathbb{Z}_N \times \mathbb{Z}_N}})/{\mathbb{Z}_3}$. On the other hand, in the $(p,q)$ 5-brane web description in Type IIB theory, the symmetry combines the $U$-duality symmetry with a rotation in space, defining a so-called $U$-fold background, where the $E_6$ symmetry is manifest. △ Less

Submitted 20 March, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

arXiv:2110.02674 [pdf, other]

Unveiling and veiling a Schrödinger cat state from the vacuum

Authors: Roberto Stassi, Mauro Cirio, Ken Funo, Neill Lambert, Jorge Puebla, Franco Nori

Abstract: Deep in the ultrastrong light-matter coupling regime, it has been predicted that the ground state of a two-level atom interacting with a cavity mode takes the form of a "virtual" Schrödinger cat entangled state between light and matter. We propose a method to convert this Schrödinger cat state from virtual to real, and back again, by driving the atom with optimally chosen pulses. Our system consis… ▽ More Deep in the ultrastrong light-matter coupling regime, it has been predicted that the ground state of a two-level atom interacting with a cavity mode takes the form of a "virtual" Schrödinger cat entangled state between light and matter. We propose a method to convert this Schrödinger cat state from virtual to real, and back again, by driving the atom with optimally chosen pulses. Our system consists of a four-level atom, with two of these levels ultrastrongly coupled to a cavity mode. We show that the Schrödinger cat state can be converted between virtual and real by making use of either an ideal ultrafast pulse or a multi-tone π-pulse. In addition to allowing us to observe these unusual virtual states this method could also be used to generate entangled cat states on demand for quantum information processing. △ Less

Submitted 6 October, 2021; originally announced October 2021.

Showing 1–50 of 239 results for author: Lambert, N