Skip to main content

Showing 1–9 of 9 results for author: Dalton, A

  1. arXiv:2310.04610  [pdf, other

    cs.AI cs.LG

    DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

    Authors: Shuaiwen Leon Song, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, Xiaoxia Wu, Jeff Rasley, Ammar Ahmad Awan, Connor Holmes, Martin Cai, Adam Ghanem, Zhongzhu Zhou, Yuxiong He, Pete Luferenko, Divya Kumar, Jonathan Weyn, Ruixiong Zhang, Sylwester Klocek, Volodymyr Vragov, Mohammed AlQuraishi, Gustaf Ahdritz, Christina Floristean, Cristina Negri , et al. (67 additional authors not shown)

    Abstract: In the upcoming decade, deep learning may revolutionize the natural sciences, enhancing our capacity to model and predict natural occurrences. This could herald a new era of scientific exploration, bringing significant advancements across sectors from drug development to renewable energy. To answer this call, we present DeepSpeed4Science initiative (deepspeed4science.ai) which aims to build unique… ▽ More

    Submitted 11 October, 2023; v1 submitted 6 October, 2023; originally announced October 2023.

  2. arXiv:2203.10659  [pdf, other

    cs.CL cs.AI

    From Stance to Concern: Adaptation of Propositional Analysis to New Tasks and Domains

    Authors: Brodie Mather, Bonnie J Dorr, Adam Dalton, William de Beaumont, Owen Rambow, Sonja M. Schmer-Galunder

    Abstract: We present a generalized paradigm for adaptation of propositional analysis (predicate-argument pairs) to new tasks and domains. We leverage an analogy between stances (belief-driven sentiment) and concerns (topical issues with moral dimensions/endorsements) to produce an explanatory representation. A key contribution is the combination of semi-automatic resource building for extraction of domain-d… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted to Findings of the Association for Computational Linguistics, 2022

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2009.12506  [pdf, other

    cs.CL

    Learning to Plan and Realize Separately for Open-Ended Dialogue Systems

    Authors: Sashank Santhanam, Zhuo Cheng, Brodie Mather, Bonnie Dorr, Archna Bhatia, Bryanna Hebenstreit, Alan Zemel, Adam Dalton, Tomek Strzalkowski, Samira Shaikh

    Abstract: Achieving true human-like ability to conduct a conversation remains an elusive goal for open-ended dialogue systems. We posit this is because extant approaches towards natural language generation (NLG) are typically construed as end-to-end architectures that do not adequately model human generation processes. To investigate, we decouple generation into two separate phases: planning and realization… ▽ More

    Submitted 4 October, 2020; v1 submitted 25 September, 2020; originally announced September 2020.

    Comments: Accepted at EMNLP 2020 (Findings)

  4. arXiv:2004.09662  [pdf, other

    cs.CL cs.CR

    The Panacea Threat Intelligence and Active Defense Platform

    Authors: Adam Dalton, Ehsan Aghaei, Ehab Al-Shaer, Archna Bhatia, Esteban Castillo, Zhuo Cheng, Sreekar Dhaduvai, Qi Duan, Md Mazharul Islam, Younes Karimi, Amir Masoumzadeh, Brodie Mather, Sashank Santhanam, Samira Shaikh, Tomek Strzalkowski, Bonnie J. Dorr

    Abstract: We describe Panacea, a system that supports natural language processing (NLP) components for active defenses against social engineering attacks. We deploy a pipeline of human language technology, including Ask and Framing Detection, Named Entity Recognition, Dialogue Engineering, and Stylometry. Panacea processes modern message formats through a plug-in architecture to accommodate innovative appro… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted at STOC

  5. arXiv:2004.09050  [pdf, ps, other

    cs.CL

    Adaptation of a Lexical Organization for Social Engineering Detection and Response Generation

    Authors: Archna Bhatia, Adam Dalton, Brodie Mather, Sashank Santhanam, Samira Shaikh, Alan Zemel, Tomek Strzalkowski, Bonnie J. Dorr

    Abstract: We present a paradigm for extensible lexicon development based on Lexical Conceptual Structure to support social engineering detection and response generation. We leverage the central notions of ask (elicitation of behaviors such as providing access to money) and framing (risk/reward implied by the ask). We demonstrate improvements in ask/framing detection through refinements to our lexical organi… ▽ More

    Submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted at STOC

  6. arXiv:2002.10931  [pdf, other

    cs.CL

    Detecting Asks in SE attacks: Impact of Linguistic and Structural Knowledge

    Authors: Bonnie J. Dorr, Archna Bhatia, Adam Dalton, Brodie Mather, Bryanna Hebenstreit, Sashank Santhanam, Zhuo Cheng, Samira Shaikh, Alan Zemel, Tomek Strzalkowski

    Abstract: Social engineers attempt to manipulate users into undertaking actions such as downloading malware by clicking links or providing access to money or sensitive information. Natural language processing, computational sociolinguistics, and media-specific structural clues provide a means for detecting both the ask (e.g., buy gift card) and the risk/reward implied by the ask, which we call framing (e.g.… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted at AAAI 2020

  7. arXiv:2002.00120  [pdf, ps, other

    stat.ML cs.CV cs.LG

    On the Consistency of Optimal Bayesian Feature Selection in the Presence of Correlations

    Authors: Ali Foroughi pour, Lori A. Dalton

    Abstract: Optimal Bayesian feature selection (OBFS) is a multivariate supervised screening method designed from the ground up for biomarker discovery. In this work, we prove that Gaussian OBFS is strongly consistent under mild conditions, and provide rates of convergence for key posteriors in the framework. These results are of enormous importance, since they identify precisely what features are selected by… ▽ More

    Submitted 31 January, 2020; originally announced February 2020.

    Comments: 33 pages, 1 figure

    MSC Class: 62F15; 62C10; 62F07; 92C37

  8. arXiv:1909.03637  [pdf, other

    stat.ML cs.CV cs.LG

    Theory of Optimal Bayesian Feature Filtering

    Authors: Ali Foroughi pour, Lori A. Dalton

    Abstract: Optimal Bayesian feature filtering (OBF) is a supervised screening method designed for biomarker discovery. In this article, we prove two major theoretical properties of OBF. First, optimal Bayesian feature selection under a general family of Bayesian models reduces to filtering if and only if the underlying Bayesian model assumes all features are mutually independent. Therefore, OBF is optimal if… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 51 pages, 5 figures, 6 tables

    MSC Class: 62F15; 62C10; 62F07; 92C37

  9. arXiv:1806.00672  [pdf, ps, other

    stat.ML cs.CV cs.LG eess.IV

    Optimal Clustering under Uncertainty

    Authors: Lori A. Dalton, Marco E. Benalcázar, Edward R. Dougherty

    Abstract: Classical clustering algorithms typically either lack an underlying probability framework to make them predictive or focus on parameter estimation rather than defining and minimizing a notion of error. Recent work addresses these issues by developing a probabilistic framework based on the theory of random labeled point processes and characterizing a Bayes clusterer that minimizes the number of mis… ▽ More

    Submitted 2 June, 2018; originally announced June 2018.

    Comments: 19 pages, 5 eps figures, 1 table

    MSC Class: 62H30; 62F35 ACM Class: I.5.3; G.1.6; I.4.9; I.2.6