Skip to main content

Showing 1–19 of 19 results for author: Mysore, S

  1. arXiv:2406.19928  [pdf, other

    cs.CL cs.HC cs.IR

    Interactive Topic Models with Optimal Transport

    Authors: Garima Dhanania, Sheshera Mysore, Chau Minh Pham, Mohit Iyyer, Hamed Zamani, Andrew McCallum

    Abstract: Topic models are widely used to analyze document collections. While they are valuable for discovering latent topics in a corpus when analysts are unfamiliar with the corpus, analysts also commonly start with an understanding of the content present in a corpus. This may be through categories obtained from an initial pass over the corpus or a desire to analyze the corpus through a predefined set of… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Pre-print; Work in progress

  2. arXiv:2405.14139  [pdf, other

    q-bio.NC cs.LG cs.NE

    Contribute to balance, wire in accordance: Emergence of backpropagation from a simple, bio-plausible neuroplasticity rule

    Authors: Xinhao Fan, Shreesh P Mysore

    Abstract: Backpropagation (BP) has been pivotal in advancing machine learning and remains essential in computational applications and comparative studies of biological and artificial neural networks. Despite its widespread use, the implementation of BP in the brain remains elusive, and its biological plausibility is often questioned due to inherent issues such as the need for symmetry of weights between for… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2405.04656  [pdf, other

    cs.HC

    Corporate Communication Companion (CCC): An LLM-empowered Writing Assistant for Workplace Social Media

    Authors: Zhuoran Lu, Sheshera Mysore, Tara Safavi, Jennifer Neville, Longqi Yang, Mengting Wan

    Abstract: Workplace social media platforms enable employees to cultivate their professional image and connect with colleagues in a semi-formal environment. While semi-formal corporate communication poses a unique set of challenges, large language models (LLMs) have shown great promise in helping users draft and edit their social media posts. However, LLMs may fail to capture individualized tones and voices… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  4. arXiv:2311.09180  [pdf, other

    cs.CL cs.HC cs.IR

    PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

    Authors: Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, Tara Safavi

    Abstract: Powerful large language models have facilitated the development of writing assistants that promise to significantly improve the quality and efficiency of composition and communication. However, a barrier to effective assistance is the lack of personalization in LLM outputs to the author's communication style and specialized knowledge. In this paper, we address this challenge by proposing PEARL, a… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Pre-print, work in progress

  5. arXiv:2306.02250  [pdf, other

    cs.IR cs.CL

    Large Language Model Augmented Narrative Driven Recommendations

    Authors: Sheshera Mysore, Andrew McCallum, Hamed Zamani

    Abstract: Narrative-driven recommendation (NDR) presents an information access problem where users solicit recommendations with verbose descriptions of their preferences and context, for example, travelers soliciting recommendations for points of interest while describing their likes/dislikes and travel circumstances. These requests are increasingly important with the rise of natural language-based conversa… ▽ More

    Submitted 21 July, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: RecSys 2023 Camera-ready

  6. arXiv:2304.11406  [pdf, other

    cs.CL

    LaMP: When Large Language Models Meet Personalization

    Authors: Alireza Salemi, Sheshera Mysore, Michael Bendersky, Hamed Zamani

    Abstract: This paper highlights the importance of personalization in large language models and introduces the LaMP benchmark -- a novel benchmark for training and evaluating language models for producing personalized outputs. LaMP offers a comprehensive evaluation framework with diverse language tasks and multiple entries for each user profile. It consists of seven personalized tasks, spanning three text cl… ▽ More

    Submitted 4 June, 2024; v1 submitted 22 April, 2023; originally announced April 2023.

  7. arXiv:2304.04250  [pdf, other

    cs.IR cs.CL cs.HC cs.LG

    Editable User Profiles for Controllable Text Recommendation

    Authors: Sheshera Mysore, Mahmood Jasim, Andrew McCallum, Hamed Zamani

    Abstract: Methods for making high-quality recommendations often rely on learning latent representations from interaction data. These methods, while performant, do not provide ready mechanisms for users to control the recommendation they receive. Our work tackles this problem by proposing LACE, a novel concept value bottleneck model for controllable text recommendations. LACE represents each user with a succ… ▽ More

    Submitted 16 October, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: SIGIR-2023 paper with extended results

  8. arXiv:2301.06987  [pdf, other

    cs.RO cs.LG

    The SwaNNFlight System: On-the-Fly Sim-to-Real Adaptation via Anchored Learning

    Authors: Bassel El Mabsout, Shahin Roozkhosh, Siddharth Mysore, Kate Saenko, Renato Mancuso

    Abstract: Reinforcement Learning (RL) agents trained in simulated environments and then deployed in the real world are often sensitive to the differences in dynamics presented, commonly termed the sim-to-real gap. With the goal of minimizing this gap on resource-constrained embedded systems, we train and live-adapt agents on quadrotors built from off-the-shelf hardware. In achieving this we developed three… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  9. arXiv:2301.03774  [pdf, other

    cs.IR cs.AI cs.DL cs.HC

    How Data Scientists Review the Scholarly Literature

    Authors: Sheshera Mysore, Mahmood Jasim, Haoru Song, Sarah Akbar, Andre Kenneth Chase Randall, Narges Mahyar

    Abstract: Keeping up with the research literature plays an important role in the workflow of scientists - allowing them to understand a field, formulate the problems they focus on, and develop the solutions that they contribute, which in turn shape the nature of the discipline. In this paper, we examine the literature review practices of data scientists. Data science represents a field seeing an exponential… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

    Comments: CHIIR 2023 camera-ready

  10. arXiv:2206.01328  [pdf, other

    cs.IR cs.CL

    Augmenting Scientific Creativity with Retrieval across Knowledge Domains

    Authors: Hyeonsu B. Kang, Sheshera Mysore, Kevin Huang, Haw-Shiuan Chang, Thorben Prein, Andrew McCallum, Aniket Kittur, Elsa Olivetti

    Abstract: Exposure to ideas in domains outside a scientist's own may benefit her in reformulating existing research problems in novel ways and discovering new application domains for existing solution ideas. While improved performance in scholarly search engines can help scientists efficiently identify relevant advances in domains they may already be familiar with, it may fall short of helping them explore… ▽ More

    Submitted 14 December, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: NLP+HCI Workshop at NAACL 2022

  11. arXiv:2111.08366  [pdf, other

    cs.CL cs.IR

    Multi-Vector Models with Textual Guidance for Fine-Grained Scientific Document Similarity

    Authors: Sheshera Mysore, Arman Cohan, Tom Hope

    Abstract: We present a new scientific document similarity model based on matching fine-grained aspects of texts. To train our model, we exploit a naturally-occurring source of supervision: sentences in the full-text of papers that cite multiple papers together (co-citations). Such co-citations not only reflect close paper relatedness, but also provide textual descriptions of how the co-cited papers are rela… ▽ More

    Submitted 4 May, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: NAACL 2022 camera-ready

  12. arXiv:2110.05260  [pdf, other

    cond-mat.mtrl-sci cs.GR cs.LG

    Designing Composites with Target Effective Young's Modulus using Reinforcement Learning

    Authors: Aldair E. Gongora, Siddharth Mysore, Beichen Li, Wan Shou, Wojciech Matusik, Elise F. Morgan, Keith A. Brown, Emily Whiting

    Abstract: Advancements in additive manufacturing have enabled design and fabrication of materials and structures not previously realizable. In particular, the design space of composite materials and structures has vastly expanded, and the resulting size and complexity has challenged traditional design methodologies, such as brute force exploration and one factor at a time (OFAT) exploration, to find optimum… ▽ More

    Submitted 7 October, 2021; originally announced October 2021.

    Comments: Accepted to the Symposium on Computational Fabrication (SCF) 2021

  13. arXiv:2103.12906  [pdf, other

    cs.IR cs.CL

    CSFCube -- A Test Collection of Computer Science Research Articles for Faceted Query by Example

    Authors: Sheshera Mysore, Tim O'Gorman, Andrew McCallum, Hamed Zamani

    Abstract: Query by Example is a well-known information retrieval task in which a document is chosen by the user as the search query and the goal is to retrieve relevant documents from a large collection. However, a document often covers multiple aspects of a topic. To address this scenario we introduce the task of faceted Query by Example in which users can also specify a finer grained aspect in addition to… ▽ More

    Submitted 7 November, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted to the NeurIPS 2021 Track on Datasets and Benchmarks

  14. arXiv:2102.11893  [pdf, other

    cs.LG cs.RO

    Honey, I Shrunk The Actor: A Case Study on Preserving Performance with Smaller Actors in Actor-Critic RL

    Authors: Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

    Abstract: Actors and critics in actor-critic reinforcement learning algorithms are functionally separate, yet they often use the same network architectures. This case study explores the performance impact of network sizes when considering actor and critic architectures independently. By relaxing the assumption of architectural symmetry, it is often possible for smaller actors to achieve comparable policy pe… ▽ More

    Submitted 18 June, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: Accepted to the IEEE Conference on Games 2021

  15. arXiv:2012.06656  [pdf, other

    cs.RO eess.SY

    How to Train your Quadrotor: A Framework for Consistently Smooth and Responsive Flight Control via Reinforcement Learning

    Authors: Siddharth Mysore, Bassel Mabsout, Kate Saenko, Renato Mancuso

    Abstract: We focus on the problem of reliably training Reinforcement Learning (RL) models (agents) for stable low-level control in embedded systems and test our methods on a high-performance, custom-built quadrotor platform. A common but often under-studied problem in developing RL agents for continuous control is that the control policies developed are not always smooth. This lack of smoothness can be a ma… ▽ More

    Submitted 22 February, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Journal ref: ACM Transactions on Cyber-Physical Systems, Volume 5, Issue 4, October 2021

  16. arXiv:2012.06644  [pdf, other

    cs.RO cs.LG eess.SY

    Regularizing Action Policies for Smooth Control with Reinforcement Learning

    Authors: Siddharth Mysore, Bassel Mabsout, Renato Mancuso, Kate Saenko

    Abstract: A critical problem with the practical utility of controllers trained with deep Reinforcement Learning (RL) is the notable lack of smoothness in the actions learned by the RL policies. This trend often presents itself in the form of control signal oscillation and can result in poor control, high power consumption, and undue system wear. We introduce Conditioning for Action Policy Smoothness (CAPS),… ▽ More

    Submitted 26 May, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: Accepted for publication to ICRA 2021

  17. arXiv:1905.06939  [pdf, other

    cs.CL cs.LG

    The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures

    Authors: Sheshera Mysore, Zach Jensen, Edward Kim, Kevin Huang, Haw-Shiuan Chang, Emma Strubell, Jeffrey Flanigan, Andrew McCallum, Elsa Olivetti

    Abstract: Materials science literature contains millions of materials synthesis procedures described in unstructured natural language text. Large-scale analysis of these synthesis procedures would facilitate deeper scientific understanding of materials synthesis and enable automated synthesis planning. Such analysis requires extracting structured representations of synthesis procedures from the raw text as… ▽ More

    Submitted 13 July, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

    Comments: Accepted as a long paper at the Linguistic Annotation Workshop (LAW) at ACL 2019

  18. arXiv:1901.00032  [pdf, other

    cond-mat.mtrl-sci cs.AI stat.ML

    Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks

    Authors: Edward Kim, Zach Jensen, Alexander van Grootel, Kevin Huang, Matthew Staib, Sheshera Mysore, Haw-Shiuan Chang, Emma Strubell, Andrew McCallum, Stefanie Jegelka, Elsa Olivetti

    Abstract: Leveraging new data sources is a key step in accelerating the pace of materials design and discovery. To complement the strides in synthesis planning driven by historical, experimental, and computed data, we present an automated method for connecting scientific literature to synthesis insights. Starting from natural language text, we apply word embeddings from language models, which are fed into a… ▽ More

    Submitted 17 February, 2019; v1 submitted 31 December, 2018; originally announced January 2019.

    Comments: Added new funding support to the acknowledgments section in this version

  19. arXiv:1711.06872  [pdf, other

    cs.CL

    Automatically Extracting Action Graphs from Materials Science Synthesis Procedures

    Authors: Sheshera Mysore, Edward Kim, Emma Strubell, Ao Liu, Haw-Shiuan Chang, Srikrishna Kompella, Kevin Huang, Andrew McCallum, Elsa Olivetti

    Abstract: Computational synthesis planning approaches have achieved recent success in organic chemistry, where tabulated synthesis procedures are readily available for supervised learning. The syntheses of inorganic materials, however, exist primarily as natural language narratives contained within scientific journal articles. This synthesis information must first be extracted from the text in order to enab… ▽ More

    Submitted 28 November, 2017; v1 submitted 18 November, 2017; originally announced November 2017.

    Comments: NIPS Workshop on Machine Learning for Molecules and Materials