Skip to main content

Showing 1–5 of 5 results for author: Pham, C M

  1. arXiv:2407.08792  [pdf, other

    cs.CR

    ProxyGPT: Enabling Anonymous Queries in AI Chatbots with (Un)Trustworthy Browser Proxies

    Authors: Dzung Pham, Jade Sheffey, Chau Minh Pham, Amir Houmansadr

    Abstract: AI-powered chatbots (ChatGPT, Claude, etc.) require users to create an account using their email and phone number, thereby linking their personally identifiable information to their conversational data and usage patterns. As these chatbots are increasingly being used for tasks involving sensitive information, privacy concerns have been raised about how chatbot providers handle user data. To addres… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2406.19928  [pdf, other

    cs.CL cs.HC cs.IR

    Interactive Topic Models with Optimal Transport

    Authors: Garima Dhanania, Sheshera Mysore, Chau Minh Pham, Mohit Iyyer, Hamed Zamani, Andrew McCallum

    Abstract: Topic models are widely used to analyze document collections. While they are valuable for discovering latent topics in a corpus when analysts are unfamiliar with the corpus, analysts also commonly start with an understanding of the content present in a corpus. This may be through categories obtained from an initial pass over the corpus or a desire to analyze the corpus through a predefined set of… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: Pre-print; Work in progress

  3. arXiv:2406.19371  [pdf, other

    cs.CL

    Suri: Multi-constraint Instruction Following for Long-form Text Generation

    Authors: Chau Minh Pham, Simeng Sun, Mohit Iyyer

    Abstract: Existing research on instruction following largely focuses on tasks with simple instructions and short responses. In this work, we explore multi-constraint instruction following for generating long-form text. We create Suri, a dataset with 20K human-written long-form texts paired with LLM-generated backtranslated instructions that contain multiple complex constraints. Because of prohibitive challe… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  4. arXiv:2311.01449  [pdf, other

    cs.CL

    TopicGPT: A Prompt-based Topic Modeling Framework

    Authors: Chau Minh Pham, Alexander Hoyle, Simeng Sun, Philip Resnik, Mohit Iyyer

    Abstract: Topic modeling is a well-established technique for exploring text corpora. Conventional topic models (e.g., LDA) represent topics as bags of words that often require "reading the tea leaves" to interpret; additionally, they offer users minimal control over the formatting and specificity of resulting topics. To tackle these issues, we introduce TopicGPT, a prompt-based framework that uses large lan… ▽ More

    Submitted 1 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: Accepted to NAACL 2024 (Main conference)

  5. arXiv:2306.03280  [pdf, other

    cs.HC

    AHA!: Facilitating AI Impact Assessment by Generating Examples of Harms

    Authors: Zana Buçinca, Chau Minh Pham, Maurice Jakesch, Marco Tulio Ribeiro, Alexandra Olteanu, Saleema Amershi

    Abstract: While demands for change and accountability for harmful AI consequences mount, foreseeing the downstream effects of deploying AI systems remains a challenging task. We developed AHA! (Anticipating Harms of AI), a generative framework to assist AI practitioners and decision-makers in anticipating potential harms and unintended consequences of AI systems prior to development or deployment. Given an… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.