Skip to main content

Showing 1–7 of 7 results for author: Patel, O

  1. arXiv:2407.03518  [pdf

    cs.CL cs.AI

    Improving LLM Abilities in Idiomatic Translation

    Authors: Sundesh Donthi, Maximilian Spencer, Om Patel, Joon Doh, Eid Rodan

    Abstract: For large language models (LLMs) like NLLB and GPT, translating idioms remains a challenge. Our goal is to enhance translation fidelity by improving LLM processing of idiomatic language while preserving the original linguistic style. This has a significant social impact, as it preserves cultural nuances and ensures translated texts retain their intent and emotional resonance, fostering better cros… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2406.07882  [pdf, other

    cs.CL cs.AI cs.HC

    Designing a Dashboard for Transparency and Control of Conversational AI

    Authors: Yida Chen, Aoyu Wu, Trevor DePodesta, Catherine Yeh, Kenneth Li, Nicholas Castillo Marin, Oam Patel, Jan Riecke, Shivam Raval, Olivia Seow, Martin Wattenberg, Fernanda Viégas

    Abstract: Conversational LLMs function as black box systems, leaving users guessing about why they see the output they do. This lack of transparency is potentially problematic, especially given concerns around bias and truthfulness. To address this issue, we present an end-to-end prototype-connecting interpretability techniques with user experience design-that seeks to make chatbots more transparent. We beg… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Project page: https://bit.ly/talktuner-project-page 38 pages, 23 figures

  3. arXiv:2403.05030  [pdf, other

    cs.CR cs.AI cs.LG

    Defending Against Unforeseen Failure Modes with Latent Adversarial Training

    Authors: Stephen Casper, Lennart Schulze, Oam Patel, Dylan Hadfield-Menell

    Abstract: Despite extensive diagnostics and debugging by developers, AI systems sometimes exhibit harmful unintended behaviors. Finding and fixing these is challenging because the attack surface is so large -- it is not tractable to exhaustively search for inputs that may elicit harmful behaviors. Red-teaming and adversarial training (AT) are commonly used to improve robustness, however, they empirically st… ▽ More

    Submitted 1 April, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  4. arXiv:2403.03218  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Authors: Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer , et al. (32 additional authors not shown)

    Abstract: The White House Executive Order on Artificial Intelligence highlights the risks of large language models (LLMs) empowering malicious actors in developing biological, cyber, and chemical weapons. To measure these risks of malicious use, government institutions and major AI labs are developing evaluations for hazardous capabilities in LLMs. However, current evaluations are private, preventing furthe… ▽ More

    Submitted 15 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: See the project page at https://wmdp.ai

  5. arXiv:2306.03341  [pdf, other

    cs.LG cs.AI cs.CL

    Inference-Time Intervention: Eliciting Truthful Answers from a Language Model

    Authors: Kenneth Li, Oam Patel, Fernanda Viégas, Hanspeter Pfister, Martin Wattenberg

    Abstract: We introduce Inference-Time Intervention (ITI), a technique designed to enhance the "truthfulness" of large language models (LLMs). ITI operates by shifting model activations during inference, following a set of directions across a limited number of attention heads. This intervention significantly improves the performance of LLaMA models on the TruthfulQA benchmark. On an instruction-finetuned LLa… ▽ More

    Submitted 26 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 spotlight; code: https://github.com/likenneth/honest_llama

  6. arXiv:2204.11835  [pdf, other

    q-bio.QM cs.AI cs.LG

    A Novel Scalable Apache Spark Based Feature Extraction Approaches for Huge Protein Sequence and their Clustering Performance Analysis

    Authors: Preeti Jha, Aruna Tiwari, Neha Bharill, Milind Ratnaparkhe, Om Prakash Patel, Nilagiri Harshith, Mukkamalla Mounika, Neha Nagendra

    Abstract: Genome sequencing projects are rapidly increasing the number of high-dimensional protein sequence datasets. Clustering a high-dimensional protein sequence dataset using traditional machine learning approaches poses many challenges. Many different feature extraction methods exist and are widely used. However, extracting features from millions of protein sequences becomes impractical because they ar… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  7. A Comparative Study of Histogram Equalization Based Image Enhancement Techniques for Brightness Preservation and Contrast Enhancement

    Authors: Omprakash Patel, Yogendra P. S. Maravi, Sanjeev Sharma

    Abstract: Histogram Equalization is a contrast enhancement technique in the image processing which uses the histogram of image. However histogram equalization is not the best method for contrast enhancement because the mean brightness of the output image is significantly different from the input image. There are several extensions of histogram equalization has been proposed to overcome the brightness preser… ▽ More

    Submitted 16 November, 2013; originally announced November 2013.

    Comments: 15 pages, 5 figures, 4 tables, Signal & Image Processing : An International Journal (SIPIJ)

    Journal ref: Signal & Image Processing : An International Journal (SIPIJ) Vol.4, No.5, October 2013