Skip to main content

Showing 1–11 of 11 results for author: Bai, A

  1. arXiv:2402.16459  [pdf, other

    cs.CL cs.AI

    Defending LLMs against Jailbreaking Attacks via Backtranslation

    Authors: Yihan Wang, Zhouxing Shi, Andrew Bai, Cho-Jui Hsieh

    Abstract: Although many large language models (LLMs) have been trained to refuse harmful requests, they are still vulnerable to jailbreaking attacks which rewrite the original prompt to conceal its harmful intent. In this paper, we propose a new method for defending LLMs against jailbreaking attacks by ``backtranslation''. Specifically, given an initial response generated by the target LLM from an input pro… ▽ More

    Submitted 6 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

  2. arXiv:2402.08096  [pdf, other

    cs.LG

    Which Pretrain Samples to Rehearse when Finetuning Pretrained Models?

    Authors: Andrew Bai, Chih-Kuan Yeh, Cho-Jui Hsieh, Ankur Taly

    Abstract: Fine-tuning pretrained foundational models on specific tasks is now the de facto approach for text and vision tasks. A known pitfall of this approach is the forgetting of pretraining knowledge that happens during finetuning. Rehearsing samples randomly from the pretrain dataset is a common approach to alleviate such forgetting. However, we find that random mixing unintentionally includes samples w… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 17 pages, 13 figures

  3. arXiv:2401.09031  [pdf, other

    cs.LG

    Data Attribution for Diffusion Models: Timestep-induced Bias in Influence Estimation

    Authors: Tong Xie, Haoyu Li, Andrew Bai, Cho-Jui Hsieh

    Abstract: Data attribution methods trace model behavior back to its training dataset, offering an effective approach to better understand ''black-box'' neural networks. While prior research has established quantifiable links between model output and training data in diverse settings, interpreting diffusion model outputs in relation to training samples remains underexplored. In particular, diffusion models o… ▽ More

    Submitted 21 January, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  4. arXiv:2310.13861  [pdf

    cs.HC

    Examining the Influence of Job Satisfaction on Individual Innovation and Its Components: Considering the Moderating Role of Technostress

    Authors: Fatemeh Daneshmandi, Hassan Hessari, Tahmineh Nategh, Ali Bai

    Abstract: Background: Employee innovation is a crucial aspect of organizations in the current era. Therefore, studying the factors influencing individual innovation is vital and unavoidable. Undoubtedly, job satisfaction is a significant variable in management sciences. Nowadays, all organizations are interconnected with technology. Objective: This research explores the relationship between job satisfaction… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 13 pages, 2 figures, 4 tables

  5. arXiv:2306.04455  [pdf, ps, other

    cs.IR

    RD-Suite: A Benchmark for Ranking Distillation

    Authors: Zhen Qin, Rolf Jagerman, Rama Pasumarthi, Honglei Zhuang, He Zhang, Aijun Bai, Kai Hui, Le Yan, Xuanhui Wang

    Abstract: The distillation of ranking models has become an important topic in both academia and industry. In recent years, several advanced methods have been proposed to tackle this problem, often leveraging ranking information from teacher rankers that is absent in traditional classification settings. To date, there is no well-established consensus on how to evaluate this class of models. Moreover, inconsi… ▽ More

    Submitted 12 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 15 pages, 2 figures. arXiv admin note: text overlap with arXiv:2011.04006 by other authors

    ACM Class: H.3.3

  6. arXiv:2211.01494  [pdf, other

    cs.IR

    Regression Compatible Listwise Objectives for Calibrated Ranking with Binary Relevance

    Authors: Aijun Bai, Rolf Jagerman, Zhen Qin, Le Yan, Pratyush Kar, Bing-Rong Lin, Xuanhui Wang, Michael Bendersky, Marc Najork

    Abstract: As Learning-to-Rank (LTR) approaches primarily seek to improve ranking quality, their output scores are not scale-calibrated by design. This fundamentally limits LTR usage in score-sensitive applications. Though a simple multi-objective approach that combines a regression and a ranking objective can effectively learn scale-calibrated scores, we argue that the two objectives are not necessarily com… ▽ More

    Submitted 21 August, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  7. arXiv:2210.12231  [pdf, other

    cs.LG

    Reducing Training Sample Memorization in GANs by Training with Memorization Rejection

    Authors: Andrew Bai, Cho-Jui Hsieh, Wendy Kan, Hsuan-Tien Lin

    Abstract: Generative adversarial network (GAN) continues to be a popular research direction due to its high generation quality. It is observed that many state-of-the-art GANs generate samples that are more similar to the training set than a holdout testing set from the same distribution, hinting some training samples are implicitly memorized in these models. This memorization behavior is unfavorable in many… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  8. arXiv:2208.14966  [pdf, other

    cs.LG

    Concept Gradient: Concept-based Interpretation Without Linear Assumption

    Authors: Andrew Bai, Chih-Kuan Yeh, Pradeep Ravikumar, Neil Y. C. Lin, Cho-Jui Hsieh

    Abstract: Concept-based interpretations of black-box models are often more intuitive for humans to understand. The most widely adopted approach for concept-based interpretation is Concept Activation Vector (CAV). CAV relies on learning a linear relation between some latent representation of a given model and concepts. The linear separability is usually implicitly assumed but does not hold true in general. I… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 August, 2022; originally announced August 2022.

    Comments: 21 pages, 7 figures, published in ICLR 2023

  9. Million.js: A Fast Compiler-Augmented Virtual DOM for the Web

    Authors: Aiden Bai

    Abstract: Interactive web applications created with declarative JavaScript User Interface (UI) libraries have increasingly dominated the modern internet. However, existing libraries are primarily made for run-time execution, and rely on the user to load and render web applications. This led us to create Million.js, a fast compiler-augmented virtual Document Object Model (DOM) for the web. Million.js reduces… ▽ More

    Submitted 1 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: 8 pages, 12 figures. Accepted to ACM SAC

  10. arXiv:1706.04315  [pdf, ps, other

    cs.MA

    RoboCup 2D Soccer Simulation League: Evaluation Challenges

    Authors: Mikhail Prokopenko, Peter Wang, Sebastian Marian, Aijun Bai, Xiao Li, Xiaoping Chen

    Abstract: We summarise the results of RoboCup 2D Soccer Simulation League in 2016 (Leipzig), including the main competition and the evaluation round. The evaluation round held in Leipzig confirmed the strength of RoboCup-2015 champion (WrightEagle, i.e. WE2015) in the League, with only eventual finalists of 2016 competition capable of defeating WE2015. An extended, post-Leipzig, round-robin tournament which… ▽ More

    Submitted 14 June, 2017; originally announced June 2017.

    Comments: 12 pages, RoboCup-2017, Nagoya, Japan, July 2017

  11. arXiv:1605.07960  [pdf, other

    cs.CV cs.RO

    Multi-Object Tracking and Identification over Sets

    Authors: Aijun Bai

    Abstract: The ability for an autonomous agent or robot to track and identify potentially multiple objects in a dynamic environment is essential for many applications, such as automated surveillance, traffic monitoring, human-robot interaction, etc. The main challenge is due to the noisy and incomplete perception including inevitable false negative and false positive errors from a low-level detector. In this… ▽ More

    Submitted 25 May, 2016; originally announced May 2016.

    Comments: Draft version