Skip to main content

Showing 1–50 of 789 results for author: Jain, S

  1. arXiv:2407.10264  [pdf, other

    cs.LG cs.CL

    What Makes and Breaks Safety Fine-tuning? Mechanistic Study

    Authors: Samyak Jain, Ekdeep Singh Lubana, Kemal Oksuz, Tom Joy, Philip H. S. Torr, Amartya Sanyal, Puneet K. Dokania

    Abstract: Safety fine-tuning helps align Large Language Models (LLMs) with human preferences for their safe deployment. To better understand the underlying factors that make models safe via safety fine-tuning, we design a synthetic data generation framework that captures salient aspects of an unsafe input by modeling the interaction between the task the model is asked to perform (e.g., ``design'') versus th… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: Preprint

  2. arXiv:2407.09473  [pdf, other

    cs.CV

    StyleSplat: 3D Object Style Transfer with Gaussian Splatting

    Authors: Sahil Jain, Avik Kuthiala, Prabhdeep Singh Sethi, Prakanshul Saxena

    Abstract: Recent advancements in radiance fields have opened new avenues for creating high-quality 3D assets and scenes. Style transfer can enhance these 3D assets with diverse artistic styles, transforming creative expression. However, existing techniques are often slow or unable to localize style transfer to specific objects. We introduce StyleSplat, a lightweight method for stylizing 3D objects in scenes… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: for code and results, see http://bernard0047.github.io/stylesplat

  3. arXiv:2407.04302  [pdf, other

    cs.LG

    Fair Federated Data Clustering through Personalization: Bridging the Gap between Diverse Data Distributions

    Authors: Shivam Gupta, Tarushi, Tsering Wangzes, Shweta Jain

    Abstract: The rapid growth of data from edge devices has catalyzed the performance of machine learning algorithms. However, the data generated resides at client devices thus there are majorly two challenge faced by traditional machine learning paradigms - centralization of data for training and secondly for most the generated data the class labels are missing and there is very poor incentives to clients to… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2407.03677  [pdf, other

    math.DS math.NA nlin.PS

    Nonlinear Model Reduction to Random Spectral Submanifolds in Random Vibrations

    Authors: Zhenwei Xu, Roshan S. Kaundinya, Shobhit Jain, George Haller

    Abstract: Dynamical systems in engineering and physics are often subject to irregular excitations that are best modeled as random. Monte Carlo simulations are routinely performed on such random models to obtain statistics on their long-term response. Such simulations, however, are prohibitively expensive and time consuming for high-dimensional nonlinear systems. Here we propose to decrease this numerical bu… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 26 pages, 15 figures

  5. arXiv:2407.01351  [pdf, other

    astro-ph.HE

    Probing the connection between IceCube neutrinos and MOJAVE AGN

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 14 Pages 7 Figures

  6. arXiv:2407.01314  [pdf, other

    hep-ex

    Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures. To be submitted to Physical Review D

  7. arXiv:2406.19314  [pdf, other

    cs.CL cs.AI cs.LG

    LiveBench: A Challenging, Contamination-Free LLM Benchmark

    Authors: Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann LeCun, Tom Goldstein, Willie Neiswanger, Micah Goldblum

    Abstract: Test set contamination, wherein test data from a benchmark ends up in a newer model's training set, is a well-documented obstacle for fair LLM evaluation and can quickly render benchmarks obsolete. To mitigate this, many recent benchmarks crowdsource new prompts and evaluations from human or LLM judges; however, these can introduce significant biases, and break down when scoring hard questions. In… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  8. arXiv:2406.17975  [pdf, ps, other

    cs.CL cs.CR cs.LG

    Inherent Challenges of Post-Hoc Membership Inference for Large Language Models

    Authors: Matthieu Meeus, Shubham Jain, Marek Rei, Yves-Alexandre de Montjoye

    Abstract: Large Language Models (LLMs) are often trained on vast amounts of undisclosed data, motivating the development of post-hoc Membership Inference Attacks (MIAs) to gain insight into their training data composition. However, in this paper, we identify inherent challenges in post-hoc MIA evaluation due to potential distribution shifts between collected member and non-member datasets. Using a simple ba… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  9. arXiv:2406.17447  [pdf, other

    quant-ph hep-th

    Constraints on local processes

    Authors: Abhijit Gadde, Shraiyance Jain, Harshal Kulkarni

    Abstract: If we want to transform the quantum of state of a system to another using local processes, what is the probability of success? It turns out that this probability can be bounded by quantifying entanglement within both the states. In this paper, we construct a family of multipartite entanglement measures that are monotonic under local operations and classical communication on average. The measures a… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 33 pages, 15 figures

    Report number: TIFR/TH/24-10

  10. arXiv:2406.17025  [pdf, other

    astro-ph.CO hep-th

    Time Non-locality in Dark Matter and LSS

    Authors: Arhum Ansari, Arka Banerjee, Sachin Jain, Shaunak Padhyegurjar

    Abstract: We explore the intriguing phenomenon of time non-locality in the evolution of dark matter and Large Scale Structure (LSS). Recently in\,\cite{Donath:2023sav}, it was shown that time non-locality emerges in bias tracer fluctuations, which are $SO(3)$ scalars in real space, at fifth order in the perturbation expansion in dark matter overdensity. We demonstrate that by breaking the symmetry down to… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 20 pages + 2 appendices

  11. arXiv:2406.16846  [pdf, other

    cs.LG cs.CY stat.ML

    Data Debiasing with Datamodels (D3M): Improving Subgroup Robustness via Data Selection

    Authors: Saachi Jain, Kimia Hamidieh, Kristian Georgiev, Andrew Ilyas, Marzyeh Ghassemi, Aleksander Madry

    Abstract: Machine learning models can fail on subgroups that are underrepresented during training. While techniques such as dataset balancing can improve performance on underperforming groups, they require access to training group annotations and can end up removing large portions of the dataset. In this paper, we introduce Data Debiasing with Datamodels (D3M), a debiasing approach which isolates and remove… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  12. arXiv:2406.13383  [pdf, other

    nlin.CG cs.ET

    Emergent Dynamics in Heterogeneous Life-Like Cellular Automata

    Authors: Aarati Shrestha, Felix Reimers, Sanyam Jain, Paolo Baldini, Michele Braccini, Andrea Roli, Stefano Nichele

    Abstract: The Game of Life (GoL), one well known 2D cellular automaton, does not typically ensure interesting long-term phenotypic dynamics. Therefore, while being Turing complete, GoL cannot be said to be open-ended. In this work, we extend GoL with the opportunity for local mutations, thus enabling a heterogeneous life-like cellular automaton guided by an evolutionary inner loop. Additionally, we introduc… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 16 pages, 9 Figures

  13. arXiv:2406.11937  [pdf, other

    physics.ins-det hep-ex physics.data-an

    Using graph neural networks to reconstruct charged pion showers in the CMS High Granularity Calorimeter

    Authors: M. Aamir, B. Acar, G. Adamov, T. Adams, C. Adloff, S. Afanasiev, C. Agrawal, C. Agrawal, A. Ahmad, H. A. Ahmed, S. Akbar, N. Akchurin, B. Akgul, B. Akgun, R. O. Akpinar, E. Aktas, A. AlKadhim, V. Alexakhin, J. Alimena, J. Alison, A. Alpana, W. Alshehri, P. Alvarez Dominguez, M. Alyari, C. Amendola , et al. (550 additional authors not shown)

    Abstract: A novel method to reconstruct the energy of hadronic showers in the CMS High Granularity Calorimeter (HGCAL) is presented. The HGCAL is a sampling calorimeter with very fine transverse and longitudinal granularity. The active media are silicon sensors and scintillator tiles readout by SiPMs and the absorbers are a combination of lead and Cu/CuW in the electromagnetic section, and steel in the hadr… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Prepared for submission to JINST

  14. arXiv:2406.06798  [pdf, other

    eess.AS cs.SD

    The Reasonable Effectiveness of Speaker Embeddings for Violence Detection

    Authors: Sarthak Jain, Orchid Chetia Phukan, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this paper, we focus on audio violence detection (AVD). AVD is necessary for several reasons, especially in the context of maintaining safety, preventing harm, and ensuring security in various environments. This calls for accurate AVD systems. Like many related applications in audio processing, the most common approach for improving the performance, would be by leveraging self-supervised (SSL)… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 24 Show & Tell Demonstrations

  15. arXiv:2406.06781  [pdf, other

    eess.AS cs.SD

    PERSONA: An Application for Emotion Recognition, Gender Recognition and Age Estimation

    Authors: Devyani Koshal, Orchid Chetia Phukan, Sarthak Jain, Arun Balaji Buduru, Rajesh Sharma

    Abstract: Emotion Recognition (ER), Gender Recognition (GR), and Age Estimation (AE) constitute paralinguistic tasks that rely not on the spoken content but primarily on speech characteristics such as pitch and tone. While previous research has made significant strides in developing models for each task individually, there has been comparatively less emphasis on concurrently learning these tasks, despite th… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

  16. arXiv:2406.06774  [pdf, other

    eess.AS cs.SD

    ComFeAT: Combination of Neural and Spectral Features for Improved Depression Detection

    Authors: Orchid Chetia Phukan, Sarthak Jain, Shubham Singh, Muskaan Singh, Arun Balaji Buduru, Rajesh Sharma

    Abstract: In this work, we focus on the detection of depression through speech analysis. Previous research has widely explored features extracted from pre-trained models (PTMs) primarily trained for paralinguistic tasks. Although these features have led to sufficient advances in speech-based depression detection, their performance declines in real-world settings. To address this, in this paper, we introduce… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024 Show & Tell Demonstrations

  17. arXiv:2406.06684  [pdf, other

    astro-ph.HE

    Search for neutrino emission from hard X-ray AGN with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (401 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  18. arXiv:2406.06461  [pdf, other

    cs.CL

    Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies

    Authors: Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun

    Abstract: A diverse array of reasoning strategies has been proposed to elicit the capabilities of large language models. However, in this paper, we point out that traditional evaluations which focus solely on performance metrics miss a key factor: the increased effectiveness due to additional compute. By overlooking this aspect, a skewed view of strategy efficiency is often presented. This paper introduces… ▽ More

    Submitted 14 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  19. arXiv:2406.05331  [pdf, other

    cs.RO

    Autonomous Robotic Assembly: From Part Singulation to Precise Assembly

    Authors: Kei Ota, Devesh K. Jha, Siddarth Jain, Bill Yerazunis, Radu Corcodel, Yash Shukla, Antonia Bronars, Diego Romeres

    Abstract: Imagine a robot that can assemble a functional product from the individual parts presented in any configuration to the robot. Designing such a robotic system is a complex problem which presents several open challenges. To bypass these challenges, the current generation of assembly systems is built with a lot of system integration effort to provide the structure and precision necessary for assembly… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

    Comments: Under submission

  20. arXiv:2406.00905  [pdf, other

    hep-ex

    Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  21. arXiv:2405.21074  [pdf, other

    cs.CV

    Latent Intrinsics Emerge from Training to Relight

    Authors: Xiao Zhang, William Gao, Seemandhar Jain, Michael Maire, David. A. Forsyth, Anand Bhattad

    Abstract: Image relighting is the task of showing what a scene from a source image would look like if illuminated differently. Inverse graphics schemes recover an explicit representation of geometry and a set of chosen intrinsics, then relight with some form of renderer. However error control for inverse graphics is difficult, and inverse graphics methods can represent only the effects of the chosen intrins… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  22. arXiv:2405.19569  [pdf, other

    cs.CV

    Improved Convex Decomposition with Ensembling and Boolean Primitives

    Authors: Vaibhav Vavilala, Florian Kluger, Seemandhar Jain, Bodo Rosenhahn, David Forsyth

    Abstract: Describing a scene in terms of primitives -- geometrically simple shapes that offer a parsimonious but accurate abstraction of structure -- is an established vision problem. This is a good model of a difficult fitting problem: different scenes require different numbers of primitives and primitives interact strongly, but any proposed solution can be evaluated at inference time. The state of the art… ▽ More

    Submitted 9 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 18 pages, 9 figures, 7 tables

  23. arXiv:2405.15788  [pdf, other

    cs.IR cs.HC cs.LG

    Towards Fairness in Provably Communication-Efficient Federated Recommender Systems

    Authors: Kirandeep Kaur, Sujit Gujar, Shweta Jain

    Abstract: To reduce the communication overhead caused by parallel training of multiple clients, various federated learning (FL) techniques use random client sampling. Nonetheless, ensuring the efficacy of random sampling and determining the optimal number of clients to sample in federated recommender systems (FRSs) remains challenging due to the isolated nature of each user as a separate client. This challe… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  24. arXiv:2405.14812  [pdf, other

    cs.CY

    As an AI Language Model, "Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

    Authors: Shomik Jain, D Calacci, Ashia Wilson

    Abstract: We investigate the phenomenon of norm inconsistency: where LLMs apply different norms in similar situations. Specifically, we focus on the high-risk application of deciding whether to call the police in Amazon Ring home surveillance videos. We evaluate the decisions of three state-of-the-art LLMs -- GPT-4, Gemini 1.0, and Claude 3 Sonnet -- in relation to the activities portrayed in the videos, th… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  25. arXiv:2405.14614  [pdf, ps, other

    cs.CY cs.ET cs.IR

    Push and Pull: A Framework for Measuring Attentional Agency

    Authors: Zachary Wojtowicz, Shrey Jain, Nicholas Vincent

    Abstract: We propose a framework for measuring attentional agency - the ability to allocate one's attention according to personal desires, goals, and intentions - on digital platforms. Platforms extend people's limited powers of attention by extrapolating their preferences to large collections of previously unconsidered informational objects. However, platforms typically also allow people to influence one a… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  26. arXiv:2405.07518  [pdf, other

    cs.AR cs.AI

    SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

    Authors: Raghu Prabhakar, Ram Sivaramakrishnan, Darshan Gandhi, Yun Du, Mingran Wang, Xiangyu Song, Kejie Zhang, Tianren Gao, Angela Wang, Karen Li, Yongning Sheng, Joshua Brot, Denis Sokolov, Apurv Vivek, Calvin Leung, Arjun Sabnis, Jiayu Bai, Tuowen Zhao, Mark Gottscho, David Jackson, Mark Luttrell, Manish K. Shah, Edison Chen, Kaizhao Liang, Swayambhoo Jain , et al. (5 additional authors not shown)

    Abstract: Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate increase in compute-to-memory ratio of modern AI accelerators have created a memory wall, necessitating new methods to deploy AI. Composition of Expert… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  27. arXiv:2405.05530  [pdf, other

    cs.CV

    NurtureNet: A Multi-task Video-based Approach for Newborn Anthropometry

    Authors: Yash Khandelwal, Mayur Arvind, Sriram Kumar, Ashish Gupta, Sachin Kumar Danisetty, Piyush Bagad, Anish Madan, Mayank Lunayach, Aditya Annavajjala, Abhishek Maiti, Sansiddh Jain, Aman Dalmia, Namrata Deka, Jerome White, Jigar Doshi, Angjoo Kanazawa, Rahul Panicker, Alpan Raval, Srinivas Rana, Makarand Tapaswi

    Abstract: Malnutrition among newborns is a top public health concern in developing countries. Identification and subsequent growth monitoring are key to successful interventions. However, this is challenging in rural communities where health systems tend to be inaccessible and under-equipped, with poor adherence to protocol. Our goal is to equip health workers and public health systems with a solution for c… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPM Workshop at CVPR 2024

  28. arXiv:2405.03643  [pdf, other

    cs.CV

    Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors

    Authors: Samreen Anjum, Suyog Jain, Danna Gurari

    Abstract: We propose a hybrid framework for consistently producing high-quality object tracks by combining an automated object tracker with little human input. The key idea is to tailor a module for each dataset to intelligently decide when an object tracker is failing and so humans should be brought in to re-localize an object for continued tracking. Our approach leverages self-supervised learning on unlab… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  29. arXiv:2405.01602  [pdf

    physics.soc-ph

    A District level Flood Severity Index for India

    Authors: Manabendra Saharia, Sharad K Jain, Ved Prakash, Harshul Malik, O P Sreejith

    Abstract: India is one of the worst affected countries in the world in terms of fatalities and economic damage due to natural disasters, particularly floods. For planning flood mitigating and relief measures, granular historical information on a pan-India basis is required, which has been missing. Through recent efforts, a few national scale datasets have been created, but they lack the requisite informatio… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  30. arXiv:2405.01481  [pdf, other

    cs.CL cs.AI cs.LG

    NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

    Authors: Gerald Shen, Zhilin Wang, Olivier Delalleau, Jiaqi Zeng, Yi Dong, Daniel Egert, Shengyang Sun, Jimmy Zhang, Sahil Jain, Ali Taghibakhshi, Markel Sanz Ausin, Ashwath Aithal, Oleksii Kuchaiev

    Abstract: Aligning Large Language Models (LLMs) with human values and preferences is essential for making them helpful and safe. However, building efficient tools to perform alignment can be challenging, especially for the largest and most competent LLMs which often contain tens or hundreds of billions of parameters. We create NeMo-Aligner, a toolkit for model alignment that can efficiently scale to using h… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures

  31. arXiv:2405.00773  [pdf, other

    hep-th

    Hidden sectors of Chern-Simons Matter theories and Exact Holography

    Authors: Sachin Jain, Dhruva K. S, Evgeny Skvortsov

    Abstract: Chiral higher-spin gravity is a higher-spin extension of both self-dual Yang-Mills and self-dual gravity and is a unique local higher-spin gravity in four dimensions. Its existence implies that there are two closed subsectors in Chern-Simons matter theories. We make first steps in identifying these (anti-)chiral subsectors directly on the CFT side, which should result in a holographically dual pai… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 27 pages+17 pages appendices, 1 figure

  32. Spectroscopic Investigation of Nebular Gas (SING): Instrument Design, Assembly and Calibration

    Authors: Bharat Chandra P, Binukumar G. Nair, Shubham Jankiram Ghatul, Shubhangi Jain, S. Sriram, Mahesh Babu S., Rekhesh Mohan, Margarita Safonova, Jayant Murthy, Mikhail Sachkov

    Abstract: The Spectroscopic Investigation of Nebular Gas (SING) is a near-ultraviolet (NUV) low-resolution spectrograph payload designed to operate in the NUV range, 1400 $\unicode{x212B}$ -- 2700 $\unicode{x212B}$, from a stable space platform. SING telescope has a primary aperture of 298 mm, feeding the light to the long-slit UV spectrograph. SING has a field of view (FOV) of 1$^{\circ}$, achieving a spat… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Journal ref: Exp Astron 57, 18 (2024)

  33. arXiv:2404.08592  [pdf, other

    cs.CY

    Scarce Resource Allocations That Rely On Machine Learning Should Be Randomized

    Authors: Shomik Jain, Kathleen Creel, Ashia Wilson

    Abstract: Contrary to traditional deterministic notions of algorithmic fairness, this paper argues that fairly allocating scarce resources using machine learning often requires randomness. We address why, when, and how to randomize by proposing stochastic procedures that more adequately account for all of the claims that individuals have to allocations of social goods or opportunities.

    Submitted 19 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: To appear in the proceedings of the International Conference on Machine Learning (ICML 2024)

    ACM Class: K.4.0

  34. arXiv:2404.06768  [pdf, ps, other

    cs.IT math.RA

    A new approach to construct minimal linear codes over $\mathbb{F}_{3}$

    Authors: Wajid M. Shaikh, Rupali S. Jain, B. Surendranath Reddy, Bhagyashri S. Patil, Sahar M. A. Maqbol

    Abstract: In this article, we present two new approaches to construct minimal linear codes of dimension $n+1$ over $\mathbb{F}_{3}$ using characteristic and ternary functions. We also obtain the weight distributions of these constructed minimal linear codes. We further show that a specific class of these codes violates Ashikhmin-Barg condition.

    Submitted 10 April, 2024; originally announced April 2024.

    Journal ref: MJMS-2024-0154

  35. arXiv:2404.05981  [pdf, other

    cs.LG cs.CV

    A Lightweight Measure of Classification Difficulty from Application Dataset Characteristics

    Authors: Bryan Bo Cao, Abhinav Sharma, Lawrence O'Gorman, Michael Coss, Shubham Jain

    Abstract: Despite accuracy and computation benchmarks being widely available to help choose among neural network models, these are usually trained on datasets with many classes, and do not give a precise idea of performance for applications of few (< 10) classes. The conventional procedure to predict performance is to train and test repeatedly on the different models and dataset variations of interest. Howe… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 13 pages, 3 figures

    MSC Class: 65D19

  36. arXiv:2404.03245  [pdf, other

    cs.ET cs.OS

    Memory Sharing with CXL: Hardware and Software Design Approaches

    Authors: Sunita Jain, Nagaradhesh Yeleswarapu, Hasan Al Maruf, Rita Gupta

    Abstract: Compute Express Link (CXL) is a rapidly emerging coherent interconnect standard that provides opportunities for memory pooling and sharing. Memory sharing is a well-established software feature that improves memory utilization by avoiding unnecessary data movement. In this paper, we discuss multiple approaches to enable memory sharing with different generations of CXL protocol (i.e., CXL 2.0 and C… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: Presented at the 3rd Workshop on Heterogeneous Composable and Disaggregated Systems (HCDS 2024)

  37. arXiv:2404.03150  [pdf, other

    cs.CL cs.AI

    NLP at UC Santa Cruz at SemEval-2024 Task 5: Legal Answer Validation using Few-Shot Multi-Choice QA

    Authors: Anish Pahilajani, Samyak Rajesh Jain, Devasha Trivedi

    Abstract: This paper presents our submission to the SemEval 2024 Task 5: The Legal Argument Reasoning Task in Civil Procedure. We present two approaches to solving the task of legal answer validation, given an introduction to the case, a question and an answer candidate. Firstly, we fine-tuned pre-trained BERT-based models and found that models trained on domain knowledge perform better. Secondly, we perfor… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  38. arXiv:2404.02371  [pdf, ps, other

    math.DS

    Piecewise Contractions

    Authors: Sakshi Jain, Carlangelo Liverani

    Abstract: We study piecewise injective, but not necessarily globally injective, contracting maps on a compact subset of \(\bR^d\). We prove that generically the attractor and the set of discontinuities are disjoint, and hence the attractor consists of periodic orbits. In addition, we prove that piecewise injective contractions are generically topologically stable.

    Submitted 15 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  39. arXiv:2404.01203  [pdf, other

    cs.CV

    Video Interpolation with Diffusion Models

    Authors: Siddhant Jain, Daniel Watson, Eric Tabellion, Aleksander Hołyński, Ben Poole, Janne Kontkanen

    Abstract: We present VIDIM, a generative model for video interpolation, which creates short videos given a start and end frame. In order to achieve high fidelity and generate motions unseen in the input data, VIDIM uses cascaded diffusion models to first generate the target video at low resolution, and then generate the high-resolution video conditioned on the low-resolution generated video. We compare VIDI… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: CVPR 2024, Project page at https://vidim-interpolation.github.io/

  40. arXiv:2403.19850  [pdf

    physics.optics physics.app-ph

    Incubating Advances in Integrated Photonics with Emerging Sensing and Computational Capabilities

    Authors: Sourabh Jain, May Hlaing, Kang Chieh Fan, Jason Midkiff, Shupeng Ning, Chenghao Feng, Po Yu Hsiao, Patrick Camp, Ray Chen

    Abstract: As photonic technologies continue to grow in multidimensional aspects, integrated photonics holds a unique position and continuously presents enormous possibilities to research communities. Applications span across data centers, environmental monitoring, medical diagnosis, and highly compact communication components, with further possibilities growing endlessly. Here, we provide a review of state… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  41. arXiv:2403.15966  [pdf, other

    eess.SY

    Fisher Information Approach for Masking the Sensing Plan: Applications in Multifunction Radars

    Authors: Shashwat Jain, Vikram Krishnamurthy, Muralidhar Rangaswamy, Bosung Kang, Sandeep Gogineni

    Abstract: How to design a Markov Decision Process (MDP) based radar controller that makes small sacrifices in performance to mask its sensing plan from an adversary? The radar controller purposefully minimizes the Fisher information of its emissions so that an adversary cannot identify the controller's model parameters accurately. Unlike classical open loop statistical inference, where the Fisher informatio… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  42. arXiv:2403.15484  [pdf, other

    cs.CL cs.LG

    RakutenAI-7B: Extending Large Language Models for Japanese

    Authors: Rakuten Group, Aaron Levine, Connie Huang, Chenguang Wang, Eduardo Batista, Ewa Szymanska, Hongyi Ding, Hou Wei Chou, Jean-François Pessiot, Johanes Effendi, Justin Chiu, Kai Torben Ohlhus, Karan Chopra, Keiji Shinzato, Koji Murakami, Lee Xiong, Lei Chen, Maki Kubota, Maksim Tkachenko, Miroku Lee, Naoki Takahashi, Prathyusha Jwalapuram, Ryutaro Tatsushima, Saurabh Jain, Sunil Kumar Yadav , et al. (5 additional authors not shown)

    Abstract: We introduce RakutenAI-7B, a suite of Japanese-oriented large language models that achieve the best performance on the Japanese LM Harness benchmarks among the open 7B models. Along with the foundation model, we release instruction- and chat-tuned models, RakutenAI-7B-instruct and RakutenAI-7B-chat respectively, under the Apache 2.0 license.

    Submitted 21 March, 2024; originally announced March 2024.

  43. arXiv:2403.14806  [pdf, other

    cs.ET physics.app-ph physics.optics

    Photonic-Electronic Integrated Circuits for High-Performance Computing and AI Accelerators

    Authors: Shupeng Ning, Hanqing Zhu, Chenghao Feng, Jiaqi Gu, Zhixing Jiang, Zhoufeng Ying, Jason Midkiff, Sourabh Jain, May H. Hlaing, David Z. Pan, Ray T. Chen

    Abstract: In recent decades, the demand for computational power has surged, particularly with the rapid expansion of artificial intelligence (AI). As we navigate the post-Moore's law era, the limitations of traditional electrical digital computing, including process bottlenecks and power consumption issues, are propelling the search for alternative computing paradigms. Among various emerging technologies, i… ▽ More

    Submitted 11 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

  44. arXiv:2403.14484  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    HyperGALE: ASD Classification via Hypergraph Gated Attention with Learnable Hyperedges

    Authors: Mehul Arora, Chirag Shantilal Jain, Lalith Bharadwaj Baru, Kamalaker Dadi, Bapi Raju Surampudi

    Abstract: Autism Spectrum Disorder (ASD) is a neurodevelopmental condition characterized by varied social cognitive challenges and repetitive behavioral patterns. Identifying reliable brain imaging-based biomarkers for ASD has been a persistent challenge due to the spectrum's diverse symptomatology. Existing baselines in the field have made significant strides in this direction, yet there remains room for i… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Accepted to IJCNN 2024

  45. arXiv:2403.13350  [pdf, ps, other

    cs.IT math.RA

    Construction of Minimal Binary Linear Codes of dimension $n+3$

    Authors: Wajid M. Shaikh, Rupali S. Jain, B. Surendranath Reddy, Bhagyashri S. Patil

    Abstract: In this paper, we will give the generic construction of a binary linear code of dimension $n+3$ and derive the necessary and sufficient conditions for the constructed code to be minimal. Using generic construction, a new family of minimal binary linear code will be constructed from a special class of Boolean functions violating the Ashikhmin-Barg condition. We also obtain the weight distribution o… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    MSC Class: 94B05; 94C10; 94A60

  46. arXiv:2403.12419  [pdf, ps, other

    cs.IT

    Sparsity-Constrained Community-Based Group Testing

    Authors: Sarthak Jain, Martina Cardone, Soheil Mohajer

    Abstract: In this work, we consider the sparsity-constrained community-based group testing problem, where the population follows a community structure. In particular, the community consists of $F$ families, each with $M$ members. A number $k_f$ out of the $F$ families are infected, and a family is said to be infected if $k_m$ out of its $M$ members are infected. Furthermore, the sparsity constraint allows a… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  47. arXiv:2403.10513  [pdf, other

    hep-th astro-ph.CO gr-qc

    Inflationary non-Gaussianities in alpha vacua and consistency with conformal symmetries

    Authors: Arhum Ansari, Pinak Banerjee, Prateksh Dhivakar, Sachin Jain, Nilay Kundu

    Abstract: We study the conformal invariance of inflationary non-Gaussianities associated with scalar fluctuations in a non-Bunch-Davies initial state, known as the $α$-vacuum, in single-field slow-roll inflation. The $α$-vacuum is a one-parameter family of states, including the Bunch-Davies one, that preserves the conformal symmetry of inflationary dynamics in a nearly de-Sitter space-time. Working within t… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 40 pages + appendices; 2 figures

  48. arXiv:2403.08623  [pdf, ps, other

    math.GR math.GT

    The algebraic structure of hyperbolic graph braid groups

    Authors: B. Appiah, P. Dani, W. Ge, C. Hudson, S. Jain, M. Lemoine, J. Murphy, J. Murray, A. Pandikkadan, K. Schreve, H. Vo

    Abstract: Genevois recently classified which graph braid groups on $\ge 3$ strands are word hyperbolic. In the $3$-strand case, he asked whether all such word hyperbolic groups are actually free; this reduced to checking two infinite classes of graphs: sun and pulsar graphs. We prove that $3$-strand braid groups of sun graphs are free. On the other hand, it was known to experts that $3$-strand braid groups… ▽ More

    Submitted 21 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

    Comments: Based on work from a Louisiana State University VIR (Vertically Integrated Research) course. In v2, we reworded the introduction to better reflect what was previously known in the pulsar case and corrected some typos

  49. arXiv:2403.07911  [pdf

    cs.CY cs.AI

    Standing on FURM ground -- A framework for evaluating Fair, Useful, and Reliable AI Models in healthcare systems

    Authors: Alison Callahan, Duncan McElfresh, Juan M. Banda, Gabrielle Bunney, Danton Char, Jonathan Chen, Conor K. Corbin, Debadutta Dash, Norman L. Downing, Sneha S. Jain, Nikesh Kotecha, Jonathan Masterson, Michelle M. Mello, Keith Morse, Srikar Nallan, Abby Pandya, Anurang Revri, Aditya Sharma, Christopher Sharp, Rahul Thapa, Michael Wornow, Alaa Youssef, Michael A. Pfeffer, Nigam H. Shah

    Abstract: The impact of using artificial intelligence (AI) to guide patient care or operational processes is an interplay of the AI model's output, the decision-making protocol based on that output, and the capacity of the stakeholders involved to take the necessary subsequent action. Estimating the effects of this interplay before deployment, and studying it in real time afterwards, are essential to bridge… ▽ More

    Submitted 14 March, 2024; v1 submitted 26 February, 2024; originally announced March 2024.

  50. arXiv:2403.06350  [pdf, other

    cs.CL

    IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

    Authors: Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad G, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra

    Abstract: Despite the considerable advancements in English LLMs, the progress in building comparable models for other languages has been hindered due to the scarcity of tailored resources. Our work aims to bridge this divide by introducing an expansive suite of resources specifically designed for the development of Indic LLMs, covering 22 languages, containing a total of 251B tokens and 74.8M instruction-re… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.