subscribe to arXiv mailings

Synthetic Data: Revisiting the Privacy-Utility Trade-off

Authors: Fatima Jahan Sarmin, Atiquer Rahman Sarkar, Yang Wang, Noman Mohammed

Abstract: Synthetic data has been considered a better privacy-preserving alternative to traditionally sanitized data across various applications. However, a recent article challenges this notion, stating that synthetic data does not provide a better trade-off between privacy and utility than traditional anonymization techniques, and that it leads to unpredictable utility loss and highly unpredictable privac… ▽ More Synthetic data has been considered a better privacy-preserving alternative to traditionally sanitized data across various applications. However, a recent article challenges this notion, stating that synthetic data does not provide a better trade-off between privacy and utility than traditional anonymization techniques, and that it leads to unpredictable utility loss and highly unpredictable privacy gain. The article also claims to have identified a breach in the differential privacy guarantees provided by PATEGAN and PrivBayes. When a study claims to refute or invalidate prior findings, it is crucial to verify and validate the study. In our work, we analyzed the implementation of the privacy game described in the article and found that it operated in a highly specialized and constrained environment, which limits the applicability of its findings to general cases. Our exploration also revealed that the game did not satisfy a crucial precondition concerning data distributions, which contributed to the perceived violation of the differential privacy guarantees offered by PATEGAN and PrivBayes. We also conducted a privacy-utility trade-off analysis in a more general and unconstrained environment. Our experimentation demonstrated that synthetic data achieves a more favorable privacy-utility trade-off compared to the provided implementation of k-anonymization, thereby reaffirming earlier conclusions. △ Less

Submitted 9 July, 2024; originally announced July 2024.

arXiv:2407.04849 [pdf, other]

MUSIC-lite: Efficient MUSIC using Approximate Computing: An OFDM Radar Case Study

Authors: Rajat Bhattacharjya, Arnab Sarkar, Biswadip Maity, Nikil Dutt

Abstract: Multiple Signal Classification (MUSIC) is a widely used Direction of Arrival (DoA)/Angle of Arrival (AoA) estimation algorithm applied to various application domains such as autonomous driving, medical imaging, and astronomy. However, MUSIC is computationally expensive and challenging to implement in low-power hardware, requiring exploration of trade-offs between accuracy, cost, and power. We pres… ▽ More Multiple Signal Classification (MUSIC) is a widely used Direction of Arrival (DoA)/Angle of Arrival (AoA) estimation algorithm applied to various application domains such as autonomous driving, medical imaging, and astronomy. However, MUSIC is computationally expensive and challenging to implement in low-power hardware, requiring exploration of trade-offs between accuracy, cost, and power. We present MUSIC-lite, which exploits approximate computing to generate a design space exploring accuracy-area-power trade-offs. This is specifically applied to the computationally intensive singular value decomposition (SVD) component of the MUSIC algorithm in an orthogonal frequency-division multiplexing (OFDM) radar use case. MUSIC-lite incorporates approximate adders into the iterative CORDIC algorithm that is used for hardware implementation of MUSIC, generating interesting accuracy-area-power trade-offs. Our experiments demonstrate MUSIC-lite's ability to save an average of 17.25% on-chip area and 19.4% power with a minimal 0.14% error for efficient MUSIC implementations. △ Less

Submitted 5 July, 2024; originally announced July 2024.

Comments: Paper accepted at ESWEEK-CASES 2024 as a Late Breaking (LB) Result paper. The definitive version of the work will appear in IEEE Embedded Systems Letters

arXiv:2407.02903 [pdf, other]

doi 10.1145/3663384.3663389

"It's like a rubber duck that talks back": Understanding Generative AI-Assisted Data Analysis Workflows through a Participatory Prompting Study

Authors: Ian Drosos, Advait Sarkar, Xiaotong Xu, Carina Negreanu, Sean Rintel, Lev Tankelevitch

Abstract: Generative AI tools can help users with many tasks. One such task is data analysis, which is notoriously challenging for non-expert end-users due to its expertise requirements, and where AI holds much potential, such as finding relevant data sources, proposing analysis strategies, and writing analysis code. To understand how data analysis workflows can be assisted or impaired by generative AI, we… ▽ More Generative AI tools can help users with many tasks. One such task is data analysis, which is notoriously challenging for non-expert end-users due to its expertise requirements, and where AI holds much potential, such as finding relevant data sources, proposing analysis strategies, and writing analysis code. To understand how data analysis workflows can be assisted or impaired by generative AI, we conducted a study (n=15) using Bing Chat via participatory prompting. Participatory prompting is a recently developed methodology in which users and researchers reflect together on tasks through co-engagement with generative AI. In this paper we demonstrate the value of the participatory prompting method. We found that generative AI benefits the information foraging and sensemaking loops of data analysis in specific ways, but also introduces its own barriers and challenges, arising from the difficulties of query formulation, specifying context, and verifying results. △ Less

Submitted 3 July, 2024; originally announced July 2024.

Comments: Ian Drosos, Advait Sarkar, Xiaotong Xu, Carina Negreanu, Sean Rintel, and Lev Tankelevitch. 2024. "It's like a rubber duck that talks back": Understanding Generative AI-Assisted Data Analysis Workflows through a Participatory Prompting Study. In Proceedings of the 3rd Annual Meeting of the Symposium on Human-Computer Interaction for Work (CHIWORK 2024)

Journal ref: Proceedings of the 3rd Annual Meeting of the Symposium on Human-Computer Interaction for Work (CHIWORK 2024)

arXiv:2407.02651 [pdf, other]

Improving Steering and Verification in AI-Assisted Data Analysis with Interactive Task Decomposition

Authors: Majeed Kazemitabaar, Jack Williams, Ian Drosos, Tovi Grossman, Austin Henley, Carina Negreanu, Advait Sarkar

Abstract: LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the challenging task of data analysis programming, which requires expertise in data processing, programming, and statistics. However, our formative study (n=15) uncovered serious challenges in verifying AI-generated results and steering the AI (i.e., guiding the AI system to produce the desired output). We develo… ▽ More LLM-powered tools like ChatGPT Data Analysis, have the potential to help users tackle the challenging task of data analysis programming, which requires expertise in data processing, programming, and statistics. However, our formative study (n=15) uncovered serious challenges in verifying AI-generated results and steering the AI (i.e., guiding the AI system to produce the desired output). We developed two contrasting approaches to address these challenges. The first (Stepwise) decomposes the problem into step-by-step subgoals with pairs of editable assumptions and code until task completion, while the second (Phasewise) decomposes the entire problem into three editable, logical phases: structured input/output assumptions, execution plan, and code. A controlled, within-subjects experiment (n=18) compared these systems against a conversational baseline. Users reported significantly greater control with the Stepwise and Phasewise systems, and found intervention, correction, and verification easier, compared to the baseline. The results suggest design guidelines and trade-offs for AI-assisted data analysis tools. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: Conditionally Accepted to UIST 2024; 19 pages, 8 figures, and 2 tables

arXiv:2407.02360 [pdf, other]

Physics of 1 keV line in X-ray binaries

Authors: Priyanka Chakraborty, Gary Ferland, Andrew Fabian, Arnab Sarkar, Renee Ludlam, Stefano Bianchi, Hayden Hall, Peter Kosec

Abstract: X-ray binaries (XRBs) often exhibit spectral residuals in the 0.5 to 2 keV range, known as the "1 keV residual/1 keV feature", with variable centroid and intensity across different systems. Yet a comprehensive scientific explanation of the variability of the 1 keV feature has remained largely elusive. In this paper, we explain for the first time the origin and variability of the 1 keV feature in X… ▽ More X-ray binaries (XRBs) often exhibit spectral residuals in the 0.5 to 2 keV range, known as the "1 keV residual/1 keV feature", with variable centroid and intensity across different systems. Yet a comprehensive scientific explanation of the variability of the 1 keV feature has remained largely elusive. In this paper, we explain for the first time the origin and variability of the 1 keV feature in XRBs using the spectral synthesis code \textsc{Cloudy}. We constructed line blends for the emission and absorption lines and study the variability of these blends with ionization parameters, temperature, and column density. We conducted a sample study involving five XRBs including two ultraluminous X-ray sources (ULXs): NGC 247 ULX-1, NGC 1313 X-1, a binary X-ray pulsar: Hercules X-1, and two typical low-mass X-ray binaries (LMXBs): Cygnus X-2, and Serpens X-1, providing a comprehensive explanation of the 1 keV feature observed across these targets. △ Less

Submitted 2 July, 2024; originally announced July 2024.

Comments: 22 pages, 12 figures, Submitted to MNRAS

arXiv:2406.17896 [pdf, other]

Prospects of Detecting the Global HI-21cm Signal at uGMRT through the Gravitational Lensing by an isolated Neutron Star

Authors: Rupa Basu, Siddhartha Bhattacharyya, Anjan Kumar Sarkar, Shibaji Banerjee, Debasis Majumder

Abstract: The strength of the global HI-21cm signal is several orders of magnitude lower than the foreground and background noise and hence it is difficult to observe this signal at a given radio telescope. However, a few recent studies reported the detection of that signal at the radio band suggests the strength of this signal is somehow magnified. In this analysis, we study the prospects of detecting this… ▽ More The strength of the global HI-21cm signal is several orders of magnitude lower than the foreground and background noise and hence it is difficult to observe this signal at a given radio telescope. However, a few recent studies reported the detection of that signal at the radio band suggests the strength of this signal is somehow magnified. In this analysis, we study the prospects of detecting this global signal at different frequency bands of uGMRT where this global signal is supposed to be amplified through the strong gravitational lensing by an isolated neutron star located in a cosmological distance. Our study shows the effects of the lensing parameters on the observables of that amplified global signal and discusses its variation with the frequency bands considered here. We present a method to estimate the position and size of an isolated neutron star using the signal-to-noise ratio of that global signal supposed to be detected at different frequency bands of uGMRT. We discuss the scope of multi-messenger astronomy in the era of HI-21cm observation where the estimated lensing parameters can be cross-validated using the pulsar detection at the X-ray band from the same location in the sky. Our analysis is equally applicable to any radio telescope with given specifications. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 17 Pages, 4 figures, comments and suggestions are welcome

arXiv:2406.17630 [pdf, other]

KANQAS: Kolmogorov Arnold Network for Quantum Architecture Search

Authors: Akash Kundu, Aritra Sarkar, Abhishek Sadhu

Abstract: Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high n… ▽ More Quantum architecture search~(QAS) is a promising direction for optimization and automated design of quantum circuits towards quantum advantage. Recent techniques in QAS focus on machine learning-based approaches from reinforcement learning, like deep Q-network. While multi-layer perceptron-based deep Q-networks have been applied for QAS, their interpretability remains challenging due to the high number of parameters. In this work, we evaluate the practicality of KANs in quantum architecture search problems, analyzing their efficiency in terms of the probability of success, frequency of optimal solutions and their dependencies on various degrees of freedom of the network. In a noiseless scenario, the probability of success and the number of optimal quantum circuit configurations to generate the multi-qubit maximally entangled states are significantly higher than MLPs. Moreover in noisy scenarios, KAN can achieve a better fidelity in approximating maximally entangled state than MLPs, where the performance of the MLP significantly depends on the choice of activation function. Further investigation reveals that KAN requires a very small number of learnable parameters compared to MLPs, however, the average time of executing each episode for KAN is much higher. △ Less

Submitted 25 June, 2024; originally announced June 2024.

Comments: 10 pages and 4 figures

arXiv:2406.17610 [pdf, other]

YAQQ: Yet Another Quantum Quantizer -- Design Space Exploration of Quantum Gate Sets using Novelty Search

Authors: Aritra Sarkar, Akash Kundu, Matthew Steinberg, Sibasish Mishra, Sebastiaan Fauquenot, Tamal Acharya, Jarosław A. Miszczak, Sebastian Feld

Abstract: In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dep… ▽ More In the standard circuit model of quantum computation, the number and quality of the quantum gates composing the circuit influence the runtime and fidelity of the computation. The fidelity of the decomposition of quantum algorithms, represented as unitary matrices, to bounded depth quantum circuits depends strongly on the set of gates available for the decomposition routine. To investigate this dependence, we explore the design space of discrete quantum gate sets and present a software tool for comparative analysis of quantum processing units and control protocols based on their native gates. The evaluation is conditioned on a set of unitary transformations representing target use cases on the quantum processors. The cost function considers three key factors: (i) the statistical distribution of the decomposed circuits' depth, (ii) the statistical distribution of process fidelities for the approximate decomposition, and (iii) the relative novelty of a gate set compared to other gate sets in terms of the aforementioned properties. The developed software, YAQQ (Yet Another Quantum Quantizer), enables the discovery of an optimized set of quantum gates through this tunable joint cost function. To identify these gate sets, we use the novelty search algorithm, circuit decomposition techniques, and stochastic optimization to implement YAQQ within the Qiskit quantum simulator environment. YAQQ exploits reachability tradeoffs conceptually derived from quantum algorithmic information theory. Our results demonstrate the pragmatic application of identifying gate sets that are advantageous to popularly used quantum gate sets in representing quantum algorithms. Consequently, we demonstrate pragmatic use cases of YAQQ in comparing transversal logical gate sets in quantum error correction codes, designing optimal quantum instruction sets, and compiling to specific quantum processors. △ Less

Submitted 25 June, 2024; originally announced June 2024.

arXiv:2406.16542 [pdf, other]

Thermal Evolution of the IGM due to Lyman-α photons during the Cosmic Dawn

Authors: Janakee Raste, Anjan Kumar Sarkar, Shiv K. Sethi

Abstract: The first star-forming objects which formed at high redshifts during the cosmic dawn (CD) also emitted photons between Lyman-$α$ and Lyman-limit frequencies. These photons are instrumental in coupling the spin temperature of the neutral hydrogen (HI) atoms with the kinetic temperature of the intergalactic medium (IGM). Along with this coupling effect, these photons also impact the kinetic temperat… ▽ More The first star-forming objects which formed at high redshifts during the cosmic dawn (CD) also emitted photons between Lyman-$α$ and Lyman-limit frequencies. These photons are instrumental in coupling the spin temperature of the neutral hydrogen (HI) atoms with the kinetic temperature of the intergalactic medium (IGM). Along with this coupling effect, these photons also impact the kinetic temperature by exchanging energy with the HI atoms. The injected Lyman-$α$ photons in general cool the medium, while the continuum photons heat the medium. While studying this effect in the literature, quasi-static profile around the Lyman-$α$ frequency is assumed. In this paper, we solve the time-dependent coupled dynamics of the photon intensity profile along with the evolution of the thermal state of the IGM and HI spin temperature. It is expected that, during the CD era, the IGM has a mix of continuum photons with 10-20% of injected photons. For this case, we show that the system reaches thermal equilibrium in around 1 Myr, with final temperature in the range 50-100 K. This time scale is comparable to the source lifetime of PopIII stars at high redshifts. One impact of switching off short-lived sources is that it can keep the system heated above the temperature of the quasi-static state. We also show that the quasi-static equilibrium for the continuum photons is only achieved on time scales of 100 Myr at $z\simeq 20$, comparable to the age of the Universe. We also briefly discuss how the Lyman-$α$ induced heating can impact the 21 cm signal from CD. △ Less

Submitted 24 June, 2024; originally announced June 2024.

Comments: 18 pages, 6 figures, submitted to ApJ

arXiv:2406.15298 [pdf, ps, other]

Visibility property in one and several variables and its applications

Authors: Vikramjeet Singh Chandel, Sushil Gorai, Anwoy Maitra, Amar Deep Sarkar

Abstract: In this paper we report our investigations on visibility with respect to the Kobayashi distance and its applications, with a special focus on planar domains. We prove that totally disconnected subsets of the boundary are removable in the context of visibility. We also show that a domain in $\mathbb{C}^n$ is a local weak visibility domain if and only if it is a weak visibility domain. The above hol… ▽ More In this paper we report our investigations on visibility with respect to the Kobayashi distance and its applications, with a special focus on planar domains. We prove that totally disconnected subsets of the boundary are removable in the context of visibility. We also show that a domain in $\mathbb{C}^n$ is a local weak visibility domain if and only if it is a weak visibility domain. The above holds also for visibility. Along the way, we prove an intrinsic localization result for the Kobayashi distance. Moreover, we observe some interesting consequences of weak visibility; for example, weak visibility implies compactness of the end topology of the closure of the domain. For planar domains: (i) We provide examples of visibility domains that are not locally Goldilocks at any boundary point. (ii) We provide certain general conditions on planar domains that yield the continuous extension of conformal maps, generalizing the Carathéodory extension theorem. Our conditions are quite general and assume very little regularity of the boundary. We demonstrate this through examples. (iii) We also provide conditions for the homeomorphic extension of biholomorphic maps up to the boundary. (iv) We prove that a hyperbolic, simply connected domain possesses the visibility property if and only if its boundary is locally connected. This leads us to reformulate the MLC conjecture in terms of visibility. (v) We provide a characterization of visibility for a large class of planar domains including certain uncountably connected domains. △ Less

Submitted 21 June, 2024; originally announced June 2024.

Comments: 66 pages. Comments are welcome

MSC Class: Primary: 32F45; 30D40; Secondary: 32Q45; 53C23

arXiv:2406.11963 [pdf, other]

Lepton Collider as a window to Reheating

Authors: Basabendu Barman, Subhaditya Bhattacharya, Sahabub Jahedi, Dipankar Pradhan, Abhik Sarkar

Abstract: We propose a search strategy for MeV-scale feebly interacting massive particle (FIMP) dark matter (DM) at the $e^+e^-$ collider. We argue, detection of a mono-$γ$ signal plus missing energy can indicate to an MeV-scale reheating temperature of the Universe, after addressing observed DM abundance and other relevant constraints. We propose a search strategy for MeV-scale feebly interacting massive particle (FIMP) dark matter (DM) at the $e^+e^-$ collider. We argue, detection of a mono-$γ$ signal plus missing energy can indicate to an MeV-scale reheating temperature of the Universe, after addressing observed DM abundance and other relevant constraints. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: 9 pages, 7 figures, and 1 table

arXiv:2406.05793 [pdf, ps, other]

Existence of Positive Solutions for Generalized Fractional Brézis-Nirenberg Problem

Authors: Rohit Kumar, Abhishek Sarkar

Abstract: In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*} (-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where… ▽ More In this article, we study the fractional Brézis-Nirenberg type problem on whole domain $\mathbb{R}^N$ associated with the fractional $p$-Laplace operator. To be precise, we want to study the following problem: \begin{equation*} (-Δ)_{p}^{s}u - λw |u|^{p-2}u= |u|^{p_{s}^{*}-2}u \quad \text{in} ~\mathcal{D}^{s,p}(\mathbb{R}^{N}), \end{equation*} where $s\in (0,1),~p \in (1,\frac{N}{s}), ~p_{s}^{*}= \frac{Np}{N-sp}$ and the operator $(-Δ)_{p}^{s}$ is the fractional $p$-Laplace operator. The space $\mathcal{D}^{s,p}(\mathbb{R}^{N})$ is the completion of $C_c^\infty(\mathbb{R}^N)$ with respect to the Gaglairdo semi-norm. In this article, we prove the existence of a positive solution to this problem by allowing the Hardy weight $w$ to change its sign. △ Less

Submitted 9 June, 2024; originally announced June 2024.

Comments: 28 pages

MSC Class: 35B09; 35R11

arXiv:2406.05306 [pdf, other]

Detection of New Galaxy Candidates at $z\ >$ 11 in the JADES Field Using JWST NIRCam

Authors: Priyanka Chakraborty, Arnab Sarkar, Scott Wolk, Benjamin Schneider, Nancy Brickhouse, Kenneth Lanzetta, Adam Foster, Randall Smith

Abstract: We report the detection of seven new galaxy candidates with redshift $z$ $>$ 11 within the JWST Advanced Deep Extragalactic Survey (JADES) GOODS-S and GOODS-N fields. These new candidates are detected through meticulous analysis of NIRCam photometry in eight filters spanning a wavelength range of 0.8-5.0 $μ$m. Photometric redshifts of these galaxy candidates are independently measured utilizing sp… ▽ More We report the detection of seven new galaxy candidates with redshift $z$ $>$ 11 within the JWST Advanced Deep Extragalactic Survey (JADES) GOODS-S and GOODS-N fields. These new candidates are detected through meticulous analysis of NIRCam photometry in eight filters spanning a wavelength range of 0.8-5.0 $μ$m. Photometric redshifts of these galaxy candidates are independently measured utilizing spectral energy distribution (SED) fitting techniques using \texttt{EAZY} and \texttt{BAGPIPES} codes, followed by visual scrutiny. Two of these galaxy candidates are located in GOODS-S field, while the remaining five galaxies are located in GOODS-N field. Our analysis reveals that the stellar masses of these galaxies typically range from log $M_{\ast}$/$M_{\odot}$ = 7.75--8.75. Futhermore, these galaxies are typically young with their mass-weighted ages spanning from 80 to 240 Myr. Their specific star formation rates (sSFR), quantified as $\log (\text{sSFR}/\text{Gyr}$), are measured to vary between $\sim 0.95$ to 1.46. These new galaxy candidates offer a robust sample for probing the physical properties of galaxies within the first few hundred Myr of the history of the Universe. We also analyze the relationship between star formation rate (SFR) and stellar mass ($M_\ast$) within our sample. Using linear regression, our analysis yields a slope of $0.71 \pm 0.12$, which we then compare with results from previous studies. Continued investigation through spectroscopic analysis using JWST/NIRSpec is needed to spectroscopically confirm these high-redshift galaxy candidates and investigate further into their physical properties. We plan to follow up on these candidates with future NIRSpec observations. △ Less

Submitted 16 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

Comments: 12 pages, 10 figures, Submitted to MNRAS

arXiv:2406.02928 [pdf, other]

doi 10.1021/jacs.4c05478

Unveiling a Family of Dimerized Quantum Magnets in Ternary Metal Borides

Authors: Zhen Zhang, Andrew P. Porter, Yang Sun, Kirill D. Belashchenko, Gayatri Viswanathan, Arka Sarkar, Kirill Kovnir, Kai-Ming Ho, Vladimir Antropov

Abstract: Dimerized quantum magnets are exotic crystalline materials where Bose-Einstein condensation of magnetic excitations can happen. However, known dimerized quantum magnets are limited to only a few oxides and halides. Here, we unveil 9 dimerized quantum magnets and 11 conventional antiferromagnets in ternary metal borides MTB$_4$ (M = Sc, Y, La, Ce, Lu, Mg, Ca, Al; T = V, Cr, Mn, Fe, Co, Ni). In this… ▽ More Dimerized quantum magnets are exotic crystalline materials where Bose-Einstein condensation of magnetic excitations can happen. However, known dimerized quantum magnets are limited to only a few oxides and halides. Here, we unveil 9 dimerized quantum magnets and 11 conventional antiferromagnets in ternary metal borides MTB$_4$ (M = Sc, Y, La, Ce, Lu, Mg, Ca, Al; T = V, Cr, Mn, Fe, Co, Ni). In this type of structure, 3d transition-metal atoms T are arranged in dimers. Quantum magnetism in these compounds is dominated by strong antiferromagnetic interactions between Cr (both Cr and Mn for M = Mg and Ca) atoms within the structural dimers, with much weaker interactions between the dimers. These systems are proposed to be close to a quantum critical point between a disordered singlet spin-dimer phase, with a spin gap, and the ordered conventional Néel antiferromagnetic phase. This new family of dimerized quantum magnets greatly enriches the materials inventory that allows investigations of the spin-gap phase. All the quantum-, conventionally-, and non-magnetic systems identified, together with experimental synthesis methods of a phase suitable for characterization, provide a platform with abundant possibilities to tune the magnetic exchange coupling by doping and study this unconventional type of quantum phase transition. This work opens up new avenues for studying the quantum magnetism of spin dimers in borides and establishes a theoretical workflow for future searches for dimerized quantum magnets in other families or types of materials. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2406.02499 [pdf, other]

Electronic properties of magnetic semiconductor $\textrm{CuMnO}_{2}$ : a first principles study

Authors: Apurba Sarkar, Joydeep Chatterjee, Arghya Taraphder, Nandan Pakhira

Abstract: Geometrically frustrated magnetic semiconductor $\textrm{CuMnO}_{2}$ has potential applications as photo-catalyst, in photochemical cells and multi-ferroic devices. Electronic band structure in the antiferromagnetic and ferromagnetic phases of $\textrm{CuMnO}_{2}$ were calculated using first principle density functional theory (DFT) as implemented in VASP. Electronic band structure in the antiferr… ▽ More Geometrically frustrated magnetic semiconductor $\textrm{CuMnO}_{2}$ has potential applications as photo-catalyst, in photochemical cells and multi-ferroic devices. Electronic band structure in the antiferromagnetic and ferromagnetic phases of $\textrm{CuMnO}_{2}$ were calculated using first principle density functional theory (DFT) as implemented in VASP. Electronic band structure in the antiferromagnetic state shows indirect band gap ($\sim 0.53$ eV) where as in the ferromagnetic state it shows half-metallic state with 100\% spin polarization. The half-metallic state arises due to \textit{double exchange} mechanism. In the half-metallic state the density of states for the up spin channel shows asymmetric power law behaviour near the Fermi level while the down spin channel shows fully gapped behaviour. The calculated magnetic moment of Mn atom in the ferromagnetic (3.70 $μ_{B}$) and antiferromagnetic (3.57 $μ_{B}$) states are consistent with experimental values. Our calculation predicts potential application of $\textrm{CuMnO}_{2}$ in spintronic devices especially in the ferromagnetic state, as a spin injector for spin valves in spintronic devices. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 7 pages 8 figures

arXiv:2406.01917 [pdf, other]

GOMAA-Geo: GOal Modality Agnostic Active Geo-localization

Authors: Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan Jacobs, Yevgeniy Vorobeychik

Abstract: We consider the task of active geo-localization (AGL) in which an agent uses a sequence of visual cues observed during aerial navigation to find a target specified through multiple possible modalities. This could emulate a UAV involved in a search-and-rescue operation navigating through an area, observing a stream of aerial images as it goes. The AGL task is associated with two important challenge… ▽ More We consider the task of active geo-localization (AGL) in which an agent uses a sequence of visual cues observed during aerial navigation to find a target specified through multiple possible modalities. This could emulate a UAV involved in a search-and-rescue operation navigating through an area, observing a stream of aerial images as it goes. The AGL task is associated with two important challenges. Firstly, an agent must deal with a goal specification in one of multiple modalities (e.g., through a natural language description) while the search cues are provided in other modalities (aerial imagery). The second challenge is limited localization time (e.g., limited battery life, urgency) so that the goal must be localized as efficiently as possible, i.e. the agent must effectively leverage its sequentially observed aerial views when searching for the goal. To address these challenges, we propose GOMAA-Geo - a goal modality agnostic active geo-localization agent - for zero-shot generalization between different goal modalities. Our approach combines cross-modality contrastive learning to align representations across modalities with supervised foundation model pretraining and reinforcement learning to obtain highly effective navigation and localization policies. Through extensive evaluations, we show that GOMAA-Geo outperforms alternative learnable approaches and that it generalizes across datasets - e.g., to disaster-hit areas without seeing a single disaster scenario during training - and goal modalities - e.g., to ground-level imagery or textual descriptions, despite only being trained with goals specified as aerial views. Code and models are publicly available at https://github.com/mvrl/GOMAA-Geo/tree/main. △ Less

Submitted 3 June, 2024; originally announced June 2024.

Comments: 23 pages, 17 figures

arXiv:2406.00748 [pdf]

Augmenting the FedProx Algorithm by Minimizing Convergence

Authors: Anomitra Sarkar, Lavanya Vajpayee

Abstract: The Internet of Things has experienced significant growth and has become an integral part of various industries. This expansion has given rise to the Industrial IoT initiative where industries are utilizing IoT technology to enhance communication and connectivity through innovative solutions such as data analytics and cloud computing. However this widespread adoption of IoT is demanding of algorit… ▽ More The Internet of Things has experienced significant growth and has become an integral part of various industries. This expansion has given rise to the Industrial IoT initiative where industries are utilizing IoT technology to enhance communication and connectivity through innovative solutions such as data analytics and cloud computing. However this widespread adoption of IoT is demanding of algorithms that provide better efficiency for the same training environment without speed being a factor. In this paper we present a novel approach called G Federated Proximity. Building upon the existing FedProx technique our implementation introduces slight modifications to enhance its efficiency and effectiveness. By leveraging FTL our proposed system aims to improve the accuracy of model obtained after the training dataset with the help of normalization techniques such that it performs better on real time devices and heterogeneous networks Our results indicate a significant increase in the throughput of approximately 90% better convergence compared to existing model performance. △ Less

Submitted 2 June, 2024; originally announced June 2024.

ACM Class: F.2.2; I.2.7

arXiv:2405.19328 [pdf, other]

Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation

Authors: Atrisha Sarkar, Andrei Ioan Muresanu, Carter Blair, Aaryam Sharma, Rakshit S Trivedi, Gillian K Hadfield

Abstract: Generative agents, which implement behaviors using a large language model (LLM) to interpret and evaluate an environment, has demonstrated the capacity to solve complex tasks across many social and technological domains. However, when these agents interact with other agents and humans in presence of social structures such as existing norms, fostering cooperation between them is a fundamental chall… ▽ More Generative agents, which implement behaviors using a large language model (LLM) to interpret and evaluate an environment, has demonstrated the capacity to solve complex tasks across many social and technological domains. However, when these agents interact with other agents and humans in presence of social structures such as existing norms, fostering cooperation between them is a fundamental challenge. In this paper, we develop the framework of a 'Normative Module': an architecture designed to enhance cooperation by enabling agents to recognize and adapt to the normative infrastructure of a given environment. We focus on the equilibrium selection aspect of the cooperation problem and inform our agent design based on the existence of classification institutions that implement correlated equilibrium to provide effective resolution of the equilibrium selection problem. Specifically, the normative module enables agents to learn through peer interactions which of multiple candidate institutions in the environment, does a group treat as authoritative. By enabling normative competence in this sense, agents gain ability to coordinate their sanctioning behaviour; coordinated sanctioning behaviour in turn shapes primary behaviour within a social environment, leading to higher average welfare. We design a new environment that supports institutions and evaluate the proposed framework based on two key criteria derived from agent interactions with peers and institutions: (i) the agent's ability to disregard non-authoritative institutions and (ii) the agent's ability to identify authoritative institutions among several options. We show that these capabilities allow the agent to achieve more stable cooperative outcomes compared to baseline agents without the normative module, paving the way for research in a new avenue of designing environments and agents that account for normative infrastructure. △ Less

Submitted 29 May, 2024; originally announced May 2024.

arXiv:2405.17281 [pdf]

Anisotropic Third Harmonic Generation in Two-Dimensional Tin Sulfide

Authors: George Miltos Maragkakis, Sotiris Psilodimitrakopoulos, Leonidas Mouchliadis, Abdus Salam Sarkar, Andreas Lemonis, George Kioseoglou, Emmanuel Stratakis

Abstract: The in-plane anisotropic properties of two-dimensional (2D) group IV monochalcogenides provide an additional degree of freedom which can be used in future optoelectronic devices. Here, it is shown that the third harmonic generation (THG) signal produced by ultrathin tin (II) sulfide (SnS) is in-plane anisotropic with respect to the incident linear polarization of the laser field. We fit the experi… ▽ More The in-plane anisotropic properties of two-dimensional (2D) group IV monochalcogenides provide an additional degree of freedom which can be used in future optoelectronic devices. Here, it is shown that the third harmonic generation (THG) signal produced by ultrathin tin (II) sulfide (SnS) is in-plane anisotropic with respect to the incident linear polarization of the laser field. We fit the experimental polarization-resolved THG (P-THG) measurements with a nonlinear optics model, which accounts for the orthorhombic crystal structure of 2D SnS. We calculate the relative magnitudes of the \{chi}^(3) tensor components by recording and simultaneously fitting both orthogonal components of the P-THG intensity. Furthermore, we introduce a THG anisotropy ratio, whose calculated values compare the total THG intensity when the excitation linear polarization is along the armchair crystallographic direction with the case when it is along the zigzag direction. Our results provide quantitative information on the anisotropic nature of the THG process in SnS, paving the way to a better understanding of anisotropic nonlinear light-matter interactions, and the development of polarization-sensitive nonlinear optical devices. △ Less

Submitted 27 May, 2024; originally announced May 2024.

arXiv:2405.15031 [pdf, other]

Amortized nonmyopic active search via deep imitation learning

Authors: Quan Nguyen, Anindya Sarkar, Roman Garnett

Abstract: Active search formalizes a specialized active learning setting where the goal is to collect members of a rare, valuable class. The state-of-the-art algorithm approximates the optimal Bayesian policy in a budget-aware manner, and has been shown to achieve impressive empirical performance in previous work. However, even this approximate policy has a superlinear computational complexity with respect… ▽ More Active search formalizes a specialized active learning setting where the goal is to collect members of a rare, valuable class. The state-of-the-art algorithm approximates the optimal Bayesian policy in a budget-aware manner, and has been shown to achieve impressive empirical performance in previous work. However, even this approximate policy has a superlinear computational complexity with respect to the size of the search problem, rendering its application impractical in large spaces or in real-time systems where decisions must be made quickly. We study the amortization of this policy by training a neural network to learn to search. To circumvent the difficulty of learning from scratch, we appeal to imitation learning techniques to mimic the behavior of the expert, expensive-to-compute policy. Our policy network, trained on synthetic data, learns a beneficial search strategy that yields nonmyopic decisions carefully balancing exploration and exploitation. Extensive experiments demonstrate our policy achieves competitive performance at real-world tasks that closely approximates the expert's at a fraction of the cost, while outperforming cheaper baselines. △ Less

Submitted 23 May, 2024; originally announced May 2024.

arXiv:2405.12418 [pdf, ps, other]

Learning models on rooted regular trees with majority update policy: convergence and phase transition

Authors: Moumanti Podder, Anish Sarkar

Abstract: We study a learning model in which an agent is stationed at each vertex of $\mathbb{T}_{m}$, the rooted tree in which each vertex has $m$ children. At any time-step $t \in \mathbb{N}_{0}$, they are allowed to select one of two available technologies: $B$ and $R$. Let the technology chosen by the agent at vertex $v\in\mathbb{T}_{m}$, at time-step $t$, be $C_{t}(v)$. Let… ▽ More We study a learning model in which an agent is stationed at each vertex of $\mathbb{T}_{m}$, the rooted tree in which each vertex has $m$ children. At any time-step $t \in \mathbb{N}_{0}$, they are allowed to select one of two available technologies: $B$ and $R$. Let the technology chosen by the agent at vertex $v\in\mathbb{T}_{m}$, at time-step $t$, be $C_{t}(v)$. Let $\{C_{0}(v):v\in\mathbb{T}_{m}\}$ be i.i.d., where $C_{0}(v)=B$ with probability $π_{0}$. During epoch $t$, the agent at $v$ performs an experiment that results in success with probability $p_{B}$ if $C_{t}(v)=B$, and with probability $p_{R}$ if $C_{t}(v)=R$. If the children of $v$ are $v_{1},\ldots,v_{m}$, the agent at $v$ updates their technology to $C_{t+1}(v)=B$ if the number of successes among all $v_{i}$ with $C_{t}(v_{i})=B$ exceeds, strictly, the number of successes among all $v_{j}$ with $C_{t}(v_{j})=R$. If these numbers are equal, then the agent at $v$ sets $C_{t+1}(v)=B$ with probability $1/2$. Else, $C_{t+1}(v)=R$. We show that $\{C_{t}(v):v\in\mathbb{T}_{m}\}$ is i.i.d., where $C_{t}(v)=B$ with probability $π_{t}$, and $\{π_{t}\}_{t \in \mathbb{N}_{0}}$ converges to a fixed point $π$ of a function $g_{m}$. For $m \geqslant 3$, there exists a $p(m) \in (0,1)$ such that $g_{m}$ has a unique fixed point, $1/2$, when $p \leqslant p(m)$, and three distinct fixed points, of the form $α$, $1/2$ and $1-α$, when $p > p(m)$. When $m=3$, $p_{B}=1$ and $p_{R} \in [0,1)$, we show that $g_{3}$ has a unique fixed point, $1$, when $p_{R} < \sqrt{3}-1$, two distinct fixed points, one of which is $1$, when $p_{R} = \sqrt{3}-1$, and three distinct fixed points, one of which is $1$, when $p_{R} > \sqrt{3}-1$. When $g_{m}$ has multiple fixed points, we also specify which of these fixed points $π$ equals, depending on $π_{0}$. For $m=2$, we describe the behaviour of $g_{3}$ for all $p_{B}$ and $p_{R}$. △ Less

Submitted 20 May, 2024; originally announced May 2024.

arXiv:2405.09309 [pdf, ps, other]

Identification via Permutation Channels

Authors: Abhishek Sarkar, Bikash Kumar Dey

Abstract: We study message identification over a $q$-ary uniform permutation channel, where the transmitted vector is permuted by a permutation chosen uniformly at random. For discrete memoryless channels (DMCs), the number of identifiable messages grows doubly exponentially. Identification capacity, the maximum second-order exponent, is known to be the same as the Shannon capacity of the DMC. Permutation c… ▽ More We study message identification over a $q$-ary uniform permutation channel, where the transmitted vector is permuted by a permutation chosen uniformly at random. For discrete memoryless channels (DMCs), the number of identifiable messages grows doubly exponentially. Identification capacity, the maximum second-order exponent, is known to be the same as the Shannon capacity of the DMC. Permutation channels support reliable communication of only polynomially many messages. A simple achievability result shows that message sizes growing as $2^{c_nn^{q-1}}$ are identifiable for any $c_n\rightarrow 0$. We prove two converse results. A ``soft'' converse shows that for any $R>0$, there is no sequence of identification codes with message size growing as $2^{Rn^{q-1}}$ with a power-law decay ($n^{-μ}$) of the error probability. We also prove a ``strong" converse showing that for any sequence of identification codes with message size $2^{Rn^{q-1}\log n}$ ($R>0$), the sum of type I and type II error probabilities approaches at least $1$ as $n\rightarrow \infty$. To prove the soft converse, we use a sequence of steps to construct a new identification code with a simpler structure which relates to a set system, and then use a lower bound on the normalized maximum pairwise intersection of a set system. To prove the strong converse, we use results on approximation of distributions. △ Less

Submitted 4 June, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

Comments: 9 pages. Extended and generalized version of submission to ITW 2024

MSC Class: 68P30; 94A15

arXiv:2405.06667 [pdf, other]

Sentiment Polarity Analysis of Bangla Food Reviews Using Machine and Deep Learning Algorithms

Authors: Al Amin, Anik Sarkar, Md Mahamodul Islam, Asif Ahammad Miazee, Md Robiul Islam, Md Mahmudul Hoque

Abstract: The Internet has become an essential tool for people in the modern world. Humans, like all living organisms, have essential requirements for survival. These include access to atmospheric oxygen, potable water, protective shelter, and sustenance. The constant flux of the world is making our existence less complicated. A significant portion of the population utilizes online food ordering services to… ▽ More The Internet has become an essential tool for people in the modern world. Humans, like all living organisms, have essential requirements for survival. These include access to atmospheric oxygen, potable water, protective shelter, and sustenance. The constant flux of the world is making our existence less complicated. A significant portion of the population utilizes online food ordering services to have meals delivered to their residences. Although there are numerous methods for ordering food, customers sometimes experience disappointment with the food they receive. Our endeavor was to establish a model that could determine if food is of good or poor quality. We compiled an extensive dataset of over 1484 online reviews from prominent food ordering platforms, including Food Panda and HungryNaki. Leveraging the collected data, a rigorous assessment of various deep learning and machine learning techniques was performed to determine the most accurate approach for predicting food quality. Out of all the algorithms evaluated, logistic regression emerged as the most accurate, achieving an impressive 90.91% accuracy. The review offers valuable insights that will guide the user in deciding whether or not to order the food. △ Less

Submitted 3 May, 2024; originally announced May 2024.

arXiv:2405.06602 [pdf, other]

doi 10.3847/1538-4357/ad47c6

Advancing Precision Particle Background Estimation for Future X-ray Missions: Correlated Variability between AMS and Chandra/XMM-Newton

Authors: Arnab Sarkar, Catherine E. Grant, Eric D. Miller, Mark Bautz, Benjamin Schneider, Rick F. Foster, Gerrit Schellenberger, Steven Allen, Ralph P. Kraft, Dan Wilkins, Abe Falcone, Andrew Ptak

Abstract: Galactic cosmic ray (GCR) particles have a significant impact on the particle-induced background of X-ray observatories, and their flux exhibits substantial temporal variability, potentially influencing background levels. In this study, we present one-day binned high-energy reject rates derived from the Chandra-ACIS and XMM-Newton EPIC-pn instruments, serving as proxies for GCR particle flux. We s… ▽ More Galactic cosmic ray (GCR) particles have a significant impact on the particle-induced background of X-ray observatories, and their flux exhibits substantial temporal variability, potentially influencing background levels. In this study, we present one-day binned high-energy reject rates derived from the Chandra-ACIS and XMM-Newton EPIC-pn instruments, serving as proxies for GCR particle flux. We systematically analyze the ACIS and EPIC-pn reject rates and compare them with the AMS proton flux. Our analysis initially reveals robust correlations between the AMS proton flux and the ACIS/EPIC-pn reject rates when binned over 27-day intervals. However, a closer examination reveals substantial fluctuations within each 27-day bin, indicating shorter-term variability. Upon daily binning, we observe finer. temporal structures in the datasets, demonstrating the presence of recurrent variations with periods of $\sim$ 25 days and 23 days in ACIS and EPIC-pn reject rates, respectively, spanning the years 2014 to 2018. Notably, during the 2016--2017 period, we additionally detect periodicities of $\sim$13.5 days and 9 days in the ACIS and EPIC-pn reject rates, respectively. Intriguingly, we observe a time lag of $\sim$ 6 days between the AMS proton flux and the ACIS/EPIC-pn reject rates during the second half of 2016. This time lag is not visible before 2016 and aftern2017. The underlying physical mechanisms responsible for this time lag remain a subject of ongoing investigation. △ Less

Submitted 10 May, 2024; originally announced May 2024.

Comments: 16 pages, 8 figures, accepted for publication in ApJ

arXiv:2405.04382 [pdf, ps, other]

Large Language Models Cannot Explain Themselves

Authors: Advait Sarkar

Abstract: Large language models can be prompted to produce text. They can also be prompted to produce "explanations" of their output. But these are not really explanations, because they do not accurately reflect the mechanical process underlying the prediction. The illusion that they reflect the reasoning process can result in significant harms. These "explanations" can be valuable, but for promoting critic… ▽ More Large language models can be prompted to produce text. They can also be prompted to produce "explanations" of their output. But these are not really explanations, because they do not accurately reflect the mechanical process underlying the prediction. The illusion that they reflect the reasoning process can result in significant harms. These "explanations" can be valuable, but for promoting critical thinking rather than for understanding the model. I propose a recontextualisation of these "explanations", using the term "exoplanations" to draw attention to their exogenous nature. I discuss some implications for design and technology, such as the inclusion of appropriate guardrails and responses when models are prompted to generate explanations. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: In Proceedings of the ACM CHI 2024 Workshop on Human-Centered Explainable AI (HCXAI 2024)

arXiv:2405.02097 [pdf, other]

Transformer Models for Quantum Gate Set Tomography

Authors: King Yiu Yu, Aritra Sarkar, Ryoichi Ishihara, Sebastian Feld

Abstract: Quantum computation represents a promising frontier in the domain of high-performance computing, blending quantum information theory with practical applications to overcome the limitations of classical computation. This study investigates the challenges of manufacturing high-fidelity and scalable quantum processors. Quantum gate set tomography (QGST) is a critical method for characterizing quantum… ▽ More Quantum computation represents a promising frontier in the domain of high-performance computing, blending quantum information theory with practical applications to overcome the limitations of classical computation. This study investigates the challenges of manufacturing high-fidelity and scalable quantum processors. Quantum gate set tomography (QGST) is a critical method for characterizing quantum processors and understanding their operational capabilities and limitations. This paper introduces ML4QGST as a novel approach to QGST by integrating machine learning techniques, specifically utilizing a transformer neural network model. Adapting the transformer model for QGST addresses the computational complexity of modeling quantum systems. Advanced training strategies, including data grouping and curriculum learning, are employed to enhance model performance, demonstrating significant congruence with ground-truth values. We benchmark this training pipeline on the constructed learning model, to successfully perform QGST for $3$ gates on a $1$ qubit system with over-rotation error and depolarizing noise estimation with comparable accuracy to pyGSTi. This research marks a pioneering step in applying deep neural networks to the complex problem of quantum gate set tomography, showcasing the potential of machine learning to tackle nonlinear tomography challenges in quantum computing. △ Less

Submitted 3 May, 2024; originally announced May 2024.

Comments: 14 pages

arXiv:2404.19313 [pdf, other]

High-precision chemical quantum sensing in flowing monodisperse microdroplets

Authors: Adrisha Sarkar, Zachary Jones, Madhur Parashar, Emanuel Druga, Amala Akkiraju, Sophie Conti, Pranav Krishnamoorthi, Srisai Nachuri, Parker Aman, Mohammad Hashemi, Nicholas Nunn, Marco Torelli, Benjamin Gilbert, Kevin R. Wilson, Olga Shenderova, Deepti Tanjore, Ashok Ajoy

Abstract: We report on a novel flow-based method for high-precision chemical detection that integrates quantum sensing with droplet microfluidics. We deploy nanodiamond particles hosting fluorescent nitrogen vacancy defects as quantum sensors in flowing, monodisperse, picoliter-volume microdroplets containing analyte molecules. ND motion within these microcompartments facilitates close sensor-analyte intera… ▽ More We report on a novel flow-based method for high-precision chemical detection that integrates quantum sensing with droplet microfluidics. We deploy nanodiamond particles hosting fluorescent nitrogen vacancy defects as quantum sensors in flowing, monodisperse, picoliter-volume microdroplets containing analyte molecules. ND motion within these microcompartments facilitates close sensor-analyte interaction and mitigates particle heterogeneity. Microdroplet flow rates are rapid (upto 4cm/s) and with minimal drift. Pairing this controlled flow with microwave control of NV electronic spins, we introduce a new noise-suppressed mode of Optically Detected Magnetic Resonance that is sensitive to chemical analytes while resilient against experimental variations, achieving detection of analyte-induced signals at an unprecedented level of a few hundredths of a percent of the ND fluorescence. We demonstrate its application to detecting paramagnetic ions in droplets with simultaneously low limit-of-detection and low analyte volumes, in a manner significantly better than existing technologies. This is combined with exceptional measurement stability over >103s and across hundreds of thousands of droplets, while utilizing minimal sensor volumes and incurring low ND costs (<$0.70 for an hour of operation). Additionally, we demonstrate using these droplets as micro-confinement chambers by co-encapsulating ND quantum sensors with analytes, including single cells. This versatility suggests wide-ranging applications, like single-cell metabolomics and real-time intracellular measurements in bioreactors. Our work paves the way for portable, high-sensitivity, amplification-free, chemical assays with high throughput; introduces a new chemical imaging tool for probing chemical reactions in microenvironments; and establishes the foundation for developing movable, arrayed quantum sensors through droplet microfluidics. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.19032 [pdf, other]

Fermionic Machine Learning

Authors: Jérémie Gince, Jean-Michel Pagé, Marco Armenta, Ayana Sarkar, Stefanos Kourtis

Abstract: We introduce fermionic machine learning (FermiML), a machine learning framework based on fermionic quantum computation. FermiML models are expressed in terms of parameterized matchgate circuits, a restricted class of quantum circuits that map exactly to systems of free Majorana fermions. The FermiML framework allows for building fermionic counterparts of any quantum machine learning (QML) model ba… ▽ More We introduce fermionic machine learning (FermiML), a machine learning framework based on fermionic quantum computation. FermiML models are expressed in terms of parameterized matchgate circuits, a restricted class of quantum circuits that map exactly to systems of free Majorana fermions. The FermiML framework allows for building fermionic counterparts of any quantum machine learning (QML) model based on parameterized quantum circuits, including models that produce highly entangled quantum states. Importantly, matchgate circuits are efficiently simulable classically, thus rendering FermiML a flexible framework for utility benchmarks of QML methods on large real-world datasets. We initiate the exploration of FermiML by benchmarking it against unrestricted PQCs in the context of classification with random quantum kernels. Through experiments on standard datasets (Digits and Wisconsin Breast Cancer), we demonstrate that FermiML kernels are on-par with unrestricted PQC kernels in classification tasks using support-vector machines. Furthermore, we find that FermiML kernels outperform their unrestricted candidates on multi-class classification, including on datasets with several tens of relevant features. We thus show how FermiML enables us to explore regimes previously inaccessible to QML methods. △ Less

Submitted 29 April, 2024; originally announced April 2024.

arXiv:2404.14858 [pdf, other]

A resource-efficient variational quantum algorithm for mRNA codon optimization

Authors: Hongfeng Zhang, Aritra Sarkar, Koen Bertels

Abstract: Optimizing the mRNA codon has an essential impact on gene expression for a specific target protein. It is an NP-hard problem; thus, exact solutions to such optimization problems become computationally intractable for realistic problem sizes on both classical and quantum computers. However, approximate solutions via heuristics can substantially impact the application they enable. Quantum approximat… ▽ More Optimizing the mRNA codon has an essential impact on gene expression for a specific target protein. It is an NP-hard problem; thus, exact solutions to such optimization problems become computationally intractable for realistic problem sizes on both classical and quantum computers. However, approximate solutions via heuristics can substantially impact the application they enable. Quantum approximate optimization is an alternative computation paradigm promising for tackling such problems. Recently, there has been some research in quantum algorithms for bioinformatics, specifically for mRNA codon optimization. This research presents a denser way to encode codons for implementing mRNA codon optimization via the variational quantum eigensolver algorithms on a gate-based quantum computer. This reduces the qubit requirement by half compared to the existing quantum approach, thus allowing longer sequences to be executed on existing quantum processors. The performance of the proposed algorithm is evaluated by comparing its results to exact solutions, showing well-matching results. △ Less

Submitted 10 May, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

Comments: Code available at https://github.com/Advanced-Research-Centre/mRNA-CodonOpt

arXiv:2404.11061 [pdf, other]

Unified Examination of Entity Linking in Absence of Candidate Sets

Authors: Nicolas Ong, Hassan Shavarani, Anoop Sarkar

Abstract: Despite remarkable strides made in the development of entity linking systems in recent years, a comprehensive comparative analysis of these systems using a unified framework is notably absent. This paper addresses this oversight by introducing a new black-box benchmark and conducting a comprehensive evaluation of all state-of-the-art entity linking methods. We use an ablation study to investigate… ▽ More Despite remarkable strides made in the development of entity linking systems in recent years, a comprehensive comparative analysis of these systems using a unified framework is notably absent. This paper addresses this oversight by introducing a new black-box benchmark and conducting a comprehensive evaluation of all state-of-the-art entity linking methods. We use an ablation study to investigate the impact of candidate sets on the performance of entity linking. Our findings uncover exactly how much such entity linking systems depend on candidate sets, and how much this limits the general applicability of each system. We present an alternative approach to candidate sets, demonstrating that leveraging the entire in-domain candidate set can serve as a viable substitute for certain models. We show the trade-off between less restrictive candidate sets, increased inference time and memory footprint for some models. △ Less

Submitted 17 April, 2024; originally announced April 2024.

arXiv:2404.07114 [pdf, other]

"My toxic trait is thinking I'll remember this": gaps in the learner experience of video tutorials for feature-rich software

Authors: Ian Drosos, Advait Sarkar, Andrew D. Gordon

Abstract: Video tutorials are a popular medium for informal and formal learning. However, when learners attempt to view and follow along with these tutorials, they encounter what we call gaps, that is, issues that can prevent learning. We examine the gaps encountered by users of video tutorials for feature-rich software, such as spreadsheets. We develop a theory and taxonomy of such gaps, identifying how th… ▽ More Video tutorials are a popular medium for informal and formal learning. However, when learners attempt to view and follow along with these tutorials, they encounter what we call gaps, that is, issues that can prevent learning. We examine the gaps encountered by users of video tutorials for feature-rich software, such as spreadsheets. We develop a theory and taxonomy of such gaps, identifying how they act as barriers to learning, by collecting and analyzing 360 viewer comments from 90 Microsoft Excel video tutorials published by 43 creators across YouTube, TikTok, and Instagram. We conducted contextual interviews with 8 highly influential tutorial creators to investigate the gaps their viewers experience and how they address them. Further, we obtain insights into their creative process and frustrations when creating video tutorials. Finally, we present creators with two designs that aim to address gaps identified in the comment analysis for feedback and alternative design ideas. △ Less

Submitted 10 April, 2024; originally announced April 2024.

arXiv:2404.06174 [pdf, other]

A quantum information theoretic analysis of reinforcement learning-assisted quantum architecture search

Authors: Abhishek Sadhu, Aritra Sarkar, Akash Kundu

Abstract: In the field of quantum computing, variational quantum algorithms (VQAs) represent a pivotal category of quantum solutions across a broad spectrum of applications. These algorithms demonstrate significant potential for realising quantum computational advantage. A fundamental aspect of VQAs involves formulating expressive and efficient quantum circuits (namely ansatz) and automating the search of s… ▽ More In the field of quantum computing, variational quantum algorithms (VQAs) represent a pivotal category of quantum solutions across a broad spectrum of applications. These algorithms demonstrate significant potential for realising quantum computational advantage. A fundamental aspect of VQAs involves formulating expressive and efficient quantum circuits (namely ansatz) and automating the search of such ansatz is known as quantum architecture search (QAS). RL-QAS involves optimising QAS using reinforcement learning techniques. This study investigates RL-QAS for crafting ansatzes tailored to the variational quantum state diagonalisation problem. Our investigation includes a comprehensive analysis of various dimensions, such as the entanglement thresholds of the resultant states, the impact of initial conditions on the performance of RL-agent, the phase change behaviour of correlation in concurrence bounds, and the discrete contributions of qubits in deducing eigenvalues through conditional entropy metrics. We leverage these insights to devise entanglement-guided admissible ansatz in QAS to diagonalise random quantum states using optimal resources. Furthermore, the methodologies presented herein offer a generalised framework for constructing reward functions within RL-QAS applicable to variational quantum algorithms. △ Less

Submitted 15 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

Comments: 10 pages, 8 figures. Revised version

arXiv:2404.03574 [pdf, other]

TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices

Authors: Hasib-Al Rashid, Argho Sarkar, Aryya Gangopadhyay, Maryam Rahnemoonfar, Tinoosh Mohsenin

Abstract: Traditional machine learning models often require powerful hardware, making them unsuitable for deployment on resource-limited devices. Tiny Machine Learning (tinyML) has emerged as a promising approach for running machine learning models on these devices, but integrating multiple data modalities into tinyML models still remains a challenge due to increased complexity, latency, and power consumpti… ▽ More Traditional machine learning models often require powerful hardware, making them unsuitable for deployment on resource-limited devices. Tiny Machine Learning (tinyML) has emerged as a promising approach for running machine learning models on these devices, but integrating multiple data modalities into tinyML models still remains a challenge due to increased complexity, latency, and power consumption. This paper proposes TinyVQA, a novel multimodal deep neural network for visual question answering tasks that can be deployed on resource-constrained tinyML hardware. TinyVQA leverages a supervised attention-based model to learn how to answer questions about images using both vision and language modalities. Distilled knowledge from the supervised attention-based VQA model trains the memory aware compact TinyVQA model and low bit-width quantization technique is employed to further compress the model for deployment on tinyML devices. The TinyVQA model was evaluated on the FloodNet dataset, which is used for post-disaster damage assessment. The compact model achieved an accuracy of 79.5%, demonstrating the effectiveness of TinyVQA for real-world applications. Additionally, the model was deployed on a Crazyflie 2.0 drone, equipped with an AI deck and GAP8 microprocessor. The TinyVQA model achieved low latencies of 56 ms and consumes 693 mW power while deployed on the tiny drone, showcasing its suitability for resource-constrained embedded systems. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: Accepted as a full paper by the tinyML Research Symposium 2024

arXiv:2404.00933 [pdf, other]

Constraining ultra slow roll inflation using cosmological datasets

Authors: H. V. Ragavendra, Anjan Kumar Sarkar, Shiv K. Sethi

Abstract: In recent years, the detection of gravitational waves by LIGO and PTA collaborations have raised the intriguing possibility of excess matter power at small scales. Such an increase can be achieved by ultra slow roll (USR) phase during inflationary epoch. We constrain excess power over small scales within the framework of such models using cosmological datasets, particularly of CMB anisotropies and… ▽ More In recent years, the detection of gravitational waves by LIGO and PTA collaborations have raised the intriguing possibility of excess matter power at small scales. Such an increase can be achieved by ultra slow roll (USR) phase during inflationary epoch. We constrain excess power over small scales within the framework of such models using cosmological datasets, particularly of CMB anisotropies and Lyman-$α$. We parameterize the USR phase in terms of the e-fold at the onset of USR (counted from the end of inflation) $\bar N_1$ and the duration of USR phase $ΔN$. The former dictates the scale of enhancement in the primordial power spectrum, while the latter determines the amplitude of such an enhancement. From a joint dataset of CMB, SNIa and galaxy surveys, we obtain $\bar N_1 \lesssim 45$ with no bound on $ΔN$. This in turn implies that the scales over which the power spectrum can deviate significantly from the nearly scale invariant behavior of a typical slow-roll model is $k \gtrsim 1 \, \rm Mpc^{-1}$. On the other hand, the Lyman-$α$ data is sensitive to baryonic power spectrum along the line of sight. We consider a semi-analytic theoretical method and high spectral-resolution Lyman-$α$ data to constrain the model. The Lyman-$α$ data limits both the USR parameters: $\bar N_1 \lesssim 41$ and $ΔN \lesssim 0.4$. This constrains the amplitude of the power spectrum enhancement to be less than a factor of hundred over scales $1 \lesssim k/{\rm Mpc^{-1}} \lesssim 100$, thereby considerably improving the constraint on power over these scales as compared to the bounds arrived at from CMB spectral distortion. △ Less

Submitted 22 June, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

Comments: v1: 27 pages, 8 figures; v2: 24 pages, 7 figures, updated dataset, discussion and references, accepted in JCAP

arXiv:2403.11539 [pdf, ps, other]

Detecting superfluid transition in the pulsar core

Authors: Partha Bagchi, Biswanath Layek, Dheeraj Saini, Anjishnu Sarkar, Ajit M. Srivastava, Deepthi Godaba Venkata

Abstract: It is believed that the core of a neutron star can be host to various novel phases of matter, from nucleon superfluid phase to exotic high baryon density QCD phases. Different observational signals for such phase transitions have been discussed in the literature. Here, we point out a unique phenomenon associated with phase transition to a superfluid phase, which may be the nucleon superfluid phase… ▽ More It is believed that the core of a neutron star can be host to various novel phases of matter, from nucleon superfluid phase to exotic high baryon density QCD phases. Different observational signals for such phase transitions have been discussed in the literature. Here, we point out a unique phenomenon associated with phase transition to a superfluid phase, which may be the nucleon superfluid phase or a phase like the CFL phase, allowing for superfluid vortices. In any superfluid phase transition, a random network of vortices forms via the so-called Kibble-Zurek mechanism, which eventually mostly decays away, finally leaving primarily vortices arising from the initial angular momentum of the core. This transient, random vortex network can have a non-zero net angular momentum for the superfluid component, which will generally be oriented in an arbitrary direction. This is in contrast to the final vortices, which arise from initial rotation and hence have the initial angular momentum of the neutron star. The angular momentum of the random vortex network is balanced by an equal and opposite angular momentum in the normal fluid due to the conservation of angular momentum, thereby imparting an arbitrarily oriented angular momentum component to the outer shell of the neutron star. This will affect the pulse timing and pulse profile of a pulsar. These changes in the pulses will decay away in a characteristic manner as the random vortex network decays, obeying specific scaling laws leading to universal features for the detection of superfluid transitions occurring in a pulsar core. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 9 pages, no figures

arXiv:2403.08769 [pdf, ps, other]

The thermalization of $γ$-rays in radioactive expanding ejecta: A simple model and its application for Kilonovae and Ia SNe

Authors: Or Guttman, Ben Shenhar, Arnab Sarkar, Eli Waxman

Abstract: A semi-analytic approximation is derived for the time-dependent fraction $f_γ(t)$ of the energy deposited by radioactive decay $γ$-rays in a homologously expanding plasma of general structure. An analytic approximation is given for spherically symmetric plasma distributions. Applied to Kilonovae (KNe) associated with neutron stars mergers and Type Ia supernovae, our semi-analytic and analytic appr… ▽ More A semi-analytic approximation is derived for the time-dependent fraction $f_γ(t)$ of the energy deposited by radioactive decay $γ$-rays in a homologously expanding plasma of general structure. An analytic approximation is given for spherically symmetric plasma distributions. Applied to Kilonovae (KNe) associated with neutron stars mergers and Type Ia supernovae, our semi-analytic and analytic approximations reproduce, with a few percent and 10% accuracy, respectively, the energy deposition rates, $\dot{Q}_\text{dep}$, obtained in numeric Monte Carlo calculations. The time $t_γ$ beyond which $γ$-ray deposition is inefficient is determined by an effective frequency-independent $γ$-ray opacity $κ_{γ,\text{eff}}$, $t_γ= \sqrt{κ_{γ,\text{eff}}\langleΣ\rangle t^2}$, where $\langleΣ\rangle\propto t^{-2}$ is the average plasma column density. For $β$-decay dominated energy release, $κ_{γ,\text{eff}}$ is typically close to the effective Compton scattering opacity, $κ_{γ,\text{eff}} \approx 0.025~{\rm {cm}^{2}\,g^{-1}}$ with a weak dependence on composition. For KNe, $κ_{γ,\text{eff}}$ depends mainly on the initial electron fraction $Y_e$, $κ_{γ,\text{eff}} \approx 0.03(0.05)~{\rm {cm}^{2}\,g^{-1}}$ for $Y_e \gtrsim (\lesssim) 0.25$ (in contrast with earlier work that found $κ_{γ,\text{eff}}$ larger by 1-2 orders of magnitude for low $Y_e$), and is insensitive to the (large) nuclear physics uncertainties. Determining $t_γ$ from observations will therefore measure the ejecta $\langleΣ\rangle t^2$, providing a stringent test of models. For $\langleΣ\rangle t^2=2\times10^{11}~{\rm g\,{cm}^{-2}\,s^2}$, a typical value expected for KNe, $t_γ\approx1$ d. △ Less

Submitted 13 March, 2024; originally announced March 2024.

Comments: 18 pages, 15 figures, 1 table. submitted to MNRAS

arXiv:2403.06264 [pdf, other]

Dynamics of Polarization Under Normative Institutions and Opinion Expression Stewarding

Authors: Atrisha Sarkar, Gillian K. Hadfield

Abstract: Although there is mounting empirical evidence for the increase in affective polarization, few mechanistic models can explain its emergence at the population level. The question of how such a phenomenon can emerge from divergent opinions of a population on an ideological issue is still an open issue. In this paper, we establish that human normativity, that is, individual expression of normative opi… ▽ More Although there is mounting empirical evidence for the increase in affective polarization, few mechanistic models can explain its emergence at the population level. The question of how such a phenomenon can emerge from divergent opinions of a population on an ideological issue is still an open issue. In this paper, we establish that human normativity, that is, individual expression of normative opinions based on beliefs about the population, can lead to population-level polarization when ideological institutions distort beliefs in accordance with their objective of moving expressed opinion to one extreme. Using a game-theoretic model, we establish that individuals with more extreme opinions will have more extreme rhetoric and higher misperceptions about their outgroup members. Our model also shows that when social recommendation systems mediate institutional signals, we can observe the formation of different institutional communities, each with its unique community structure and characteristics. Using the model, we identify practical strategies platforms can implement, such as reducing exposure to signals from ideological institutions and a tailored approach to content moderation, both of which can rectify the affective polarization problem within its purview. △ Less

Submitted 10 March, 2024; originally announced March 2024.

ACM Class: J.4

arXiv:2403.04857 [pdf, other]

Dark Matter Line Searches with the Cherenkov Telescope Array

Authors: S. Abe, J. Abhir, A. Abhishek, F. Acero, A. Acharyya, R. Adam, A. Aguasca-Cabot, I. Agudo, A. Aguirre-Santaella, J. Alfaro, R. Alfaro, N. Alvarez-Crespo, R. Alves Batista, J. -P. Amans, E. Amato, G. Ambrosi, L. Angel, C. Aramo, C. Arcaro, T. T. H. Arnesen, L. Arrabito, K. Asano, Y. Ascasibar, J. Aschersleben, H. Ashkar , et al. (540 additional authors not shown)

Abstract: Monochromatic gamma-ray signals constitute a potential smoking gun signature for annihilating or decaying dark matter particles that could relatively easily be distinguished from astrophysical or instrumental backgrounds. We provide an updated assessment of the sensitivity of the Cherenkov Telescope Array (CTA) to such signals, based on observations of the Galactic centre region as well as of sele… ▽ More Monochromatic gamma-ray signals constitute a potential smoking gun signature for annihilating or decaying dark matter particles that could relatively easily be distinguished from astrophysical or instrumental backgrounds. We provide an updated assessment of the sensitivity of the Cherenkov Telescope Array (CTA) to such signals, based on observations of the Galactic centre region as well as of selected dwarf spheroidal galaxies. We find that current limits and detection prospects for dark matter masses above 300 GeV will be significantly improved, by up to an order of magnitude in the multi-TeV range. This demonstrates that CTA will set a new standard for gamma-ray astronomy also in this respect, as the world's largest and most sensitive high-energy gamma-ray observatory, in particular due to its exquisite energy resolution at TeV energies and the adopted observational strategy focussing on regions with large dark matter densities. Throughout our analysis, we use up-to-date instrument response functions, and we thoroughly model the effect of instrumental systematic uncertainties in our statistical treatment. We further present results for other potential signatures with sharp spectral features, e.g.~box-shaped spectra, that would likewise very clearly point to a particle dark matter origin. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: 43 pages JCAP style (excluding author list and references), 19 figures

arXiv:2403.03001 [pdf, other]

Higgs couplings in SMEFT via Zh production at the HL-LHC

Authors: Subhaditya Bhattacharya, Abhik Sarkar, Sanjoy Biswas

Abstract: We study the Higgs couplings present in the $Zh$ associated production mode at the Large Hadron Collider (LHC) in presence of both CP even and CP odd dimension 6 Standard Model Effective Theory (SMEFT) operators. The analysis is performed mainly in context of the HL-LHC (with $\sqrt{s}=$14 TeV and luminosity 3000 $fb^{-1}$) setup using cut based as well as machine learning techniques. The analysis… ▽ More We study the Higgs couplings present in the $Zh$ associated production mode at the Large Hadron Collider (LHC) in presence of both CP even and CP odd dimension 6 Standard Model Effective Theory (SMEFT) operators. The analysis is performed mainly in context of the HL-LHC (with $\sqrt{s}=$14 TeV and luminosity 3000 $fb^{-1}$) setup using cut based as well as machine learning techniques. The analysis shows significant betterment in the signal significance by using the machine learning technique. We also do a $χ^2$ analysis, which reveals a significant change in the sensitivity of the coupling modifiers due to the presence of effective operators, in particular due to the four point $qqZh$ interaction. The presence of dimension six CP odd four point operators, which contributes at $\mathcal{O} (Λ^{-4})$ order due to lack of interference with the SM contributions, can only have sensitivity with smaller NP scale at the HL-LHC, after addressing the effective limit and constraints. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 24 pages, 11 figures, 10 tables

arXiv:2402.18914 [pdf, ps, other]

Smooth Structures on $M^n\times\mathbb{S}^k$

Authors: Samik Basu, Ramesh Kasilingam, Ankur Sarkar

Abstract: This paper explores various differentiable structures on the product manifold $M \times \mathbb{S}^k$, where $M$ is either a 4-dimensional closed oriented manifold or a simply connected 5-dimensional closed manifold. We identify the possible stable homotopy types of $M$ and use it to calculate the concordance inertia group and the concordance structure set of $M\times\mathbb{S}^k$ for… ▽ More This paper explores various differentiable structures on the product manifold $M \times \mathbb{S}^k$, where $M$ is either a 4-dimensional closed oriented manifold or a simply connected 5-dimensional closed manifold. We identify the possible stable homotopy types of $M$ and use it to calculate the concordance inertia group and the concordance structure set of $M\times\mathbb{S}^k$ for $1\leq k\leq 10.$ These calculations enable us to further classify all manifolds that are homeomorphic to $\mathbb{C}P^2\times\mathbb{S}^k$, up to diffeomorphism, for each $4\leq k\leq 6$. △ Less

Submitted 29 February, 2024; originally announced February 2024.

MSC Class: 57R55; 55P42 (Primary) 57R65; 55Q45 (Secondary)

arXiv:2402.18297 [pdf, other]

Sums, Differences and Dilates

Authors: Jonathan Cutler, Luke Pebody, Amites Sarkar

Abstract: Given a set of integers $A$ and an integer $k$, write $A+k\cdot A$ for the set $\{a+kb:a\in A,b\in A\}$. Hanson and Petridis showed that if $|A+A|\le K|A|$ then $|A+2\cdot A|\le K^{2.95}|A|$. At a presentation of this result, Petridis stated that the highest known value for $\frac{\log(|A+2\cdot A|/|A|)}{\log(|A+A|/|A|)}$ (bounded above by 2.95) was $\frac{\log 4}{\log 3}$. We show that, for all… ▽ More Given a set of integers $A$ and an integer $k$, write $A+k\cdot A$ for the set $\{a+kb:a\in A,b\in A\}$. Hanson and Petridis showed that if $|A+A|\le K|A|$ then $|A+2\cdot A|\le K^{2.95}|A|$. At a presentation of this result, Petridis stated that the highest known value for $\frac{\log(|A+2\cdot A|/|A|)}{\log(|A+A|/|A|)}$ (bounded above by 2.95) was $\frac{\log 4}{\log 3}$. We show that, for all $ε>0$, there exist $A$ and $K$ with $|A+A|\le K|A|$ but with $|A+2\cdot A|\ge K^{2-ε}|A|$. Further, we analyse a method of Ruzsa, and generalise it to give continuous analogues of the sizes of sumsets, differences and dilates. We apply this method to a construction of Hennecart, Robert and Yudin to prove that, for all $ε>0$, there exists a set $A$ with $|A-A|\ge |A|^{2-ε}$ but with $|A+A|<|A|^{1.7354+ε}$. The second author would like to thank E. Papavassilopoulos for useful discussions about how to improve the efficiency of his computer searches. △ Less

Submitted 28 February, 2024; originally announced February 2024.

arXiv:2402.13352 [pdf, other]

KetGPT - Dataset Augmentation of Quantum Circuits using Transformers

Authors: Boran Apak, Medina Bandic, Aritra Sarkar, Sebastian Feld

Abstract: Quantum algorithms, represented as quantum circuits, can be used as benchmarks for assessing the performance of quantum systems. Existing datasets, widely utilized in the field, suffer from limitations in size and versatility, leading researchers to employ randomly generated circuits. Random circuits are, however, not representative benchmarks as they lack the inherent properties of real quantum a… ▽ More Quantum algorithms, represented as quantum circuits, can be used as benchmarks for assessing the performance of quantum systems. Existing datasets, widely utilized in the field, suffer from limitations in size and versatility, leading researchers to employ randomly generated circuits. Random circuits are, however, not representative benchmarks as they lack the inherent properties of real quantum algorithms for which the quantum systems are manufactured. This shortage of `useful' quantum benchmarks poses a challenge to advancing the development and comparison of quantum compilers and hardware. This research aims to enhance the existing quantum circuit datasets by generating what we refer to as `realistic-looking' circuits by employing the Transformer machine learning architecture. For this purpose, we introduce KetGPT, a tool that generates synthetic circuits in OpenQASM language, whose structure is based on quantum circuits derived from existing quantum algorithms and follows the typical patterns of human-written algorithm-based code (e.g., order of gates and qubits). Our three-fold verification process, involving manual inspection and Qiskit framework execution, transformer-based classification, and structural analysis, demonstrates the efficacy of KetGPT in producing large amounts of additional circuits that closely align with algorithm-based structures. Beyond benchmarking, we envision KetGPT contributing substantially to AI-driven quantum compilers and systems. △ Less

Submitted 23 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

arXiv:2402.12426 [pdf]

Attacks on Node Attributes in Graph Neural Networks

Authors: Ying Xu, Michael Lanier, Anindya Sarkar, Yevgeniy Vorobeychik

Abstract: Graphs are commonly used to model complex networks prevalent in modern social media and literacy applications. Our research investigates the vulnerability of these graphs through the application of feature based adversarial attacks, focusing on both decision time attacks and poisoning attacks. In contrast to state of the art models like Net Attack and Meta Attack, which target node attributes and… ▽ More Graphs are commonly used to model complex networks prevalent in modern social media and literacy applications. Our research investigates the vulnerability of these graphs through the application of feature based adversarial attacks, focusing on both decision time attacks and poisoning attacks. In contrast to state of the art models like Net Attack and Meta Attack, which target node attributes and graph structure, our study specifically targets node attributes. For our analysis, we utilized the text dataset Hellaswag and graph datasets Cora and CiteSeer, providing a diverse basis for evaluation. Our findings indicate that decision time attacks using Projected Gradient Descent (PGD) are more potent compared to poisoning attacks that employ Mean Node Embeddings and Graph Contrastive Learning strategies. This provides insights for graph data security, pinpointing where graph-based models are most vulnerable and thereby informing the development of stronger defense mechanisms against such attacks. △ Less

Submitted 5 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: Accepted to AAAI 2024 AICS workshop

arXiv:2402.11734 [pdf, other]

Solving Data-centric Tasks using Large Language Models

Authors: Shraddha Barke, Christian Poelitz, Carina Suzana Negreanu, Benjamin Zorn, José Cambronero, Andrew D. Gordon, Vu Le, Elnaz Nouri, Nadia Polikarpova, Advait Sarkar, Brian Slininger, Neil Toronto, Jack Williams

Abstract: Large language models (LLMs) are rapidly replacing help forums like StackOverflow, and are especially helpful for non-professional programmers and end users. These users are often interested in data-centric tasks, such as spreadsheet manipulation and data wrangling, which are hard to solve if the intent is only communicated using a natural-language description, without including the data. But how… ▽ More Large language models (LLMs) are rapidly replacing help forums like StackOverflow, and are especially helpful for non-professional programmers and end users. These users are often interested in data-centric tasks, such as spreadsheet manipulation and data wrangling, which are hard to solve if the intent is only communicated using a natural-language description, without including the data. But how do we decide how much data and which data to include in the prompt? This paper makes two contributions towards answering this question. First, we create a dataset of real-world NL-to-code tasks manipulating tabular data, mined from StackOverflow posts. Second, we introduce a cluster-then-select prompting technique, which adds the most representative rows from the input data to the LLM prompt. Our experiments show that LLM performance is indeed sensitive to the amount of data passed in the prompt, and that for tasks with a lot of syntactic variation in the input table, our cluster-then-select technique outperforms a random selection baseline. △ Less

Submitted 24 March, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

Comments: Paper accepted to NAACL 2024 (Findings)

arXiv:2402.10585 [pdf]

Coherent X-ray Imaging of Stochastic Dynamics

Authors: Arnab Sarkar, Allan S. Johnson

Abstract: Condensed phase systems often exhibit a mixture of deterministic and stochastic dynamics at the nanoscale which are essential to understanding their function, but can be challenging to study directly using conventional imaging methods. Coherent X-ray imaging has emerged as a powerful tool for studying both nanoscale structures and dynamics in condensed phase systems, including stochastic dynamics,… ▽ More Condensed phase systems often exhibit a mixture of deterministic and stochastic dynamics at the nanoscale which are essential to understanding their function, but can be challenging to study directly using conventional imaging methods. Coherent X-ray imaging has emerged as a powerful tool for studying both nanoscale structures and dynamics in condensed phase systems, including stochastic dynamics, but the requirement to obtain single-shot images in order to obtain freeze-frame images of the stochastic dynamics means the X-ray fluxes used must be very high, potentially destroying the samples. This prevents coherent imaging from being applied to complex systems like tracking the motion of charge carriers or domain fluctuations in quantum materials. Here we show that, by leveraging the coherence intrinsic to these methods, we can separate out the stochastic and deterministic contributions to a coherent X-ray scattering pattern, returning real space images of the deterministic contributions and the momentum spectrum of the stochastic contributions. We further show that, for several typical and important classes of fluctuations, we can return real space images of the mean fluctuations. We demonstrate this approach by numerically simulating the imaging of stochastic polaron separation following photoexcitation and by recovering the spectral properties of fluctuating domain walls. Our versatile approach will enable the direct recovery of the spatial, spectral and temporal properties of stochastic material dynamics in a wide variety of systems currently unobtainable with existing methods. △ Less

Submitted 16 February, 2024; originally announced February 2024.

Comments: 14

arXiv:2402.09352 [pdf, other]

Sign of the $hZZ$ coupling and implication for new physics

Authors: Dipankar Das, Anirban Kundu, Miguel Levy, Anugrah M. Prasad, Ipsita Saha, Agnivo Sarkar

Abstract: The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boso… ▽ More The magnitudes of the couplings of the scalar resonance at 125 GeV with the SM particles are found to be consistent with those of the SM Higgs boson. However, the signs are not experimentally determined in most of the cases, a prime example being that with the $Z$-boson pair. In other words, $κ_Z^h$, the ratio of the couplings of the actual 125 GeV resonance with $ZZ$ and that of the SM Higgs boson with the same, is consistent with both $+1$ and $-1$, the latter being the `wrong-sign'. We argue that the wrong-sign $hZZ$ coupling will necessitate the intervention of new physics below $\mathcal{O}\left(620\right)$ GeV to safeguard the underlying theory from unitarity violation. The strength of the new nonstandard couplings can be derived from the unitarity sum rules, which are comparable to the SM-Higgs couplings in magnitude. Thus the strong limits from the direct searches at the LHC can help us rule out the existence of such nonstandard particles with unusually large couplings thereby disfavoring the possibility of a wrong-sign $hZZ$ coupling. △ Less

Submitted 14 February, 2024; originally announced February 2024.

Comments: 8 pages, 3 figures

Report number: HRI-RECAPP-2024-01

arXiv:2402.07457 [pdf, other]

A Note on Kernel Functions of Dirichlet Spaces

Authors: Sahil Gehlawat, Aakanksha Jain, Amar Deep Sarkar

Abstract: For a planar domain $Ω$, we consider the Dirichlet spaces with respect to a base point $ζ\inΩ$ and the corresponding kernel functions. It is not known how these kernel functions behave as we vary the base point. In this note, we prove that these kernel functions vary smoothly. As an application of the smoothness result, we prove a Ramadanov-type theorem for these kernel functions on $Ω\timesΩ$. Th… ▽ More For a planar domain $Ω$, we consider the Dirichlet spaces with respect to a base point $ζ\inΩ$ and the corresponding kernel functions. It is not known how these kernel functions behave as we vary the base point. In this note, we prove that these kernel functions vary smoothly. As an application of the smoothness result, we prove a Ramadanov-type theorem for these kernel functions on $Ω\timesΩ$. This extends the previously known convergence results of these kernel functions. In fact, we have made these observations in a more general setting, that is, for weighted kernel functions and their higher-order counterparts. △ Less

Submitted 12 February, 2024; originally announced February 2024.

Comments: 9 pages. Comments are welcome

MSC Class: 30H20; 46E22 (Primary) 30C40 (Secondary)

arXiv:2402.05912 [pdf, other]

doi 10.1093/mnras/stae1704

Towards a holistic magnetic braking model -- II: explaining several long-term internal- and surface-spin properties of solar-like stars and the Sun

Authors: Arnab Sarkar, Patrick Eggenberger, Lev Yungelson, Christopher A. Tout

Abstract: We extend our model of magnetic braking (MB), driven by an $α-Ω$ dynamo mechanism, from fully convective M-dwarfs (FCMDs) to explain the surface and internal spin $P_\mathrm{spin}$ evolution of partly convective dwarfs (PCDs) starting from the disc-dispersal stage to the main-sequence turnoff. In our model, the spin of the core is governed by shear at the core-envelope boundary while the spin of t… ▽ More We extend our model of magnetic braking (MB), driven by an $α-Ω$ dynamo mechanism, from fully convective M-dwarfs (FCMDs) to explain the surface and internal spin $P_\mathrm{spin}$ evolution of partly convective dwarfs (PCDs) starting from the disc-dispersal stage to the main-sequence turnoff. In our model, the spin of the core is governed by shear at the core-envelope boundary while the spin of the envelope is governed by MB and shear. We show that (1) the most massive FCMDs experience a stronger spin-down than PCDs and less massive FCMDs, (2) the stalled spin-down and enhanced activity of K-dwarfs and the pileup of G-dwarfs older than a few Gyr are stellar-structure- and MB-dependent, and weakly dependent on core-envelope coupling effects, (3) our expression of the core-envelope convergence time-scale $τ_\mathrm{converge}(M_\ast,\,P_\mathrm{spin})$ between a few 10 to 100~Myr strongly depends on stellar structure but weakly on MB strength and shear, such that fast and massive rotators achieve corotation earlier, (4) our estimates of the surface magnetic fields are in general agreement with observations and our wind mass loss evolution explains the weak winds from the solar analog $π^1$ UMa and (5) with our model the massive young Sun hypothesis as a solution to the faint young Sun problem can likely be ruled out, because the maximum mass lost by winds from our Sun with our model is about an order of magnitude smaller than required to solve the problem. △ Less

Submitted 9 July, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: Accepted for publication in MNRAS. Minor changes from the original pre-print

arXiv:2402.02125 [pdf, other]

DCE-FORMER: A Transformer-based Model With Mutual Information And Frequency-based Loss Functions For Early And Late Response Prediction In Prostate DCE-MRI

Authors: Sadhana S, Sriprabha Ramanarayanan, Arunima Sarkar, Matcha Naga Gayathri, Keerthi Ram, Mohanasankar Sivaprakasam

Abstract: Dynamic Contrast Enhanced Magnetic Resonance Imaging aids in the detection and assessment of tumor aggressiveness by using a Gadolinium-based contrast agent (GBCA). However, GBCA is known to have potential toxic effects. This risk can be avoided if we obtain DCE-MRI images without using GBCA. We propose, DCE-former, a transformer-based neural network to generate early and late response prostate DC… ▽ More Dynamic Contrast Enhanced Magnetic Resonance Imaging aids in the detection and assessment of tumor aggressiveness by using a Gadolinium-based contrast agent (GBCA). However, GBCA is known to have potential toxic effects. This risk can be avoided if we obtain DCE-MRI images without using GBCA. We propose, DCE-former, a transformer-based neural network to generate early and late response prostate DCE-MRI images from non-contrast multimodal inputs (T2 weighted, Apparent Diffusion Coefficient, and T1 pre-contrast MRI). Additionally, we introduce (i) a mutual information loss function to capture the complementary information about contrast uptake, and (ii) a frequency-based loss function in the pixel and Fourier space to learn local and global hyper-intensity patterns in DCE-MRI. Extensive experiments show that DCE-former outperforms other methods with improvement margins of +1.39 dB and +1.19 db in PSNR, +0.068 and +0.055 in SSIM, and -0.012 and -0.013 in Mean Absolute Error for early and late response DCE-MRI, respectively. △ Less

Submitted 3 February, 2024; originally announced February 2024.

Comments: Accepted at IEEE ISBI 2024

arXiv:2402.02018 [pdf, other]

The Landscape and Challenges of HPC Research and LLMs

Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breaching exascale performance levels. In this paper, we posit that adapting and utilizing such language model-based techniques for tasks in high-performance computing (HPC) would be very beneficial. This study presents our reasoning behind the aforementioned position and highlights how existing ideas can be improved and adapted for HPC tasks. △ Less

Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Showing 1–50 of 589 results for author: Sarkar, A