Skip to main content

Showing 1–50 of 165 results for author: Sanyal, S

  1. arXiv:2407.00931  [pdf, other

    cs.RO

    Real-Time Neuromorphic Navigation: Integrating Event-Based Vision and Physics-Driven Planning on a Parrot Bebop2 Quadrotor

    Authors: Amogh Joshi, Sourav Sanyal, Kaushik Roy

    Abstract: In autonomous aerial navigation, real-time and energy-efficient obstacle avoidance remains a significant challenge, especially in dynamic and complex indoor environments. This work presents a novel integration of neuromorphic event cameras with physics-driven planning algorithms implemented on a Parrot Bebop2 quadrotor. Neuromorphic event cameras, characterized by their high dynamic range and low… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  2. arXiv:2406.18700  [pdf, other

    cs.CC

    On Fourier analysis of sparse Boolean functions over certain Abelian groups

    Authors: Sourav Chakraborty, Swarnalipa Datta, Pranjal Dutta, Arijit Ghosh, Swagato Sanyal

    Abstract: Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2406.11794  [pdf, other

    cs.LG cs.CL

    DataComp-LM: In search of the next generation of training sets for language models

    Authors: Jeffrey Li, Alex Fang, Georgios Smyrnis, Maor Ivgi, Matt Jordan, Samir Gadre, Hritik Bansal, Etash Guha, Sedrick Keh, Kushal Arora, Saurabh Garg, Rui Xin, Niklas Muennighoff, Reinhard Heckel, Jean Mercat, Mayee Chen, Suchin Gururangan, Mitchell Wortsman, Alon Albalak, Yonatan Bitton, Marianna Nezhurina, Amro Abbas, Cheng-Yu Hsieh, Dhruba Ghosh, Josh Gardner , et al. (34 additional authors not shown)

    Abstract: We introduce DataComp for Language Models (DCLM), a testbed for controlled dataset experiments with the goal of improving language models. As part of DCLM, we provide a standardized corpus of 240T tokens extracted from Common Crawl, effective pretraining recipes based on the OpenLM framework, and a broad suite of 53 downstream evaluations. Participants in the DCLM benchmark can experiment with dat… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Project page: https://www.datacomp.ai/dclm/

  4. arXiv:2404.08634  [pdf, other

    cs.CL cs.AI cs.LG

    Pre-training Small Base LMs with Fewer Tokens

    Authors: Sunny Sanyal, Sujay Sanghavi, Alexandros G. Dimakis

    Abstract: We study the effectiveness of a simple approach to develop a small base language model (LM) starting from an existing large base LM: first inherit a few transformer blocks from the larger LM, and then train this smaller model on a very small subset (0.1\%) of the raw pretraining data of the larger model. We call our simple recipe Inheritune and first demonstrate it for building a small base LM wit… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 15 pages, 6 figures, 10 tables

  5. arXiv:2403.04379  [pdf, other

    cs.NI

    Performance evaluation of conditional handover in 5G systems under fading scenario

    Authors: Souvik Deb, Megh Rathod, Rishi Balamurugan, Shankar K. Ghosh, Rajeev K. Singh, Samriddha Sanyal

    Abstract: To enhance the handover performance in fifth generation (5G) cellular systems, conditional handover (CHO) has been evolved as a promising solution. Unlike A3 based handover where handover execution is certain after receiving handover command from the serving access network, in CHO, handover execution is conditional on the RSRP measurements from both current and target access networks, as well as o… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  6. arXiv:2402.14751  [pdf, ps, other

    cs.CC quant-ph

    On the communication complexity of finding a king in a tournament

    Authors: Nikhil S. Mande, Manaswi Paraashar, Swagato Sanyal, Nitin Saurabh

    Abstract: A tournament is a complete directed graph. A king in a tournament is a vertex v such that every other vertex is reachable from v via a path of length at most 2. It is well known that every tournament has at least one king, one of which is a maximum out-degree vertex. The tasks of finding a king, a maximum out-degree vertex and a source in a tournament has been relatively well studied in the contex… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  7. arXiv:2402.03686  [pdf, other

    cs.CL cs.AI

    Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification

    Authors: Soumya Sanyal, Tianyi Xiao, Jiacheng Liu, Wenya Wang, Xiang Ren

    Abstract: Making inferences in text comprehension to understand the meaning is essential in language processing. This work studies the entailment verification (EV) problem of multi-sentence premises that requires a system to make multiple inferences implicitly. Studying EV for such complex premises is important because modern NLP problems, such as detecting inconsistent model-generated rationales, require c… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  8. arXiv:2401.15352  [pdf, ps, other

    cs.CC cs.DS

    Randomized query composition and product distributions

    Authors: Swagato Sanyal

    Abstract: Let R_eps denote randomized query complexity for error probability eps, and R:=R_{1/3}. In this work we investigate whether a perfect composition theorem R(f o g^n)=Omega(R(f).R(g)) holds for a relation f in {0,1}^n * S and a total inner function g:{0,1}^m \to {0, 1}. Let D^(prod) denote the maximum distributional query complexity with respect to any product (over variables) distribution. In thi… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted to STACS 2024

  9. arXiv:2401.06035  [pdf, other

    cs.CV cs.LG

    RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks

    Authors: Partha Ghosh, Soubhik Sanyal, Cordelia Schmid, Bernhard Schölkopf

    Abstract: We present a novel unconditional video generative model designed to address long-term spatial and temporal dependencies. To capture these dependencies, our approach incorporates a hybrid explicit-implicit tri-plane representation inspired by 3D-aware generative frameworks developed for three-dimensional object representation and employs a singular latent code to model an entire video sequence. Ind… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  10. arXiv:2312.14579  [pdf, other

    cs.CV

    Environment-Specific People

    Authors: Mirela Ostrek, Soubhik Sanyal, Carol O'Sullivan, Michael J. Black, Justus Thies

    Abstract: Despite significant progress in generative image synthesis and full-body generation in particular, state-of-the-art methods are either context-independent, overly reliant to text prompts, or bound to the curated training datasets, such as fashion images with monotonous backgrounds. Here, our goal is to generate people in clothing that is semantically appropriate for a given scene. To this end, we… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  11. arXiv:2311.16294  [pdf, other

    cs.CV

    Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation

    Authors: Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Pradyumna YM, Akshay Kulkarni, Jogendra Nath Kundu, R Venkatesh Babu

    Abstract: Conventional domain adaptation algorithms aim to achieve better generalization by aligning only the task-discriminative causal factors between a source and target domain. However, we find that retaining the spurious correlation between causal and non-causal factors plays a vital role in bridging the domain gap and improving target adaptation. Therefore, we propose to build a framework that disenta… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: WACV 2024. Project Page: https://val.cds.iisc.ac.in/C-SFTrans/

  12. arXiv:2311.13159  [pdf, other

    cs.LG math.OC stat.ML

    Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow

    Authors: Yinuo Ren, Tesi Xiao, Tanmay Gangwani, Anshuka Rangi, Holakou Rahmanian, Lexing Ying, Subhajit Sanyal

    Abstract: Multi-objective optimization (MOO) aims to optimize multiple, possibly conflicting objectives with widespread applications. We introduce a novel interacting particle method for MOO inspired by molecular dynamics simulations. Our approach combines overdamped Langevin and birth-death dynamics, incorporating a "dominance potential" to steer particles toward global Pareto optimality. In contrast to pr… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  13. arXiv:2311.09603  [pdf, other

    cs.CL

    Self-Contradictory Reasoning Evaluation and Detection

    Authors: Ziyi Liu, Isabelle Lee, Yongkang Du, Soumya Sanyal, Jieyu Zhao

    Abstract: In a plethora of recent work, large language models (LLMs) demonstrated impressive reasoning ability, but many proposed downstream reasoning tasks focus on performance-wise evaluation. Two fundamental questions persist: 1) how reliable is the quality of reasoning, and 2) can models detect unreliable reasoning? In this paper, we investigate self-contradictory (Self-Contra) reasoning, where the mode… ▽ More

    Submitted 19 February, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

  14. arXiv:2308.14023  [pdf, other

    cs.CV

    Domain-Specificity Inducing Transformers for Source-Free Domain Adaptation

    Authors: Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Akshay Kulkarni, Jogendra Nath Kundu, R. Venkatesh Babu

    Abstract: Conventional Domain Adaptation (DA) methods aim to learn domain-invariant feature representations to improve the target adaptation performance. However, we motivate that domain-specificity is equally important since in-domain trained models hold crucial domain-specific properties that are beneficial for adaptation. Hence, we propose to build a framework that supports disentanglement and learning o… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: ICCV 2023. Project page: http://val.cds.iisc.ac.in/DSiT-SFDA

  15. arXiv:2308.10638  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    SCULPT: Shape-Conditioned Unpaired Learning of Pose-dependent Clothed and Textured Human Meshes

    Authors: Soubhik Sanyal, Partha Ghosh, Jinlong Yang, Michael J. Black, Justus Thies, Timo Bolkart

    Abstract: We present SCULPT, a novel 3D generative model for clothed and textured 3D meshes of humans. Specifically, we devise a deep neural network that learns to represent the geometry and appearance distribution of clothed human bodies. Training such a model is challenging, as datasets of textured 3D meshes for humans are limited in size and accessibility. Our key observation is that there exist medium-s… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Updated to camera ready version of CVPR 2024

  16. arXiv:2307.11349  [pdf, other

    cs.RO

    EV-Planner: Energy-Efficient Robot Navigation via Event-Based Physics-Guided Neuromorphic Planner

    Authors: Sourav Sanyal, Rohan Kumar Manna, Kaushik Roy

    Abstract: Vision-based object tracking is an essential precursor to performing autonomous aerial navigation in order to avoid obstacles. Biologically inspired neuromorphic event cameras are emerging as a powerful alternative to frame-based cameras, due to their ability to asynchronously detect varying intensities (even in poor lighting conditions), high dynamic range, and robustness to motion blur. Spiking… ▽ More

    Submitted 3 January, 2024; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: accepted for publication at IEEE Robotics and Automation Letters

  17. arXiv:2307.03900  [pdf, ps, other

    cs.CC

    On the Composition of Randomized Query Complexity and Approximate Degree

    Authors: Sourav Chakraborty, Chandrima Kayal, Rajat Mittal, Manaswi Paraashar, Swagato Sanyal, Nitin Saurabh

    Abstract: For any Boolean functions $f$ and $g$, the question whether $R(f\circ g) = \tildeΘ(R(f)R(g))$, is known as the composition question for the randomized query complexity. Similarly, the composition question for the approximate degree asks whether $\widetilde{deg}(f\circ g) = \tildeΘ(\widetilde{deg}(f)\cdot\widetilde{deg}(g))$. These questions are two of the most important and well-studied problems,… ▽ More

    Submitted 11 July, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

  18. arXiv:2306.03241  [pdf, other

    cs.LG cs.AI cs.CL

    Early Weight Averaging meets High Learning Rates for LLM Pre-training

    Authors: Sunny Sanyal, Atula Neerkaje, Jean Kaddour, Abhishek Kumar, Sujay Sanghavi

    Abstract: Training Large Language Models (LLMs) incurs significant cost; hence, any strategy that accelerates model convergence is helpful. In this paper, we investigate the ability of a simple idea checkpoint averaging along the trajectory of a training run to improve both convergence and generalization quite early on during training. Here we show that models trained with high learning rates observe higher… ▽ More

    Submitted 11 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 17 pages, 13 figures, presented at NeurIPs 2023 WANT workshop

  19. arXiv:2306.02680  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    BeAts: Bengali Speech Acts Recognition using Multimodal Attention Fusion

    Authors: Ahana Deb, Sayan Nag, Ayan Mahapatra, Soumitri Chattopadhyay, Aritra Marik, Pijush Kanti Gayen, Shankha Sanyal, Archi Banerjee, Samir Karmakar

    Abstract: Spoken languages often utilise intonation, rhythm, intensity, and structure, to communicate intention, which can be interpreted differently depending on the rhythm of speech of their utterance. These speech acts provide the foundation of communication and are unique in expression to the language. Recent advancements in attention-based models, demonstrating their ability to learn powerful represent… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted at INTERSPEECH 2023

  20. arXiv:2305.19472  [pdf, other

    cs.CL cs.AI cs.LG

    PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Planning

    Authors: Faeze Brahman, Chandra Bhagavatula, Valentina Pyatkin, Jena D. Hwang, Xiang Lorraine Li, Hirona J. Arai, Soumya Sanyal, Keisuke Sakaguchi, Xiang Ren, Yejin Choi

    Abstract: Procedural planning, which entails decomposing a high-level goal into a sequence of temporally ordered steps, is an important yet intricate task for machines. It involves integrating common-sense knowledge to reason about complex contextualized situations that are often counterfactual, e.g. "scheduling a doctor's appointment without a phone". While current approaches show encouraging results using… ▽ More

    Submitted 26 July, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: cited new paper, 27 pages

  21. arXiv:2305.18654  [pdf, other

    cs.CL cs.AI cs.LG

    Faith and Fate: Limits of Transformers on Compositionality

    Authors: Nouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, Jena D. Hwang, Soumya Sanyal, Sean Welleck, Xiang Ren, Allyson Ettinger, Zaid Harchaoui, Yejin Choi

    Abstract: Transformer large language models (LLMs) have sparked admiration for their exceptional performance on tasks that demand intricate multi-step reasoning. Yet, these models simultaneously show failures on surprisingly trivial problems. This begs the question: Are these errors incidental, or do they signal more substantial limitations? In an attempt to demystify transformer LLMs, we investigate the li… ▽ More

    Submitted 31 October, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 10 pages + appendix (40 pages)

  22. arXiv:2304.07560  [pdf, other

    cs.CV

    Continual Domain Adaptation through Pruning-aided Domain-specific Weight Modulation

    Authors: Prasanna B, Sunandini Sanyal, R. Venkatesh Babu

    Abstract: In this paper, we propose to develop a method to address unsupervised domain adaptation (UDA) in a practical setting of continual learning (CL). The goal is to update the model on continually changing domains while preserving domain-specific knowledge to prevent catastrophic forgetting of past-seen domains. To this end, we build a framework for preserving domain-specific features utilizing the inh… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: CVPR CLVision Workshop 2023, For code see https://github.com/PrasannaB29/PACDA

  23. Binomial Line Cox Processes: Statistical Characterization and Applications in Wireless Network Analysis

    Authors: Mohammad Taha Shah, Gourab Ghatak, Souradip Sanyal, Martin Haenggi

    Abstract: The current analysis of wireless networks whose transceivers are confined to streets is largely based on Poissonian models, such as Poisson line processes and Poisson line Cox processes. We demonstrate important scenarios where a model with a finite and deterministic number of streets, termed binomial line process, is more accurate. We characterize the statistical properties of the BLP and the cor… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Submitted to IEEE Transactions on Wireless Communications

  24. arXiv:2212.09282  [pdf, other

    cs.CL cs.AI cs.LG

    APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning

    Authors: Soumya Sanyal, Yichong Xu, Shuohang Wang, Ziyi Yang, Reid Pryzant, Wenhao Yu, Chenguang Zhu, Xiang Ren

    Abstract: Logical reasoning of text is an important ability that requires understanding the information present in the text, their interconnections, and then reasoning through them to infer new conclusions. Prior works on improving the logical reasoning ability of language models require complex processing of training data (e.g., aligning symbolic knowledge to text), yielding task-specific data augmentation… ▽ More

    Submitted 4 June, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted at ACL 2023, code available at https://github.com/INK-USC/APOLLO

  25. Lifting to Parity Decision Trees Via Stifling

    Authors: Arkadev Chattopadhyay, Nikhil S. Mande, Swagato Sanyal, Suhail Sherif

    Abstract: We show that the deterministic decision tree complexity of a (partial) function or relation $f$ lifts to the deterministic parity decision tree (PDT) size complexity of the composed function/relation $f \circ g$ as long as the gadget $g$ satisfies a property that we call stifling. We observe that several simple gadgets of constant size, like Indexing on 3 input bits, Inner Product on 4 input bits,… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  26. arXiv:2209.10063  [pdf, other

    cs.CL cs.AI

    Generate rather than Retrieve: Large Language Models are Strong Context Generators

    Authors: Wenhao Yu, Dan Iter, Shuohang Wang, Yichong Xu, Mingxuan Ju, Soumya Sanyal, Chenguang Zhu, Michael Zeng, Meng Jiang

    Abstract: Knowledge-intensive tasks, such as open-domain question answering (QA), require access to a large amount of world or domain knowledge. A common approach for knowledge-intensive tasks is to employ a retrieve-then-read pipeline that first retrieves a handful of relevant contextual documents from an external corpus such as Wikipedia and then predicts an answer conditioned on the retrieved documents.… ▽ More

    Submitted 25 January, 2023; v1 submitted 20 September, 2022; originally announced September 2022.

    Comments: Accepted at ICLR 2023 (v3, add code and implementation details)

  27. arXiv:2209.09025  [pdf, other

    cs.RO cs.AI eess.SY

    RAMP-Net: A Robust Adaptive MPC for Quadrotors via Physics-informed Neural Network

    Authors: Sourav Sanyal, Kaushik Roy

    Abstract: Model Predictive Control (MPC) is a state-of-the-art (SOTA) control technique which requires solving hard constrained optimization problems iteratively. For uncertain dynamics, analytical model based robust MPC imposes additional constraints, increasing the hardness of the problem. The problem exacerbates in performance-critical applications, when more compute is required in lesser time. Data-driv… ▽ More

    Submitted 24 February, 2023; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: This work has been accepted for presentation at the 2023 IEEE International Conference on Robotics and Automation (ICRA), May 29 - June 2, 2023, London, UK. arXiv version will be merged with the conference proceeding once available

  28. arXiv:2209.08042  [pdf, other

    cs.CC

    Decision Tree Complexity versus Block Sensitivity and Degree

    Authors: Rahul Chugh, Supartha Podder, Swagato Sanyal

    Abstract: Relations between the decision tree complexity and various other complexity measures of Boolean functions is a thriving topic of research in computational complexity. It is known that decision tree complexity is bounded above by the cube of block sensitivity, and the cube of polynomial degree. However, the widest separation between decision tree complexity and each of block sensitivity and degree… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  29. arXiv:2205.12598  [pdf, other

    cs.CL cs.LG cs.LO

    RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning

    Authors: Soumya Sanyal, Zeyi Liao, Xiang Ren

    Abstract: Transformers have been shown to be able to perform deductive reasoning on a logical rulebase containing rules and statements written in English natural language. While the progress is promising, it is currently unclear if these models indeed perform logical reasoning by understanding the underlying logical semantics in the language. To this end, we propose RobustLR, a suite of evaluation datasets… ▽ More

    Submitted 8 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: Accpeted at EMNLP 2022, code available at https://github.com/INK-USC/RobustLR

  30. arXiv:2204.11022  [pdf, other

    cs.CR cs.CV

    Towards Data-Free Model Stealing in a Hard Label Setting

    Authors: Sunandini Sanyal, Sravanti Addepalli, R. Venkatesh Babu

    Abstract: Machine learning models deployed as a service (MLaaS) are susceptible to model stealing attacks, where an adversary attempts to steal the model within a restricted access framework. While existing attacks demonstrate near-perfect clone-model performance using softmax predictions of the classification network, most of the APIs allow access to only the top-1 labels. In this work, we show that it is… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: CVPR 2022, Project Page: https://sites.google.com/view/dfms-hl

  31. arXiv:2203.10261  [pdf, other

    cs.CL cs.AI cs.LG

    FaiRR: Faithful and Robust Deductive Reasoning over Natural Language

    Authors: Soumya Sanyal, Harman Singh, Xiang Ren

    Abstract: Transformers have been shown to be able to perform deductive reasoning on a logical rulebase containing rules and statements written in natural language. Recent works show that such models can also produce the reasoning steps (i.e., the proof graph) that emulate the model's logical reasoning process. Currently, these black-box models generate both the proof graph and intermediate inferences within… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted in ACL 2022

  32. arXiv:2203.00083  [pdf, ps, other

    cs.AI

    Sampling-Based Winner Prediction in District-Based Elections

    Authors: Palash Dey, Debajyoti Kar, Swagato Sanyal

    Abstract: In a district-based election, we apply a voting rule $r$ to decide the winners in each district, and a candidate who wins in a maximum number of districts is the winner of the election. We present efficient sampling-based algorithms to predict the winner of such district-based election systems in this paper. When $r$ is plurality and the margin of victory is known to be at least $\varepsilon$ frac… ▽ More

    Submitted 28 February, 2022; originally announced March 2022.

    Comments: 27 pages

  33. arXiv:2110.05458  [pdf, other

    cs.CV

    Learning Realistic Human Reposing using Cyclic Self-Supervision with 3D Shape, Pose, and Appearance Consistency

    Authors: Soubhik Sanyal, Alex Vorobiov, Timo Bolkart, Matthew Loper, Betty Mohler, Larry Davis, Javier Romero, Michael J. Black

    Abstract: Synthesizing images of a person in novel poses from a single image is a highly ambiguous task. Most existing approaches require paired training images; i.e. images of the same person with the same clothing in different poses. However, obtaining sufficiently large datasets with paired data is challenging and costly. Previous methods that forego paired supervision lack realism. We propose a self-sup… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: International Conference on Computer Vision (ICCV)

  34. arXiv:2109.04554  [pdf, other

    cs.LG cs.CY cs.DS

    Feature-based Individual Fairness in k-Clustering

    Authors: Debajyoti Kar, Mert Kosan, Debmalya Mandal, Sourav Medya, Arlei Silva, Palash Dey, Swagato Sanyal

    Abstract: Ensuring fairness in machine learning algorithms is a challenging and essential task. We consider the problem of clustering a set of points while satisfying fairness constraints. While there have been several attempts to capture group fairness in the $k$-clustering problem, fairness at an individual level is relatively less explored. We introduce a new notion of individual fairness in $k$-clusteri… ▽ More

    Submitted 3 February, 2023; v1 submitted 9 September, 2021; originally announced September 2021.

  35. arXiv:2108.13654  [pdf, other

    cs.CL

    Discretized Integrated Gradients for Explaining Language Models

    Authors: Soumya Sanyal, Xiang Ren

    Abstract: As a prominent attribution-based explanation algorithm, Integrated Gradients (IG) is widely adopted due to its desirable explanation axioms and the ease of gradient computation. It measures feature importance by averaging the model's output gradient interpolated along a straight-line path in the input data space. However, such straight-line interpolated points are not representative of text data d… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

    Comments: Accepted in EMNLP 2021

  36. arXiv:2108.13195  [pdf, ps, other

    math.NA cs.LG stat.ML

    On the approximation of a matrix

    Authors: Samriddha Sanyal

    Abstract: Let $F^{*}$ be an approximation of a given $(a \times b)$ matrix $F$ derived by methods that are not randomized. We prove that for a given $F$ and $F^{*}$, $H$ and $T$ can be computed by randomized algorithm such that $(HT)$ is an approximation of $F$ better than $F^{*}$.

    Submitted 25 August, 2021; originally announced August 2021.

  37. arXiv:2105.01963  [pdf, ps, other

    cs.CC

    One-way communication complexity and non-adaptive decision trees

    Authors: Nikhil S. Mande, Swagato Sanyal, Suhail Sherif

    Abstract: We study the relationship between various one-way communication complexity measures of a composed function with the analogous decision tree complexity of the outer function. We consider two gadgets: the AND function on 2 inputs, and the Inner Product on a constant number of inputs. Let $IP$ denote Inner Product on $2b$ bits. - If $f$ is a total Boolean function that depends on all of its inputs,… ▽ More

    Submitted 16 January, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: 33 pages

  38. arXiv:2104.14785  [pdf, other

    cs.FL

    Methodology for Biasing Random Simulation for Rapid Coverage of Corner Cases in AMS Designs

    Authors: Sayandeep Sanyal, Ayan Chakraborty, Pallab Dasgupta, Aritra Hazra

    Abstract: Exploring the limits of an Analog and Mixed Signal (AMS) circuit by driving appropriate inputs has been a serious challenge to the industry. Doing an exhaustive search of the entire input state space is a time-consuming exercise and the returns to efforts ratio is quite low. In order to meet time-to-market requirements, often suboptimal coverage results of an integrated circuit (IC) are leveraged.… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

  39. arXiv:2104.08793  [pdf, other

    cs.CL cs.AI cs.LG

    SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning

    Authors: Aaron Chan, Jiashu Xu, Boyuan Long, Soumya Sanyal, Tanishq Gupta, Xiang Ren

    Abstract: Augmenting pre-trained language models with knowledge graphs (KGs) has achieved success on various commonsense reasoning tasks. However, for a given task instance, the KG, or certain parts of the KG, may not be useful. Although KG-augmented models often use attention to focus on specific KG components, the KG is still always used, and the attention mechanism is never explicitly taught which KG com… ▽ More

    Submitted 20 March, 2022; v1 submitted 18 April, 2021; originally announced April 2021.

    Comments: NeurIPS 2021

  40. arXiv:2102.06038  [pdf

    cs.SD cs.CL eess.AS

    A Fractal Approach to Characterize Emotions in Audio and Visual Domain: A Study on Cross-Modal Interaction

    Authors: Sayan Nag, Uddalok Sarkar, Shankha Sanyal, Archi Banerjee, Souparno Roy, Samir Karmakar, Ranjan Sengupta, Dipak Ghosh

    Abstract: It is already known that both auditory and visual stimulus is able to convey emotions in human mind to different extent. The strength or intensity of the emotional arousal vary depending on the type of stimulus chosen. In this study, we try to investigate the emotional arousal in a cross-modal scenario involving both auditory and visual stimulus while studying their source characteristics. A robus… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  41. arXiv:2102.06003  [pdf

    cs.SD cs.CL eess.AS

    Language Independent Emotion Quantification using Non linear Modelling of Speech

    Authors: Uddalok Sarkar, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present emotion extraction from speech is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking styles of a person, vocal tract information, timbral qualities and other congenital information regarding his voice. Our speech production system is a nonlinear system like most other real world system… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  42. arXiv:2102.04929  [pdf, ps, other

    cs.GT

    A Game Theoretic Framework for Surplus Food Distribution in Smart Cities and Beyond

    Authors: Surja Sanyal, Vikash Kumar Singh, Fatos Xhafa, Banhi Sanyal, Sajal Mukhopadhyay

    Abstract: Food waste is a major challenge for the present world. It is the precursor to several socioeconomic problems that are plaguing the modern society. To counter the same and to, simultaneously, stand by the undernourished, surplus food redistribution has surfaced as a viable solution. Information and Communications Technology (ICT)-mediated food redistribution is a highly scalable approach and it per… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Comments: 31 pages, 8 figures, 5 tables

  43. arXiv:2102.00616  [pdf

    cs.SD cs.LG cs.MM eess.AS

    Neural Network architectures to classify emotions in Indian Classical Music

    Authors: Uddalok Sarkar, Sayan Nag, Medha Basu, Archi Banerjee, Shankha Sanyal, Ranjan Sengupta, Dipak Ghosh

    Abstract: Music is often considered as the language of emotions. It has long been known to elicit emotions in human being and thus categorizing music based on the type of emotions they induce in human being is a very intriguing topic of research. When the task comes to classify emotions elicited by Indian Classical Music (ICM), it becomes much more challenging because of the inherent ambiguity associated wi… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  44. arXiv:2012.02335  [pdf, ps, other

    cs.CC

    Tight Chang's-lemma-type bounds for Boolean functions

    Authors: Sourav Chakraborty, Nikhil S. Mande, Rajat Mittal, Tulasimohan Molli, Manaswi Paraashar, Swagato Sanyal

    Abstract: Chang's lemma (Duke Mathematical Journal, 2002) is a classical result with applications across several areas in mathematics and computer science. For a Boolean function $f$ that takes values in {-1,1} let $r(f)$ denote its Fourier rank. For each positive threshold $t$, Chang's lemma provides a lower bound on $wt(f):=\Pr[f(x)=-1]$ in terms of the dimension of the span of its characters with Fourier… ▽ More

    Submitted 22 May, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

  45. arXiv:2011.08805  [pdf, ps, other

    cs.FL

    Recurrence in Dense-time AMS Assertions

    Authors: Sayandeep Sanyal, Antonio Anastasio Bruto da Costa, Pallab Dasgupta

    Abstract: The notion of recurrence over continuous or dense time, as required for expressing Analog and Mixed-Signal (AMS) behaviours, is fundamentally different from what is offered by the recurrence operators of SystemVerilog Assertions (SVA). This article introduces the formal semantics of recurrence over dense time and provides a methodology for the runtime verification of such properties using interval… ▽ More

    Submitted 17 November, 2020; originally announced November 2020.

  46. arXiv:2008.00266  [pdf, ps, other

    cs.CC

    On parity decision trees for Fourier-sparse Boolean functions

    Authors: Nikhil S. Mande, Swagato Sanyal

    Abstract: We study parity decision trees for Boolean functions. The motivation of our study is the log-rank conjecture for XOR functions and its connection to Fourier analysis and parity decision tree complexity. Let f be a Boolean function with Fourier support S and Fourier sparsity k. 1) We prove via the probabilistic method that there exists a parity decision tree of depth O(sqrt k) that computes f. Th… ▽ More

    Submitted 1 August, 2020; originally announced August 2020.

    MSC Class: 68Q11 ACM Class: F.1.1

  47. arXiv:2006.03239  [pdf, other

    cs.LG math.OC stat.ML

    Think out of the package: Recommending package types for e-commerce shipments

    Authors: Karthik S. Gurumoorthy, Subhajit Sanyal, Vineet Chaoji

    Abstract: Multiple product attributes like dimensions, weight, fragility, liquid content etc. determine the package type used by e-commerce companies to ship products. Sub-optimal package types lead to damaged shipments, incurring huge damage related costs and adversely impacting the company's reputation for safe delivery. Items can be shipped in more protective packages to reduce damage costs, however this… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

    Comments: Accepted in ECML-PKDD 2020

  48. arXiv:2004.08248  [pdf

    eess.AS cs.SD nlin.CD q-bio.NC

    Acoustical classification of different speech acts using nonlinear methods

    Authors: Chirayata Bhattacharyya, Sourya Sengupta, Sayan Nag, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: A recitation is a way of combining the words together so that they have a sense of rhythm and thus an emotional content is imbibed within. In this study we envisaged to answer these questions in a scientific manner taking into consideration 5 (five) well known Bengali recitations of different poets conveying a variety of moods ranging from joy to sorrow. The clips were recited as well as read (in… ▽ More

    Submitted 5 August, 2020; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: 6 pages, 2 figures; Proceedings of WESPAC 2018, New Delhi, India, November 11-15, 2018

  49. arXiv:2004.07820  [pdf

    cs.SD cs.CL eess.AS

    Speaker Recognition in Bengali Language from Nonlinear Features

    Authors: Uddalok Sarkar, Soumyadeep Pal, Sayan Nag, Chirayata Bhattacharya, Shankha Sanyal, Archi Banerjee, Ranjan Sengupta, Dipak Ghosh

    Abstract: At present Automatic Speaker Recognition system is a very important issue due to its diverse applications. Hence, it becomes absolutely necessary to obtain models that take into consideration the speaking style of a person, vocal tract information, timbral qualities of his voice and other congenital information regarding his voice. The study of Bengali speech recognition and speaker identification… ▽ More

    Submitted 15 April, 2020; originally announced April 2020.

    Comments: arXiv admin note: text overlap with arXiv:1612.00171, arXiv:1601.07709

  50. arXiv:1911.07979  [pdf, other

    cs.LG stat.ML

    ASAP: Adaptive Structure Aware Pooling for Learning Hierarchical Graph Representations

    Authors: Ekagra Ranjan, Soumya Sanyal, Partha Pratim Talukdar

    Abstract: Graph Neural Networks (GNN) have been shown to work effectively for modeling graph structured data to solve tasks such as node classification, link prediction and graph classification. There has been some recent progress in defining the notion of pooling in graphs whereby the model tries to generate a graph level representation by downsampling and summarizing the information present in the nodes.… ▽ More

    Submitted 2 February, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: The Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI 2020)