Skip to main content

Showing 1–17 of 17 results for author: Mallick, T

  1. arXiv:2406.11050  [pdf, other

    cs.CL cs.AI

    A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners

    Authors: Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie J. Su, Camillo J. Taylor, Dan Roth

    Abstract: This study introduces a hypothesis-testing framework to assess whether large language models (LLMs) possess genuine reasoning abilities or primarily depend on token bias. We go beyond evaluating LLMs on accuracy; rather, we aim to investigate their token bias in solving logical reasoning tasks. Specifically, we develop carefully controlled synthetic datasets, featuring conjunction fallacy and syll… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: Codes are open-sourced at https://github.com/bowen-upenn/llm_token_bias

  2. arXiv:2406.00252  [pdf, other

    cs.AI cs.CL cs.CV cs.MA

    Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey

    Authors: Bowen Jiang, Yangxinyu Xie, Xiaomeng Wang, Weijie J. Su, Camillo J. Taylor, Tanwi Mallick

    Abstract: Rationality is the quality of being guided by reason, characterized by logical thinking and decision-making that align with evidence and logical rules. This quality is essential for effective problem-solving, as it ensures that solutions are well-founded and systematically derived. Despite the advancements of large language models (LLMs) in generating human-like text with remarkable accuracy, they… ▽ More

    Submitted 18 June, 2024; v1 submitted 31 May, 2024; originally announced June 2024.

  3. arXiv:2402.07877  [pdf, other

    cs.AI

    WildfireGPT: Tailored Large Language Model for Wildfire Analysis

    Authors: Yangxinyu Xie, Tanwi Mallick, Joshua David Bergerson, John K. Hutchison, Duane R. Verner, Jordan Branham, M. Ross Alexander, Robert B. Ross, Yan Feng, Leslie-Anne Levy, Weijie Su

    Abstract: The recent advancement of large language models (LLMs) represents a transformational capability at the frontier of artificial intelligence (AI) and machine learning (ML). However, LLMs are generalized models, trained on extensive text corpus, and often struggle to provide context-specific information, particularly in areas requiring specialized knowledge such as wildfire details within the broader… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  4. arXiv:2401.06817  [pdf, other

    cs.CL cs.LG

    Analyzing Regional Impacts of Climate Change using Natural Language Processing Techniques

    Authors: Tanwi Mallick, John Murphy, Joshua David Bergerson, Duane R. Verner, John K Hutchison, Leslie-Anne Levy

    Abstract: Understanding the multifaceted effects of climate change across diverse geographic locations is crucial for timely adaptation and the development of effective mitigation strategies. As the volume of scientific literature on this topic continues to grow exponentially, manually reviewing these documents has become an immensely challenging task. Utilizing Natural Language Processing (NLP) techniques… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  5. arXiv:2401.06040  [pdf, other

    cs.LG

    Wavelet-Inspired Multiscale Graph Convolutional Recurrent Network for Traffic Forecasting

    Authors: Qipeng Qian, Tanwi Mallick

    Abstract: Traffic forecasting is the foundation for intelligent transportation systems. Spatiotemporal graph neural networks have demonstrated state-of-the-art performance in traffic forecasting. However, these methods do not explicitly model some of the natural characteristics in traffic data, such as the multiscale structure that encompasses spatial and temporal variations at different levels of granulari… ▽ More

    Submitted 4 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  6. arXiv:2308.15464  [pdf, other

    cs.LG cs.AI

    A Comparative Study of Loss Functions: Traffic Predictions in Regular and Congestion Scenarios

    Authors: Yangxinyu Xie, Tanwi Mallick

    Abstract: Spatiotemporal graph neural networks have achieved state-of-the-art performance in traffic forecasting. However, they often struggle to forecast congestion accurately due to the limitations of traditional loss functions. While accurate forecasting of regular traffic conditions is crucial, a reliable AI system must also accurately forecast congestion scenarios to maintain safe and efficient transpo… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  7. arXiv:2302.01887  [pdf, other

    cs.LG

    Analyzing the impact of climate change on critical infrastructure from the scientific literature: A weakly supervised NLP approach

    Authors: Tanwi Mallick, Joshua David Bergerson, Duane R. Verner, John K Hutchison, Leslie-Anne Levy, Prasanna Balaprakash

    Abstract: Natural language processing (NLP) is a promising approach for analyzing large volumes of climate-change and infrastructure-related scientific literature. However, best-in-practice NLP techniques require large collections of relevant documents (corpus). Furthermore, NLP techniques using machine learning and deep learning techniques require labels grouping the articles based on user-defined criteria… ▽ More

    Submitted 5 February, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

  8. arXiv:2209.13123  [pdf, other

    cs.LG

    Explainable Graph Pyramid Autoformer for Long-Term Traffic Forecasting

    Authors: Weiheng Zhong, Tanwi Mallick, Hadi Meidani, Jane Macfarlane, Prasanna Balaprakash

    Abstract: Accurate traffic forecasting is vital to an intelligent transportation system. Although many deep learning models have achieved state-of-art performance for short-term traffic forecasting of up to 1 hour, long-term traffic forecasting that spans multiple hours remains a major challenge. Moreover, most of the existing deep learning traffic forecasting models are black box, presenting additional cha… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  9. arXiv:2204.01618  [pdf, other

    cs.LG

    Deep-Ensemble-Based Uncertainty Quantification in Spatiotemporal Graph Neural Networks for Traffic Forecasting

    Authors: Tanwi Mallick, Prasanna Balaprakash, Jane Macfarlane

    Abstract: Deep-learning-based data-driven forecasting methods have produced impressive results for traffic forecasting. A major limitation of these methods, however, is that they provide forecasts without estimates of uncertainty, which are critical for real-time deployments. We focus on a diffusion convolutional recurrent neural network (DCRNN), a state-of-the-art method for short-term traffic forecasting.… ▽ More

    Submitted 5 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  10. arXiv:2112.09792  [pdf, other

    cs.LG

    A data-centric weak supervised learning for highway traffic incident detection

    Authors: Yixuan Sun, Tanwi Mallick, Prasanna Balaprakash, Jane Macfarlane

    Abstract: Using the data from loop detector sensors for near-real-time detection of traffic incidents in highways is crucial to averting major traffic congestion. While recent supervised machine learning methods offer solutions to incident detection by leveraging human-labeled incident data, the false alarm rate is often too high to be used in practice. Specifically, the inconsistency in the human labeling… ▽ More

    Submitted 2 August, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  11. arXiv:2112.02476  [pdf, other

    cs.CR cs.NI

    Provisioning Fog Services to 3GPP Subscribers: Authentication and Application Mobility

    Authors: Asad Ali, Tushin Mallick, Sadman Sakib, Md. Shohrab Hossain, Ying-Dar Lin

    Abstract: Multi-Access Edge computing (MEC) and Fog computing provide services to subscribers at low latency. There is a need to form a federation among 3GPP MEC and fog to provide better coverage to 3GPP subscribers. This federation gives rise to two issues - third-party authentication and application mobility - for continuous service during handover from 3GPP MEC to fog without re-authentication. In this… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: 6 pages, 5 figures, Submitted to IEEE ICC 2022

  12. arXiv:2008.12767  [pdf, other

    cs.LG cs.NI eess.SP stat.ML

    Dynamic Graph Neural Network for Traffic Forecasting in Wide Area Networks

    Authors: Tanwi Mallick, Mariam Kiran, Bashir Mohammed, Prasanna Balaprakash

    Abstract: Wide area networking infrastructures (WANs), particularly science and research WANs, are the backbone for moving large volumes of scientific data between experimental facilities and data centers. With demands growing at exponential rates, these networks are struggling to cope with large data volumes, real-time responses, and overall network performance. Network operators are increasingly looking f… ▽ More

    Submitted 28 August, 2020; originally announced August 2020.

    Comments: 10 Pages, 11 Figures

  13. arXiv:2004.11994  [pdf, other

    cs.MM cs.LG

    Bharatanatyam Dance Transcription using Multimedia Ontology and Machine Learning

    Authors: Tanwi Mallick, Patha Pratim Das, Arun Kumar Majumdar

    Abstract: Indian Classical Dance is an over 5000 years' old multi-modal language for expressing emotions. Preservation of dance through multimedia technology is a challenging task. In this paper, we develop a system to generate a parseable representation of a dance performance. The system will help to preserve intangible heritage, annotate performances for better tutoring, and synthesize dance performances.… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

  14. arXiv:2004.08269  [pdf, other

    cs.SD eess.AS

    Beat Detection and Automatic Annotation of the Music of Bharatanatyam Dance using Speech Recognition Techniques

    Authors: Tanwi Mallick, Partha Pratim Das, Arun Kumar Majumdar

    Abstract: Bharatanatyam, an Indian Classical Dance form, represents the rich cultural heritage of India. Analysis and recognition of such dance forms are critical for the preservation of cultural heritage. Like in most dance forms, a Bharatanatyam dancer performs in synchronization with structured rhythmic music, called Sollukattu, which comprises instrumental beats and vocalized utterances (bols) to create… ▽ More

    Submitted 17 April, 2020; originally announced April 2020.

  15. arXiv:2004.08038  [pdf, other

    cs.LG stat.ML

    Transfer Learning with Graph Neural Networks for Short-Term Highway Traffic Forecasting

    Authors: Tanwi Mallick, Prasanna Balaprakash, Eric Rask, Jane Macfarlane

    Abstract: Highway traffic modeling and forecasting approaches are critical for intelligent transportation systems. Recently, deep-learning-based traffic forecasting methods have emerged as state of the art for a wide range of traffic forecasting tasks. However, these methods require a large amount of training data, which needs to be collected over a significant period of time. This can present a number of c… ▽ More

    Submitted 20 April, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

  16. Graph-Partitioning-Based Diffusion Convolutional Recurrent Neural Network for Large-Scale Traffic Forecasting

    Authors: Tanwi Mallick, Prasanna Balaprakash, Eric Rask, Jane Macfarlane

    Abstract: Traffic forecasting approaches are critical to developing adaptive strategies for mobility. Traffic patterns have complex spatial and temporal dependencies that make accurate forecasting on large highway networks a challenging task. Recently, diffusion convolutional recurrent neural networks (DCRNNs) have achieved state-of-the-art results in traffic forecasting by capturing the spatiotemporal dyna… ▽ More

    Submitted 20 April, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

    Journal ref: Transportation Research Record (2020)

  17. arXiv:1909.11023  [pdf, other

    cs.CV cs.LG cs.MM

    Posture and sequence recognition for Bharatanatyam dance performances using machine learning approach

    Authors: Tanwi Mallick, Partha Pratim Das, Arun Kumar Majumdar

    Abstract: Understanding the underlying semantics of performing arts like dance is a challenging task. Dance is multimedia in nature and spans over time as well as space. Capturing and analyzing the multimedia content of the dance is useful for the preservation of cultural heritage, to build video recommendation systems, to assist learners to use tutoring systems. To develop an application for dance, three a… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.

    ACM Class: I.4.9