Skip to main content

Showing 1–50 of 124 results for author: Rossi, A

  1. arXiv:2407.11016  [pdf, other

    cs.CL cs.LG

    LongLaMP: A Benchmark for Personalized Long-form Text Generation

    Authors: Ishita Kumar, Snigdha Viswanathan, Sushrita Yerra, Alireza Salemi, Ryan A. Rossi, Franck Dernoncourt, Hanieh Deilamsalehy, Xiang Chen, Ruiyi Zhang, Shubham Agarwal, Nedim Lipka, Hamed Zamani

    Abstract: Long-text generation is seemingly ubiquitous in real-world applications of large language models such as generating an email or writing a review. Despite the fundamental importance and prevalence of long-text generation in many practical applications, existing work on personalized generation has focused on the generation of very short text. To overcome these limitations, we study the problem of pe… ▽ More

    Submitted 26 June, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures, 20 tables(including appendix) submitted to EMNLP

  2. arXiv:2407.07291  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Discovery in Semi-Stationary Time Series

    Authors: Shanyun Gao, Raghavendra Addanki, Tong Yu, Ryan A. Rossi, Murat Kocaoglu

    Abstract: Discovering causal relations from observational time series without making the stationary assumption is a significant challenge. In practice, this challenge is common in many areas, such as retail sales, transportation systems, and medical science. Here, we consider this problem for a class of non-stationary time series. The structural causal model (SCM) of this type of time series, called the sem… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    ACM Class: I.2.6, G.3

  3. arXiv:2407.07290  [pdf, other

    cs.LG cs.AI stat.ML

    Causal Discovery-Driven Change Point Detection in Time Series

    Authors: Shanyun Gao, Raghavendra Addanki, Tong Yu, Ryan A. Rossi, Murat Kocaoglu

    Abstract: Change point detection in time series seeks to identify times when the probability distribution of time series changes. It is widely applied in many areas, such as human-activity sensing and medical science. In the context of multivariate time series, this typically involves examining the joint distribution of high-dimensional data: If any one variable changes, the whole time series is assumed to… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    ACM Class: I.2.6, G.3

  4. arXiv:2407.04855  [pdf, other

    cs.CL cs.AI

    Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs

    Authors: Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui

    Abstract: Extractive summarization plays a pivotal role in natural language processing due to its wide-range applications in summarizing diverse content efficiently, while also being faithful to the original content. Despite significant advancement achieved in extractive summarization by Large Language Models (LLMs), these summaries frequently exhibit incoherence. An important aspect of the coherent summary… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 10 pages

  5. arXiv:2407.02750  [pdf, other

    cs.CL

    Learning to Reduce: Towards Improving Performance of Large Language Models on Structured Data

    Authors: Younghun Lee, Sungchul Kim, Ryan A. Rossi, Tong Yu, Xiang Chen

    Abstract: Large Language Models (LLMs) have been achieving competent performance on a wide range of downstream tasks, yet existing work shows that inference on structured data is challenging for LLMs. This is because LLMs need to either understand long structured data or select the most relevant evidence before inference, and both approaches are not trivial. This paper proposes a framework, Learning to Redu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: ICML 2024 Workshop on Long-Context Foundation Models, Vienna, Austria 2024. arXiv admin note: substantial text overlap with arXiv:2402.14195

  6. arXiv:2406.05109  [pdf, other

    cs.LG

    Large Generative Graph Models

    Authors: Yu Wang, Ryan A. Rossi, Namyong Park, Huiyuan Chen, Nesreen K. Ahmed, Puja Trivedi, Franck Dernoncourt, Danai Koutra, Tyler Derr

    Abstract: Large Generative Models (LGMs) such as GPT, Stable Diffusion, Sora, and Suno are trained on a huge amount of language corpus, images, videos, and audio that are extremely diverse from numerous domains. This training paradigm over diverse well-curated data lies at the heart of generating creative and sensible content. However, all previous graph generative models (e.g., GraphRNN, MDVAE, MoFlow, GDS… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  7. arXiv:2404.11602  [pdf, other

    cs.HC

    Interaction Techniques for Exploratory Data Visualization on Mobile Devices

    Authors: Luke S. Snyder, Ryan A. Rossi, Eunyee Koh, Jeffrey Heer, Jane Hoffswell

    Abstract: The ubiquity and on-the-go availability of mobile devices makes them central to many tasks such as interpersonal communication and media consumption. However, despite the potential of mobile devices for on-demand exploratory data visualization, existing mobile interactions are difficult, often using highly custom interactions, complex gestures, or multi-modal input. We synthesize limitations from… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 4 pages, 1 figure, 1 table, EuroVis 2024 Short Papers

  8. arXiv:2404.01588  [pdf, other

    cs.CL cs.AI cs.LG

    Hallucination Diversity-Aware Active Learning for Text Summarization

    Authors: Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li

    Abstract: Large Language Models (LLMs) have shown propensity to generate hallucinated outputs, i.e., texts that are factually incorrect or unsupported. Existing methods for alleviating hallucinations typically require costly human annotations to identify and correct hallucinations in LLM outputs. Moreover, most of these methods focus on a specific type of hallucination, e.g., entity or token errors, which l… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to NAACL 2024

  9. arXiv:2403.07213  [pdf, other

    cs.LG stat.ML

    Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits

    Authors: Yu Xia, Fang Kong, Tong Yu, Liya Guo, Ryan A. Rossi, Sungchul Kim, Shuai Li

    Abstract: Web-based applications such as chatbots, search engines and news recommendations continue to grow in scale and complexity with the recent surge in the adoption of LLMs. Online model selection has thus garnered increasing attention due to the need to choose the best model among a diverse set while balancing task reward and exploration cost. Organizations faces decisions like whether to employ a cos… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Accepted by WWW'24 (Oral)

  10. arXiv:2402.14195  [pdf, other

    cs.CL

    Learning to Reduce: Optimal Representations of Structured Data in Prompting Large Language Models

    Authors: Younghun Lee, Sungchul Kim, Tong Yu, Ryan A. Rossi, Xiang Chen

    Abstract: Large Language Models (LLMs) have been widely used as general-purpose AI agents showing comparable performance on many downstream tasks. However, existing work shows that it is challenging for LLMs to integrate structured data (e.g. KG, tables, DBs) into their prompts; LLMs need to either understand long text data or select the most relevant evidence prior to inference, and both approaches are not… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 5 pages

  11. arXiv:2402.03388  [pdf, other

    cs.AI cs.IR cs.LG

    Delivery Optimized Discovery in Behavioral User Segmentation under Budget Constraint

    Authors: Harshita Chopra, Atanu R. Sinha, Sunav Choudhary, Ryan A. Rossi, Paavan Kumar Indela, Veda Pranav Parwatala, Srinjayee Paul, Aurghya Maiti

    Abstract: Users' behavioral footprints online enable firms to discover behavior-based user segments (or, segments) and deliver segment specific messages to users. Following the discovery of segments, delivery of messages to users through preferred media channels like Facebook and Google can be challenging, as only a portion of users in a behavior segment find match in a medium, and only a fraction of those… ▽ More

    Submitted 15 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  12. arXiv:2402.01981  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Self-Debiasing Large Language Models: Zero-Shot Recognition and Reduction of Stereotypes

    Authors: Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Tong Yu, Hanieh Deilamsalehy, Ruiyi Zhang, Sungchul Kim, Franck Dernoncourt

    Abstract: Large language models (LLMs) have shown remarkable advances in language generation and understanding but are also prone to exhibiting harmful social biases. While recognition of these behaviors has generated an abundance of bias mitigation techniques, most require modifications to the training data, model parameters, or decoding strategy, which may be infeasible without access to a trainable model… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  13. arXiv:2401.14983  [pdf

    cs.DB hep-ex

    Quota management in dCache or making a perfectly normal file system normal

    Authors: Dmitry Litvintsev, Chitrapu Krishnaveni, Svenja Meyer, Paul Millar, Tigran Mkrtchyan, Lea Morschel, Albert Rossi, Marina Sahakyan

    Abstract: dCache (https://dcache.org) is a highly scalable storage system providing location-independent access to data. The data are stored across multiple data servers as complete files presented to the end-user via a single-rooted namespace. From its inception, dCache has been designed as a caching disk buffer to a tertiary tape storage system with the assumption that the latter has virtually unlimited c… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 26th Intl Conf Computing High Energy & Nuclear Phys (CHEP 2023)

    Report number: FERMILAB-CONF-23-530-CSAID

  14. arXiv:2311.16838  [pdf, other

    cs.RO

    Increasing Transparency of Reinforcement Learning using Shielding for Human Preferences and Explanations

    Authors: Georgios Angelopoulos, Luigi Mangiacapra, Alessandra Rossi, Claudia Di Napoli, Silvia Rossi

    Abstract: The adoption of Reinforcement Learning (RL) in several human-centred applications provides robots with autonomous decision-making capabilities and adaptability based on the observations of the operating environment. In such scenarios, however, the learning process can make robots' behaviours unclear and unpredictable to humans, thus preventing a smooth and effective Human-Robot Interaction (HRI).… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  15. arXiv:2311.12182  [pdf, ps, other

    cs.RO cs.AI

    Common (good) practices measuring trust in HRI

    Authors: Patrick Holthaus, Alessandra Rossi

    Abstract: Trust in robots is widely believed to be imperative for the adoption of robots into people's daily lives. It is, therefore, understandable that the literature of the last few decades focuses on measuring how much people trust robots -- and more generally, any agent - to foster such trust in these technologies. Researchers have been exploring how people trust robot in different ways, such as measur… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: In SCRITA 2023 Workshop Proceedings (arXiv:2311.05401) held in conjunction with 32nd IEEE International Conference on Robot & Human Interactive Communication, 28/08 - 31/08 2023, Busan (Korea)

    Report number: SCRITA/2023/10

  16. arXiv:2311.05401   

    cs.RO

    SCRITA 2023: Trust, Acceptance and Social Cues in Human-Robot Interaction

    Authors: Alessandra Rossi, Patrick Holthaus, Gabriella Lakatos, Sílvia Moros, Lewis Riches

    Abstract: This workshop focused on identifying the challenges and dynamics between people and robots to foster short interactions and long-lasting relationships in different fields, from educational, service, collaborative, companion, care-home and medical robotics. For that, this workshop facilitated a discussion about people's trust towards robots "in the field", inviting workshop participants to contribu… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: SCRITA 2023 workshop proceedings including 9 articles

    Report number: SCRITA/2023

  17. Feature Selection and Hyperparameter Fine-tuning in Artificial Neural Networks for Wood Quality Classification

    Authors: Mateus Roder, Leandro Aparecido Passos, João Paulo Papa, André Luis Debiaso Rossi

    Abstract: Quality classification of wood boards is an essential task in the sawmill industry, which is still usually performed by human operators in small to median companies in developing countries. Machine learning algorithms have been successfully employed to investigate the problem, offering a more affordable alternative compared to other solutions. However, such approaches usually present some drawback… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  18. arXiv:2310.13227  [pdf, other

    cs.CL cs.AI cs.LG

    ToolChain*: Efficient Action Space Navigation in Large Language Models with A* Search

    Authors: Yuchen Zhuang, Xiang Chen, Tong Yu, Saayan Mitra, Victor Bursztyn, Ryan A. Rossi, Somdeb Sarkhel, Chao Zhang

    Abstract: Large language models (LLMs) have demonstrated powerful decision-making and planning capabilities in solving complicated real-world problems. LLM-based autonomous agents can interact with diverse tools (e.g., functional APIs) and generate solution plans that execute a series of API function calls in a step-by-step manner. The multitude of candidate API function calls significantly expands the acti… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  19. arXiv:2310.09626  [pdf, ps, other

    cs.RO

    Current and Future Challenges in Humanoid Robotics -- An Empirical Investigation

    Authors: Maike Paetzel-Prüsmann, Alessandra Rossi, Merel Keijsers

    Abstract: The goal of RoboCup is to make research in the area of robotics measurable over time, and grow a community that works together to solve increasingly difficult challenges over the years. The most ambitious of these challenges it to be able to play against the human world champions in soccer in 2050. To better understand what members of the RoboCup community believes to be the state of the art and t… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  20. arXiv:2309.09400  [pdf, other

    cs.CL cs.AI

    CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

    Authors: Thuat Nguyen, Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: The driving factors behind the development of large language models (LLMs) with impressive learning capabilities are their colossal model sizes and extensive training datasets. Along with the progress in natural language processing, LLMs have been frequently made accessible to the public to foster deeper investigation and applications. However, when it comes to training datasets for these LLMs, es… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Ongoing Work

  21. arXiv:2309.08872  [pdf, other

    cs.CL cs.AI cs.LG

    PDFTriage: Question Answering over Long, Structured Documents

    Authors: Jon Saad-Falcon, Joe Barrow, Alexa Siu, Ani Nenkova, David Seunghyun Yoon, Ryan A. Rossi, Franck Dernoncourt

    Abstract: Large Language Models (LLMs) have issues with document question answering (QA) in situations where the document is unable to fit in the small context length of an LLM. To overcome this issue, most existing works focus on retrieving the relevant context from the document, representing them as plain text. However, documents such as PDFs, web pages, and presentations are naturally structured with dif… ▽ More

    Submitted 8 November, 2023; v1 submitted 16 September, 2023; originally announced September 2023.

  22. arXiv:2309.00770  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Bias and Fairness in Large Language Models: A Survey

    Authors: Isabel O. Gallegos, Ryan A. Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, Ruiyi Zhang, Nesreen K. Ahmed

    Abstract: Rapid advancements of large language models (LLMs) have enabled the processing, understanding, and generation of human-like text, with increasing integration into systems that touch our social sphere. Despite this success, these models can learn, perpetuate, and amplify harmful social biases. In this paper, we present a comprehensive survey of bias evaluation and mitigation techniques for LLMs. We… ▽ More

    Submitted 12 July, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

    Comments: Accepted at Computational Linguistics, Volume 50, Number 3

  23. arXiv:2308.11730  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Knowledge Graph Prompting for Multi-Document Question Answering

    Authors: Yu Wang, Nedim Lipka, Ryan A. Rossi, Alexa Siu, Ruiyi Zhang, Tyler Derr

    Abstract: The `pre-train, prompt, predict' paradigm of large language models (LLMs) has achieved remarkable success in open-domain question answering (OD-QA). However, few works explore this paradigm in the scenario of multi-document question answering (MD-QA), a task demanding a thorough understanding of the logical associations among the contents and structures of different documents. To fill this crucial… ▽ More

    Submitted 25 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  24. arXiv:2307.16039  [pdf, other

    cs.CL cs.LG

    Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

    Authors: Viet Dac Lai, Chien Van Nguyen, Nghia Trung Ngo, Thuat Nguyen, Franck Dernoncourt, Ryan A. Rossi, Thien Huu Nguyen

    Abstract: A key technology for the development of large language models (LLMs) involves instruction tuning that helps align the models' responses with human expectations to realize impressive learning abilities. Two major approaches for instruction tuning characterize supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF), which are currently applied to produce the best commercia… ▽ More

    Submitted 1 August, 2023; v1 submitted 29 July, 2023; originally announced July 2023.

  25. arXiv:2307.10867  [pdf, other

    cs.CL cs.CV cs.LG

    FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback

    Authors: Ashish Singh, Prateek Agarwal, Zixuan Huang, Arpita Singh, Tong Yu, Sungchul Kim, Victor Bursztyn, Nikos Vlassis, Ryan A. Rossi

    Abstract: Captions are crucial for understanding scientific visualizations and documents. Existing captioning methods for scientific figures rely on figure-caption pairs extracted from documents for training, many of which fall short with respect to metrics like helpfulness, explainability, and visual-descriptiveness [15] leading to generated captions being misaligned with reader preferences. To enable the… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 19 pages, 4 figures. Benchmark Documentation: https://figcapshf.github.io/

  26. arXiv:2307.03929  [pdf, other

    cs.LG cs.IR cs.SI

    Fairness-Aware Graph Neural Networks: A Survey

    Authors: April Chen, Ryan A. Rossi, Namyong Park, Puja Trivedi, Yu Wang, Tong Yu, Sungchul Kim, Franck Dernoncourt, Nesreen K. Ahmed

    Abstract: Graph Neural Networks (GNNs) have become increasingly important due to their representational power and state-of-the-art predictive performance on many fundamental learning tasks. Despite this success, GNNs suffer from fairness issues that arise as a result of the underlying graph data and the fundamental aggregation mechanism that lies at the heart of the large class of GNN models. In this articl… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

  27. arXiv:2306.11855  [pdf, other

    cs.LG stat.ME

    A Model-free Closeness-of-influence Test for Features in Supervised Learning

    Authors: Mohammad Mehrabi, Ryan A. Rossi

    Abstract: Understanding the effect of a feature vector $x \in \mathbb{R}^d$ on the response value (label) $y \in \mathbb{R}$ is the cornerstone of many statistical learning problems. Ideally, it is desired to understand how a set of collected features combine together and influence the response value, but this problem is notoriously difficult, due to the high-dimensionality of data and limited number of lab… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  28. arXiv:2305.10434  [pdf, other

    cs.CL cs.AI cs.LG

    Learning the Visualness of Text Using Large Vision-Language Models

    Authors: Gaurav Verma, Ryan A. Rossi, Christopher Tensmeyer, Jiuxiang Gu, Ani Nenkova

    Abstract: Visual text evokes an image in a person's mind, while non-visual text fails to do so. A method to automatically detect visualness in text will enable text-to-image retrieval and generation models to augment text with relevant images. This is particularly challenging with long-form text as text-to-image generation and retrieval models are often triggered for text that is designed to be explicitly v… ▽ More

    Submitted 22 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted at EMNLP 2023 (Main, long); 9 pages, 5 figures

  29. arXiv:2303.15652  [pdf, other

    cs.LG cs.DS

    Structured Dynamic Pricing: Optimal Regret in a Global Shrinkage Model

    Authors: Rashmi Ranjan Bhuyan, Adel Javanmard, Sungchul Kim, Gourab Mukherjee, Ryan A. Rossi, Tong Yu, Handong Zhao

    Abstract: We consider dynamic pricing strategies in a streamed longitudinal data set-up where the objective is to maximize, over time, the cumulative profit across a large number of customer segments. We consider a dynamic model with the consumers' preferences as well as price sensitivity varying over time. Building on the well-known finding that consumers sharing similar characteristics act in similar ways… ▽ More

    Submitted 13 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 43 pages, 10 figures

  30. arXiv:2303.13525  [pdf, other

    cs.DC cs.AI cs.LG

    Forecasting Workload in Cloud Computing: Towards Uncertainty-Aware Predictions and Transfer Learning

    Authors: Andrea Rossi, Andrea Visentin, Diego Carraro, Steven Prestwich, Kenneth N. Brown

    Abstract: Predicting future resource demand in Cloud Computing is essential for optimizing the trade-off between serving customers' requests efficiently and minimizing the provisioning cost. Modelling prediction uncertainty is also desirable to better inform the resource decision-making process, but research in this field is under-investigated. In this paper, we propose univariate and bivariate Bayesian dee… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 February, 2023; originally announced March 2023.

  31. DataPilot: Utilizing Quality and Usage Information for Subset Selection during Visual Data Preparation

    Authors: Arpit Narechania, Fan Du, Atanu R Sinha, Ryan A. Rossi, Jane Hoffswell, Shunan Guo, Eunyee Koh, Shamkant B. Navathe, Alex Endert

    Abstract: Selecting relevant data subsets from large, unfamiliar datasets can be difficult. We address this challenge by modeling and visualizing two kinds of auxiliary information: (1) quality - the validity and appropriateness of data required to perform certain analytical tasks; and (2) usage - the historical utilization characteristics of data across multiple users. Through a design study with 14 data w… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 18 pages, 5 figures, 1 table, ACM CHI 2023

  32. arXiv:2212.14077  [pdf, other

    cs.LG cs.DM cs.SI

    A Hypergraph Neural Network Framework for Learning Hyperedge-Dependent Node Embeddings

    Authors: Ryan Aponte, Ryan A. Rossi, Shunan Guo, Jane Hoffswell, Nedim Lipka, Chang Xiao, Gromit Chan, Eunyee Koh, Nesreen Ahmed

    Abstract: In this work, we introduce a hypergraph representation learning framework called Hypergraph Neural Networks (HNN) that jointly learns hyperedge embeddings along with a set of hyperedge-dependent embeddings for each node in the hypergraph. HNN derives multiple embeddings per node in the hypergraph where each embedding for a node is dependent on a specific hyperedge of that node. Notably, HNN is acc… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  33. arXiv:2212.13709  [pdf, other

    cs.LG cs.SI

    PersonaSAGE: A Multi-Persona Graph Neural Network

    Authors: Gautam Choudhary, Iftikhar Ahamath Burhanuddin, Eunyee Koh, Fan Du, Ryan A. Rossi

    Abstract: Graph Neural Networks (GNNs) have become increasingly important in recent years due to their state-of-the-art performance on many important downstream applications. Existing GNNs have mostly focused on learning a single node representation, despite that a node often exhibits polysemous behavior in different contexts. In this work, we develop a persona-based graph neural network framework called Pe… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: 10 pages, 6 figures, 7 tables

  34. arXiv:2212.13257  [pdf

    physics.med-ph cs.CV eess.IV eess.SY physics.optics

    A portable widefield fundus camera with high dynamic range imaging capability

    Authors: Alfa Rossi, Mojtaba Rahimi, David Le, Taeyoon son, Michael J. Heiferman, R. V. Paul Chan, Xincheng Yao

    Abstract: Fundus photography is indispensable for clinical detection and management of eye diseases. Limited image contrast and field of view (FOV) are common limitations of conventional fundus cameras, making it difficult to detect subtle abnormalities at the early stages of eye diseases. Further improvements of image contrast and FOV coverage are important to improve early disease detection and reliable t… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

    Comments: 12 pages, 8 figures

  35. From Actions to Events: A Transfer Learning Approach Using Improved Deep Belief Networks

    Authors: Mateus Roder, Jurandy Almeida, Gustavo H. de Rosa, Leandro A. Passos, André L. D. Rossi, João P. Papa

    Abstract: In the last decade, exponential data growth supplied machine learning-based algorithms' capacity and enabled their usage in daily-life activities. Additionally, such an improvement is partially explained due to the advent of deep learning techniques, i.e., stacks of simple architectures that end up in more complex models. Although both factors produce outstanding results, they also pose drawbacks… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

  36. arXiv:2211.02646  [pdf, other

    cs.LG cs.AI cs.MM

    Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions

    Authors: Gaurav Verma, Vishwa Vinay, Ryan A. Rossi, Srijan Kumar

    Abstract: As multimodal learning finds applications in a wide variety of high-stakes societal tasks, investigating their robustness becomes important. Existing work has focused on understanding the robustness of vision-and-language models to imperceptible variations on benchmark tasks. In this work, we investigate the robustness of multimodal classifiers to cross-modal dilutions - a plausible variation. We… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Accepted at the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP); Full Paper (Oral)

  37. arXiv:2210.00032  [pdf, other

    cs.LG cs.SI

    Direct Embedding of Temporal Network Edges via Time-Decayed Line Graphs

    Authors: Sudhanshu Chanpuriya, Ryan A. Rossi, Sungchul Kim, Tong Yu, Jane Hoffswell, Nedim Lipka, Shunan Guo, Cameron Musco

    Abstract: Temporal networks model a variety of important phenomena involving timed interactions between entities. Existing methods for machine learning on temporal networks generally exhibit at least one of two limitations. First, time is assumed to be discretized, so if the time data is continuous, the user must determine the discretization and discard precise time information. Second, edge representations… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  38. arXiv:2208.11090   

    cs.RO

    IEEE Trust, Acceptance and Social Cues in Human-Robot Interaction -- SCRITA 2022 Workshop

    Authors: Alessandra Rossi, Patrick Holthaus, Sìlvia Moros, Gabriella Lakatos

    Abstract: The Trust, Acceptance and Social Cues in Human-Robot Interaction - SCRITA is the 5th edition of a series of workshops held in conjunction with the IEEE RO-MAN conference. This workshop focuses on addressing the challenges and development of the dynamics between people and robots in order to foster short interactions and long-lasting relationships in different fields, from educational, service, col… ▽ More

    Submitted 28 August, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: SCRITA 2022 workshop proceedings including 8 articles

    Report number: SCRITA/2022

    Journal ref: 31st IEEE International Conference on Robot & Human Interactive Communication, 29 August - 3 September 2022

  39. arXiv:2208.11030  [pdf, ps, other

    quant-ph cs.LG cs.SI q-bio.MN

    Link prediction with continuous-time classical and quantum walks

    Authors: Mark Goldsmith, Guillermo García-Pérez, Joonas Malmi, Matteo A. C. Rossi, Harto Saarinen, Sabrina Maniscalco

    Abstract: Protein-protein interaction (PPI) networks consist of the physical and/or functional interactions between the proteins of an organism. Since the biophysical and high-throughput methods used to form PPI networks are expensive, time-consuming, and often contain inaccuracies, the resulting networks are usually incomplete. In order to infer missing interactions in these networks, we propose a novel cl… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: 9 pages, 5 figures

  40. arXiv:2208.09076  [pdf, other

    cs.IR cs.LG cs.SI

    Implicit Session Contexts for Next-Item Recommendations

    Authors: Sejoon Oh, Ankur Bhardwaj, Jongseok Han, Sungchul Kim, Ryan A. Rossi, Srijan Kumar

    Abstract: Session-based recommender systems capture the short-term interest of a user within a session. Session contexts (i.e., a user's high-level interests or intents within a session) are not explicitly given in most datasets, and implicitly inferring session context as an aggregation of item-level attributes is crude. In this paper, we propose ISCON, which implicitly contextualizes sessions. ISCON first… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted for publication at: 31st ACM International Conference on Information and Knowledge Management (CIKM 2022) short paper track. Code and data at: https://github.com/srijankr/iscon

  41. arXiv:2207.00389  [pdf, ps, other

    math.AP cs.LG math-ph math.OC

    Analysis of Kinetic Models for Label Switching and Stochastic Gradient Descent

    Authors: Martin Burger, Alex Rossi

    Abstract: In this paper we provide a novel approach to the analysis of kinetic models for label switching, which are used for particle systems that can randomly switch between gradient flows in different energy landscapes. Besides problems in biology and physics, we also demonstrate that stochastic gradient descent, the most popular technique in machine learning, can be understood in this setting, when cons… ▽ More

    Submitted 8 December, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

  42. arXiv:2206.08270   

    cs.RO cs.AI

    The Road to a Successful HRI: AI, Trust and ethicS-TRAITS

    Authors: Alessandra Rossi, Antonio Andriella, Silvia Rossi, Anouk van Maris

    Abstract: The aim of this workshop is to foster the exchange of insights on past and ongoing research towards effective and long-lasting collaborations between humans and robots. This workshop will provide a forum for representatives from academia and industry communities to analyse the different aspects of HRI that impact on its success. We particularly focus on AI techniques required to implement autonomo… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: TRAITS Workshop Proceedings including 7 articles

    Journal ref: Companion of the 2022 ACM/IEEE International Conference on Human-Robot Interaction, March 2022

  43. arXiv:2206.03635  [pdf, other

    cs.SI cs.CY cs.HC cs.LG

    Network Report: A Structured Description for Network Datasets

    Authors: Xinyi Zheng, Ryan A. Rossi, Nesreen Ahmed, Dominik Moritz

    Abstract: The rapid development of network science and technologies depends on shareable datasets. Currently, there is no standard practice for reporting and sharing network datasets. Some network dataset providers only share links, while others provide some contexts or basic statistics. As a result, critical information may be unintentionally dropped, and network dataset consumers may misunderstand or over… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  44. arXiv:2205.14459  [pdf, other

    cs.CV cs.LG

    CyCLIP: Cyclic Contrastive Language-Image Pretraining

    Authors: Shashank Goel, Hritik Bansal, Sumit Bhatia, Ryan A. Rossi, Vishwa Vinay, Aditya Grover

    Abstract: Recent advances in contrastive representation learning over paired image-text data have led to models such as CLIP that achieve state-of-the-art performance for zero-shot classification and distributional robustness. Such models typically require joint reasoning in the image and text representation spaces for downstream inference tasks. Contrary to prior beliefs, we demonstrate that the image and… ▽ More

    Submitted 26 October, 2022; v1 submitted 28 May, 2022; originally announced May 2022.

    Comments: 19 pages, 13 tables, 6 figures, Oral at NeuRIPS 2022

  45. arXiv:2203.06631  [pdf, other

    cs.RO cs.AI cs.MA

    A ROS Architecture for Personalised HRI with a Bartender Social Robot

    Authors: Alessandra Rossi, Maria Di Maro, Antonio Origlia, Agostino Palmiero, Silvia Rossi

    Abstract: BRILLO (Bartending Robot for Interactive Long-Lasting Operations) project has the overall goal of creating an autonomous robotic bartender that can interact with customers while accomplishing its bartending tasks. In such a scenario, people's novelty effect connected to the use of an attractive technology is destined to wear off and, consequently, it negatively affects the success of the service r… ▽ More

    Submitted 15 March, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Comments: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022)

  46. arXiv:2111.14674  [pdf, ps, other

    cs.LG cs.AI cs.DS stat.ML

    Online MAP Inference and Learning for Nonsymmetric Determinantal Point Processes

    Authors: Aravind Reddy, Ryan A. Rossi, Zhao Song, Anup Rao, Tung Mai, Nedim Lipka, Gang Wu, Eunyee Koh, Nesreen Ahmed

    Abstract: In this paper, we introduce the online and streaming MAP inference and learning problems for Non-symmetric Determinantal Point Processes (NDPPs) where data points arrive in an arbitrary order and the algorithms are constrained to use a single-pass over the data as well as sub-linear memory. The online setting has an additional requirement of maintaining a valid solution at any point in time. For s… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  47. arXiv:2111.03030  [pdf, other

    cs.LG cs.SI

    Exact Representation of Sparse Networks with Symmetric Nonnegative Embeddings

    Authors: Sudhanshu Chanpuriya, Ryan A. Rossi, Anup Rao, Tung Mai, Nedim Lipka, Zhao Song, Cameron Musco

    Abstract: Many models for undirected graphs are based on factorizing the graph's adjacency matrix; these models find a vector representation of each node such that the predicted probability of a link between two nodes increases with the similarity (dot product) of their associated vectors. Recent work has shown that these models are unable to capture key structures in real-world graphs, particularly heterop… ▽ More

    Submitted 30 September, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

  48. Cookie Banners, What's the Purpose? Analyzing Cookie Banner Text Through a Legal Lens

    Authors: Cristiana Santos, Arianna Rossi, Lorena Sánchez Chamorro, Kerstin Bongard-Blanchy, Ruba Abu-Salma

    Abstract: A cookie banner pops up when a user visits a website for the first time, requesting consent to the use of cookies and other trackers for a variety of purposes. Unlike prior work that has focused on evaluating the user interface (UI) design of cookie banners, this paper presents an in-depth analysis of what cookie banners say to users to get their consent. We took an interdisciplinary approach to d… ▽ More

    Submitted 7 October, 2021; v1 submitted 6 October, 2021; originally announced October 2021.

  49. Influence-guided Data Augmentation for Neural Tensor Completion

    Authors: Sejoon Oh, Sungchul Kim, Ryan A. Rossi, Srijan Kumar

    Abstract: How can we predict missing values in multi-dimensional data (or tensors) more accurately? The task of tensor completion is crucial in many applications such as personalized recommendation, image and video restoration, and link prediction in social networks. Many tensor factorization and neural network-based tensor completion algorithms have been developed to predict missing entries in partially ob… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted for publication at 30th ACM International Conference on Information and Knowledge Management (ACM CIKM 2021). Code and data: https://github.com/srijankr/DAIN

  50. arXiv:2108.08092   

    cs.RO

    Trust, Acceptance and Social Cues in Human-Robot Interaction -- SCRITA 2021

    Authors: Alessandra Rossi, Patrick Holthaus, Sílvia Moros, Marcus Scheunemann, Gabriella Lakatos

    Abstract: This workshop aimed for a deeper exploration of trust and acceptance in human-robot interaction (HRI) from a multidisciplinary perspective including robots' capabilities of sensing and perceiving other agents, the environment, and human-robot dynamics. The workshop was held online in conjunction with IEEE RO-MAN 2021 (see https://ro-man2021.org/). Three invited speakers and six position papers ana… ▽ More

    Submitted 19 August, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: SCRITA 2021 workshop proceedings including 5 articles