Skip to main content

Showing 1–50 of 5,941 results for author: Paul

  1. arXiv:2407.11934  [pdf, other

    cs.SE

    Code Documentation and Analysis to Secure Software Development

    Authors: Paul Attie, Anas Obeidat, Nathaniel Oh, Ian Yelle

    Abstract: We present the Code Documentation and Analysis Tool (CoDAT). CoDAT is a tool designed to maintain consistency between the various levels of code documentation, e.g. if a line in a code sketch is changed, the comment that documents the corresponding code is also changed. That is, comments are linked and updated so as to remain internally consistent and also consistent with the code. By flagging "ou… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 31 pages

    ACM Class: D.2.2; D.2.3; D.2.5; D.2.6

  2. arXiv:2407.11366  [pdf

    cs.CY

    Perceived Importance of ICT Proficiency for Teaching, Learning, and Career Progression among Physical Education Teachers in Pampanga

    Authors: Kristine Joy D. Magallanes, Mark Brianne C. Carreon, Kristalyn C. Miclat, Niña Vina V. Salita, Gino A. Sumilhig, Raymart Christopher C. Guevarra, John Paul P. Miranda

    Abstract: The integration of information and communication technology (ICT) has become increasingly vital across various educational fields, including physical education (PE). This study aimed to evaluate the proficiency levels of PE teachers in using various ICT applications and to examine the relationship between the perceived importance of ICT proficiency for teaching and learning, career advancement, an… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 16 pages, 1 figure, 4 tables

    Journal ref: Puissant 5 (2024) 2336-2351

  3. arXiv:2407.10989  [pdf

    cs.CL cs.AI cs.HC

    Do Large Language Models Understand Verbal Indicators of Romantic Attraction?

    Authors: Sandra C. Matz, Heinrich Peters, Paul W. Eastwick, Moran Cerf, Eli J. Finkel

    Abstract: What makes people 'click' on a first date and become mutually attracted to one another? While understanding and predicting the dynamics of romantic interactions used to be exclusive to human judgment, we show that Large Language Models (LLMs) can detect romantic attraction during brief getting-to-know-you interactions. Examining data from 964 speed dates, we show that ChatGPT (and Claude 3) can pr… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

  4. arXiv:2407.10580  [pdf, other

    cs.AI

    Leveraging Hybrid Intelligence Towards Sustainable and Energy-Efficient Machine Learning

    Authors: Daniel Geissler, Paul Lukowicz

    Abstract: Hybrid intelligence aims to enhance decision-making, problem-solving, and overall system performance by combining the strengths of both, human cognitive abilities and artificial intelligence. With the rise of Large Language Models (LLM), progressively participating as smart agents to accelerate machine learning development, Hybrid Intelligence is becoming an increasingly important topic for effect… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  5. arXiv:2407.10567  [pdf, other

    cs.CV eess.IV

    PULPo: Probabilistic Unsupervised Laplacian Pyramid Registration

    Authors: Leonard Siegert, Paul Fischer, Mattias P. Heinrich, Christian F. Baumgartner

    Abstract: Deformable image registration is fundamental to many medical imaging applications. Registration is an inherently ambiguous task often admitting many viable solutions. While neural network-based registration techniques enable fast and accurate registration, the majority of existing approaches are not able to estimate uncertainty. Here, we present PULPo, a method for probabilistic deformable registr… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted as full paper to MICCAI 2024

  6. arXiv:2407.10316  [pdf, ps, other

    cs.DS cs.GT

    Online Matroid Embeddings

    Authors: Andrés Cristi, Paul Dütting, Robert Kleinberg, Renato Paes Leme

    Abstract: We introduce the notion of an online matroid embedding, which is an algorithm for mapping an unknown matroid that is revealed in an online fashion to a larger-but-known matroid. The existence of such embedding enables a reduction from the version of the matroid secretary problem where the matroid is unknown to the version where the matroid is known in advance. We show that online matroid embedding… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 25 pages, 4 figures

  7. arXiv:2407.09801  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.MM

    IoT-LM: Large Multisensory Language Models for the Internet of Things

    Authors: Shentong Mo, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang

    Abstract: The Internet of Things (IoT) network integrating billions of smart physical devices embedded with sensors, software, and communication technologies is a critical and rapidly expanding component of our modern world. The IoT ecosystem provides a rich source of real-world modalities such as motion, thermal, geolocation, imaging, depth, sensors, and audio to recognize the states of humans and physical… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.06217

  8. The Ballad of the Bots: Sonification Using Cognitive Metaphor to Support Immersed Teleoperation of Robot Teams

    Authors: Joe Simmons, Paul Bremner, Thomas J Mitchell, Alison Bown, Verity McIntosh

    Abstract: As an embodied and spatial medium, virtual reality is proving an attractive proposition for robot teleoperation in hazardous environments. This paper examines a nuclear decommissioning scenario in which a simulated team of semi-autonomous robots are used to characterise a chamber within a virtual nuclear facility. This study examines the potential utility and impact of sonification as a means of c… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in Frontiers in Virtual Reality->Technologies for VR under the research topic 'Interactive Audio Systems and Artefacts within Extended Reality: Innovation, Creativity and Accessibility'

  9. arXiv:2407.09671  [pdf, other

    math.CO cs.DS

    Obstructions to Erdős-Pósa Dualities for Minors

    Authors: Christophe Paul, Evangelos Protopapas, Dimitrios M. Thilikos, Sebastian Wiederrecht

    Abstract: Let ${\cal G}$ and ${\cal H}$ be minor-closed graph classes. The pair $({\cal H},{\cal G})$ is an Erdős-Pósa pair (EP-pair) if there is a function $f$ where, for every $k$ and every $G\in{\cal G},$ either $G$ has $k$ pairwise vertex-disjoint subgraphs not belonging to ${\cal H},$ or there is a set $S\subseteq V(G)$ where $|S|\leq f(k)$ and $G-S\in{\cal H}.$ The classic result of Erdős and Pósa say… ▽ More

    Submitted 16 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: Accepted to FOCS 2024

    MSC Class: 05C83; 05C85; 05C10; 05C75; 68R10 ACM Class: G.2.2

  10. arXiv:2407.09516  [pdf, other

    cs.HC

    An Actionability Assessment Tool for Explainable AI

    Authors: Ronal Singh, Tim Miller, Liz Sonenberg, Eduardo Velloso, Frank Vetere, Piers Howe, Paul Dourish

    Abstract: In this paper, we introduce and evaluate a tool for researchers and practitioners to assess the actionability of information provided to users to support algorithmic recourse. While there are clear benefits of recourse from the user's perspective, the notion of actionability in explainable AI research remains vague, and claims of `actionable' explainability techniques are based on the researchers'… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

  11. arXiv:2407.09510  [pdf, other

    cs.CV

    3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods

    Authors: Milena T. Bagdasarian, Paul Knoll, Florian Barthel, Anna Hilsmann, Peter Eisert, Wieland Morgenstern

    Abstract: We present a work-in-progress survey on 3D Gaussian Splatting compression methods, focusing on their statistical performance across various benchmarks. This survey aims to facilitate comparability by summarizing key statistics of different compression approaches in a tabulated format. The datasets evaluated include TanksAndTemples, MipNeRF360, DeepBlending, and SyntheticNeRF. For each method, we r… ▽ More

    Submitted 16 July, 2024; v1 submitted 17 June, 2024; originally announced July 2024.

    Comments: Gaussian Splatting compression survey; Added missing authors; Added new compression papers to table

  12. arXiv:2407.09231  [pdf, ps, other

    cs.CY cs.HC

    Prompts First, Finally

    Authors: Brent N. Reeves, James Prather, Paul Denny, Juho Leinonen, Stephen MacNeil, Brett A. Becker, Andrew Luxton-Reilly

    Abstract: Generative AI (GenAI) and large language models in particular, are disrupting Computer Science Education. They are proving increasingly capable at more and more challenges. Some educators argue that they pose a serious threat to computing education, and that we should ban their use in the classroom. While there are serious GenAI issues that remain unsolved, it may be useful in the present moment t… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 4 pages

  13. arXiv:2407.08994  [pdf, other

    cs.CV

    Global Attention-Guided Dual-Domain Point Cloud Feature Learning for Classification and Segmentation

    Authors: Zihao Li, Pan Gao, Kang You, Chuan Yan, Manoranjan Paul

    Abstract: Previous studies have demonstrated the effectiveness of point-based neural models on the point cloud analysis task. However, there remains a crucial issue on producing the efficient input embedding for raw point coordinates. Moreover, another issue lies in the limited efficiency of neighboring aggregations, which is a critical component in the network stem. In this paper, we propose a Global Atten… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  14. arXiv:2407.08452  [pdf, other

    cs.FL

    MITL Model Checking via Generalized Timed Automata and a New Liveness Algorithm

    Authors: S. Akshay, Paul Gastin, R. Govind, B. Srivathsan

    Abstract: The translation of Metric Interval Temporal Logic (MITL) to timed automata is a topic that has been extensively studied. A key challenge here is the conversion of future modalities into equivalent automata. Typical conversions equip the automata with a guess-and-check mechanism to ascertain the truth of future modalities. Guess-and-check can be naturally implemented via alternation. However, since… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  15. arXiv:2407.08432  [pdf, other

    cs.LG

    Subgroup-Specific Risk-Controlled Dose Estimation in Radiotherapy

    Authors: Paul Fischer, Hannah Willms, Moritz Schneider, Daniela Thorwarth, Michael Muehlebach, Christian F. Baumgartner

    Abstract: Cancer remains a leading cause of death, highlighting the importance of effective radiotherapy (RT). Magnetic resonance-guided linear accelerators (MR-Linacs) enable imaging during RT, allowing for inter-fraction, and perhaps even intra-fraction, adjustments of treatment plans. However, achieving this requires fast and accurate dose calculations. While Monte Carlo simulations offer accuracy, they… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: This work was accepted as a full paper at MICCAI 2024

  16. arXiv:2407.08410  [pdf, other

    cs.AI

    Specialist vision-language models for clinical ophthalmology

    Authors: Robbie Holland, Thomas R. P. Taylor, Christopher Holmes, Sophie Riedl, Julia Mai, Maria Patsiamanidi, Dimitra Mitsopoulou, Paul Hager, Philip Müller, Hendrik P. N. Scholl, Hrvoje Bogunović, Ursula Schmidt-Erfurth, Daniel Rueckert, Sobha Sivaprasad, Andrew J. Lotery, Martin J. Menten

    Abstract: Clinicians spend a significant amount of time reviewing medical images and transcribing their findings regarding patient diagnosis, referral and treatment in text form. Vision-language models (VLMs), which automatically interpret images and summarize their findings as text, have enormous potential to alleviate clinical workloads and increase patient access to high-quality medical care. While found… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Submitted to Nature Medicine

  17. arXiv:2407.08289  [pdf, other

    cs.AI cs.LG

    Predicting Heart Failure with Attention Learning Techniques Utilizing Cardiovascular Data

    Authors: Ershadul Haque, Manoranjan Paul, Faranak Tohidi

    Abstract: Cardiovascular diseases (CVDs) encompass a group of disorders affecting the heart and blood vessels, including conditions such as coronary artery disease, heart failure, stroke, and hypertension. In cardiovascular diseases, heart failure is one of the main causes of death and also long-term suffering in patients worldwide. Prediction is one of the risk factors that is highly valuable for treatment… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 11 pages, 37 figures

  18. arXiv:2407.08286  [pdf

    cs.RO

    The control architecture of a spherical robot for Minimally Invasive Surgery

    Authors: Gabriela Rus, Nadim Al Hajjar, Paul Tucan, Ionut Zima, Calin Vaida, Corina Radu, Daniel Jucan, Damien Chablat, Doina Pisla

    Abstract: Control systems used in Minimally Invasive Surgery (MIS) play a crucial role in ensuring preci-sion and safety throughout procedures. This paper presents a control architecture developed for a robotic system designed for MIS operations. The modular structure of the control system allows for compatibility with a range of procedures in abdominal and thoracic regions. The proposed control system, emp… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Journal ref: 6th IFToMM Symposium on Mechanism Design for Robotics, Jun 2024, Timi{\c s}oara, Romania

  19. arXiv:2407.08280  [pdf, other

    cs.CV cs.GR cs.RO

    WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving

    Authors: Jannik Zürn, Paul Gladkov, Sofía Dudas, Fergal Cotter, Sofi Toteva, Jamie Shotton, Vasiliki Simaiaki, Nikhil Mohan

    Abstract: We present WayveScenes101, a dataset designed to help the community advance the state of the art in novel view synthesis that focuses on challenging driving scenes containing many dynamic and deformable elements with changing geometry and texture. The dataset comprises 101 driving scenes across a wide range of environmental conditions and driving scenarios. The dataset is designed for benchmarking… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 7 pages

  20. arXiv:2407.08166  [pdf, other

    cs.LG cs.AI eess.SP

    Synthetic Electroretinogram Signal Generation Using Conditional Generative Adversarial Network for Enhancing Classification of Autism Spectrum Disorder

    Authors: Mikhail Kulyabin, Paul A. Constable, Aleksei Zhdanov, Irene O. Lee, David H. Skuse, Dorothy A. Thompson, Andreas Maier

    Abstract: The electroretinogram (ERG) is a clinical test that records the retina's electrical response to light. The ERG is a promising way to study different neurodevelopmental and neurodegenerative disorders, including autism spectrum disorder (ASD) - a neurodevelopmental condition that impacts language, communication, and reciprocal social interactions. However, in heterogeneous populations, such as ASD,… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  21. arXiv:2407.07726  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    PaliGemma: A versatile 3B VLM for transfer

    Authors: Lucas Beyer, Andreas Steiner, André Susano Pinto, Alexander Kolesnikov, Xiao Wang, Daniel Salz, Maxim Neumann, Ibrahim Alabdulmohsin, Michael Tschannen, Emanuele Bugliarello, Thomas Unterthiner, Daniel Keysers, Skanda Koppula, Fangyu Liu, Adam Grycner, Alexey Gritsenko, Neil Houlsby, Manoj Kumar, Keran Rong, Julian Eisenschlos, Rishabh Kabra, Matthias Bauer, Matko Bošnjak, Xi Chen, Matthias Minderer , et al. (10 additional authors not shown)

    Abstract: PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model. It is trained to be a versatile and broadly knowledgeable base model that is effective to transfer. It achieves strong performance on a wide variety of open-world tasks. We evaluate PaliGemma on almost 40 diverse tasks including standard VLM benchmarks, but also more… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  22. arXiv:2407.07639  [pdf, other

    cs.LG cs.AI

    Explaining Graph Neural Networks for Node Similarity on Graphs

    Authors: Daniel Daza, Cuong Xuan Chu, Trung-Kien Tran, Daria Stepanova, Michael Cochez, Paul Groth

    Abstract: Similarity search is a fundamental task for exploiting information in various applications dealing with graph data, such as citation networks or knowledge graphs. While this task has been intensively approached from heuristics to graph embeddings and graph neural networks (GNNs), providing explanations for similarity has received less attention. In this work we are concerned with explainable simil… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  23. arXiv:2407.07606  [pdf, ps, other

    cs.CL cs.AI

    The Computational Learning of Construction Grammars: State of the Art and Prospective Roadmap

    Authors: Jonas Doumen, Veronica Juliana Schmalz, Katrien Beuls, Paul Van Eecke

    Abstract: This paper documents and reviews the state of the art concerning computational models of construction grammar learning. It brings together prior work on the computational learning of form-meaning pairings, which has so far been studied in several distinct areas of research. The goal of this paper is threefold. First of all, it aims to synthesise the variety of methodologies that have been proposed… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Peer-reviewed author's draft of a journal article to appear in Constructions and Frames (2025)

  24. arXiv:2407.07412  [pdf, other

    cs.CV cs.AI

    Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation

    Authors: Seonghoon Yu, Paul Hongsuck Seo, Jeany Son

    Abstract: We propose a new framework that automatically generates high-quality segmentation masks with their referring expressions as pseudo supervisions for referring image segmentation (RIS). These pseudo supervisions allow the training of any supervised RIS methods without the cost of manual labeling. To achieve this, we incorporate existing segmentation and image captioning foundation models, leveraging… ▽ More

    Submitted 15 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  25. arXiv:2407.07135  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Improving Out-of-Distribution Detection by Combining Existing Post-hoc Methods

    Authors: Paul Novello, Yannick Prudent, Joseba Dalmau, Corentin Friedrich, Yann Pequignot

    Abstract: Since the seminal paper of Hendrycks et al. arXiv:1610.02136, Post-hoc deep Out-of-Distribution (OOD) detection has expanded rapidly. As a result, practitioners working on safety-critical applications and seeking to improve the robustness of a neural network now have a plethora of methods to choose from. However, no method outperforms every other on every dataset arXiv:2210.07242, so the current b… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  26. arXiv:2407.06720  [pdf, ps, other

    cs.DL cs.HC

    Author Intent: Eliminating Ambiguity in MathML

    Authors: David Carliste, Paul Libbrecht, Moritz Schubotz, Neil Soiffer

    Abstract: MathML has been successful in improving the accessibility of mathematical notation on the web. All major screen readers support MathML to generate speech, allow navigation of the math, and generate braille. A troublesome area remains: handling ambiguous notations such as \( \vert x\vert\). While it is possible to speak this syntactically, anecdotal evidence indicates most people prefer semantic sp… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in Int. Conf. on Computers Helping People with Special Needs will be available online at TBD

  27. arXiv:2407.06293  [pdf, other

    cs.CE physics.app-ph

    A Framework for Simulating the Path-level Residual Stress in the Laser Powder Bed Fusion Process

    Authors: Xin Liu, Xingchen Liu, Paul Witherell

    Abstract: Laser Powder Bed Fusion (LPBF) additive manufacturing has revolutionized industries with its capability to create intricate and customized components. The LPBF process uses moving heat sources to melt and solidify metal powders. The fast melting and cooling leads to residual stress, which critically affects the part quality. Currently, the computational intensity of accurately simulating the resid… ▽ More

    Submitted 10 April, 2024; originally announced July 2024.

  28. arXiv:2407.06100  [pdf, other

    physics.ao-ph cs.LG

    Leveraging data-driven weather models for improving numerical weather prediction skill through large-scale spectral nudging

    Authors: Syed Zahid Husain, Leo Separovic, Jean-François Caron, Rabah Aider, Mark Buehner, Stéphane Chamberland, Ervig Lapalme, Ron McTaggart-Cowan, Christopher Subich, Paul Vaillancourt, Jing Yang, Ayrton Zadra

    Abstract: Operational meteorological forecasting has long relied on physics-based numerical weather prediction (NWP) models. Recently, this landscape has been disrupted by the advent of data-driven artificial intelligence (AI)-based weather models, which offer tremendous computational performance and competitive forecasting skill. However, data-driven models for medium-range forecasting generally suffer fro… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  29. arXiv:2407.06096  [pdf, other

    cs.CV

    Muzzle-Based Cattle Identification System Using Artificial Intelligence (AI)

    Authors: Hasan Zohirul Islam, Safayet Khan, Sanjib Kumar Paul, Sheikh Imtiaz Rahi, Fahim Hossain Sifat, Md. Mahadi Hasan Sany, Md. Shahjahan Ali Sarker, Tareq Anam, Ismail Hossain Polas

    Abstract: Absence of tamper-proof cattle identification technology was a significant problem preventing insurance companies from providing livestock insurance. This lack of technology had devastating financial consequences for marginal farmers as they did not have the opportunity to claim compensation for any unexpected events such as the accidental death of cattle in Bangladesh. Using machine learning and… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  30. arXiv:2407.05528  [pdf, other

    cs.CV

    An accurate detection is not all you need to combat label noise in web-noisy datasets

    Authors: Paul Albert, Jack Valmadre, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Training a classifier on web-crawled data demands learning algorithms that are robust to annotation errors and irrelevant examples. This paper builds upon the recent empirical observation that applying unsupervised contrastive learning to noisy, web-crawled datasets yields a feature representation under which the in-distribution (ID) and out-of-distribution (OOD) samples are linearly separable. We… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted in the European Conference on Computer Vision (ECCV) 2024

  31. arXiv:2407.05467  [pdf, other

    cs.DC cs.AI

    The infrastructure powering IBM's Gen AI model development

    Authors: Talia Gershon, Seetharami Seelam, Brian Belgodere, Milton Bonilla, Lan Hoang, Danny Barnett, I-Hsin Chung, Apoorve Mohan, Ming-Hung Chen, Lixiang Luo, Robert Walkup, Constantinos Evangelinos, Shweta Salaria, Marc Dombrowa, Yoonho Park, Apo Kayi, Liran Schour, Alim Alim, Ali Sydney, Pavlos Maniotis, Laurent Schares, Bernard Metzler, Bengi Karacali-Akyamac, Sophia Wen, Tatsuhiro Chiba , et al. (121 additional authors not shown)

    Abstract: AI Infrastructure plays a key role in the speed and cost-competitiveness of developing and deploying advanced AI models. The current demand for powerful AI infrastructure for model training is driven by the emergence of generative AI and foundational models, where on occasion thousands of GPUs must cooperate on a single training job for the model to be trained in a reasonable time. Delivering effi… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Corresponding Authors: Talia Gershon, Seetharami Seelam,Brian Belgodere, Milton Bonilla

  32. arXiv:2407.05399  [pdf, other

    cs.CL cs.AI cs.LG

    IL-TUR: Benchmark for Indian Legal Text Understanding and Reasoning

    Authors: Abhinav Joshi, Shounak Paul, Akshat Sharma, Pawan Goyal, Saptarshi Ghosh, Ashutosh Modi

    Abstract: Legal systems worldwide are inundated with exponential growth in cases and documents. There is an imminent need to develop NLP and ML techniques for automatically processing and understanding legal documents to streamline the legal system. However, evaluating and comparing various NLP models designed specifically for the legal domain is challenging. This paper addresses this challenge by proposing… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted at ACL 2024 Main Conference; 40 Pages (9 Pages + References + Appendix)

  33. arXiv:2407.05259  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-scale Conditional Generative Modeling for Microscopic Image Restoration

    Authors: Luzhe Huang, Xiongye Xiao, Shixuan Li, Jiawen Sun, Yi Huang, Aydogan Ozcan, Paul Bogdan

    Abstract: The advance of diffusion-based generative models in recent years has revolutionized state-of-the-art (SOTA) techniques in a wide variety of image analysis and synthesis tasks, whereas their adaptation on image restoration, particularly within computational microscopy remains theoretically and empirically underexplored. In this research, we introduce a multi-scale generative model that enhances con… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  34. arXiv:2407.04888  [pdf, other

    eess.IV cs.CV

    Unraveling Radiomics Complexity: Strategies for Optimal Simplicity in Predictive Modeling

    Authors: Mahdi Ait Lhaj Loutfi, Teodora Boblea Podasca, Alex Zwanenburg, Taman Upadhaya, Jorge Barrios, David R. Raleigh, William C. Chen, Dante P. I. Capaldi, Hong Zheng, Olivier Gevaert, Jing Wu, Alvin C. Silva, Paul J. Zhang, Harrison X. Bai, Jan Seuntjens, Steffen Löck, Patrick O. Richard, Olivier Morin, Caroline Reinhold, Martin Lepage, Martin Vallières

    Abstract: Background: The high dimensionality of radiomic feature sets, the variability in radiomic feature types and potentially high computational requirements all underscore the need for an effective method to identify the smallest set of predictive features for a given clinical problem. Purpose: Develop a methodology and tools to identify and explain the smallest set of predictive radiomic features. Mat… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  35. arXiv:2407.04873  [pdf, ps, other

    cs.AI cs.CY

    Evaluating Language Models for Generating and Judging Programming Feedback

    Authors: Charles Koutcheme, Nicola Dainese, Arto Hellas, Sami Sarsa, Juho Leinonen, Syed Ashraf, Paul Denny

    Abstract: The emergence of large language models (LLMs) has transformed research and practice in a wide range of domains. Within the computing education research (CER) domain, LLMs have received plenty of attention especially in the context of learning programming. Much of the work on LLMs in CER has however focused on applying and evaluating proprietary models. In this article, we evaluate the efficiency o… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  36. arXiv:2407.04589  [pdf, other

    cs.LG

    Remembering Everything Makes You Vulnerable: A Limelight on Machine Unlearning for Personalized Healthcare Sector

    Authors: Ahan Chatterjee, Sai Anirudh Aryasomayajula, Rajat Chaudhari, Subhajit Paul, Vishwa Mohan Singh

    Abstract: As the prevalence of data-driven technologies in healthcare continues to rise, concerns regarding data privacy and security become increasingly paramount. This thesis aims to address the vulnerability of personalized healthcare models, particularly in the context of ECG monitoring, to adversarial attacks that compromise patient privacy. We propose an approach termed "Machine Unlearning" to mitigat… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 15 Pages, Exploring unlearning techniques on ECG Classifier

  37. arXiv:2407.04551  [pdf, other

    cs.CR cs.AI cs.LG

    An AI Architecture with the Capability to Classify and Explain Hardware Trojans

    Authors: Paul Whitten, Francis Wolff, Chris Papachristou

    Abstract: Hardware trojan detection methods, based on machine learning (ML) techniques, mainly identify suspected circuits but lack the ability to explain how the decision was arrived at. An explainable methodology and architecture is introduced based on the existing hardware trojan detection features. Results are provided for explaining digital hardware trojans within a netlist using trust-hub trojan bench… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  38. arXiv:2407.03850  [pdf, other

    cs.CL cs.AI

    HYBRINFOX at CheckThat! 2024 -- Task 1: Enhancing Language Models with Structured Information for Check-Worthiness Estimation

    Authors: Géraud Faye, Morgane Casanova, Benjamin Icard, Julien Chanson, Guillaume Gadek, Guillaume Gravier, Paul Égré

    Abstract: This paper summarizes the experiments and results of the HYBRINFOX team for the CheckThat! 2024 - Task 1 competition. We propose an approach enriching Language Models such as RoBERTa with embeddings produced by triples (subject ; predicate ; object) extracted from the text sentences. Our analysis of the developmental data shows that this method improves the performance of Language Models alone. On… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Paper to appear in the Proceedings of the Conference and Labs of the Evaluation Forum (CLEF 2024 CheckThat!)

  39. arXiv:2407.03831  [pdf, other

    math.CO cs.DM

    Exploring Algorithmic Solutions for the Independent Roman Domination Problem in Graphs

    Authors: Kaustav Paul, Ankit Sharma, Arti Pandey

    Abstract: Given a graph $G=(V,E)$, a function $f:V\to \{0,1,2\}$ is said to be a \emph{Roman Dominating function} if for every $v\in V$ with $f(v)=0$, there exists a vertex $u\in N(v)$ such that $f(u)=2$. A Roman Dominating function $f$ is said to be an \emph{Independent Roman Dominating function} (or IRDF), if $V_1\cup V_2$ forms an independent set, where $V_i=\{v\in V~\vert~f(v)=i\}$, for… ▽ More

    Submitted 12 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  40. arXiv:2407.03812  [pdf, other

    cs.DM cs.DS

    Algorithmic Results for Weak Roman Domination Problem in Graphs

    Authors: Kaustav Paul, Ankit Sharma, Arti Pandey

    Abstract: Consider a graph $G = (V, E)$ and a function $f: V \rightarrow \{0, 1, 2\}$. A vertex $u$ with $f(u)=0$ is defined as \emph{undefended} by $f$ if it lacks adjacency to any vertex with a positive $f$-value. The function $f$ is said to be a \emph{Weak Roman Dominating function} (WRD function) if, for every vertex $u$ with $f(u) = 0$, there exists a neighbour $v$ of $u$ with $f(v) > 0$ and a new func… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  41. arXiv:2407.03770  [pdf, other

    cs.CL cs.AI

    HYBRINFOX at CheckThat! 2024 -- Task 2: Enriching BERT Models with the Expert System VAGO for Subjectivity Detection

    Authors: Morgane Casanova, Julien Chanson, Benjamin Icard, Géraud Faye, Guillaume Gadek, Guillaume Gravier, Paul Égré

    Abstract: This paper presents the HYBRINFOX method used to solve Task 2 of Subjectivity detection of the CLEF 2024 CheckThat! competition. The specificity of the method is to use a hybrid system, combining a RoBERTa model, fine-tuned for subjectivity detection, a frozen sentence-BERT (sBERT) model to capture semantics, and several scores calculated by the English version of the expert system VAGO, developed… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: To appear in the Proceedings of the Conference and Labs of the Evaluation Forum (CLEF 2024 CheckThat!)

  42. arXiv:2407.03652  [pdf, other

    cs.AI cs.CC

    Over the Edge of Chaos? Excess Complexity as a Roadblock to Artificial General Intelligence

    Authors: Teo Susnjak, Timothy R. McIntosh, Andre L. C. Barczak, Napoleon H. Reyes, Tong Liu, Paul Watters, Malka N. Halgamuge

    Abstract: In this study, we explored the progression trajectories of artificial intelligence (AI) systems through the lens of complexity theory. We challenged the conventional linear and exponential projections of AI advancement toward Artificial General Intelligence (AGI) underpinned by transformer-based architectures, and posited the existence of critical points, akin to phase transitions in complex syste… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  43. arXiv:2407.03418  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    HEMM: Holistic Evaluation of Multimodal Foundation Models

    Authors: Paul Pu Liang, Akshay Goindani, Talha Chafekar, Leena Mathur, Haofei Yu, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Multimodal foundation models that can holistically process text alongside images, video, audio, and other sensory modalities are increasingly used in a variety of real-world applications. However, it is challenging to characterize and study progress in multimodal foundation models, given the range of possible modeling decisions, tasks, and domains. In this paper, we introduce Holistic Evaluation o… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/pliang279/HEMM

  44. arXiv:2407.03146  [pdf, other

    cs.CY cs.AI cs.CV cs.GT cs.LG

    Enhancing Class Fairness in Classification with A Two-Player Game Approach

    Authors: Yunpeng Jiang, Paul Weng, Yutong Ban

    Abstract: Data augmentation is widely applied and has shown its benefits in different machine learning tasks. However, as recently observed in some downstream tasks, data augmentation may introduce an unfair impact on classifications. While it can improve the performance of some classes, it can actually be detrimental for other classes, which can be problematic in some application domains. In this paper, to… ▽ More

    Submitted 8 July, 2024; v1 submitted 30 May, 2024; originally announced July 2024.

  45. arXiv:2407.02880  [pdf, other

    cs.LG cs.AI cs.CV

    Knowledge Composition using Task Vectors with Learned Anisotropic Scaling

    Authors: Frederic Z. Zhang, Paul Albert, Cristian Rodriguez-Opazo, Anton van den Hengel, Ehsan Abbasnejad

    Abstract: Pre-trained models produce strong generic representations that can be adapted via fine-tuning. The learned weight difference relative to the pre-trained model, known as a task vector, characterises the direction and stride of fine-tuning. The significance of task vectors is such that simple arithmetic operations on them can be used to combine diverse representations from different domains. This pa… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  46. arXiv:2407.02737  [pdf, other

    q-bio.QM cs.LG

    Development of Machine Learning Classifiers for Blood-based Diagnosis and Prognosis of Suspected Acute Infections and Sepsis

    Authors: Ljubomir Buturovic, Michael Mayhew, Roland Luethy, Kirindi Choi, Uros Midic, Nandita Damaraju, Yehudit Hasin-Brumshtein, Amitesh Pratap, Rhys M. Adams, Joao Fonseca, Ambika Srinath, Paul Fleming, Claudia Pereira, Oliver Liesenfeld, Purvesh Khatri, Timothy Sweeney

    Abstract: We applied machine learning to the unmet medical need of rapid and accurate diagnosis and prognosis of acute infections and sepsis in emergency departments. Our solution consists of a Myrna (TM) Instrument and embedded TriVerity (TM) classifiers. The instrument measures abundances of 29 messenger RNAs in patient's blood, subsequently used as features for machine learning. The classifiers convert t… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 16 pages, 6 figures

  47. arXiv:2407.02353  [pdf, other

    eess.SP cs.AR eess.SY

    Roadmap to Neuromorphic Computing with Emerging Technologies

    Authors: Adnan Mehonic, Daniele Ielmini, Kaushik Roy, Onur Mutlu, Shahar Kvatinsky, Teresa Serrano-Gotarredona, Bernabe Linares-Barranco, Sabina Spiga, Sergey Savelev, Alexander G Balanov, Nitin Chawla, Giuseppe Desoli, Gerardo Malavena, Christian Monzio Compagnoni, Zhongrui Wang, J Joshua Yang, Ghazi Sarwat Syed, Abu Sebastian, Thomas Mikolajick, Beatriz Noheda, Stefan Slesazeck, Bernard Dieny, Tuo-Hung, Hou, Akhil Varri , et al. (28 additional authors not shown)

    Abstract: The roadmap is organized into several thematic sections, outlining current computing challenges, discussing the neuromorphic computing approach, analyzing mature and currently utilized technologies, providing an overview of emerging technologies, addressing material challenges, exploring novel computing concepts, and finally examining the maturity level of emerging technologies while determining t… ▽ More

    Submitted 5 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 90 pages, 22 figures, roadmap, neuromorphic

  48. arXiv:2407.01115  [pdf, other

    cs.LG stat.ML

    Enabling Mixed Effects Neural Networks for Diverse, Clustered Data Using Monte Carlo Methods

    Authors: Andrej Tschalzev, Paul Nitschke, Lukas Kirchdorfer, Stefan Lüdtke, Christian Bartelt, Heiner Stuckenschmidt

    Abstract: Neural networks often assume independence among input data samples, disregarding correlations arising from inherent clustering patterns in real-world datasets (e.g., due to different sites or repeated measurements). Recently, mixed effects neural networks (MENNs) which separate cluster-specific 'random effects' from cluster-invariant 'fixed effects' have been proposed to improve generalization and… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  49. arXiv:2407.01069  [pdf, other

    cs.IR

    Deep Domain Specialisation for single-model multi-domain learning to rank

    Authors: Paul Missault, Abdelmaseeh Felfel

    Abstract: Information Retrieval (IR) practitioners often train separate ranking models for different domains (geographic regions, languages, stores, websites,...) as it is believed that exclusively training on in-domain data yields the best performance when sufficient data is available. Despite their performance gains, training multiple models comes at a higher cost to train, maintain and update compared to… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  50. arXiv:2407.01032  [pdf, other

    cs.LG cs.CV stat.ME

    Overcoming Common Flaws in the Evaluation of Selective Classification Systems

    Authors: Jeremias Traub, Till J. Bungert, Carsten T. Lüth, Michael Baumgartner, Klaus H. Maier-Hein, Lena Maier-Hein, Paul F Jaeger

    Abstract: Selective Classification, wherein models can reject low-confidence predictions, promises reliable translation of machine-learning based classification systems to real-world scenarios such as clinical diagnostics. While current evaluation of these systems typically assumes fixed working points based on pre-defined rejection thresholds, methodological progress requires benchmarking the general perfo… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.