Skip to main content

Showing 1–9 of 9 results for author: Marchi, M

  1. arXiv:2406.07288  [pdf, other

    cs.CL

    Fine-tuning with HED-IT: The impact of human post-editing for dialogical language models

    Authors: Daniela Occhipinti, Michele Marchi, Irene Mondella, Huiyuan Lai, Felice Dell'Orletta, Malvina Nissim, Marco Guerini

    Abstract: Automatic methods for generating and gathering linguistic data have proven effective for fine-tuning Language Models (LMs) in languages less resourced than English. Still, while there has been emphasis on data quantity, less attention has been given to its quality. In this work, we investigate the impact of human intervention on machine-generated data when fine-tuning dialogical models. In particu… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.14061  [pdf, other

    cs.AI cs.CL cs.LG

    Meanings and Feelings of Large Language Models: Observability of Latent States in Generative AI

    Authors: Tian Yu Liu, Stefano Soatto, Matteo Marchi, Pratik Chaudhari, Paulo Tabuada

    Abstract: We tackle the question of whether Large Language Models (LLMs), viewed as dynamical systems with state evolving in the embedding space of symbolic tokens, are observable. That is, whether there exist multiple 'mental' state trajectories that yield the same sequence of generated tokens, or sequences that belong to the same Nerode equivalence class ('meaning'). If not observable, mental state trajec… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  3. arXiv:2404.02325  [pdf, ps, other

    cs.LG eess.SY math.OC

    Heat Death of Generative Models in Closed-Loop Learning

    Authors: Matteo Marchi, Stefano Soatto, Pratik Chaudhari, Paulo Tabuada

    Abstract: Improvement and adoption of generative machine learning models is rapidly accelerating, as exemplified by the popularity of LLMs (Large Language Models) for text, and diffusion models for image generation.As generative models become widespread, data they generate is incorporated into shared content through the public web. This opens the question of what happens when data generated by a model is fe… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  4. arXiv:2309.01612  [pdf, other

    cs.CV cs.LG

    On the Query Strategies for Efficient Online Active Distillation

    Authors: Michele Boldo, Enrico Martini, Mirco De Marchi, Stefano Aldegheri, Nicola Bombieri

    Abstract: Deep Learning (DL) requires lots of time and data, resulting in high computational demands. Recently, researchers employ Active Learning (AL) and online distillation to enhance training efficiency and real-time model adaptation. This paper evaluates a set of query strategies to achieve the best training results. It focuses on Human Pose Estimation (HPE) applications, assessing the impact of select… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  5. arXiv:2304.06793  [pdf, other

    cs.NE cs.LG eess.IV

    Speck: A Smart event-based Vision Sensor with a low latency 327K Neuron Convolutional Neuronal Network Processing Pipeline

    Authors: Ole Richter, Yannan Xing, Michele De Marchi, Carsten Nielsen, Merkourios Katsimpris, Roberto Cattaneo, Yudi Ren, Yalun Hu, Qian Liu, Sadique Sheik, Tugba Demirci, Ning Qiao

    Abstract: Edge computing solutions that enable the extraction of high-level information from a variety of sensors is in increasingly high demand. This is due to the increasing number of smart devices that require sensory processing for their application on the edge. To tackle this problem, we present a smart vision sensor System on Chip (SoC), featuring an event-based camera and a low-power asynchronous spi… ▽ More

    Submitted 27 May, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: accepted and presented at 28th IEEE International Symposium On Asynchronous Circuits and Systems (ASYNC) 2023

    Journal ref: IEEE ASYNC 2023

  6. Hyperspectral and LiDAR data for the prediction via machine learning of tree species, volume and biomass: a possible contribution for updating forest management plans

    Authors: Daniele Michelini, Michele Dalponte, Angelo Carriero, Erico Kutchart, Salvatore Eugenio Pappalardo, Massimo De Marchi, Francesco Pirotti

    Abstract: This work intends to lay the foundations for identifying the prevailing forest types and the delineation of forest units within private forest inventories in the Autonomous Province of Trento (PAT), using currently available remote sensing solutions. In particular, data from LiDAR and hyperspectral surveys of 2014 made available by PAT were acquired and processed. Such studies are very important i… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

  7. arXiv:2105.13278  [pdf, other

    cs.LG

    One Step Preference Elicitation in Multi-Objective Bayesian Optimization

    Authors: Juan Ungredda, Mariapia Marchi, Teresa Montrone, Juergen Branke

    Abstract: We consider a multi-objective optimization problem with objective functions that are expensive to evaluate. The decision maker (DM) has unknown preferences, and so the standard approach is to generate an approximation of the Pareto front and let the DM choose from the generated non-dominated designs. However, especially for expensive to evaluate problems where the number of designs that can be eva… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

  8. arXiv:2011.05547  [pdf, other

    cs.NE

    Identifying Properties of Real-World Optimisation Problems through a Questionnaire

    Authors: Koen van der Blom, Timo M. Deist, Vanessa Volz, Mariapia Marchi, Yusuke Nojima, Boris Naujoks, Akira Oyama, Tea Tušar

    Abstract: Optimisation algorithms are commonly compared on benchmarks to get insight into performance differences. However, it is not clear how closely benchmarks match the properties of real-world problems because these properties are largely unknown. This work investigates the properties of real-world problems through a questionnaire to enable the design of future benchmark problems that more closely rese… ▽ More

    Submitted 14 July, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

    Comments: Book Chapter (Under review, revised version)

  9. Towards Realistic Optimization Benchmarks: A Questionnaire on the Properties of Real-World Problems

    Authors: Koen van der Blom, Timo M. Deist, Tea Tušar, Mariapia Marchi, Yusuke Nojima, Akira Oyama, Vanessa Volz, Boris Naujoks

    Abstract: Benchmarks are a useful tool for empirical performance comparisons. However, one of the main shortcomings of existing benchmarks is that it remains largely unclear how they relate to real-world problems. What does an algorithm's performance on a benchmark say about its potential on a specific real-world problem? This work aims to identify properties of real-world problems through a questionnaire o… ▽ More

    Submitted 14 April, 2020; originally announced April 2020.

    Comments: 2 pages, GECCO2020 Poster Paper