Skip to main content

Showing 1–22 of 22 results for author: Manica, M

  1. arXiv:2301.12586  [pdf, other

    cs.LG cs.CL

    Unifying Molecular and Textual Representations via Multi-task Language Modelling

    Authors: Dimitrios Christofidellis, Giorgio Giannone, Jannis Born, Ole Winther, Teodoro Laino, Matteo Manica

    Abstract: The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the potential to fuel a new era of data-driven automation in scientific discovery. However, specialized models are still typically required for each task, leading to the n… ▽ More

    Submitted 17 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  2. arXiv:2301.08750  [pdf, other

    cs.LG cs.AI cs.IT

    Domain-agnostic and Multi-level Evaluation of Generative Models

    Authors: Girmaw Abebe Tadesse, Jannis Born, Celia Cintas, William Ogallo, Dmitry Zubarev, Matteo Manica, Komminist Weldemariam

    Abstract: While the capabilities of generative models heavily improved in different domains (images, text, graphs, molecules, etc.), their evaluation metrics largely remain based on simplified quantities or manual inspection with limited practicality. To this end, we propose a framework for Multi-level Performance Evaluation of Generative mOdels (MPEGO), which could be employed across different domains. MPE… ▽ More

    Submitted 20 January, 2023; originally announced January 2023.

  3. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  4. arXiv:2208.07084  [pdf, other

    cs.CL cs.LG

    Z-BERT-A: a zero-shot Pipeline for Unknown Intent detection

    Authors: Daniele Comi, Dimitrios Christofidellis, Pier Francesco Piazza, Matteo Manica

    Abstract: Intent discovery is a crucial task in natural language processing, and it is increasingly relevant for various of industrial applications. Identifying novel, unseen intents from user inputs remains one of the biggest challenges in this field. Herein, we propose Zero-Shot-BERT-Adapters, a two-stage method for multilingual intent discovery relying on a Transformer architecture, fine-tuned with Adapt… ▽ More

    Submitted 8 December, 2023; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: 14 pages, 4 figures, 14 tables, https://github.com/GT4SD/zberta

  5. Accelerating Material Design with the Generative Toolkit for Scientific Discovery

    Authors: Matteo Manica, Jannis Born, Joris Cadow, Dimitrios Christofidellis, Ashish Dave, Dean Clarke, Yves Gaetan Nana Teukam, Giorgio Giannone, Samuel C. Hoffman, Matthew Buchan, Vijil Chenthamarakshan, Timothy Donovan, Hsiang Han Hsu, Federico Zipoli, Oliver Schilter, Akihiro Kishimoto, Lisa Hamada, Inkit Padhi, Karl Wehden, Lauren McHugh, Alexy Khrabrov, Payel Das, Seiji Takeda, John R. Smith

    Abstract: With the growing availability of data within various scientific domains, generative models hold enormous potential to accelerate scientific discovery. They harness powerful representations learned from datasets to speed up the formulation of novel hypotheses with the potential to impact material discovery broadly. We present the Generative Toolkit for Scientific Discovery (GT4SD). This extensible… ▽ More

    Submitted 31 January, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: 15 pages, 2 figures

    Journal ref: Nature Partner Journals (npj) Computational Materials 9, 69 (2023)

  6. arXiv:2202.01338  [pdf, other

    cs.LG cs.AI cs.CL q-bio.BM

    Regression Transformer: Concurrent sequence regression and generation for molecular language modeling

    Authors: Jannis Born, Matteo Manica

    Abstract: Despite significant progress of generative models in the natural sciences, their controllability remains challenging. One fundamentally missing aspect of molecular or protein generative models is an inductive bias that can reflect continuous properties of interest. To that end, we propose the Regression Transformer (RT), a novel method that abstracts regression as a conditional sequence modeling p… ▽ More

    Submitted 11 November, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Updated paper, under review; Preliminary version as spotlight talk at ICLR 2022 workshop on Machine Learning for Drug Discovery

    Journal ref: Nature Machine Intelligence 5, 432-444 (2023)

  7. arXiv:2111.05654  [pdf, other

    cs.DC

    Utilising urgent computing to tackle the spread of mosquito-borne diseases

    Authors: Nick Brown, Rupert Nash, Piero Poletti, Giorgio Guzzetta, Mattia Manica, Agnese Zardini, Markus Flatken, Jules Vidal, Charles Gueunet, Evgenij Belikov, Julien Tierny, Artur Podobas, Wei Der Chien, Stefano Markidis, Andreas Gerndt

    Abstract: It is estimated that around 80\% of the world's population live in areas susceptible to at-least one major vector borne disease, and approximately 20% of global communicable diseases are spread by mosquitoes. Furthermore, the outbreaks of such diseases are becoming more common and widespread, with much of this driven in recent years by socio-demographic and climatic factors. These trends are causi… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: Preprint of paper in 2021 IEEE/ACM HPC for Urgent Decision Making (UrgentHPC)

  8. arXiv:2110.08207  [pdf, other

    cs.LG cs.CL

    Multitask Prompted Training Enables Zero-Shot Task Generalization

    Authors: Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen , et al. (16 additional authors not shown)

    Abstract: Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale,… ▽ More

    Submitted 17 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 Spotlight (with extended discussion)

  9. arXiv:2012.10271  [pdf, other

    cs.CL cs.LG

    Understood in Translation, Transformers for Domain Understanding

    Authors: Dimitrios Christofidellis, Matteo Manica, Leonidas Georgopoulos, Hans Vandierendonck

    Abstract: Knowledge acquisition is the essential first step of any Knowledge Graph (KG) application. This knowledge can be extracted from a given corpus (KG generation process) or specified from an existing KG (KG specification process). Focusing on domain specific solutions, knowledge acquisition is a labor intensive task usually orchestrated and supervised by subject matter experts. Specifically, the doma… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 4 figures, 7 tables, main text pages 8, appendix pages 6

  10. arXiv:2012.03084  [pdf, other

    q-bio.BM cs.CL

    Pre-training Protein Language Models with Label-Agnostic Binding Pairs Enhances Performance in Downstream Tasks

    Authors: Modestas Filipavicius, Matteo Manica, Joris Cadow, Maria Rodriguez Martinez

    Abstract: Less than 1% of protein sequences are structurally and functionally annotated. Natural Language Processing (NLP) community has recently embraced self-supervised learning as a powerful approach to learn representations from unlabeled text, in large part due to the attention-based context-aware Transformer models. In this work we present a modification to the RoBERTa model by inputting during pre-tr… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

    Comments: 20 pages, 12 figures, accepted to Machine Learning for Structural Biology (MLSB) workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS)

  11. arXiv:2009.11152  [pdf, other

    cs.CL cs.AI

    Hierarchical Pre-training for Sequence Labelling in Spoken Dialog

    Authors: Emile Chapuis, Pierre Colombo, Matteo Manica, Matthieu Labeau, Chloe Clavel

    Abstract: Sequence labelling tasks like Dialog Act and Emotion/Sentiment identification are a key component of spoken dialog systems. In this work, we propose a new approach to learn generic representations adapted to spoken dialog, which we evaluate on a new benchmark we call Sequence labellIng evaLuatIon benChmark fOr spoken laNguagE benchmark (\texttt{SILICONE}). \texttt{SILICONE} is model-agnostic and c… ▽ More

    Submitted 8 February, 2021; v1 submitted 23 September, 2020; originally announced September 2020.

    Journal ref: EMNLP 2020

  12. arXiv:2005.13285  [pdf, other

    q-bio.QM cs.LG stat.ML

    PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models

    Authors: Jannis Born, Matteo Manica, Joris Cadow, Greta Markert, Nil Adell Mill, Modestas Filipavicius, María Rodríguez Martínez

    Abstract: With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin… ▽ More

    Submitted 6 July, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: 5 pages, 6 figures

    Journal ref: ICML Workshop on Computational Biology 2020

  13. arXiv:2004.01215  [pdf, other

    cs.LG q-bio.QM stat.ML

    CogMol: Target-Specific and Selective Drug Design for COVID-19 Using Deep Generative Models

    Authors: Vijil Chenthamarakshan, Payel Das, Samuel C. Hoffman, Hendrik Strobelt, Inkit Padhi, Kar Wai Lim, Benjamin Hoover, Matteo Manica, Jannis Born, Teodoro Laino, Aleksandra Mojsilovic

    Abstract: The novel nature of SARS-CoV-2 calls for the development of efficient de novo drug design approaches. In this study, we propose an end-to-end framework, named CogMol (Controlled Generation of Molecules), for designing new drug-like small molecules targeting novel viral proteins with high affinity and off-target selectivity. CogMol combines adaptive pre-training of a molecular SMILES Variational Au… ▽ More

    Submitted 23 June, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

  14. arXiv:2002.09419  [pdf

    cs.CL

    Guider l'attention dans les modeles de sequence a sequence pour la prediction des actes de dialogue

    Authors: Pierre Colombo, Emile Chapuis, Matteo Manica, Emmanuel Vignon, Giovanna Varni, Chloe Clavel

    Abstract: The task of predicting dialog acts (DA) based on conversational dialog is a key component in the development of conversational agents. Accurately predicting DAs requires a precise modeling of both the conversation and the global tag dependencies. We leverage seq2seq approaches widely adopted in Neural Machine Translation (NMT) to improve the modelling of tag sequentiality. Seq2seq models are known… ▽ More

    Submitted 26 February, 2020; v1 submitted 21 February, 2020; originally announced February 2020.

    Comments: in French

    Journal ref: WACAI 2020

  15. arXiv:2002.08801  [pdf, other

    cs.CL cs.LG

    Guiding attention in Sequence-to-sequence models for Dialogue Act prediction

    Authors: Pierre Colombo, Emile Chapuis, Matteo Manica, Emmanuel Vignon, Giovanna Varni, Chloe Clavel

    Abstract: The task of predicting dialog acts (DA) based on conversational dialog is a key component in the development of conversational agents. Accurately predicting DAs requires a precise modeling of both the conversation and the global tag dependencies. We leverage seq2seq approaches widely adopted in Neural Machine Translation (NMT) to improve the modelling of tag sequentiality. Seq2seq models are known… ▽ More

    Submitted 26 February, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Journal ref: AAAI 2020

  16. arXiv:1909.05114  [pdf, other

    q-bio.BM cs.LG stat.ML

    PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning

    Authors: Jannis Born, Matteo Manica, Ali Oskooei, Joris Cadow, Karsten Borgwardt, María Rodríguez Martínez

    Abstract: With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa… ▽ More

    Submitted 16 April, 2020; v1 submitted 29 August, 2019; originally announced September 2019.

    Comments: 18 pages total (12 pages main text, 4 pages references, 11 pages appendix) 8 figures

    Journal ref: International Conference on Research in Computational Molecular Biology 2020

  17. arXiv:1907.08400  [pdf, other

    cs.IR cs.LG

    An Information Extraction and Knowledge Graph Platform for Accelerating Biochemical Discoveries

    Authors: Matteo Manica, Christoph Auer, Valery Weber, Federico Zipoli, Michele Dolfi, Peter Staar, Teodoro Laino, Costas Bekas, Akihiro Fujita, Hiroki Toda, Shuichi Hirose, Yasumitsu Orii

    Abstract: Information extraction and data mining in biochemical literature is a daunting task that demands resource-intensive computation and appropriate means to scale knowledge ingestion. Being able to leverage this immense source of technical information helps to drastically reduce costs and time to solution in multiple application fields from food safety to pharmaceutics. We present a scalable document… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

    Comments: 4 pages, 1 figure, Workshop on Applied Data Science for Healthcare at KDD, Anchorage, AK, 2019

  18. arXiv:1904.11223  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    Towards Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-based Convolutional Encoders

    Authors: Matteo Manica, Ali Oskooei, Jannis Born, Vigneshwari Subramanian, Julio Sáez-Rodríguez, María Rodríguez Martínez

    Abstract: In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior kn… ▽ More

    Submitted 14 July, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

    Comments: 11 pages, 5 figures, 1 table, Workshop on Computational Biology at the International Conference on Machine Learning (ICML), Long Beach, CA, 2019

    Journal ref: Mol. Pharmaceutics 2019

  19. arXiv:1901.06261  [pdf, other

    cs.LG cs.SE stat.ML

    NeuNetS: An Automated Synthesis Engine for Neural Network Design

    Authors: Atin Sood, Benjamin Elder, Benjamin Herta, Chao Xue, Costas Bekas, A. Cristiano I. Malossi, Debashish Saha, Florian Scheidegger, Ganesh Venkataraman, Gegi Thomas, Giovanni Mariani, Hendrik Strobelt, Horst Samulowitz, Martin Wistuba, Matteo Manica, Mihir Choudhury, Rong Yan, Roxana Istrate, Ruchir Puri, Tejaswini Pedapati

    Abstract: Application of neural networks to a vast variety of practical applications is transforming the way AI is applied in practice. Pre-trained neural network models available through APIs or capability to custom train pre-built neural network architectures with customer data has made the consumption of AI by developers much simpler and resulted in broad adoption of these complex AI models. While prebui… ▽ More

    Submitted 16 January, 2019; originally announced January 2019.

    Comments: 14 pages, 12 figures. arXiv admin note: text overlap with arXiv:1806.00250

  20. arXiv:1811.06802  [pdf, other

    cs.LG q-bio.MN q-bio.QM

    PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks

    Authors: Ali Oskooei, Jannis Born, Matteo Manica, Vigneshwari Subramanian, Julio Sáez-Rodríguez, María Rodríguez Martínez

    Abstract: We present a novel approach for the prediction of anticancer compound sensitivity by means of multi-modal attention-based neural networks (PaccMann). In our approach, we integrate three key pillars of drug sensitivity, namely, the molecular structure of compounds, transcriptomic profiles of cancer cells as well as prior knowledge about interactions among proteins within cells. Our models ingest a… ▽ More

    Submitted 14 July, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: 10 pages, 5 figures, 2 tables. NIPS MLMM 2018

    Journal ref: NeurIPS 2018 Workshop on Machine Learning for Molecules & Materials

  21. arXiv:1808.06603  [pdf

    q-bio.QM cs.LG stat.ML

    Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer

    Authors: Ali Oskooei, Matteo Manica, Roland Mathis, Maria Rodriguez Martinez

    Abstract: We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assign… ▽ More

    Submitted 26 April, 2019; v1 submitted 18 August, 2018; originally announced August 2018.

    Comments: 36 pages, 5 figures, 3 supplementary figures

  22. Mixed-Precision In-Memory Computing

    Authors: Manuel Le Gallo, Abu Sebastian, Roland Mathis, Matteo Manica, Heiner Giefers, Tomas Tuma, Costas Bekas, Alessandro Curioni, Evangelos Eleftheriou

    Abstract: As CMOS scaling reaches its technological limits, a radical departure from traditional von Neumann systems, which involve separate processing and memory units, is needed in order to significantly extend the performance of today's computers. In-memory computing is a promising approach in which nanoscale resistive memory devices, organized in a computational memory unit, are used for both processing… ▽ More

    Submitted 4 October, 2018; v1 submitted 16 January, 2017; originally announced January 2017.

    Journal ref: Nature Electronics volume 1, pages 246-253 (2018)