Skip to main content

Showing 101–120 of 120 results for author: Haffari, G

  1. arXiv:1902.08888  [pdf, other

    cs.LG cs.CV stat.ML

    Medical Multimodal Classifiers Under Scarce Data Condition

    Authors: Faik Aydin, Maggie Zhang, Michelle Ananda-Rajah, Gholamreza Haffari

    Abstract: Data is one of the essential ingredients to power deep learning research. Small datasets, especially specific to medical institutes, bring challenges to deep learning training stage. This work aims to develop a practical deep multimodal that can classify patients into abnormal and normal categories accurately as well as assist radiologists to detect visual and textual anomalies by locating areas o… ▽ More

    Submitted 23 February, 2019; originally announced February 2019.

  2. arXiv:1902.03402  [pdf, ps, other

    cs.IR

    A new simple and effective measure for bag-of-word inter-document similarity measurement

    Authors: Sunil Aryal, Kai Ming Ting, Takashi Washio, Gholamreza Haffari

    Abstract: To measure the similarity of two documents in the bag-of-words (BoW) vector representation, different term weighting schemes are used to improve the performance of cosine similarity---the most widely used inter-document similarity measure in text mining. In this paper, we identify the shortcomings of the underlying assumptions of term weighting in the inter-document similarity measurement task; an… ▽ More

    Submitted 9 February, 2019; originally announced February 2019.

  3. arXiv:1810.07455  [pdf, other

    cs.CL

    Exploring Textual and Speech information in Dialogue Act Classification with Speaker Domain Adaptation

    Authors: Xuanli He, Quan Hung Tran, William Havard, Laurent Besacier, Ingrid Zukerman, Gholamreza Haffari

    Abstract: In spite of the recent success of Dialogue Act (DA) classification, the majority of prior works focus on text-based classification with oracle transcriptions, i.e. human transcriptions, instead of Automatic Speech Recognition (ASR)'s transcriptions. In spoken dialog systems, however, the agent would only have access to noisy ASR transcriptions, which may further suffer performance degradation due… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: 5 pages, 2 figurs

  4. arXiv:1810.07391  [pdf, other

    cs.CL

    Sequence to Sequence Mixture Model for Diverse Machine Translation

    Authors: Xuanli He, Gholamreza Haffari, Mohammad Norouzi

    Abstract: Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can be attributed to the limitation of SEQ2SEQ models in capturing lexical and syntactic variations in a parallel corpus resulting from different styles, genres, topics, or ambiguity of the translation process. In this paper, we develop a novel sequence to sequence mixture (S2SMIX) model that improves… ▽ More

    Submitted 17 October, 2018; originally announced October 2018.

    Comments: 11 pages, 5 figures, accepted to CoNLL2018

  5. arXiv:1809.00344  [pdf, other

    cs.CL

    Contextual Neural Model for Translating Bilingual Multi-Speaker Conversations

    Authors: Sameen Maruf, André F. T. Martins, Gholamreza Haffari

    Abstract: Recent works in neural machine translation have begun to explore document translation. However, translating online multi-speaker conversations is still an open problem. In this work, we propose the task of translating Bilingual Multi-Speaker Conversations, and explore neural architectures which exploit both source and target-side conversation histories for this task. To initiate an evaluation for… ▽ More

    Submitted 2 September, 2018; originally announced September 2018.

    Comments: WMT 2018

  6. arXiv:1806.09835  [pdf, other

    cs.CL cs.LG

    Graph-to-Sequence Learning using Gated Graph Neural Networks

    Authors: Daniel Beck, Gholamreza Haffari, Trevor Cohn

    Abstract: Many NLP applications can be framed as a graph-to-sequence learning problem. Previous work proposing neural architectures on this setting obtained promising results compared to grammar-based approaches but still rely on linearisation heuristics and/or standard recurrent networks to achieve the best performance. In this work, we propose a new model that encodes the full structural information conta… ▽ More

    Submitted 26 June, 2018; originally announced June 2018.

    Comments: ACL 2018

  7. arXiv:1805.04237  [pdf, other

    cs.CL

    Neural Machine Translation for Bilingually Scarce Scenarios: A Deep Multi-task Learning Approach

    Authors: Poorya Zaremoodi, Gholamreza Haffari

    Abstract: Neural machine translation requires large amounts of parallel training text to learn a reasonable-quality translation model. This is particularly inconvenient for language pairs for which enough parallel text is not available. In this paper, we use monolingual linguistic resources in the source side to address this challenging problem based on a multi-task learning approach. More specifically, we… ▽ More

    Submitted 10 May, 2018; originally announced May 2018.

    Comments: NAACL 2018 (long paper)

  8. arXiv:1711.07019  [pdf, other

    cs.CL

    Incorporating Syntactic Uncertainty in Neural Machine Translation with Forest-to-Sequence Model

    Authors: Poorya Zaremoodi, Gholamreza Haffari

    Abstract: Incorporating syntactic information in Neural Machine Translation models is a method to compensate their requirement for a large amount of parallel training text, especially for low-resource language pairs. Previous works on using syntactic information provided by (inevitably error-prone) parsers has been promising. In this paper, we propose a forest-to-sequence Attentional Neural Machine Translat… ▽ More

    Submitted 23 November, 2017; v1 submitted 19 November, 2017; originally announced November 2017.

  9. arXiv:1711.03688  [pdf, other

    cs.CL

    Document Context Neural Machine Translation with Memory Networks

    Authors: Sameen Maruf, Gholamreza Haffari

    Abstract: We present a document-level neural machine translation model which takes both source and target document context into account using memory networks. We model the problem as a structured prediction problem with interdependencies among the observed and hidden variables, i.e., the source sentences and their unobserved target translations in the document. The resulting structured prediction problem is… ▽ More

    Submitted 16 May, 2018; v1 submitted 9 November, 2017; originally announced November 2017.

    Comments: Accepted by ACL 2018

  10. arXiv:1701.02854  [pdf, other

    cs.CL cs.AI

    Towards Decoding as Continuous Optimization in Neural Machine Translation

    Authors: Cong Duy Vu Hoang, Gholamreza Haffari, Trevor Cohn

    Abstract: We propose a novel decoding approach for neural machine translation (NMT) based on continuous optimisation. We convert decoding - basically a discrete optimization problem - into a continuous optimization problem. The resulting constrained continuous optimisation problem is then tackled using gradient-based methods. Our powerful decoding framework enables decoding intractable models such as the in… ▽ More

    Submitted 22 July, 2017; v1 submitted 11 January, 2017; originally announced January 2017.

    Comments: EMNLP 2017 Camera Ready Paper

  11. arXiv:1608.04465  [pdf, other

    cs.CL

    Fast, Small and Exact: Infinite-order Language Modelling with Compressed Suffix Trees

    Authors: Ehsan Shareghi, Matthias Petri, Gholamreza Haffari, Trevor Cohn

    Abstract: Efficient methods for storing and querying are critical for scaling high-order n-gram language models to large corpora. We propose a language model based on compressed suffix trees, a representation that is highly compact and can be easily held in memory, while supporting queries needed in computing language model probabilities on-the-fly. We present several optimisations which improve query runti… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: 14 pages in Transactions of the Association for Computational Linguistics (TACL) 2016

  12. arXiv:1606.04217  [pdf, other

    cs.NE cs.CL

    Word Representation Models for Morphologically Rich Languages in Neural Machine Translation

    Authors: Ekaterina Vylomova, Trevor Cohn, Xuanli He, Gholamreza Haffari

    Abstract: Dealing with the complex word forms in morphologically rich languages is an open problem in language processing, and is particularly important in translation. In contrast to most modern neural systems of translation, which discard the identity for rare words, in this paper we propose several architectures for learning word representations from character and morpheme level word decompositions. We i… ▽ More

    Submitted 14 June, 2016; originally announced June 2016.

  13. arXiv:1603.08594  [pdf, other

    cs.CL

    Prepositional Attachment Disambiguation Using Bilingual Parsing and Alignments

    Authors: Geetanjali Rakshit, Sagar Sontakke, Pushpak Bhattacharyya, Gholamreza Haffari

    Abstract: In this paper, we attempt to solve the problem of Prepositional Phrase (PP) attachments in English. The motivation for the work comes from NLP applications like Machine Translation, for which, getting the correct attachment of prepositions is very crucial. The idea is to correct the PP-attachments for a sentence with the help of alignments from parallel data in another language. The novelty of our… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

  14. arXiv:1603.01913  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    A Latent Variable Recurrent Neural Network for Discourse Relation Language Models

    Authors: Yangfeng Ji, Gholamreza Haffari, Jacob Eisenstein

    Abstract: This paper presents a novel latent variable recurrent neural network architecture for jointly modeling sequences of words and (possibly latent) discourse relations between adjacent sentences. A recurrent neural network generates individual words, thus reaping the benefits of discriminatively-trained vector representations. The discourse relations are represented with a latent variable, which can b… ▽ More

    Submitted 5 April, 2016; v1 submitted 6 March, 2016; originally announced March 2016.

    Comments: NAACL 2016 camera ready, 11 pages

  15. arXiv:1601.01085  [pdf, other

    cs.CL

    Incorporating Structural Alignment Biases into an Attentional Neural Translation Model

    Authors: Trevor Cohn, Cong Duy Vu Hoang, Ekaterina Vymolova, Kaisheng Yao, Chris Dyer, Gholamreza Haffari

    Abstract: Neural encoder-decoder models of machine translation have achieved impressive results, rivalling traditional translation models. However their modelling formulation is overly simplistic, and omits several key inductive biases built into traditional models. In this paper we extend the attentional neural translation model to include structural biases from word based alignment models, including posit… ▽ More

    Submitted 6 January, 2016; originally announced January 2016.

    Comments: 10 pages

  16. arXiv:1503.02768  [pdf, ps, other

    stat.ML

    Novel Bernstein-like Concentration Inequalities for the Missing Mass

    Authors: Bahman Yari Saeed Khanloo, Gholamreza Haffari

    Abstract: We are concerned with obtaining novel concentration inequalities for the missing mass, i.e. the total probability mass of the outcomes not observed in the sample. We not only derive - for the first time - distribution-free Bernstein-like deviation bounds with sublinear exponents in deviation size for missing mass, but also improve the results of McAllester and Ortiz (2003) andBerend and Kontorovic… ▽ More

    Submitted 19 June, 2015; v1 submitted 10 March, 2015; originally announced March 2015.

    Comments: arXiv admin note: text overlap with arXiv:1402.6262. Appears in 31st Conference on Uncertainty in Artificial Intelligence (UAI), 2015

  17. arXiv:1503.02417  [pdf, other

    cs.LG cs.CL

    Structured Prediction of Sequences and Trees using Infinite Contexts

    Authors: Ehsan Shareghi, Gholamreza Haffari, Trevor Cohn, Ann Nicholson

    Abstract: Linguistic structures exhibit a rich array of global phenomena, however commonly used Markov models are unable to adequately describe these phenomena due to their strong locality assumptions. We propose a novel hierarchical model for structured prediction over sequences and trees which exploits global context by conditioning each generation decision on an unbounded context of prior decisions. This… ▽ More

    Submitted 9 March, 2015; originally announced March 2015.

  18. arXiv:1503.00486  [pdf, other

    q-bio.GN

    HetFHMM: A novel approach to infer tumor heterogeneity using factorial Hidden Markov model

    Authors: Gholamreza Haffari, Zhaoxiang Cai, Mohammad S. Rahman, Ann E. Nicholson

    Abstract: Cancer arises from successive rounds of mutations which generate tumor cells with different genomic variation i.e. clones. For drug responsiveness and therapeutics, it is necessary to identify the clones in tumor sample accurately. Many methods are developed to infer tumor heterogeneity by either computing cellular prevalence and tumor phylogeny or predicting genotype of mutations. All methods suf… ▽ More

    Submitted 2 March, 2015; originally announced March 2015.

    Comments: 9 pages

  19. arXiv:1301.1590  [pdf, ps, other

    q-bio.BM cs.LG

    An Efficient Algorithm for Upper Bound on the Partition Function of Nucleic Acids

    Authors: Hamidreza Chitsaz, Elmirasadat Forouzmand, Gholamreza Haffari

    Abstract: It has been shown that minimum free energy structure for RNAs and RNA-RNA interaction is often incorrect due to inaccuracies in the energy parameters and inherent limitations of the energy model. In contrast, ensemble based quantities such as melting temperature and equilibrium concentrations can be more reliably predicted. Even structure prediction by sampling from the ensemble and clustering tho… ▽ More

    Submitted 8 January, 2013; originally announced January 2013.

  20. arXiv:1206.5240  [pdf

    cs.LG stat.ML

    Analysis of Semi-Supervised Learning with the Yarowsky Algorithm

    Authors: Gholam Reza Haffari, Anoop Sarkar

    Abstract: The Yarowsky algorithm is a rule-based semi-supervised learning algorithm that has been successfully applied to some problems in computational linguistics. The algorithm was not mathematically well understood until (Abney 2004) which analyzed some specific variants of the algorithm, and also proposed some new algorithms for bootstrapping. In this paper, we extend Abney's work and show that some of… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-159-166