Skip to main content

Showing 1–38 of 38 results for author: Kamigaito, H

  1. arXiv:2407.09548  [pdf, other

    cs.CV cs.AI

    Towards Temporal Change Explanations from Bi-Temporal Satellite Images

    Authors: Ryo Tsujimoto, Hiroki Ouchi, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Explaining temporal changes between satellite images taken at different times is important for urban planning and environmental monitoring. However, manual dataset construction for the task is costly, so human-AI collaboration is promissing. Toward the direction, in this paper, we investigate the ability of Large-scale Vision-Language Models (LVLMs) to explain temporal changes between satellite im… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

    Comments: 11 pages, 6 figures

  2. arXiv:2407.05656  [pdf, other

    cs.LG cs.CL

    Multi-label Learning with Random Circular Vectors

    Authors: Ken Nishida, Kojiro Machi, Kazuma Onishi, Katsuhiko Hayashi, Hidetaka Kamigaito

    Abstract: The extreme multi-label classification~(XMC) task involves learning a classifier that can predict from a large label set the most relevant subset of labels for a data instance. While deep neural networks~(DNNs) have demonstrated remarkable success in XMC problems, the task is still challenging because it must deal with a large number of output labels, which make the DNN training computationally ex… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 11 pages, 6 figures, 3 tables; accepted to workshop RepL4NLP held in conjunction with ACL 2024

  3. arXiv:2407.04251  [pdf, other

    cs.CL cs.LG

    Unified Interpretation of Smoothing Methods for Negative Sampling Loss Functions in Knowledge Graph Embedding

    Authors: Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: Knowledge Graphs (KGs) are fundamental resources in knowledge-intensive tasks in NLP. Due to the limitation of manually creating KGs, KG Completion (KGC) has an important role in automatically completing KGs by scoring their links with KG Embedding (KGE). To handle many entities in training, KGE relies on Negative Sampling (NS) loss that can reduce the computational cost by sampling. Since the app… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 9 pages, 4 figures, 2 tables; accepted to workshop RepL4NLP held in conjunction with ACL 2024

  4. arXiv:2407.02138  [pdf, other

    cs.CL cs.AI cs.LG

    Efficient Nearest Neighbor based Uncertainty Estimation for Natural Language Processing Tasks

    Authors: Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Trustworthy prediction in Deep Neural Networks (DNNs), including Pre-trained Language Models (PLMs) is important for safety-critical applications in the real world. However, DNNs often suffer from uncertainty estimation, such as miscalibration. In particular, approaches that require multiple stochastic inference can mitigate this problem, but the expensive cost of inference makes them impractical.… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  5. arXiv:2407.02062  [pdf, other

    cs.CL cs.AI cs.LG

    Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation?

    Authors: Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe

    Abstract: This work investigates the impact of data augmentation on confidence calibration and uncertainty estimation in Named Entity Recognition (NER) tasks. For the future advance of NER in safety-critical fields like healthcare and finance, it is essential to achieve accurate predictions with calibrated confidence when applying Deep Neural Networks (DNNs), including Pre-trained Language Models (PLMs), as… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  6. arXiv:2406.15359  [pdf, other

    cs.CL cs.CV

    Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models

    Authors: Jesse Atuhurra, Iqra Ali, Tatsuya Hiraoka, Hidetaka Kamigaito, Tomoya Iwakura, Taro Watanabe

    Abstract: Large language models (LLMs) have increased interest in vision language models (VLMs), which process image-text pairs as input. Studies investigating the visual understanding ability of VLMs have been proposed, but such studies are still preliminary because existing datasets do not permit a comprehensive evaluation of the fine-grained visual linguistic abilities of VLMs across multiple languages.… ▽ More

    Submitted 29 March, 2024; originally announced June 2024.

  7. arXiv:2406.15358  [pdf, other

    cs.CL

    Introducing Syllable Tokenization for Low-resource Languages: A Case Study with Swahili

    Authors: Jesse Atuhurra, Hiroyuki Shindo, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Many attempts have been made in multilingual NLP to ensure that pre-trained language models, such as mBERT or GPT2 get better and become applicable to low-resource languages. To achieve multilingualism for pre-trained language models (PLMs), we need techniques to create word embeddings that capture the linguistic characteristics of any language. Tokenization is one such technique because it allows… ▽ More

    Submitted 26 March, 2024; originally announced June 2024.

  8. arXiv:2406.12335  [pdf, other

    cs.CL cs.LG

    Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters

    Authors: Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Scaling the context size of large language models (LLMs) enables them to perform various new tasks, e.g., book summarization. However, the memory cost of the Key and Value (KV) cache in attention significantly limits the practical applications of LLMs. Recent works have explored token pruning for KV cache reduction in LLMs, relying solely on attention scores as a token importance indicator. Howeve… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  9. arXiv:2406.11632  [pdf, other

    cs.CL cs.AI

    Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation

    Authors: Boxuan Lyu, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura

    Abstract: Maximum a posteriori decoding, a commonly used method for neural machine translation (NMT), aims to maximize the estimated posterior probability. However, high estimated probability does not always lead to high translation quality. Minimum Bayes Risk (MBR) decoding offers an alternative by seeking hypotheses with the highest expected utility. In this work, we show that Quality Estimation (QE) re… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  10. arXiv:2406.11097  [pdf, other

    cs.CL cs.AI

    InstructCMP: Length Control in Sentence Compression through Instruction-based Large Language Models

    Authors: Juseon-Do, Jingun Kwon, Hidetaka Kamigaito, Manabu Okumura

    Abstract: Extractive summarization can produce faithful summaries but often requires additional constraints such as a desired summary length. Traditional sentence compression models do not typically consider the constraints because of their restricted model abilities, which require model modifications for coping with them. To bridge this gap, we propose Instruction-based Compression (InstructCMP), an approa… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 8 pages, 3 figures, accepted to ACL 2024 Findings (Long Paper)

    ACM Class: I.2.7

  11. arXiv:2406.04215  [pdf, other

    cs.CL cs.AI cs.LG

    mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans

    Authors: Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe

    Abstract: It is very challenging to curate a dataset for language-specific knowledge and common sense in order to evaluate natural language understanding capabilities of language models. Due to the limitation in the availability of annotators, most current multilingual datasets are created through translation, which cannot evaluate such language-specific aspects. Therefore, we propose Multilingual Commonsen… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted at Findings of ACL 2024

  12. arXiv:2405.01943  [pdf, other

    cs.CL cs.AI cs.LG

    Dependency-Aware Semi-Structured Sparsity: Declining Roles of Outliers in Pruning GLU-based LLMs

    Authors: Zhiyu Guo, Hidetaka Kamigaito, Taro Wanatnabe

    Abstract: The rapid growth in the scale of Large Language Models (LLMs) has led to significant computational and memory costs, making model compression techniques such as network pruning increasingly crucial for their efficient deployment. Recent LLMs such as LLaMA2 and Mistral have adopted GLU-based MLP architectures. However, current LLM pruning strategies are primarily based on insights from older LLM ar… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

  13. arXiv:2404.19505  [pdf, other

    cs.CL

    Context-Aware Machine Translation with Source Coreference Explanation

    Authors: Huy Hien Vu, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Despite significant improvements in enhancing the quality of translation, context-aware machine translation (MT) models underperform in many cases. One of the main reasons is that they fail to utilize the correct features from context when the context is too long or their models are overly complex. This can lead to the explain-away effect, wherein the models only consider features easier to explai… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: Accepted to TACL. This is a pre-MIT Press publication version

  14. arXiv:2404.14415  [pdf, other

    cs.CL

    Domain Adaptation in Intent Classification Systems: A Review

    Authors: Jesse Atuhurra, Hidetaka Kamigaito, Taro Watanabe, Eric Nichols

    Abstract: Dialogue agents, which perform specific tasks, are part of the long-term goal of NLP researchers to build intelligent agents that communicate with humans in natural language. Such systems should adapt easily from one domain to another to assist users in completing tasks. Researchers have developed a broad range of techniques, objectives, and datasets for intent classification to achieve such syste… ▽ More

    Submitted 26 March, 2024; originally announced April 2024.

  15. arXiv:2404.12299  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair

    Authors: Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe

    Abstract: In Simultaneous Machine Translation (SiMT) systems, training with a simultaneous interpretation (SI) corpus is an effective method for achieving high-quality yet low-latency systems. However, it is very challenging to curate such a corpus due to limitations in the abilities of annotators, and hence, existing SI corpora are limited. Therefore, we propose a method to convert existing speech translat… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 23 pages, 9 figures

  16. arXiv:2404.08666  [pdf, other

    cs.CL cs.LG

    Revealing Trends in Datasets from the 2022 ACL and EMNLP Conferences

    Authors: Jesse Atuhurra, Hidetaka Kamigaito

    Abstract: Natural language processing (NLP) has grown significantly since the advent of the Transformer architecture. Transformers have given birth to pre-trained large language models (PLMs). There has been tremendous improvement in the performance of NLP systems across several tasks. NLP systems are on par or, in some cases, better than humans at accomplishing specific tasks. However, it remains the norm… ▽ More

    Submitted 15 July, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

  17. arXiv:2403.16820  [pdf, other

    cs.CL

    Cross-lingual Contextualized Phrase Retrieval

    Authors: Huayang Li, Deng Cai, Zhi Qu, Qu Cui, Hidetaka Kamigaito, Lemao Liu, Taro Watanabe

    Abstract: Phrase-level dense retrieval has shown many appealing characteristics in downstream NLP tasks by leveraging the fine-grained information that phrases offer. In our work, we propose a new task formulation of dense retrieval, cross-lingual contextualized phrase retrieval, which aims to augment cross-lingual applications by addressing polysemy using context information. However, the lack of specific… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: preprint

  18. arXiv:2403.15430  [pdf, other

    cs.CL

    Distilling Named Entity Recognition Models for Endangered Species from Large Language Models

    Authors: Jesse Atuhurra, Seiveright Cargill Dujohn, Hidetaka Kamigaito, Hiroyuki Shindo, Taro Watanabe

    Abstract: Natural language processing (NLP) practitioners are leveraging large language models (LLM) to create structured datasets from semi-structured and unstructured data sources such as patents, papers, and theses, without having domain-specific knowledge. At the same time, ecological experts are searching for a variety of means to preserve biodiversity. To contribute to these efforts, we focused on end… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  19. arXiv:2403.05065  [pdf, other

    cs.CL

    Can we obtain significant success in RST discourse parsing by using Large Language Models?

    Authors: Aru Maekawa, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura

    Abstract: Recently, decoder-only pre-trained large language models (LLMs), with several tens of billion parameters, have significantly impacted a wide range of natural language processing (NLP) tasks. While encoder-only or encoder-decoder pre-trained language models have already proved to be effective in discourse parsing, the extent to which LLMs can perform this task remains an open research question. The… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted in the main conference of EACL 2024

  20. arXiv:2403.00068  [pdf, other

    cs.CV

    Artwork Explanation in Large-scale Vision Language Models

    Authors: Kazuki Hayashi, Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: Large-scale vision-language models (LVLMs) output text from images and instructions, demonstrating advanced capabilities in text generation and comprehension. However, it has not been clarified to what extent LVLMs understand the knowledge necessary for explaining images, the complex relationships between various pieces of knowledge, and how they integrate these understandings into their explanati… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  21. arXiv:2402.14453  [pdf, other

    cs.CL

    Do LLMs Implicitly Determine the Suitable Text Difficulty for Users?

    Authors: Seiji Gobara, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Education that suits the individual learning level is necessary to improve students' understanding. The first step in achieving this purpose by using large language models (LLMs) is to adjust the textual difficulty of the response to students. This work analyzes how LLMs can implicitly adjust text difficulty between user input and its generated text. To conduct the experiments, we created a new da… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 17pages

  22. arXiv:2402.12121  [pdf, other

    cs.CL cs.AI cs.CV cs.MM

    Evaluating Image Review Ability of Vision Language Models

    Authors: Shigeki Saito, Kazuki Hayashi, Yusuke Ide, Yusuke Sakai, Kazuma Onishi, Toma Suzuki, Seiji Gobara, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: Large-scale vision language models (LVLMs) are language models that are capable of processing images and text inputs by a single model. This paper explores the use of LVLMs to generate review texts for images. The ability of LVLMs to review images is not fully understood, highlighting the need for a methodical evaluation of their review abilities. Unlike image captions, review texts can be written… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 9pages, under reviewing

  23. arXiv:2402.11197  [pdf, other

    cs.CL

    Centroid-Based Efficient Minimum Bayes Risk Decoding

    Authors: Hiroyuki Deguchi, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe, Hideki Tanaka, Masao Utiyama

    Abstract: Minimum Bayes risk (MBR) decoding achieved state-of-the-art translation performance by using COMET, a neural metric that has a high correlation with human evaluation. However, MBR decoding requires quadratic time since it computes the expected score between a translation hypothesis and all reference translations. We propose centroid-based MBR (CBMBR) decoding to improve the speed of MBR decoding.… ▽ More

    Submitted 11 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Accepted at Findings of ACL 2024

  24. arXiv:2402.09344  [pdf, other

    cs.CL

    Generating Diverse Translation with Perturbed kNN-MT

    Authors: Yuto Nishida, Makoto Morishita, Hidetaka Kamigaito, Taro Watanabe

    Abstract: Generating multiple translation candidates would enable users to choose the one that satisfies their needs. Although there has been work on diversified generation, there exists room for improving the diversity mainly because the previous methods do not address the overcorrection problem -- the model underestimates a prediction that is largely different from the training data, even if that predicti… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL 2024 SRW

  25. arXiv:2311.09109  [pdf, other

    cs.CL cs.AI cs.LG

    Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?

    Authors: Yusuke Sakai, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: Knowledge graphs (KGs) consist of links that describe relationships between entities. Due to the difficulty of manually enumerating all relationships between entities, automatically completing them is essential for KGs. Knowledge Graph Completion (KGC) is a task that infers unseen relationships between entities in a KG. Traditional embedding-based KGC methods, such as RESCAL, TransE, DistMult, Com… ▽ More

    Submitted 6 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Accepted at NAACL 2024 main oral, 15 pages, 10 figures

  26. arXiv:2309.09296  [pdf, other

    cs.CL cs.AI cs.LG

    Model-based Subsampling for Knowledge Graph Completion

    Authors: Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: Subsampling is effective in Knowledge Graph Embedding (KGE) for reducing overfitting caused by the sparsity in Knowledge Graph (KG) datasets. However, current subsampling approaches consider only frequencies of queries that consist of entities and their relations. Thus, the existing subsampling potentially underestimates the appearance probabilities of infrequent queries even if the frequencies of… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by AACL 2023; 9 pages, 3 figures, 5 tables

  27. arXiv:2306.02115  [pdf, other

    cs.CL cs.CV cs.LG

    Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models

    Authors: Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

    Abstract: In this paper, we propose a table and image generation task to verify how the knowledge about entities acquired from natural language is retained in Vision & Language (V&L) models. This task consists of two parts: the first is to generate a table containing knowledge about an entity and its related image, and the second is to generate an image from an entity with a caption and a table containing r… ▽ More

    Submitted 25 July, 2023; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Accepted at ACL 2023

  28. arXiv:2305.13000  [pdf, other

    cs.CL

    Bidirectional Transformer Reranker for Grammatical Error Correction

    Authors: Ying Zhang, Hidetaka Kamigaito, Manabu Okumura

    Abstract: Pre-trained seq2seq models have achieved state-of-the-art results in the grammatical error correction task. However, these models still suffer from a prediction bias due to their unidirectional decoding. Thus, we propose a bidirectional Transformer reranker (BTR), that re-estimates the probability of each candidate sentence generated by the pre-trained seq2seq model. The BTR preserves the seq2seq-… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  29. arXiv:2210.08355  [pdf, other

    cs.CL

    A Simple and Strong Baseline for End-to-End Neural RST-style Discourse Parsing

    Authors: Naoki Kobayashi, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura, Masaaki Nagata

    Abstract: To promote and further develop RST-style discourse parsing models, we need a strong baseline that can be regarded as a reference for reporting reliable experimental results. This paper explores a strong baseline by integrating existing simple parsing strategies, top-down and bottom-up, with various transformer-based pre-trained language models. The experimental results obtained from two benchmark… ▽ More

    Submitted 1 November, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: Accepted in Findings of EMNLP 2022

  30. arXiv:2209.12801  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Subsampling for Knowledge Graph Embedding Explained

    Authors: Hidetaka Kamigaito, Katsuhiko Hayashi

    Abstract: In this article, we explain the recent advance of subsampling methods in knowledge graph embedding (KGE) starting from the original one used in word2vec.

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Notes for subsampling methods in Knowledge Graph Embedding

  31. arXiv:2207.00929  [pdf, other

    cs.CL

    Generating Repetitions with Appropriate Repeated Words

    Authors: Toshiki Kawamoto, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura

    Abstract: A repetition is a response that repeats words in the previous speaker's utterance in a dialogue. Repetitions are essential in communication to build trust with others, as investigated in linguistic studies. In this work, we focus on repetition generation. To the best of our knowledge, this is the first neural approach to address repetition generation. We propose Weighted Label Smoothing, a smoothi… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  32. arXiv:2206.10140  [pdf, other

    cs.LG cs.AI cs.CL cs.SI

    Comprehensive Analysis of Negative Sampling in Knowledge Graph Representation Learning

    Authors: Hidetaka Kamigaito, Katsuhiko Hayashi

    Abstract: Negative sampling (NS) loss plays an important role in learning knowledge graph embedding (KGE) to handle a huge number of entities. However, the performance of KGE degrades without hyperparameters such as the margin term and number of negative samples in NS loss being appropriately selected. Currently, empirical hyperparameter tuning addresses this problem at the cost of computational time. To so… ▽ More

    Submitted 6 July, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: Accepted at ICML2022

  33. arXiv:2204.11445  [pdf, other

    cs.CL

    Aspect-based Analysis of Advertising Appeals for Search Engine Advertising

    Authors: Soichiro Murakami, Peinan Zhang, Sho Hoshino, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

    Abstract: Writing an ad text that attracts people and persuades them to click or act is essential for the success of search engine advertising. Therefore, ad creators must consider various aspects of advertising appeals (A$^3$) such as the price, product features, and quality. However, products and services exhibit unique effective A$^3$ for different industries. In this work, we focus on exploring the effe… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted by NAACL-HLT2022 Industry track

  34. Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding

    Authors: Hidetaka Kamigaito, Katsuhiko Hayashi

    Abstract: In knowledge graph embedding, the theoretical relationship between the softmax cross-entropy and negative sampling loss functions has not been investigated. This makes it difficult to fairly compare the results of the two different loss functions. We attempted to solve this problem by using the Bregman divergence to provide a unified interpretation of the softmax cross-entropy and negative samplin… ▽ More

    Submitted 16 March, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL-IJCNLP 2021

  35. arXiv:2102.00819  [pdf, other

    cs.CL

    Metric-Type Identification for Multi-Level Header Numerical Tables in Scientific Papers

    Authors: Lya Hulliyyatus Suadaa, Hidetaka Kamigaito, Manabu Okumura, Hiroya Takamura

    Abstract: Numerical tables are widely used to present experimental results in scientific papers. For table understanding, a metric-type is essential to discriminate numbers in the tables. We introduce a new information extraction task, metric-type identification from multi-level header numerical tables, and provide a dataset extracted from scientific papers consisting of header tables, captions, and metric-… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: To appear at EACL 2021

  36. arXiv:2011.04241  [pdf, other

    cs.CL

    Pointing to Subwords for Generating Function Names in Source Code

    Authors: Shogo Fujita, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

    Abstract: We tackle the task of automatically generating a function name from source code. Existing generators face difficulties in generating low-frequency or out-of-vocabulary subwords. In this paper, we propose two strategies for copying low-frequency or out-of-vocabulary subwords in inputs. Our best performing model showed an improvement over the conventional method in terms of our modified F1 and accur… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 12 pages, accepted to COLING2020

  37. arXiv:2011.02173  [pdf, other

    cs.CL

    Neural text normalization leveraging similarities of strings and sounds

    Authors: Riku Kawamura, Tatsuya Aoki, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

    Abstract: We propose neural models that can normalize text by considering the similarities of word strings and sounds. We experimentally compared a model that considers the similarities of both word strings and sounds, a model that considers only the similarity of word strings or of sounds, and a model without the similarities as a baseline. Results showed that leveraging the word string similarity succeede… ▽ More

    Submitted 4 November, 2020; originally announced November 2020.

    Comments: 6 pages, accepted to COLING2020

  38. arXiv:2002.01145  [pdf, other

    cs.CL

    Syntactically Look-Ahead Attention Network for Sentence Compression

    Authors: Hidetaka Kamigaito, Manabu Okumura

    Abstract: Sentence compression is the task of compressing a long sentence into a short one by deleting redundant words. In sequence-to-sequence (Seq2Seq) based models, the decoder unidirectionally decides to retain or delete words. Thus, it cannot usually explicitly capture the relationships between decoded words and unseen words that will be decoded in the future time steps. Therefore, to avoid generating… ▽ More

    Submitted 17 May, 2020; v1 submitted 4 February, 2020; originally announced February 2020.

    Comments: AAAI 2020