Skip to main content

Showing 1–50 of 66 results for author: Lai, M

  1. arXiv:2407.11414  [pdf, other

    cs.CV

    SDPT: Synchronous Dual Prompt Tuning for Fusion-based Visual-Language Pre-trained Models

    Authors: Yang Zhou, Yongjian Wu, Jiya Saiyin, Bingzheng Wei, Maode Lai, Eric Chang, Yan Xu

    Abstract: Prompt tuning methods have achieved remarkable success in parameter-efficient fine-tuning on large pre-trained models. However, their application to dual-modal fusion-based visual-language pre-trained models (VLPMs), such as GLIP, has encountered issues. Existing prompt tuning methods have not effectively addressed the modal mapping and aligning problem for tokens in different modalities, leading… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV 2024

  2. arXiv:2407.10021  [pdf, other

    cs.CL cs.AI

    Document-level Clinical Entity and Relation Extraction via Knowledge Base-Guided Generation

    Authors: Kriti Bhattarai, Inez Y. Oh, Zachary B. Abrams, Albert M. Lai

    Abstract: Generative pre-trained transformer (GPT) models have shown promise in clinical entity and relation extraction tasks because of their precise extraction and contextual understanding capability. In this work, we further leverage the Unified Medical Language System (UMLS) knowledge base to accurately identify medical concepts and improve clinical entity and relation extraction at the document level.… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Accepted at Association for Computational Linguistics BioNLP 2024

  3. arXiv:2407.08800  [pdf, other

    cs.CV cs.LG

    Local Clustering for Lung Cancer Image Classification via Sparse Solution Technique

    Authors: Jackson Hamel, Ming-Jun Lai, Zhaiming Shen, Ye Tian

    Abstract: In this work, we propose to use a local clustering approach based on the sparse solution technique to study the medical image, especially the lung cancer image classification task. We view images as the vertices in a weighted graph and the similarity between a pair of images as the edges in the graph. The vertices within the same cluster can be assumed to share similar features and properties, thu… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  4. arXiv:2406.18556  [pdf

    eess.IV cs.CV cs.LG

    Renal digital pathology visual knowledge search platform based on language large model and book knowledge

    Authors: Xiaomin Lv, Chong Lai, Liya Ding, Maode Lai, Qingrong Sun

    Abstract: Large models have become mainstream, yet their applications in digital pathology still require exploration. Meanwhile renal pathology images play an important role in the diagnosis of renal diseases. We conducted image segmentation and paired corresponding text descriptions based on 60 books for renal pathology, clustering analysis for all image and text description features based on large models,… ▽ More

    Submitted 26 May, 2024; originally announced June 2024.

    Comments: 9 pages, 6 figures

  5. arXiv:2405.03060  [pdf, other

    cs.LG

    Tree-based Ensemble Learning for Out-of-distribution Detection

    Authors: Zhaiming Shen, Menglun Wang, Guang Cheng, Ming-Jun Lai, Lin Mu, Ruihao Huang, Qi Liu, Hao Zhu

    Abstract: Being able to successfully determine whether the testing samples has similar distribution as the training samples is a fundamental question to address before we can safely deploy most of the machine learning models into practice. In this paper, we propose TOOD detection, a simple yet effective tree-based out-of-distribution (TOOD) detection mechanism to determine if a set of unseen samples will ha… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  6. arXiv:2403.11211  [pdf

    cs.CV

    RCdpia: A Renal Carcinoma Digital Pathology Image Annotation dataset based on pathologists

    Authors: Qingrong Sun, Weixiang Zhong, Jie Zhou, Chong Lai, Xiaodong Teng, Maode Lai

    Abstract: The annotation of digital pathological slide data for renal cell carcinoma is of paramount importance for correct diagnosis of artificial intelligence models due to the heterogeneous nature of the tumor. This process not only facilitates a deeper understanding of renal cell cancer heterogeneity but also aims to minimize noise in the data for more accurate studies. To enhance the applicability of t… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures, 1 table

  7. arXiv:2403.00473  [pdf, other

    cs.GR cs.RO eess.SY

    Computer-Controlled 3D Freeform Surface Weaving

    Authors: Xiangjia Chen, Lip M. Lai, Zishun Liu, Chengkai Dai, Isaac C. W. Leung, Charlie C. L. Wang, Yeung Yam

    Abstract: In this paper, we present a new computer-controlled weaving technology that enables the fabrication of woven structures in the shape of given 3D surfaces by using threads in non-traditional materials with high bending-stiffness, allowing for multiple applications with the resultant woven fabrics. A new weaving machine and a new manufacturing process are developed to realize the function of 3D surf… ▽ More

    Submitted 8 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

  8. arXiv:2402.15391  [pdf, other

    cs.LG cs.AI cs.CV

    Genie: Generative Interactive Environments

    Authors: Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel

    Abstract: We introduce Genie, the first generative interactive environment trained in an unsupervised manner from unlabelled Internet videos. The model can be prompted to generate an endless variety of action-controllable virtual worlds described through text, synthetic images, photographs, and even sketches. At 11B parameters, Genie can be considered a foundation world model. It is comprised of a spatiotem… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: https://sites.google.com/corp/view/genie-2024/

  9. One-Stop Automated Diagnostic System for Carpal Tunnel Syndrome in Ultrasound Images Using Deep Learning

    Authors: Jiayu Peng, Jiajun Zeng, Manlin Lai, Ruobing Huang, Dong Ni, Zhenzhou Li

    Abstract: Objective: Ultrasound (US) examination has unique advantages in diagnosing carpal tunnel syndrome (CTS) while identifying the median nerve (MN) and diagnosing CTS depends heavily on the expertise of examiners. To alleviate this problem, we aimed to develop a one-stop automated CTS diagnosis system (OSA-CTSD) and evaluate its effectiveness as a computer-aided diagnostic tool. Methods: We combined r… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: Accepted by Ultrasound in Medicine & Biology

    Journal ref: Ultrasound in Medicine & Biology, Volume 50, Issue 2, February 2024, Pages 304-314

  10. arXiv:2309.17403  [pdf, other

    math.NA cs.LG

    Maximal Volume Matrix Cross Approximation for Image Compression and Least Squares Solution

    Authors: Kenneth Allen, Ming-Jun Lai, Zhaiming Shen

    Abstract: We study the classic cross approximation of matrices based on the maximal volume submatrices. Our main results consist of an improvement of a classic estimate for matrix cross approximation and a greedy approach for finding the maximal volume submatrices. Indeed, we present a new proof of a classic estimate of the inequality with an improved constant. Also, we present a family of greedy maximal vo… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  11. Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images

    Authors: Zhiyun Song, Penghui Du, Junpeng Yan, Kailu Li, Jianzhong Shou, Maode Lai, Yubo Fan, Yan Xu

    Abstract: Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretr… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  12. arXiv:2308.09175  [pdf, other

    cs.AI cs.LG

    Diversifying AI: Towards Creative Chess with AlphaZero

    Authors: Tom Zahavy, Vivek Veeriah, Shaobo Hou, Kevin Waugh, Matthew Lai, Edouard Leurent, Nenad Tomasev, Lisa Schut, Demis Hassabis, Satinder Singh

    Abstract: In recent years, Artificial Intelligence (AI) systems have surpassed human intelligence in a variety of computational tasks. However, AI systems, like humans, make mistakes, have blind spots, hallucinate, and struggle to generalize to new situations. This work explores whether AI can benefit from creative decision-making mechanisms when pushed to the limits of its computational rationality. In par… ▽ More

    Submitted 29 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

  13. arXiv:2308.06709  [pdf, other

    math.OC cs.LG

    The Hard-Constraint PINNs for Interface Optimal Control Problems

    Authors: Ming-Chih Lai, Yongcun Song, Xiaoming Yuan, Hangrui Yue, Tianyou Zeng

    Abstract: We show that the physics-informed neural networks (PINNs), in combination with some recently developed discontinuity capturing neural networks, can be applied to solve optimal control problems subject to partial differential equations (PDEs) with interfaces and some control constraints. The resulting algorithm is mesh-free and scalable to different PDEs, and it ensures the control constraints rigo… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  14. arXiv:2306.17659  [pdf, other

    cs.CV

    Zero-shot Nuclei Detection via Visual-Language Pre-trained Models

    Authors: Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Large-scale visual-language pre-trained models (VLPM) have proven their excellent performance in downstream object detection for natural scenes. However, zero-shot nuclei detection on H\&E images via VLPMs remains underexplored. The large gap between medical images and the web-originated text-image pairs used for pre-training makes it a challenging task. In this paper, we attempt to explore the po… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: This article has been accepted by MICCAI 2023,but has not been fully edited. Content may change prior to final publication

  15. arXiv:2306.05537  [pdf, other

    cs.CL

    AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization

    Authors: Guan Wang, Weihua Li, Edmund M-K. Lai, Quan Bai

    Abstract: The rapid growth of information on the Internet has led to an overwhelming amount of opinions and comments on various activities, products, and services. This makes it difficult and time-consuming for users to process all the available information when making decisions. Text summarization, a Natural Language Processing (NLP) task, has been widely explored to help users quickly retrieve relevant in… ▽ More

    Submitted 25 May, 2023; originally announced June 2023.

    Comments: 21 pages, 4 figures, 7 tables

  16. Cyclic Learning: Bridging Image-level Labels and Nuclei Instance Segmentation

    Authors: Yang Zhou, Yongjian Wu, Zihua Wang, Bingzheng Wei, Maode Lai, Jianzhong Shou, Yubo Fan, Yan Xu

    Abstract: Nuclei instance segmentation on histopathology images is of great clinical value for disease analysis. Generally, fully-supervised algorithms for this task require pixel-wise manual annotations, which is especially time-consuming and laborious for the high nuclei density. To alleviate the annotation burden, we seek to solve the problem through image-level weakly supervised learning, which is under… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI https://doi.org/10.1109/TMI.2023.3275609, IEEE Transactions on Medical Imaging. Code: https://github.com/wuyongjianCODE/Cyclic

  17. arXiv:2304.14339  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    MarsEclipse at SemEval-2023 Task 3: Multi-Lingual and Multi-Label Framing Detection with Contrastive Learning

    Authors: Qisheng Liao, Meiting Lai, Preslav Nakov

    Abstract: This paper describes our system for SemEval-2023 Task 3 Subtask 2 on Framing Detection. We used a multi-label contrastive loss for fine-tuning large pre-trained language models in a multi-lingual setting, achieving very competitive results: our system was ranked first on the official test set and on the official shared task leaderboard for five of the six languages for which we had training data a… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: framing, contrastive learning, SemEval-2023 task 3

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

    Journal ref: SemEval-2023

  18. arXiv:2301.09175  [pdf, other

    cs.CL

    Ensemble Transfer Learning for Multilingual Coreference Resolution

    Authors: Tuan Manh Lai, Heng Ji

    Abstract: Entity coreference resolution is an important research problem with many applications, including information extraction and question answering. Coreference resolution for English has been studied extensively. However, there is relatively little work for other languages. A problem that frequently occurs when working with a non-English language is the scarcity of annotated training data. To overcome… ▽ More

    Submitted 22 January, 2023; originally announced January 2023.

  19. arXiv:2211.11114  [pdf, other

    cs.LG math.NA

    Semi-supervised Local Cluster Extraction by Compressive Sensing

    Authors: Zhaiming Shen, Ming-Jun Lai, Sheng Li

    Abstract: Local clustering problem aims at extracting a small local structure inside a graph without the necessity of knowing the entire graph structure. As the local structure is usually small in size compared to the entire graph, one can think of it as a compressive sensing problem where the indices of target cluster can be thought as a sparse solution to a linear system. In this paper, we propose a new s… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  20. A cusp-capturing PINN for elliptic interface problems

    Authors: Yu-Hau Tseng, Te-Sheng Lin, Wei-Fan Hu, Ming-Chih Lai

    Abstract: In this paper, we propose a cusp-capturing physics-informed neural network (PINN) to solve discontinuous-coefficient elliptic interface problems whose solution is continuous but has discontinuous first derivatives on the interface. To find such a solution using neural network representation, we introduce a cusp-enforced level set function as an additional feature input to the network to retain the… ▽ More

    Submitted 16 April, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

  21. An efficient neural-network and finite-difference hybrid method for elliptic interface problems with applications

    Authors: Wei-Fan Hu, Te-Sheng Lin, Yu-Hau Tseng, Ming-Chih Lai

    Abstract: A new and efficient neural-network and finite-difference hybrid method is developed for solving Poisson equation in a regular domain with jump discontinuities on embedded irregular interfaces. Since the solution has low regularity across the interface, when applying finite difference discretization to this problem, an additional treatment accounting for the jump discontinuities must be employed. H… ▽ More

    Submitted 2 March, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

    Journal ref: Commun. Comput. Phys., Vol. 33, pp.1090-1105 (2023)

  22. arXiv:2207.10652  [pdf, other

    cs.CL

    O-Dang! The Ontology of Dangerous Speech Messages

    Authors: Marco A. Stranisci, Simona Frenda, Mirko Lai, Oscar Araque, Alessandra T. Cignarella, Valerio Basile, Viviana Patti, Cristina Bosco

    Abstract: Inside the NLP community there is a considerable amount of language resources created, annotated and released every day with the aim of studying specific linguistic phenomena. Despite a variety of attempts in order to organize such resources has been carried on, a lack of systematic methods and of possible interoperability between resources are still present. Furthermore, when storing linguistic i… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

  23. arXiv:2206.10945  [pdf, ps, other

    cs.NI

    Improve Sensing and Communication Performance of UAV via Integrated Sensing and Communication

    Authors: Wangjun Jiang, Ailing Wang, Zhiqing Wei, Meichen Lai, Meichen Lai, Zhiyong Feng, Jianjun Liu

    Abstract: The unmanned aerial vehicle (UAV) needs to sense the environment to ensure safe flight, and the sensing accuracy and communication delay performance are two important indicators of safe flight. The strategy of using integrated sensing and communication (ISAC) technology to improve the sensing and communication performance is proposed in this paper. On the one hand, the extended kalman filter (EKF)… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  24. arXiv:2205.08878  [pdf, other

    cs.CV

    Transformer based multiple instance learning for weakly supervised histopathology image segmentation

    Authors: Ziniu Qian, Kailu Li, Maode Lai, Eric I-Chao Chang, Bingzheng Wei, Yubo Fan, Yan Xu

    Abstract: Hispathological image segmentation algorithms play a critical role in computer aided diagnosis technology. The development of weakly supervised segmentation algorithm alleviates the problem of medical image annotation that it is time-consuming and labor-intensive. As a subset of weakly supervised learning, Multiple Instance Learning (MIL) has been proven to be effective in segmentation. However, t… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

    Comments: Provisional accepted for MICCAI 2022

  25. arXiv:2203.03546  [pdf, other

    cs.CL

    LMN at SemEval-2022 Task 11: A Transformer-based System for English Named Entity Recognition

    Authors: Ngoc Minh Lai

    Abstract: Processing complex and ambiguous named entities is a challenging research problem, but it has not received sufficient attention from the natural language processing community. In this short paper, we present our participation in the English track of SemEval-2022 Task 11: Multilingual Complex Named Entity Recognition. Inspired by the recent advances in pretrained Transformer language models, we pro… ▽ More

    Submitted 13 February, 2022; originally announced March 2022.

    Comments: SemEval 2022 (co-located with NAACL)

  26. arXiv:2203.01581  [pdf, other

    math.NA cs.LG

    A shallow physics-informed neural network for solving partial differential equations on surfaces

    Authors: Wei-Fan Hu, Yi-Jun Shih, Te-Sheng Lin, Ming-Chih Lai

    Abstract: In this paper, we introduce a shallow (one-hidden-layer) physics-informed neural network for solving partial differential equations on static and evolving surfaces. For the static surface case, with the aid of level set function, the surface normal and mean curvature used in the surface differential expressions can be computed easily. So instead of imposing the normal extension constraints used in… ▽ More

    Submitted 20 January, 2023; v1 submitted 3 March, 2022; originally announced March 2022.

  27. arXiv:2202.13404  [pdf, other

    cs.CL

    Improving Candidate Retrieval with Entity Profile Generation for Wikidata Entity Linking

    Authors: Tuan Manh Lai, Heng Ji, ChengXiang Zhai

    Abstract: Entity linking (EL) is the task of linking entity mentions in a document to referent entities in a knowledge base (KB). Many previous studies focus on Wikipedia-derived KBs. There is little work on EL over Wikidata, even though it is the most extensive crowdsourced KB. The scale of Wikidata can open up many new real-world applications, but its massive number of entities also makes EL challenging.… ▽ More

    Submitted 14 March, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: ACL 2022 (Findings)

  28. arXiv:2202.02904  [pdf, other

    cs.LG math.NA

    A Compressed Sensing Based Least Squares Approach to Semi-supervised Local Cluster Extraction

    Authors: Ming-Jun Lai, Zhaiming Shen

    Abstract: A least squares semi-supervised local clustering algorithm based on the idea of compressed sensing is proposed to extract clusters from a graph with known adjacency matrix. The algorithm is based on a two-stage approach similar to the one in \cite{LaiMckenzie2020}. However, under a weaker assumption and with less computational complexity than the one in \cite{LaiMckenzie2020}, the algorithm is sho… ▽ More

    Submitted 31 October, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

  29. AGMI: Attention-Guided Multi-omics Integration for Drug Response Prediction with Graph Neural Networks

    Authors: Ruiwei Feng, Yufeng Xie, Minshan Lai, Danny Z. Chen, Ji Cao, Jian Wu

    Abstract: Accurate drug response prediction (DRP) is a crucial yet challenging task in precision medicine. This paper presents a novel Attention-Guided Multi-omics Integration (AGMI) approach for DRP, which first constructs a Multi-edge Graph (MeG) for each cell line, and then aggregates multi-omics features to predict drug response using a novel structure, called Graph edge-aware Network (GeNet). For the f… ▽ More

    Submitted 9 January, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

  30. arXiv:2109.12832  [pdf, ps, other

    cs.RO

    Anti-collision Technologies for Unmanned Aerial Vehicles: Recent Advances and Future Trends

    Authors: Zhiqing Wei, Zeyang Meng, Meichen Lai, Huici Wu, Jiarong Han, Zhiyong Feng

    Abstract: Unmanned aerial vehicles (UAVs) are widely applied in civil applications, such as disaster relief, agriculture and cargo transportation, etc. With the massive number of UAV flight activities, the anti-collision technologies aiming to avoid the collisions between UAVs and other objects have attracted much attention. The anti-collision technologies are of vital importance to guarantee the survivabil… ▽ More

    Submitted 1 March, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: 32 pages, 7 figures and 9 tables

    MSC Class: 93-02 ACM Class: A.1

  31. arXiv:2108.09889  [pdf, other

    cs.CL

    A Unified Transformer-based Framework for Duplex Text Normalization

    Authors: Tuan Manh Lai, Yang Zhang, Evelina Bakhturina, Boris Ginsburg, Heng Ji

    Abstract: Text normalization (TN) and inverse text normalization (ITN) are essential preprocessing and postprocessing steps for text-to-speech synthesis and automatic speech recognition, respectively. Many methods have been proposed for either TN or ITN, ranging from weighted finite-state transducers to neural networks. Despite their impressive performance, these methods aim to tackle only one of the two ta… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: Under Review

  32. Machine learning for modeling the progression of Alzheimer disease dementia using clinical data: a systematic literature review

    Authors: Sayantan Kumar, Inez Oh, Suzanne Schindler, Albert M Lai, Philip R O Payne, Aditi Gupta

    Abstract: Objective Alzheimer disease (AD) is the most common cause of dementia, a syndrome characterized by cognitive impairment severe enough to interfere with activities of daily life. We aimed to conduct a systematic literature review (SLR) of studies that applied machine learning (ML) methods to clinical data derived from electronic health records in order to model risk for progression of AD dementia.… ▽ More

    Submitted 5 August, 2021; originally announced August 2021.

    Comments: 10 pages, 4 figures, 3 tables

    Journal ref: JAMIA Open, Volume 4, Issue 3, July 2021, ooab052

  33. A Shallow Ritz Method for Elliptic Problems with Singular Sources

    Authors: Ming-Chih Lai, Che-Chia Chang, Wei-Syuan Lin, Wei-Fan Hu, Te-Sheng Lin

    Abstract: In this paper, a shallow Ritz-type neural network for solving elliptic equations with delta function singular sources on an interface is developed. There are three novel features in the present work; namely, (i) the delta function singularity is naturally removed, (ii) level set function is introduced as a feature input, (iii) it is completely shallow, comprising only one hidden layer. We first in… ▽ More

    Submitted 1 July, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Journal ref: J. Comput. Phys., Vol.469 (2022) 111547

  34. arXiv:2107.01700  [pdf, other

    cs.CL

    End-to-end Neural Coreference Resolution Revisited: A Simple yet Effective Baseline

    Authors: Tuan Manh Lai, Trung Bui, Doo Soon Kim

    Abstract: Since the first end-to-end neural coreference resolution model was introduced, many extensions to the model have been proposed, ranging from using higher-order inference to directly optimizing evaluation metrics using reinforcement learning. Despite improving the coreference resolution performance by a large margin, these extensions add substantial extra complexity to the original model. Motivated… ▽ More

    Submitted 8 February, 2022; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: Accepted by ICASSP 2022

  35. A Discontinuity Capturing Shallow Neural Network for Elliptic Interface Problems

    Authors: Wei-Fan Hu, Te-Sheng Lin, Ming-Chih Lai

    Abstract: In this paper, a new Discontinuity Capturing Shallow Neural Network (DCSNN) for approximating $d$-dimensional piecewise continuous functions and for solving elliptic interface problems is developed. There are three novel features in the present network; namely, (i) jump discontinuities are accurately captured, (ii) it is completely shallow, comprising only one hidden layer, (iii) it is completely… ▽ More

    Submitted 30 August, 2022; v1 submitted 10 June, 2021; originally announced June 2021.

    Journal ref: J. Comput. Phys., Vol. 469 (2022) 111576

  36. arXiv:2010.11980  [pdf, other

    cs.CL cs.LG

    A Joint Learning Approach based on Self-Distillation for Keyphrase Extraction from Scientific Documents

    Authors: Tuan Manh Lai, Trung Bui, Doo Soon Kim, Quan Hung Tran

    Abstract: Keyphrase extraction is the task of extracting a small set of phrases that best describe a document. Most existing benchmark datasets for the task typically have limited numbers of annotated documents, making it challenging to train increasingly complex neural networks. In contrast, digital libraries store millions of scientific articles online, covering a wide range of topics. While a significant… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted to COLING 2020

  37. #Brexit: Leave or Remain? The Role of User's Community and Diachronic Evolution on Stance Detection

    Authors: Mirko Lai, Viviana Patti, Giancarlo Ruffo, Paolo Rosso

    Abstract: Interest has grown around the classification of stance that users assume within online debates in recent years. Stance has been usually addressed by considering users posts in isolation, while social studies highlight that social communities may contribute to influence users' opinion. Furthermore, stance should be studied in a diachronic perspective, since it could help to shed light on users' opi… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: To appear in Journal of Intelligent & Fuzzy Systems

  38. arXiv:2007.09534  [pdf, other

    math.NA cs.IT

    A Quasi-Orthogonal Matching Pursuit Algorithm for Compressive Sensing

    Authors: Ming-Jun Lai, Zhaiming Shen

    Abstract: In this paper, we propose a new orthogonal matching pursuit algorithm called quasi-OMP algorithm which greatly enhances the performance of classical orthogonal matching pursuit (OMP) algorithm, at some cost of computational complexity. We are able to show that under some sufficient conditions of mutual coherence of the sensing matrix, the QOMP Algorithm succeeds in recovering the s-sparse signal v… ▽ More

    Submitted 18 July, 2020; originally announced July 2020.

  39. arXiv:2007.07161  [pdf, ps, other

    cs.DS

    Graph Sparsification by Universal Greedy Algorithms

    Authors: Ming-Jun Lai, Jiaxin Xie, Zhiqiang Xu

    Abstract: Graph sparsification is to approximate an arbitrary graph by a sparse graph and is useful in many applications, such as simplification of social networks, least squares problems, numerical solution of symmetric positive definite linear systems and etc. In this paper, inspired by the well-known sparse signal recovery algorithm called orthogonal matching pursuit (OMP), we introduce a deterministic,… ▽ More

    Submitted 21 February, 2021; v1 submitted 14 July, 2020; originally announced July 2020.

  40. arXiv:2007.03805  [pdf, other

    cs.CL cs.AI cs.IR

    ISA: An Intelligent Shopping Assistant

    Authors: Tuan Manh Lai, Trung Bui, Nedim Lipka

    Abstract: Despite the growth of e-commerce, brick-and-mortar stores are still the preferred destinations for many people. In this paper, we present ISA, a mobile-based intelligent shopping assistant that is designed to improve shopping experience in physical stores. ISA assists users by leveraging advanced techniques in computer vision, speech processing, and natural language processing. An in-store user on… ▽ More

    Submitted 23 September, 2020; v1 submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted by AACL 2020 (Demo)

  41. arXiv:1910.12995  [pdf, other

    cs.CL cs.LG

    A Simple but Effective BERT Model for Dialog State Tracking on Resource-Limited Systems

    Authors: Tuan Manh Lai, Quan Hung Tran, Trung Bui, Daisuke Kihara

    Abstract: In a task-oriented dialog system, the goal of dialog state tracking (DST) is to monitor the state of the conversation from the dialog history. Recently, many deep learning based methods have been proposed for the task. Despite their impressive performance, current neural architectures for DST are typically heavily-engineered and conceptually complex, making it difficult to implement, debug, and ma… ▽ More

    Submitted 8 February, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: Accepted to ICASSP 2020

  42. arXiv:1908.09453  [pdf, other

    cs.LG cs.AI cs.GT cs.MA

    OpenSpiel: A Framework for Reinforcement Learning in Games

    Authors: Marc Lanctot, Edward Lockhart, Jean-Baptiste Lespiau, Vinicius Zambaldi, Satyaki Upadhyay, Julien Pérolat, Sriram Srinivasan, Finbarr Timbers, Karl Tuyls, Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Paul Muller, Timo Ewalds, Ryan Faulkner, János Kramár, Bart De Vylder, Brennan Saeta, James Bradbury, David Ding, Sebastian Borgeaud, Matthew Lai, Julian Schrittwieser, Thomas Anthony, Edward Hughes , et al. (2 additional authors not shown)

    Abstract: OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games. OpenSpiel supports n-player (single- and multi- agent) zero-sum, cooperative and general-sum, one-shot and sequential, strictly turn-taking and simultaneous-move, perfect and imperfect information games, as well as traditional multiagent environments such as (partia… ▽ More

    Submitted 26 September, 2020; v1 submitted 25 August, 2019; originally announced August 2019.

  43. Supervised Transfer Learning for Product Information Question Answering

    Authors: Tuan Manh Lai, Trung Bui, Nedim Lipka, Sheng Li

    Abstract: Popular e-commerce websites such as Amazon offer community question answering systems for users to pose product related questions and experienced customers may provide answers voluntarily. In this paper, we show that the large volume of existing community question answering data can be beneficial when building a system for answering questions related to product facts and specifications. Our experi… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: 2018 17th IEEE International Conference on Machine Learning and Applications

  44. arXiv:1810.09061  [pdf, ps, other

    cs.IT

    On DC based Methods for Phase Retrieval

    Authors: Meng Huang, Ming-Jun Lai, Abraham Varghese, Zhiqiang Xu

    Abstract: In this paper, we develop a new computational approach which is based on minimizing the difference of two convex functionals (DC) to solve a broader class of phase retrieval problems. The approach splits a standard nonlinear least squares minimizing function associated with the phase retrieval problem into the difference of two convex functions and then solves a sequence of convex minimization sub… ▽ More

    Submitted 21 October, 2018; originally announced October 2018.

    Comments: 28 pages

  45. arXiv:1808.05780  [pdf, other

    cs.IT cs.SI math.NA

    Compressive Sensing for cut improvement and local clustering

    Authors: Ming-Jun Lai, Daniel Mckenzie

    Abstract: We show how one can phrase the cut improvement problem for graphs as a sparse recovery problem, whence one can use algorithms originally developed for use in compressive sensing (such as SubspacePursuit or CoSaMP) to solve it. We show that this approach to cut improvement is fast, both in theory and practice and moreover enjoys statistical guarantees of success when applied to graphs drawn from pr… ▽ More

    Submitted 25 February, 2020; v1 submitted 17 August, 2018; originally announced August 2018.

    Comments: 25 pages. Generalizes and improves upon the earlier versions arxiv: 1808.05780 and arXiv:1708.09477. To appear in SIMODS

    MSC Class: 68Q25; 68R10; 68U05; 94A12

  46. arXiv:1807.03399  [pdf, other

    cs.CL cs.AI

    Jointly Embedding Entities and Text with Distant Supervision

    Authors: Denis Newman-Griffis, Albert M. Lai, Eric Fosler-Lussier

    Abstract: Learning representations for knowledge base entities and concepts is becoming increasingly important for NLP applications. However, recent entity embedding methods have relied on structured resources that are expensive to create for new domains and corpora. We present a distantly-supervised method for jointly learning embeddings of entities and text from an unnanotated corpus, using only a list of… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: 12 pages; Accepted to 3rd Workshop on Representation Learning for NLP (Repl4NLP 2018). Code at https://github.com/OSU-slatelab/JET

  47. arXiv:1712.01815  [pdf, other

    cs.AI cs.LG

    Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

    Authors: David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan, Demis Hassabis

    Abstract: The game of chess is the most widely-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. In contrast, the AlphaGo Zero program recently achieved superhuman performance in the game… ▽ More

    Submitted 5 December, 2017; originally announced December 2017.

  48. Unsupervised Learning for Cell-level Visual Representation in Histopathology Images with Generative Adversarial Networks

    Authors: Bo Hu, Ye Tang, Eric I-Chao Chang, Yubo Fan, Maode Lai, Yan Xu

    Abstract: The visual attributes of cells, such as the nuclear morphology and chromatin openness, are critical for histopathology image analysis. By learning cell-level visual representation, we can obtain a rich mix of features that are highly reusable for various tasks, such as cell-level classification, nuclei segmentation, and cell counting. In this paper, we propose a unified generative adversarial netw… ▽ More

    Submitted 7 July, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

    Comments: Accepted for publication in IEEE Journal of Biomedical and Health Informatics

  49. arXiv:1709.04319  [pdf, ps, other

    cs.NE eess.SY

    Enhanced Particle Swarm Optimization Algorithms for Multiple-Input Multiple-Output System Modelling using Convolved Gaussian Process Models

    Authors: Gang Cao, Edmund M-K Lai, Fakhrul Alam

    Abstract: Convolved Gaussian Process (CGP) is able to capture the correlations not only between inputs and outputs but also among the outputs. This allows a superior performance of using CGP than standard Gaussian Process (GP) in the modelling of Multiple-Input Multiple-Output (MIMO) systems when observations are missing for some of outputs. Similar to standard GP, a key issue of CGP is the learning of hype… ▽ More

    Submitted 12 July, 2017; originally announced September 2017.

  50. arXiv:1708.09477  [pdf, other

    cs.IT cs.LG stat.ML

    A Compressive Sensing Approach to Community Detection with Applications

    Authors: Ming-Jun Lai, Daniel Mckenzie

    Abstract: The community detection problem for graphs asks one to partition the n vertices V of a graph G into k communities, or clusters, such that there are many intracluster edges and few intercluster edges. Of course this is equivalent to finding a permutation matrix P such that, if A denotes the adjacency matrix of G, then PAP^T is approximately block diagonal. As there are k^n possible partitions of n… ▽ More

    Submitted 20 August, 2018; v1 submitted 30 August, 2017; originally announced August 2017.

    Comments: 39 pages, 10 figures Version 2, disabled 'showkeys' package. Note that there is an error in the proof of Lemma 5.1. A correct version of this lemma, as well as a greatly improved version of the central algorithm of this paper, is available at: arXiv:1808.05780