Skip to main content

Showing 1–38 of 38 results for author: Lai, P

  1. arXiv:2406.02502  [pdf, ps, other

    math.ST cs.DS math.NA math.PR

    Singular Subspace Perturbation Bounds via Rectangular Random Matrix Diffusions

    Authors: Peiyao Lai, Oren Mangoubi

    Abstract: Given a matrix $A \in \mathbb{R}^{m\times d}$ with singular values $σ_1\geq \cdots \geq σ_d$, and a random matrix $G \in \mathbb{R}^{m\times d}$ with iid $N(0,T)$ entries for some $T>0$, we derive new bounds on the Frobenius distance between subspaces spanned by the top-$k$ (right) singular vectors of $A$ and $A+G$. This problem arises in numerous applications in statistics where a data matrix may… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  2. arXiv:2405.16205  [pdf

    cs.AI cs.CL

    GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases

    Authors: Zhizheng Wang, Qiao Jin, Chih-Hsuan Wei, Shubo Tian, Po-Ting Lai, Qingqing Zhu, Chi-Ping Day, Christina Ross, Zhiyong Lu

    Abstract: Gene set knowledge discovery is essential for advancing human functional genomics. Recent studies have shown promising performance by harnessing the power of Large Language Models (LLMs) on this task. Nonetheless, their results are subject to several limitations common in LLMs such as hallucinations. In response, we present GeneAgent, a first-of-its-kind language agent featuring self-verification… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 30 pages with 10 figures and/or tables

  3. arXiv:2404.14209  [pdf

    cs.CL

    EnzChemRED, a rich enzyme chemistry relation extraction dataset

    Authors: Po-Ting Lai, Elisabeth Coudert, Lucila Aimo, Kristian Axelsen, Lionel Breuza, Edouard de Castro, Marc Feuermann, Anne Morgat, Lucille Pourcel, Ivo Pedruzzi, Sylvain Poux, Nicole Redaschi, Catherine Rivoire, Anastasia Sveshnikova, Chih-Hsuan Wei, Robert Leaman, Ling Luo, Zhiyong Lu, Alan Bridge

    Abstract: Expert curation is essential to capture knowledge of enzyme functions from the scientific literature in FAIR open knowledgebases but cannot keep pace with the rate of new discoveries and new publications. In this work we present EnzChemRED, for Enzyme Chemistry Relation Extraction Dataset, a new training and benchmarking dataset to support the development of Natural Language Processing (NLP) metho… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  4. arXiv:2402.05067  [pdf, other

    physics.flu-dyn cs.LG physics.comp-ph

    A Novel Paradigm in Solving Multiscale Problems

    Authors: Jing Wang, Zheng Li, Pengyu Lai, Rui Wang, Di Yang, Dewu Yang, Hui Xu, Wen-Quan Tao

    Abstract: Multiscale phenomena manifest across various scientific domains, presenting a ubiquitous challenge in accurately and effectively simulating multiscale dynamics in complex systems. In this paper, a novel decoupling solving paradigm is proposed through modelling large-scale dynamics independently and treating small-scale dynamics as a slaved system. A Spectral Physics-informed Neural Network (PINN)… ▽ More

    Submitted 30 April, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2401.12666  [pdf, other

    cs.AI

    EL-VIT: Probing Vision Transformer with Interactive Visualization

    Authors: Hong Zhou, Rui Zhang, Peifeng Lai, Chaoran Guo, Yong Wang, Zhida Sun, Junjie Li

    Abstract: Nowadays, Vision Transformer (ViT) is widely utilized in various computer vision tasks, owing to its unique self-attention mechanism. However, the model architecture of ViT is complex and often challenging to comprehend, leading to a steep learning curve. ViT developers and users frequently encounter difficulties in interpreting its inner workings. Therefore, a visualization system is needed to as… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 10 pages, 7 figures, conference

  6. arXiv:2401.11048  [pdf

    cs.CL q-bio.QM

    PubTator 3.0: an AI-powered Literature Resource for Unlocking Biomedical Knowledge

    Authors: Chih-Hsuan Wei, Alexis Allot, Po-Ting Lai, Robert Leaman, Shubo Tian, Ling Luo, Qiao Jin, Zhizheng Wang, Qingyu Chen, Zhiyong Lu

    Abstract: PubTator 3.0 (https://www.ncbi.nlm.nih.gov/research/pubtator3/) is a biomedical literature resource using state-of-the-art AI techniques to offer semantic and relation searches for key concepts like proteins, genetic variants, diseases, and chemicals. It currently provides over one billion entity and relation annotations across approximately 36 million PubMed abstracts and 6 million full-text arti… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  7. arXiv:2306.11189  [pdf

    cs.CL

    BioREx: Improving Biomedical Relation Extraction by Leveraging Heterogeneous Datasets

    Authors: Po-Ting Lai, Chih-Hsuan Wei, Ling Luo, Qingyu Chen, Zhiyong Lu

    Abstract: Biomedical relation extraction (RE) is the task of automatically identifying and characterizing relations between biomedical concepts from free text. RE is a central task in biomedical natural language processing (NLP) research and plays a critical role in many downstream applications, such as literature-based discovery and knowledge graph construction. State-of-the-art methods were used primarily… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  8. arXiv:2306.10070  [pdf

    cs.CY cs.AI cs.CL q-bio.QM

    Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health

    Authors: Shubo Tian, Qiao Jin, Lana Yeganova, Po-Ting Lai, Qingqing Zhu, Xiuying Chen, Yifan Yang, Qingyu Chen, Won Kim, Donald C. Comeau, Rezarta Islamaj, Aadit Kapoor, Xin Gao, Zhiyong Lu

    Abstract: ChatGPT has drawn considerable attention from both the general public and domain experts with its remarkable text generation capabilities. This has subsequently led to the emergence of diverse applications in the field of biomedicine and health. In this work, we examine the diverse applications of large language models (LLMs), such as ChatGPT, in biomedicine and health. Specifically we explore the… ▽ More

    Submitted 16 October, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  9. arXiv:2306.07579  [pdf, other

    cs.CV

    Parametric Implicit Face Representation for Audio-Driven Facial Reenactment

    Authors: Ricong Huang, Peiwen Lai, Yipeng Qin, Guanbin Li

    Abstract: Audio-driven facial reenactment is a crucial technique that has a range of applications in film-making, virtual avatars and video conferences. Existing works either employ explicit intermediate face representations (e.g., 2D facial landmarks or 3D face models) or implicit ones (e.g., Neural Radiance Fields), thus suffering from the trade-offs between interpretability and expressive power, hence be… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

    Comments: CVPR 2023

  10. arXiv:2304.11674  [pdf, ps, other

    cs.CV cs.LG eess.SP

    A Lightweight Recurrent Learning Network for Sustainable Compressed Sensing

    Authors: Yu Zhou, Yu Chen, Xiao Zhang, Pan Lai, Lei Huang, Jianmin Jiang

    Abstract: Recently, deep learning-based compressed sensing (CS) has achieved great success in reducing the sampling and computational cost of sensing systems and improving the reconstruction quality. These approaches, however, largely overlook the issue of the computational cost; they rely on complex structures and task-specific operator designs, resulting in extensive storage and high energy consumption in… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: has been accepted to IEEE TETCI

  11. arXiv:2302.12685  [pdf, other

    cs.LG cs.AI cs.CR

    Active Membership Inference Attack under Local Differential Privacy in Federated Learning

    Authors: Truc Nguyen, Phung Lai, Khang Tran, NhatHai Phan, My T. Thai

    Abstract: Federated learning (FL) was originally regarded as a framework for collaborative learning among clients with data privacy protection through a coordinating server. In this paper, we propose a new active membership inference (AMI) attack carried out by a dishonest server in FL. In AMI attacks, the server crafts and embeds malicious parameters into global models to effectively infer whether a target… ▽ More

    Submitted 24 July, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: Published at AISTATS 2023

    Journal ref: Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, PMLR 206:5714-5730, 2023

  12. arXiv:2301.04904  [pdf, other

    eess.IV cs.CV

    Lesion-aware Dynamic Kernel for Polyp Segmentation

    Authors: Ruifei Zhang, Peiwen Lai, Xiang Wan, De-Jun Fan, Feng Gao, Xiao-Jian Wu, Guanbin Li

    Abstract: Automatic and accurate polyp segmentation plays an essential role in early colorectal cancer diagnosis. However, it has always been a challenging task due to 1) the diverse shape, size, brightness and other appearance characteristics of polyps, 2) the tiny contrast between concealed polyps and their surrounding regions. To address these problems, we propose a lesion-aware dynamic network (LDNet) f… ▽ More

    Submitted 12 January, 2023; originally announced January 2023.

    Comments: Accepted by MICCAI2022

  13. arXiv:2212.04454  [pdf, other

    cs.LG cs.CR

    XRand: Differentially Private Defense against Explanation-Guided Attacks

    Authors: Truc Nguyen, Phung Lai, NhatHai Phan, My T. Thai

    Abstract: Recent development in the field of explainable artificial intelligence (XAI) has helped improve trust in Machine-Learning-as-a-Service (MLaaS) systems, in which an explanation is provided together with the model prediction in response to each query. However, XAI also opens a door for adversaries to gain insights into the black-box models in MLaaS, thereby making the models more vulnerable to sever… ▽ More

    Submitted 14 December, 2022; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: To be published at AAAI 2023

  14. AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning

    Authors: Ling Luo, Chih-Hsuan Wei, Po-Ting Lai, Robert Leaman, Qingyu Chen, Zhiyong Lu

    Abstract: Biomedical named entity recognition (BioNER) seeks to automatically recognize biomedical entities in natural language text, serving as a necessary foundation for downstream text mining tasks and applications such as information extraction and question answering. Manually labeling training data for the BioNER task is costly, however, due to the significant domain expertise required for accurate ann… ▽ More

    Submitted 15 May, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted by Bioinformatics

  15. arXiv:2211.05766  [pdf, other

    cs.LG cs.CR

    Heterogeneous Randomized Response for Differential Privacy in Graph Neural Networks

    Authors: Khang Tran, Phung Lai, NhatHai Phan, Issa Khalil, Yao Ma, Abdallah Khreishah, My Thai, Xintao Wu

    Abstract: Graph neural networks (GNNs) are susceptible to privacy inference attacks (PIAs), given their ability to learn joint representation from features and edges among nodes in graph data. To prevent privacy leakages in GNNs, we propose a novel heterogeneous randomized response (HeteroRR) mechanism to protect nodes' features and edges against PIAs under differential privacy (DP) guarantees without an un… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: Accepted in IEEE BigData 2022 (short paper)

  16. arXiv:2211.01141  [pdf, other

    cs.CR cs.CL cs.LG

    User-Entity Differential Privacy in Learning Natural Language Models

    Authors: Phung Lai, NhatHai Phan, Tong Sun, Rajiv Jain, Franck Dernoncourt, Jiuxiang Gu, Nikolaos Barmpalios

    Abstract: In this paper, we introduce a novel concept of user-entity differential privacy (UeDP) to provide formal privacy protection simultaneously to both sensitive entities in textual data and data owners in learning natural language models (NLMs). To preserve UeDP, we developed a novel algorithm, called UeDP-Alg, optimizing the trade-off between privacy loss and model utility with a tight sensitivity bo… ▽ More

    Submitted 8 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

    Comments: Accepted at IEEE BigData 2022

  17. arXiv:2210.05988  [pdf, other

    eess.SP cs.LG q-bio.NC

    CLEEGN: A Convolutional Neural Network for Plug-and-Play Automatic EEG Reconstruction

    Authors: Pin-Hua Lai, Bo-Shan Wang, Wei-Chun Yang, Hsiang-Chieh Tsou, Chun-Shu Wei

    Abstract: Human electroencephalography (EEG) is a brain monitoring modality that senses cortical neuroelectrophysiological activity in high-temporal resolution. One of the greatest challenges posed in applications of EEG is the unstable signal quality susceptible to inevitable artifacts during recordings. To date, most existing techniques for EEG artifact removal and reconstruction are applicable to offline… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 October, 2022; originally announced October 2022.

  18. arXiv:2210.01803  [pdf, other

    cs.LG cs.AI

    Federated Graph-based Networks with Shared Embedding

    Authors: Tianyi Yu, Pei Lai, Fei Teng

    Abstract: Nowadays, user privacy is becoming an issue that cannot be bypassed for system developers, especially for that of web applications where data can be easily transferred through internet. Thankfully, federated learning proposes an innovative method to train models with distributed devices while data are kept in local storage. However, unlike general neural networks, although graph-based networks hav… ▽ More

    Submitted 3 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures

  19. arXiv:2207.12831  [pdf, other

    cs.LG cs.AI cs.CR

    Lifelong DP: Consistently Bounded Differential Privacy in Lifelong Machine Learning

    Authors: Phung Lai, Han Hu, NhatHai Phan, Ruoming Jin, My T. Thai, An M. Chen

    Abstract: In this paper, we show that the process of continually learning new tasks and memorizing previous tasks introduces unknown privacy risks and challenges to bound the privacy loss. Based upon this, we introduce a formal definition of Lifelong DP, in which the participation of any data tuples in the training set of any tasks is protected, under a consistently bounded DP protection, given a growing st… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

  20. arXiv:2205.03853  [pdf

    cs.CL cs.IR cs.LG q-bio.GN

    Assigning Species Information to Corresponding Genes by a Sequence Labeling Framework

    Authors: Ling Luo, Chih-Hsuan Wei, Po-Ting Lai, Qingyu Chen, Rezarta Islamaj Doğan, Zhiyong Lu

    Abstract: The automatic assignment of species information to the corresponding genes in a research article is a critically important step in the gene normalization task, whereby a gene mention is normalized and linked to a database record or identifier by a text-mining algorithm. Existing methods typically rely on heuristic rules based on gene and species co-occurrence in the article, but their accuracy is… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Journal ref: Database, Volume 2022, 2022, baac090

  21. BioRED: A Rich Biomedical Relation Extraction Dataset

    Authors: Ling Luo, Po-Ting Lai, Chih-Hsuan Wei, Cecilia N Arighi, Zhiyong Lu

    Abstract: Automated relation extraction (RE) from biomedical literature is critical for many downstream text mining applications in both research and real-world settings. However, most existing benchmarking datasets for bio-medical RE only focus on relations of a single type (e.g., protein-protein interactions) at the sentence level, greatly limiting the development of RE systems in biomedicine. In this wor… ▽ More

    Submitted 19 June, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Accepted by Briefings in Bioinformatics

  22. arXiv:2201.11844  [pdf

    cs.CR cs.CV physics.optics

    Speckle-based optical cryptosystem and its application for human face recognition via deep learning

    Authors: Qi Zhao, Huanhao Li, Zhipeng Yu, Chi Man Woo, Tianting Zhong, Shengfu Cheng, Yuanjin Zheng, Honglin Liu, Jie Tian, Puxiang Lai

    Abstract: Face recognition has recently become ubiquitous in many scenes for authentication or security purposes. Meanwhile, there are increasing concerns about the privacy of face images, which are sensitive biometric data that should be carefully protected. Software-based cryptosystems are widely adopted nowadays to encrypt face images, but the security level is limited by insufficient digital secret key… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  23. arXiv:2201.07063  [pdf, other

    cs.LG cs.CR

    How to Backdoor HyperNetwork in Personalized Federated Learning?

    Authors: Phung Lai, NhatHai Phan, Issa Khalil, Abdallah Khreishah, Xintao Wu

    Abstract: This paper explores previously unknown backdoor risks in HyperNet-based personalized federated learning (HyperNetFL) through poisoning attacks. Based upon that, we propose a novel model transferring attack (called HNTroj), i.e., the first of its kind, to transfer a local backdoor infected model to all legitimate and personalized local models, which are generated by the HyperNetFL model, through co… ▽ More

    Submitted 11 December, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

  24. arXiv:2110.05223  [pdf, other

    cs.LG cs.CR

    Continual Learning with Differential Privacy

    Authors: Pradnya Desai, Phung Lai, NhatHai Phan, My T. Thai

    Abstract: In this paper, we focus on preserving differential privacy (DP) in continual learning (CL), in which we train ML models to learn a sequence of new tasks while memorizing previous tasks. We first introduce a notion of continual adjacent databases to bound the sensitivity of any data record participating in the training process of CL. Based upon that, we develop a new DP-preserving algorithm for CL… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: The paper will appear at ICONIP21

  25. arXiv:2101.04158  [pdf

    cs.CL

    BERT-GT: Cross-sentence n-ary relation extraction with BERT and Graph Transformer

    Authors: Po-Ting Lai, Zhiyong Lu

    Abstract: A biomedical relation statement is commonly expressed in multiple sentences and consists of many concepts, including gene, disease, chemical, and mutation. To automatically extract information from biomedical literature, existing biomedical text-mining approaches typically formulate the problem as a cross-sentence n-ary relation-extraction task that detects relations among n entities across multip… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: 24 pages, 6 figures

  26. arXiv:2011.12566  [pdf

    cs.AI

    ColdGAN: Resolving Cold Start User Recommendation by using Generative Adversarial Networks

    Authors: Po-Lin Lai, Chih-Yun Chen, Liang-Wei Lo, Chien-Chin Chen

    Abstract: Mitigating the new user cold-start problem has been critical in the recommendation system for online service providers to influence user experience in decision making which can ultimately affect the intention of users to use a particular service. Previous studies leveraged various side information from users and items; however, it may be impractical due to privacy concerns. In this paper, we prese… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  27. PhenoTagger: A Hybrid Method for Phenotype Concept Recognition using Human Phenotype Ontology

    Authors: Ling Luo, Shankai Yan, Po-Ting Lai, Daniel Veltri, Andrew Oler, Sandhya Xirasagar, Rajarshi Ghosh, Morgan Similuk, Peter N. Robinson, Zhiyong Lu

    Abstract: Automatic phenotype concept recognition from unstructured text remains a challenging task in biomedical text mining research. Previous works that address the task typically use dictionary-based matching methods, which can achieve high precision but suffer from lower recall. Recently, machine learning-based methods have been proposed to identify biomedical concepts, which can recognize more unseen… ▽ More

    Submitted 25 January, 2021; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Accepted by Bioinformatics

  28. arXiv:2006.02518  [pdf, other

    cs.RO

    Autonomous Vehicle Benchmarking using Unbiased Metrics

    Authors: David Paz, Po-jung Lai, Nathan Chan, Yuqing Jiang, Henrik I. Christensen

    Abstract: With the recent development of autonomous vehicle technology, there have been active efforts on the deployment of this technology at different scales that include urban and highway driving. While many of the prototypes showcased have been shown to operate under specific cases, little effort has been made to better understand their shortcomings and generalizability to new areas. Distance, uptime an… ▽ More

    Submitted 11 September, 2020; v1 submitted 3 June, 2020; originally announced June 2020.

    Comments: 6 pages, 7 figures, IROS 2020

  29. arXiv:2004.00204  [pdf, other

    cs.LG cs.AI stat.ML

    Ontology-based Interpretable Machine Learning for Textual Data

    Authors: Phung Lai, NhatHai Phan, Han Hu, Anuja Badeti, David Newman, Dejing Dou

    Abstract: In this paper, we introduce a novel interpreting framework that learns an interpretable model based on an ontology-based sampling technique to explain agnostic prediction models. Different from existing approaches, our algorithm considers contextual correlation among words, described in domain knowledge ontologies, to generate semantic explanations. To narrow down the search space for explanations… ▽ More

    Submitted 31 March, 2020; originally announced April 2020.

    Comments: Accepted by IJCNN 2020

  30. arXiv:1911.05845  [pdf, other

    eess.SP cs.CV

    Accelerating cardiac cine MRI using a deep learning-based ESPIRiT reconstruction

    Authors: Christopher M. Sandino, Peng Lai, Shreyas S. Vasanawala, Joseph Y. Cheng

    Abstract: A novel neural network architecture, known as DL-ESPIRiT, is proposed to reconstruct rapidly acquired cardiac MRI data without field-of-view limitations which are present in previously proposed deep learning-based reconstruction frameworks. Additionally, a novel convolutional neural network based on separable 3D convolutions is integrated into DL-ESPIRiT to more efficiently learn spatiotemporal pr… ▽ More

    Submitted 18 May, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: 29 pages, 9 figures, 1 table, 7 supplementary videos, Submitted to Magnetic Resonance in Medicine

  31. arXiv:1909.11359  [pdf, other

    cs.CL

    Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion

    Authors: Zihao Wang, Kwun Ping Lai, Piji Li, Lidong Bing, Wai Lam

    Abstract: For large-scale knowledge graphs (KGs), recent research has been focusing on the large proportion of infrequent relations which have been ignored by previous studies. For example few-shot learning paradigm for relations has been investigated. In this work, we further advocate that handling uncommon entities is inevitable when dealing with infrequent relations. Therefore, we propose a meta-learning… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  32. arXiv:1907.11580  [pdf, other

    cs.DC

    Edge User Allocation with Dynamic Quality of Service

    Authors: Phu Lai, Qiang He, Guangming Cui, Xiaoyu Xia, Mohamed Abdelrazek, Feifei Chen, John Hosking, John Grundy, Yun Yang

    Abstract: In edge computing, edge servers are placed in close proximity to end-users. App vendors can deploy their services on edge servers to reduce network latency experienced by their app users. The edge user allocation (EUA) problem challenges service providers with the objective to maximize the number of allocated app users with hired computing resources on edge servers while ensuring their fixed quali… ▽ More

    Submitted 26 July, 2019; originally announced July 2019.

    Comments: This manuscript has been accepted for publication at the 17th International Conference on Service-Oriented Computing and may be published in the book series Lecture Notes in Computer Science. All copyrights reserved to Springer Nature Switzerland AG, Gewerbestrasse 11, 6330 Cham, Switzerland

  33. Optimal Edge User Allocation in Edge Computing with Variable Sized Vector Bin Packing

    Authors: Phu Lai, Qiang He, Mohamed Abdelrazek, Feifei Chen, John Hosking, John Grundy, Yun Yang

    Abstract: In mobile edge computing, edge servers are geographically distributed around base stations placed near end-users to provide highly accessible and efficient computing capacities and services. In the mobile edge computing environment, a service provider can deploy its service on hired edge servers to reduce end-to-end service delays experienced by its end-users allocated to those edge servers. An op… ▽ More

    Submitted 11 April, 2019; originally announced April 2019.

  34. Revised JNLPBA Corpus: A Revised Version of Biomedical NER Corpus for Relation Extraction Task

    Authors: Ming-Siang Huang, Po-Ting Lai, Richard Tzong-Han Tsai, Wen-Lian Hsu

    Abstract: The advancement of biomedical named entity recognition (BNER) and biomedical relation extraction (BRE) researches promotes the development of text mining in biological domains. As a cornerstone of BRE, robust BNER system is required to identify the mentioned NEs in plain texts for further relation extraction stage. However, the current BNER corpora, which play important roles in these tasks, paid… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: 17 pages

    Journal ref: Briefings in Bioinformatics, 2020, bbaa054

  35. arXiv:1901.09169  [pdf, other

    cs.GT

    Learning Large Electrical Loads via Flexible Contracts with Commitment

    Authors: Pan Lai, Lingjie Duan, Xiaojun Lin

    Abstract: Large electricity customers (e.g., large data centers) can exhibit huge and variable electricity demands, which poses significant challenges for the electricity suppliers to plan for sufficient capacity. Thus, it is desirable to design incentive and coordination mechanisms between the customers and the supplier to lower the capacity cost. This paper proposes a novel scheme based on flexible contra… ▽ More

    Submitted 14 September, 2020; v1 submitted 26 January, 2019; originally announced January 2019.

  36. arXiv:1706.07276  [pdf, other

    cs.CL cs.IR cs.LG

    Jointly Learning Word Embeddings and Latent Topics

    Authors: Bei Shi, Wai Lam, Shoaib Jameel, Steven Schockaert, Kwun Ping Lai

    Abstract: Word embedding models such as Skip-gram learn a vector-space representation for each word, based on the local word collocation patterns that are observed in a text corpus. Latent topic models, on the other hand, take a more global view, looking at the word distributions across the corpus to assign a topic to each word occurrence. These two paradigms are complementary in how they represent the mean… ▽ More

    Submitted 21 June, 2017; originally announced June 2017.

    Comments: 10 pagess, 2 figures, full paper. To appear in the proceedings of The 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17)

  37. arXiv:1507.01101  [pdf, other

    cs.DC

    Utility Optimal Thread Assignment and Resource Allocation in Multi-Server Systems

    Authors: Pan Lai, Rui Fan, Xiao Zhang, Wei Zhang, Fang Liu, Joey Tianyi Zhou

    Abstract: Achieving high performance in many multi-server systems requires finding a good assignment of worker threads to servers and also effectively allocating each server's resources to its assigned threads. The assignment and allocation components of this problem have been studied extensively but largely separately in the literature. In this paper, we introduce the assign and allocate (AA) problem, whic… ▽ More

    Submitted 9 June, 2021; v1 submitted 4 July, 2015; originally announced July 2015.

    Comments: 17 pages

    ACM Class: C.1.4; D.4.2; F.2.1

  38. arXiv:0712.3587  [pdf, ps, other

    cs.IT cs.CV

    Pattern Recognition System Design with Linear Encoding for Discrete Patterns

    Authors: Po-Hsiang Lai, Joseph A. O'Sullivan

    Abstract: In this paper, designs and analyses of compressive recognition systems are discussed, and also a method of establishing a dual connection between designs of good communication codes and designs of recognition systems is presented. Pattern recognition systems based on compressed patterns and compressed sensor measurements can be designed using low-density matrices. We examine truncation encoding… ▽ More

    Submitted 20 December, 2007; originally announced December 2007.

    Comments: Submitted and accepted to ISIT 2007