Skip to main content

Showing 1–50 of 95 results for author: Bi, Z

  1. arXiv:2407.05213  [pdf, other

    cs.CL cs.AI

    BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records

    Authors: Weimin Lyu, Zexin Bi, Fusheng Wang, Chao Chen

    Abstract: The advent of clinical language models integrated into electronic health records (EHR) for clinical decision support has marked a significant advancement, leveraging the depth of clinical notes for improved decision-making. Despite their success, the potential vulnerabilities of these models remain largely unexplored. This paper delves into the realm of backdoor attacks on clinical language models… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: AMIA 2024

  2. arXiv:2407.03687  [pdf, other

    cs.CL cs.AI

    STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for Complex Reasoning in Multi-Hop Question Answering

    Authors: Zhenyu Bi, Daniel Hajialigol, Zhongkai Sun, Jie Hao, Xuan Wang

    Abstract: Multi-hop question answering (MHQA) requires a model to retrieve and integrate information from multiple passages to answer a complex question. Recent systems leverage the power of large language models and integrate evidence retrieval with reasoning prompts (e.g., chain-of-thought reasoning) for the MHQA task. However, the complexities in the question types (bridge v.s. comparison questions) and… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures

  3. arXiv:2406.12324  [pdf, other

    cs.RO

    AutoDSL: Automated domain-specific language design for structural representation of procedures with constraints

    Authors: Yu-Zhe Shi, Haofei Hou, Zhangqian Bi, Fanxu Meng, Xiang Wei, Lecheng Ruan, Qining Wang

    Abstract: Accurate representation of procedures in restricted scenarios, such as non-standardized scientific experiments, requires precise depiction of constraints. Unfortunately, Domain-specific Language (DSL), as an effective tool to express constraints structurally, often requires case-by-case hand-crafting, necessitating customized, labor-intensive efforts. To overcome this challenge, we introduce the A… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (ACL'24)

  4. arXiv:2405.03639  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.str-el

    Strong-to-Weak Spontaneous Symmetry Breaking in Mixed Quantum States

    Authors: Leonardo A. Lessa, Ruochen Ma, Jian-Hao Zhang, Zhen Bi, Meng Cheng, Chong Wang

    Abstract: Symmetry in mixed quantum states can manifest in two distinct forms: \textit{strong symmetry}, where each individual pure state in the quantum ensemble is symmetric with the same charge, and \textit{weak symmetry}, which applies only to the entire ensemble. This paper explores a novel type of spontaneous symmetry breaking (SSB) where a strong symmetry is broken to a weak one. While the SSB of a we… ▽ More

    Submitted 3 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 17+6 pages, 4 figures

  5. arXiv:2404.15639  [pdf, other

    cs.CL

    CodeIP: A Grammar-Guided Multi-Bit Watermark for Large Language Models of Code

    Authors: Batu Guan, Yao Wan, Zhangqian Bi, Zheng Wang, Hongyu Zhang, Yulei Sui, Pan Zhou, Lichao Sun

    Abstract: As Large Language Models (LLMs) are increasingly used to automate code generation, it is often desired to know if the code is AI-generated and by which model, especially for purposes like protecting intellectual property (IP) in industry and preventing academic misconduct in education. Incorporating watermarks into machine-generated content is one way to provide code provenance, but existing solut… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 13 pages, 7 figures

  6. arXiv:2404.00726  [pdf, other

    eess.IV cs.CV cs.LG

    MugenNet: A Novel Combined Convolution Neural Network and Transformer Network with its Application for Colonic Polyp Image Segmentation

    Authors: Chen Peng, Zhiqin Qian, Kunyu Wang, Qi Luo, Zhuming Bi, Wenjun Zhang

    Abstract: Biomedical image segmentation is a very important part in disease diagnosis. The term "colonic polyps" refers to polypoid lesions that occur on the surface of the colonic mucosa within the intestinal lumen. In clinical practice, early detection of polyps is conducted through colonoscopy examinations and biomedical image processing. Therefore, the accurate polyp image segmentation is of great signi… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  7. arXiv:2403.19601  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Topological Phases and Phase Transitions with Dipolar Symmetry Breaking

    Authors: Amogh Anakru, Zhen Bi

    Abstract: Systems with dipole moment conservation have been of recent interest, as they realize both novel quantum dynamics and exotic ground state phases. In this work, we study some generic properties of 1-D and 2-D dipole-conserving fermionic models at integer fillings. We find that a dipolar symmetry-breaking phase can result in a mean-field band insulator whose topological indices can strongly affect t… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 11 pages, 8 figures, 7 page appendix

  8. arXiv:2403.16978  [pdf, ps, other

    cond-mat.str-el

    Locally Purified Density Operators for Symmetry-Protected Topological Phases in Mixed States

    Authors: Yuchen Guo, Jian-Hao Zhang, Hao-Ran Zhang, Shuo Yang, Zhen Bi

    Abstract: We propose a tensor network approach known as the locally purified density operator (LPDO) to investigate the classification and characterization of symmetry-protected topological (SPT) phases in open quantum systems. We extend the concept of injectivity, originally associated with matrix product states and projected entangled pair states, to LPDOs in $(1+1)D$ and $(2+1)D$ systems, unveiling two d… ▽ More

    Submitted 16 May, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

    Comments: 27 pages, 5 figures

  9. arXiv:2403.16792  [pdf, other

    cs.CL cs.SE

    Iterative Refinement of Project-Level Code Context for Precise Code Generation with Compiler Feedback

    Authors: Zhangqian Bi, Yao Wan, Zheng Wang, Hongyu Zhang, Batu Guan, Fangxin Lu, Zili Zhang, Yulei Sui, Hai Jin, Xuanhua Shi

    Abstract: Large Language Models (LLMs) have shown remarkable progress in automated code generation. Yet, LLM-generated code may contain errors in API usage, class, data structure, or missing project-specific information. As much of this project-specific context cannot fit into the prompts of LLMs, we must find ways to allow the model to explore the project-level code context. We present CoCoGen, a new code… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  10. arXiv:2403.15673  [pdf, other

    cs.CL

    AI for Biomedicine in the Era of Large Language Models

    Authors: Zhenyu Bi, Sajib Acharjee Dip, Daniel Hajialigol, Sindhura Kommu, Hanwen Liu, Meng Lu, Xuan Wang

    Abstract: The capabilities of AI for biomedicine span a wide spectrum, from the atomic level, where it solves partial differential equations for quantum systems, to the molecular level, predicting chemical or protein structures, and further extending to societal predictions like infectious disease outbreaks. Recent advancements in large language models, exemplified by models like ChatGPT, have showcased sig… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 8 pages, 3 figures

  11. arXiv:2402.03049  [pdf, other

    cs.CL cs.AI cs.HC cs.IR cs.LG

    EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models

    Authors: Yixin Ou, Ningyu Zhang, Honghao Gui, Ziwen Xu, Shuofei Qiao, Yida Xue, Runnan Fang, Kangwei Liu, Lei Li, Zhen Bi, Guozhou Zheng, Huajun Chen

    Abstract: In recent years, instruction tuning has gained increasing attention and emerged as a crucial technique to enhance the capabilities of Large Language Models (LLMs). To construct high-quality instruction datasets, many instruction processing approaches have been proposed, aiming to achieve a delicate balance between data quantity and data quality. Nevertheless, due to inconsistencies that persist am… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ACL 2024 System Demonstrations; Project website: https://zjunlp.github.io/project/EasyInstruct Code: https://github.com/zjunlp/EasyInstruct Video: https://youtu.be/rfQOWYfziFo Demo: https://huggingface.co/spaces/zjunlp/EasyInstruct

  12. arXiv:2401.00288  [pdf, other

    cs.SE cs.AI

    Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

    Authors: Yao Wan, Yang He, Zhangqian Bi, Jianguo Zhang, Hongyu Zhang, Yulei Sui, Guandong Xu, Hai Jin, Philip S. Yu

    Abstract: Code intelligence leverages machine learning techniques to extract knowledge from extensive code corpora, with the aim of developing intelligent tools to improve the quality and productivity of computer programming. Currently, there is already a thriving research community focusing on code intelligence, with efforts ranging from software engineering, machine learning, data mining, natural language… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

  13. arXiv:2401.00028  [pdf, other

    cs.CV

    An Empirical Study of Scaling Law for OCR

    Authors: Miao Rang, Zhenni Bi, Chuanjian Liu, Yunhe Wang, Kai Han

    Abstract: The laws of model size, data volume, computation and model performance have been extensively studied in the field of Natural Language Processing (NLP). However, the scaling laws in Optical Character Recognition (OCR) have not yet been investigated. To address this, we conducted comprehensive studies that involved examining the correlation between performance and the scale of models, data volume an… ▽ More

    Submitted 31 January, 2024; v1 submitted 28 December, 2023; originally announced January 2024.

  14. arXiv:2310.02031  [pdf, other

    cs.CL cs.AI cs.CE cs.LG cs.RO

    OceanGPT: A Large Language Model for Ocean Science Tasks

    Authors: Zhen Bi, Ningyu Zhang, Yida Xue, Yixin Ou, Daxiong Ji, Guozhou Zheng, Huajun Chen

    Abstract: Ocean science, which delves into the oceans that are reservoirs of life and biodiversity, is of great significance given that oceans cover over 70% of our planet's surface. Recently, advances in Large Language Models (LLMs) have transformed the paradigm in science. Despite the success in other domains, current LLMs often fall short in catering to the needs of domain experts like oceanographers, an… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ACL2024. Project Website: https://oceangpt.zjukg.cn/

  15. arXiv:2308.15452  [pdf, other

    cs.CL cs.AI cs.LG cs.SE

    When Do Program-of-Thoughts Work for Reasoning?

    Authors: Zhen Bi, Ningyu Zhang, Yinuo Jiang, Shumin Deng, Guozhou Zheng, Huajun Chen

    Abstract: In the realm of embodied artificial intelligence, the reasoning capabilities of Large Language Models (LLMs) play a pivotal role. Although there are effective methods like program-of-thought prompting for LLMs which uses programming language to tackle complex reasoning tasks, the specific impact of code data on the improvement of reasoning capabilities remains under-explored. To address this gap,… ▽ More

    Submitted 18 December, 2023; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: AAAI 2024

  16. arXiv:2308.11104  [pdf

    cond-mat.mtrl-sci

    Revealing unusual bandgap shifts with temperature and bandgap renormalization effect in phase-stabilized metal halide perovskite thin films

    Authors: Haochen Zhang, Zhixuan Bi, Zehua Zhai, Han Gao, Yuwei Liu, Meiling Jin, Meng Ye, Xuanzhang Li, Haowen Liu, Yuegang Zhang, Xiang Li, Hairen Tan, Yong Xu, Luyi Yang

    Abstract: Hybrid organic-inorganic metal halide perovskites are emerging materials in photovoltaics, whose bandgap is one of the most crucial parameters governing their light harvesting performance. Here we present the temperature and photocarrier density dependence of the bandgap in two phase-stabilized perovskite thin films (MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3) using photoluminescence and absorption… ▽ More

    Submitted 28 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  17. arXiv:2308.09697  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Maximally Localized Wannier Orbitals, Interaction Models and Fractional Quantum Anomalous Hall Effect in Twisted Bilayer MoTe2

    Authors: Cheng Xu, Jiangxu Li, Yong Xu, Zhen Bi, Yang Zhang

    Abstract: We investigate the moiré band structures and the strong correlation effects in twisted bilayer MoTe$_2$ for a wide range of twist angles, employing a combination of various techniques. Using large-scale first principles calculations, we pinpoint realistic continuum modeling parameters, subsequently deriving the maximally localized Wannier functions for the top three moiré bands. Simplifying our mo… ▽ More

    Submitted 14 January, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 12 pages + 8 figures,PNAS in press

  18. Spin Coherence and Spin Relaxation in Hybrid Organic-Inorganic Lead and Mixed Lead-Tin Perovskites

    Authors: Haochen Zhang, Zehua Zhai, Zhixuan Bi, Han Gao, Meng Ye, Yong Xu, Hairen Tan, Luyi Yang

    Abstract: Metal halide perovskites make up a promising class of materials for semiconductor spintronics. Here we report a systematic investigation of coherent spin precession, spin dephasing and spin relaxation of electrons and holes in two hybrid organic-inorganic perovskites MA0.3FA0.7PbI3 and MA0.3FA0.7Pb0.5Sn0.5I3 using time-resolved Faraday rotation spectroscopy. With applied in-plane magnetic fields,… ▽ More

    Submitted 1 September, 2023; v1 submitted 6 August, 2023; originally announced August 2023.

    Journal ref: Nano Letters 23, 7917-7920 (2023)

  19. arXiv:2307.05474  [pdf, other

    cond-mat.str-el math-ph quant-ph

    Fractonic Higher-Order Topological Phases in Open Quantum Systems

    Authors: Jian-Hao Zhang, Ke Ding, Shuo Yang, Zhen Bi

    Abstract: In this work, we study the generalization of decohered average symmetry-protected topological phases to open quantum systems with a combination of subsystem symmetries and global symmetries. In particular, we provide examples of two types of intrinsic average higher-order topological phases with average subsystem symmetries. A classification scheme for these phases based on generalized anomaly can… ▽ More

    Submitted 24 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: 17 pages, 5 figures

    Journal ref: Phys. Rev. B 108, 155123 (2023)

  20. arXiv:2305.16399  [pdf, other

    cond-mat.str-el cond-mat.mes-hall math-ph quant-ph

    Topological Phases with Average Symmetries: the Decohered, the Disordered, and the Intrinsic

    Authors: Ruochen Ma, Jian-Hao Zhang, Zhen Bi, Meng Cheng, Chong Wang

    Abstract: Global symmetries greatly enrich the landscape of topological quantum phases, playing an essential role from topological insulators to fractional quantum Hall effect. Topological phases in mixed quantum states, originating from \textit{decoherence} in open quantum systems or \textit{disorders} in imperfect crystalline solids, have recently garnered significant interest. Unlike pure states, mixed q… ▽ More

    Submitted 19 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 25 pages, 7 figures

  21. arXiv:2305.14895  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    The Lobster Eye Imager for Astronomy Onboard the SATech-01 Satellite

    Authors: Z. X. Ling, X. J. Sun, C. Zhang, S. L. Sun, G. Jin, S. N. Zhang, X. F. Zhang, J. B. Chang, F. S. Chen, Y. F. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, Z. D. Li, P. R. Liu, Y. H. Lv, X. H. Ma, Y. J. Tang, C. B. Wang, R. J. Xie, Y. L. Xue, A. L. Yan , et al. (101 additional authors not shown)

    Abstract: The Lobster Eye Imager for Astronomy (LEIA), a pathfinder of the Wide-field X-ray Telescope of the Einstein Probe (EP) mission, was successfully launched onboard the SATech-01 satellite of the Chinese Academy of Sciences on 27 July 2022. In this paper, we introduce the design and on-ground test results of the LEIA instrument. Using state-of-the-art Micro-Pore Optics (MPO), a wide field-of-view (Fo… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted by RAA

  22. arXiv:2305.14231  [pdf, other

    quant-ph cond-mat.str-el

    Triggering Boundary Phase Transitions through Bulk Measurements in 2D Cluster States

    Authors: Yuchen Guo, Jian-Hao Zhang, Zhen Bi, Shuo Yang

    Abstract: We investigate the phase diagram at the boundary of an infinite two-dimensional cluster state subject to bulk measurements using tensor network methods. The state is subjected to uniform measurements $M = \cosθZ+\sinθX$ on the lower boundary qubits and in all bulk qubits. Our results show that the boundary of the system exhibits volume-law entanglement at the measurement angle $θ= π/2$ and area-la… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 7 pages, 6 figures

    Journal ref: Phys. Rev. Research 5, 043069 (2023)

  23. arXiv:2304.09907  [pdf, other

    cond-mat.mes-hall

    $\mathbb Z_2$-Nontrivial Moiré Minibands and Interaction-Driven Quantum Anomalous Hall Insulators in Topological Insulator Based Moiré Heterostructures

    Authors: Kaijie Yang, Zian Xu, Yanjie Feng, Frank Schindler, Yuanfeng Xu, Zhen Bi, B. Andrei Bernevig, Peizhe Tang, Chao-Xing Liu

    Abstract: We studied electronic band structure and topological property of a topological insulator thin film under a moiré superlattice potential to search for two-dimensional (2D) $\mathbb Z_2$ non-trivial isolated mini-bands. To model this system, we assume the Fermi energy inside the bulk band gap and thus consider an effective model Hamiltonian with only two surface states that are located at the top an… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

  24. arXiv:2304.09048  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SE

    CodeKGC: Code Language Model for Generative Knowledge Graph Construction

    Authors: Zhen Bi, Jing Chen, Yinuo Jiang, Feiyu Xiong, Wei Guo, Huajun Chen, Ningyu Zhang

    Abstract: Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model trained on structured data such as code has demonstrated impressive capability in understanding natural language for structural prediction and reasoning tasks. Intuit… ▽ More

    Submitted 18 January, 2024; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: ACM Transactions on Asian and Low-Resource Language Information Processing

  25. arXiv:2304.01181  [pdf, other

    cond-mat.str-el quant-ph

    Non-Fermi Liquids from Dipolar Symmetry Breaking

    Authors: Amogh Anakru, Zhen Bi

    Abstract: The emergence of fractonic topological phases and novel universality classes for quantum dynamics highlights the importance of dipolar symmetry in condensed matter systems. In this work, we study the properties of symmetry-breaking phases of the dipolar symmetries in fermionic models in various spatial dimensions. In such systems, fermions obtain energy dispersion through dipole condensation. Due… ▽ More

    Submitted 30 October, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: 11 pages + 12 page appendix, 8 figures

    Journal ref: Phys. Rev. B, 108:165112, Oct 2023

  26. arXiv:2303.11846  [pdf

    cs.RO physics.app-ph

    Dynamic models for Planar Peristaltic Locomotion of a Metameric Earthworm-like Robot

    Authors: Qinyan Zhou, Hongbin Fang, Zhihai Bi, Jian Xu

    Abstract: The development of versatile robots capable of traversing challenging and irregular environments is of increasing interest in the field of robotics, and metameric robots have been identified as a promising solution due to their slender, deformable bodies. Inspired by the effective locomotion of earthworms, earthworm-like robots capable of both rectilinear and planar locomotion have been designed a… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 12 pages, 4 figures

  27. arXiv:2303.06076  [pdf, other

    q-bio.NC

    Cognition of time and thinkings beyond

    Authors: Zedong Bi

    Abstract: A pervasive research protocol of cognitive neuroscience is to train subjects to perform deliberately designed experiments and record brain activity simultaneously, aiming to understand the brain mechanism underlying cognition. However, how the results of this protocol can be applied in technology is seldom discussed. Here, I review the studies on time processing of the brain as examples of this pr… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  28. arXiv:2211.10007  [pdf, other

    astro-ph.HE astro-ph.IM

    First wide field-of-view X-ray observations by a lobster eye focusing telescope in orbit

    Authors: C. Zhang, Z. X. Ling, X. J. Sun, S. L. Sun, Y. Liu, Z. D. Li, Y. L. Xue, Y. F. Chen, Y. F. Dai, Z. Q. Jia, H. Y. Liu, X. F. Zhang, Y. H. Zhang, S. N. Zhang, F. S. Chen, Z. W. Cheng, W. Fu, Y. X. Han, H. Li, J. F. Li, Y. Li, P. R. Liu, X. H. Ma, Y. J. Tang, C. B. Wang , et al. (53 additional authors not shown)

    Abstract: As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we repor… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: 11 pages, 4 figures. Accepted for publication in Astrophysical Journal Letter

  29. arXiv:2210.17485  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Strange Correlation Function for Average Symmetry-Protected Topological Phases

    Authors: Jian-Hao Zhang, Yang Qi, Zhen Bi

    Abstract: Average symmetry-protected topological (ASPT) phase is a generalization of symmetry-protected topological phases to disordered systems or open quantum systems. We devise a "strange correlator" in one and two dimensions to detect nontrivial ASPT states. We demonstrate that for a nontrivial ASPT phase this strange correlator exhibits long-range or power-law behavior. We explore the connection betwee… ▽ More

    Submitted 9 April, 2024; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 10 pages, 5 figures. A basis-independent form of strange correlator has been updated

  30. arXiv:2210.15596  [pdf, other

    cond-mat.str-el quant-ph

    Classification and construction of interacting fractonic higher-order topological phases

    Authors: Jian-Hao Zhang, Meng Cheng, Zhen Bi

    Abstract: The notion of higher-order topological phases can have interesting generalizations to systems with subsystem symmetries that exhibit fractonic dynamics for charged excitations. In this work, we systematically study the higher-order topological phases protected by a combination of subsystem symmetries and ordinary global symmetries in two and three-dimensional interacting boson systems, with some i… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 17 pages, 4 figures with references, comments are welcome!

  31. arXiv:2210.11298  [pdf, other

    cs.AI

    Tele-Knowledge Pre-training for Fault Analysis

    Authors: Zhuo Chen, Wen Zhang, Yufeng Huang, Mingyang Chen, Yuxia Geng, Hongtao Yu, Zhen Bi, Yichi Zhang, Zhen Yao, Wenting Song, Xinliang Wu, Yi Yang, Mingyi Chen, Zhaoyang Lian, Yingying Li, Lei Cheng, Huajun Chen

    Abstract: In this work, we share our experience on tele-knowledge pre-training for fault analysis, a crucial task in telecommunication applications that requires a wide range of knowledge normally found in both machine log data and product documents. To organize this knowledge from experts uniformly, we propose to create a Tele-KG (tele-knowledge graph). Using this valuable data, we further propose a tele-d… ▽ More

    Submitted 17 February, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: ICDE 2023 https://github.com/hackerchenzhuo/KTeleBERT

  32. arXiv:2208.11699  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Decoding Measurement-Prepared Quantum Phases and Transitions: from Ising model to gauge theory, and beyond

    Authors: Jong Yeon Lee, Wenjie Ji, Zhen Bi, Matthew P. A. Fisher

    Abstract: Measurements allow efficient preparation of interesting quantum many-body states with long-range entanglement, conditioned on additional transformations based on measurement outcomes. Here, we demonstrate that the so-called conformal quantum critical points (CQCP) can be obtained by performing general single-site measurements in an appropriate basis on the cluster states in $d\geq2$. The equal-tim… ▽ More

    Submitted 6 September, 2022; v1 submitted 24 August, 2022; originally announced August 2022.

    Comments: 37 pages, 11 figures

  33. arXiv:2207.10080  [pdf, other

    q-bio.QM cs.AI cs.CL cs.IR cs.LG

    Multi-modal Protein Knowledge Graph Construction and Applications

    Authors: Siyuan Cheng, Xiaozhuan Liang, Zhen Bi, Huajun Chen, Ningyu Zhang

    Abstract: Existing data-centric methods for protein science generally cannot sufficiently capture and leverage biology knowledge, which may be crucial for many protein tasks. To facilitate research in this field, we create ProteinKG65, a knowledge graph for protein science. Using gene ontology and Uniprot knowledge base as a basis, we transform and integrate various kinds of knowledge with aligned descripti… ▽ More

    Submitted 14 November, 2022; v1 submitted 27 May, 2022; originally announced July 2022.

    Comments: Accepted by AAAI 2023 (Student Abstract). Dataset available in https://zjunlp.github.io/project/ProteinKG65/

  34. arXiv:2205.10852  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Relphormer: Relational Graph Transformer for Knowledge Graph Representations

    Authors: Zhen Bi, Siyuan Cheng, Jing Chen, Xiaozhuan Liang, Feiyu Xiong, Ningyu Zhang

    Abstract: Transformers have achieved remarkable performance in widespread fields, including natural language processing, computer vision and graph mining. However, vanilla Transformer architectures have not yielded promising improvements in the Knowledge Graph (KG) representations, where the translational distance paradigm dominates this area. Note that vanilla Transformer architectures struggle to capture… ▽ More

    Submitted 21 November, 2023; v1 submitted 22 May, 2022; originally announced May 2022.

    Comments: Neurocomputing 2023

  35. arXiv:2202.04361  [pdf

    cs.ET cond-mat.mtrl-sci cond-mat.soft

    Molecular-scale Integration of Multi-modal Sensing and Neuromorphic Computing with Organic Electrochemical Transistors

    Authors: Shijie Wang, Xi Chen, Chao Zhao, Yuxin Kong, Baojun Lin, Yongyi Wu, Zhaozhao Bi, Ziyi Xuan, Tao Li, Yuxiang Li, Wei Zhang, En Ma, Zhongrui Wang, Wei Ma

    Abstract: Abstract: Bionic learning with fused sensing, memory and processing functions outperforms artificial neural networks running on silicon chips in terms of efficiency and footprint. However, digital hardware implementation of bionic learning suffers from device heterogeneity in sensors and processing cores, which incurs large hardware, energy and time overheads. Here, we present a universal solution… ▽ More

    Submitted 19 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

    Comments: 17 pages, 4 figures

  36. arXiv:2202.01231  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Interaction Enabled Fractonic Higher-Order Topological Phases

    Authors: Julian May-Mann, Yizhi You, Taylor L. Hughes, Zhen Bi

    Abstract: In this work, we present a collection of three-dimensional higher-order symmetry protected topological phases (HOSPTs) with gapless hinge modes that exist only in strongly interacting systems subject to subsystem symmetry constraints. We use a coupled wire construction to generate three families of microscopic lattice models: insulators with helical hinge modes, superconductors with chiral Majoran… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

    Comments: 16 pages, 9 figures

  37. arXiv:2201.11147  [pdf, other

    q-bio.BM cs.AI cs.CL cs.IR cs.LG

    OntoProtein: Protein Pretraining With Gene Ontology Embedding

    Authors: Ningyu Zhang, Zhen Bi, Xiaozhuan Liang, Siyuan Cheng, Haosen Hong, Shumin Deng, Jiazhang Lian, Qiang Zhang, Huajun Chen

    Abstract: Self-supervised protein language models have proved their effectiveness in learning the proteins representations. With the increasing computational power, current protein language models pre-trained with millions of diverse sequences can advance the parameter scale from million-level to billion-level and achieve remarkable improvement. However, those prevailing approaches rarely consider incorpora… ▽ More

    Submitted 3 June, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: Accepted by ICLR 2022

  38. arXiv:2201.10514  [pdf, ps, other

    math.PR

    Benfordness of the Generalized Gamma Distribution

    Authors: Zelong Bi, Irfan Durmić, Steven J. Miller

    Abstract: The generalized gamma distribution shows up in many problems related to engineering, hydrology as well as survival analysis. Earlier work has been done that estimated the deviation of the exponential and the Weibull distribution from Benford's Law. We give a mathematical explanation for the Benfordness of the generalized gamma distribution and present a measure for the deviation of the generalized… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Journal ref: The PUMP Journal of Undergraduate Research (2022), Volume 5, 89-104

  39. arXiv:2112.04087  [pdf, other

    cs.AI

    Improving Knowledge Graph Representation Learning by Structure Contextual Pre-training

    Authors: Ganqiang Ye, Wen Zhang, Zhen Bi, Chi Man Wong, Chen Hui, Huajun Chen

    Abstract: Representation learning models for Knowledge Graphs (KG) have proven to be effective in encoding structural information and performing reasoning over KGs. In this paper, we propose a novel pre-training-then-fine-tuning framework for knowledge graph representation learning, in which a KG model is firstly pre-trained with triple classification task, followed by discriminative fine-tuning on specific… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted to IJCKG 2021

  40. arXiv:2110.00479  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Learning to Ask for Data-Efficient Event Argument Extraction

    Authors: Hongbin Ye, Ningyu Zhang, Zhen Bi, Shumin Deng, Chuanqi Tan, Hui Chen, Fei Huang, Huajun Chen

    Abstract: Event argument extraction (EAE) is an important task for information extraction to discover specific argument roles. In this study, we cast EAE as a question-based cloze task and empirically analyze fixed discrete token template performance. As generating human-annotated question templates is often time-consuming and labor-intensive, we further propose a novel approach called "Learning to Ask," wh… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: work in progress

  41. arXiv:2108.13161  [pdf, other

    cs.CL cs.AI cs.CV cs.IR cs.LG

    Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners

    Authors: Ningyu Zhang, Luoqiu Li, Xiang Chen, Shumin Deng, Zhen Bi, Chuanqi Tan, Fei Huang, Huajun Chen

    Abstract: Large-scale pre-trained language models have contributed significantly to natural language processing by demonstrating remarkable abilities as few-shot learners. However, their effectiveness depends mainly on scaling the model parameters and prompt design, hindering their implementation in most real-world applications. This study proposes a novel pluggable, extensible, and efficient approach named… ▽ More

    Submitted 4 May, 2022; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted by ICLR 2022

  42. arXiv:2106.08087  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

    Authors: Ningyu Zhang, Mosha Chen, Zhen Bi, Xiaozhuan Liang, Lei Li, Xin Shang, Kangping Yin, Chuanqi Tan, Jian Xu, Fei Huang, Luo Si, Yuan Ni, Guotong Xie, Zhifang Sui, Baobao Chang, Hui Zong, Zheng Yuan, Linfeng Li, Jun Yan, Hongying Zan, Kunli Zhang, Buzhou Tang, Qingcai Chen

    Abstract: Artificial Intelligence (AI), along with the recent progress in biomedical language understanding, is gradually changing medical practice. With the development of biomedical language understanding benchmarks, AI applications are widely used in the medical field. However, most benchmarks are limited to English, which makes it challenging to replicate many of the successes in English for other langu… ▽ More

    Submitted 7 March, 2022; v1 submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted by ACL 2022

  43. UCPhrase: Unsupervised Context-aware Quality Phrase Tagging

    Authors: Xiaotao Gu, Zihan Wang, Zhenyu Bi, Yu Meng, Liyuan Liu, Jiawei Han, Jingbo Shang

    Abstract: Identifying and understanding quality phrases from context is a fundamental task in text mining. The most challenging part of this task arguably lies in uncommon, emerging, and domain-specific phrases. The infrequent nature of these phrases significantly hurts the performance of phrase mining methods that rely on sufficient phrase occurrences in the input corpus. Context-aware tagging models, thou… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: KDD 2021

  44. arXiv:2104.11681  [pdf, other

    cs.CL cs.AI

    Interventional Aspect-Based Sentiment Analysis

    Authors: Zhen Bi, Ningyu Zhang, Ganqiang Ye, Haiyang Yu, Xi Chen, Huajun Chen

    Abstract: Recent neural-based aspect-based sentiment analysis approaches, though achieving promising improvement on benchmark datasets, have reported suffering from poor robustness when encountering confounder such as non-target aspects. In this paper, we take a causal view to addressing this issue. We propose a simple yet effective method, namely, Sentiment Adjustment (SENTA), by applying a backdoor adjust… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Work in progress

  45. arXiv:2104.04907  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Disentangled Contrastive Learning for Learning Robust Textual Representations

    Authors: Xiang Chen, Xin Xie, Zhen Bi, Hongbin Ye, Shumin Deng, Ningyu Zhang, Huajun Chen

    Abstract: Although the self-supervised pre-training of transformer models has resulted in the revolutionizing of natural language processing (NLP) applications and the achievement of state-of-the-art results with regard to various benchmarks, this process is still vulnerable to small and imperceptible permutations originating from legitimate inputs. Intuitively, the representations should be similar in the… ▽ More

    Submitted 22 August, 2021; v1 submitted 10 April, 2021; originally announced April 2021.

    Comments: Accepted by CICAI 2021

  46. arXiv:2104.02284  [pdf, other

    cs.AI cs.CL

    Text-guided Legal Knowledge Graph Reasoning

    Authors: Luoqiu Li, Zhen Bi, Hongbin Ye, Shumin Deng, Hui Chen, Huaixiao Tou

    Abstract: Recent years have witnessed the prosperity of legal artificial intelligence with the development of technologies. In this paper, we propose a novel legal application of legal provision prediction (LPP), which aims to predict the related legal provisions of affairs. We formulate this task as a challenging knowledge graph completion problem, which requires not only text understanding but also graph… ▽ More

    Submitted 22 August, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  47. arXiv:2104.00312  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    Normal vs. Adversarial: Salience-based Analysis of Adversarial Samples for Relation Extraction

    Authors: Luoqiu Li, Xiang Chen, Zhen Bi, Xin Xie, Shumin Deng, Ningyu Zhang, Chuanqi Tan, Mosha Chen, Huajun Chen

    Abstract: Recent neural-based relation extraction approaches, though achieving promising improvement on benchmark datasets, have reported their vulnerability towards adversarial attacks. Thus far, efforts mostly focused on generating adversarial samples or defending adversarial attacks, but little is known about the difference between normal and adversarial samples. In this work, we take the first step to l… ▽ More

    Submitted 25 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: IJCKG 2021

  48. arXiv:2101.04115  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Yang-Lee edge singularity triggered entanglement transition

    Authors: Shao-Kai Jian, Zhi-Cheng Yang, Zhen Bi, Xiao Chen

    Abstract: We show that a class of $\mathcal{PT}$ symmetric non-Hermitian Hamiltonians realizing the Yang-Lee edge singularity exhibits an entanglement transition in the long-time steady state evolved under the Hamiltonian. Such a transition is induced by a level crossing triggered by the critical point associated with the Yang-Lee singularity and hence is first-order in nature. At the transition, the entang… ▽ More

    Submitted 11 October, 2021; v1 submitted 11 January, 2021; originally announced January 2021.

    Comments: Updated to published version

    Journal ref: Phys. Rev. B 104, L161107 (2021)

  49. arXiv:2009.06206  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    On Robustness and Bias Analysis of BERT-based Relation Extraction

    Authors: Luoqiu Li, Xiang Chen, Hongbin Ye, Zhen Bi, Shumin Deng, Ningyu Zhang, Huajun Chen

    Abstract: Fine-tuning pre-trained models have achieved impressive performance on standard natural language processing benchmarks. However, the resultant model generalizability remains poorly understood. We do not know, for example, how excellent performance can lead to the perfection of generalization models. In this study, we analyze a fine-tuned BERT model from different perspectives using relation extrac… ▽ More

    Submitted 25 December, 2021; v1 submitted 14 September, 2020; originally announced September 2020.

    Comments: work in progress

  50. arXiv:1912.11723  [pdf, other

    hep-lat cond-mat.str-el

    Lattice Analysis of $SU(2)$ with 1 Adjoint Dirac Flavor

    Authors: Zhen Bi, Anthony Grebe, Gurtej Kanwar, Patrick Ledwith, David Murphy, Michael L. Wagman

    Abstract: Recently $SU(2)$ Yang-Mills theory with one massless adjoint Dirac quark flavor emerges as a novel critical theory that can describe the evolution between a trivial insulator and a topological insulator in AIII class in $3+1$ dimensions. There are several classes of conjectured infrared dynamics for this theory. One possibility is that the theory undergoes spontaneous chiral symmetry breaking, wit… ▽ More

    Submitted 25 December, 2019; originally announced December 2019.

    Comments: presented at the 37th Annual International Symposium on Lattice Field Theory (LATTICE2019)

    Report number: MIT-CTP/5163