subscribe to arXiv mailings

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

Authors: Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo , et al. (15 additional authors not shown)

Abstract: This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th… ▽ More This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as they have to spend more time reading, writing, and reviewing papers. This raises the question: how can LLMs potentially assist researchers in alleviating their heavy workload? This study focuses on the topic of LLMs assist NLP Researchers, particularly examining the effectiveness of LLM in assisting paper (meta-)reviewing and its recognizability. To address this, we constructed the ReviewCritique dataset, which includes two types of information: (i) NLP papers (initial submissions rather than camera-ready) with both human-written and LLM-generated reviews, and (ii) each review comes with "deficiency" labels and corresponding explanations for individual segments, annotated by experts. Using ReviewCritique, this study explores two threads of research questions: (i) "LLMs as Reviewers", how do reviews generated by LLMs compare with those written by humans in terms of quality and distinguishability? (ii) "LLMs as Metareviewers", how effectively can LLMs identify potential issues, such as Deficient or unprofessional review segments, within individual paper reviews? To our knowledge, this is the first work to provide such a comprehensive analysis. △ Less

Submitted 25 June, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.16203 [pdf, other]

LLMs' Classification Performance is Overclaimed

Authors: Hanzi Xu, Renze Lou, Jiangshu Du, Vahid Mahzoon, Elmira Talebianaraki, Zhuoan Zhou, Elizabeth Garrison, Slobodan Vucetic, Wenpeng Yin

Abstract: In many classification tasks designed for AI or human to solve, gold labels are typically included within the label space by default, often posed as "which of the following is correct?" This standard setup has traditionally highlighted the strong performance of advanced AI, particularly top-performing Large Language Models (LLMs), in routine classification tasks. However, when the gold label is in… ▽ More In many classification tasks designed for AI or human to solve, gold labels are typically included within the label space by default, often posed as "which of the following is correct?" This standard setup has traditionally highlighted the strong performance of advanced AI, particularly top-performing Large Language Models (LLMs), in routine classification tasks. However, when the gold label is intentionally excluded from the label space, it becomes evident that LLMs still attempt to select from the available label candidates, even when none are correct. This raises a pivotal question: Do LLMs truly demonstrate their intelligence in understanding the essence of classification tasks? In this study, we evaluate both closed-source and open-source LLMs across representative classification tasks, arguing that the perceived performance of LLMs is overstated due to their inability to exhibit the expected comprehension of the task. This paper makes a threefold contribution: i) To our knowledge, this is the first work to identify the limitations of LLMs in classification tasks when gold labels are absent. We define this task as Classify-w/o-Gold and propose it as a new testbed for LLMs. ii) We introduce a benchmark, Know-No, comprising two existing classification tasks and one new task, to evaluate Classify-w/o-Gold. iii) This work defines and advocates for a new evaluation metric, OmniAccuracy, which assesses LLMs' performance in classification tasks both when gold labels are present and absent. △ Less

Submitted 3 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

arXiv:2406.05948 [pdf, other]

Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

Authors: Xi Li, Yusen Zhang, Renze Lou, Chen Wu, Jiaqi Wang

Abstract: Backdoor attacks present significant threats to Large Language Models (LLMs), particularly with the rise of third-party services that offer API integration and prompt engineering. Untrustworthy third parties can plant backdoors into LLMs and pose risks to users by embedding malicious instructions into user queries. The backdoor-compromised LLM will generate malicious output when and input is embed… ▽ More Backdoor attacks present significant threats to Large Language Models (LLMs), particularly with the rise of third-party services that offer API integration and prompt engineering. Untrustworthy third parties can plant backdoors into LLMs and pose risks to users by embedding malicious instructions into user queries. The backdoor-compromised LLM will generate malicious output when and input is embedded with a specific trigger predetermined by an attacker. Traditional defense strategies, which primarily involve model parameter fine-tuning and gradient calculation, are inadequate for LLMs due to their extensive computational and clean data requirements. In this paper, we propose a novel solution, Chain-of-Scrutiny (CoS), to address these challenges. Backdoor attacks fundamentally create a shortcut from the trigger to the target output, thus lack reasoning support. Accordingly, CoS guides the LLMs to generate detailed reasoning steps for the input, then scrutinizes the reasoning process to ensure consistency with the final answer. Any inconsistency may indicate an attack. CoS only requires black-box access to LLM, offering a practical defense, particularly for API-accessible LLMs. It is user-friendly, enabling users to conduct the defense themselves. Driven by natural language, the entire defense process is transparent to users. We validate the effectiveness of CoS through extensive experiments across various tasks and LLMs. Additionally, experiments results shows CoS proves more beneficial for more powerful LLMs. △ Less

Submitted 9 June, 2024; originally announced June 2024.

arXiv:2404.03602 [pdf, other]

Evaluating LLMs at Detecting Errors in LLM Responses

Authors: Ryo Kamoi, Sarkar Snigdha Sarathi Das, Renze Lou, Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang

Abstract: With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.… ▽ More With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g., word sorting) or limited error types (e.g., faithfulness in summarization). This work introduces ReaLMistake, the first error detection benchmark consisting of objective, realistic, and diverse errors made by LLMs. ReaLMistake contains three challenging and meaningful tasks that introduce objectively assessable errors in four categories (reasoning correctness, instruction-following, context-faithfulness, and parameterized knowledge), eliciting naturally observed and diverse errors in responses of GPT-4 and Llama 2 70B annotated by experts. We use ReaLMistake to evaluate error detectors based on 12 LLMs. Our findings show: 1) Top LLMs like GPT-4 and Claude 3 detect errors made by LLMs at very low recall, and all LLM-based error detectors perform much worse than humans. 2) Explanations by LLM-based error detectors lack reliability. 3) LLMs-based error detection is sensitive to small changes in prompts but remains challenging to improve. 4) Popular approaches to improving LLMs, including self-consistency and majority vote, do not improve the error detection performance. Our benchmark and code are provided at https://github.com/psunlpgroup/ReaLMistake. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: Benchmark and code: https://github.com/psunlpgroup/ReaLMistake

arXiv:2402.01622 [pdf, other]

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Authors: Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su

Abstract: Planning has been part of the core pursuit for artificial intelligence since its conception, but earlier AI agents mostly focused on constrained settings because many of the cognitive substrates necessary for human-level planning have been lacking. Recently, language agents powered by large language models (LLMs) have shown interesting capabilities such as tool use and reasoning. Are these languag… ▽ More Planning has been part of the core pursuit for artificial intelligence since its conception, but earlier AI agents mostly focused on constrained settings because many of the cognitive substrates necessary for human-level planning have been lacking. Recently, language agents powered by large language models (LLMs) have shown interesting capabilities such as tool use and reasoning. Are these language agents capable of planning in more complex settings that are out of the reach of prior AI agents? To advance this investigation, we propose TravelPlanner, a new planning benchmark that focuses on travel planning, a common real-world planning scenario. It provides a rich sandbox environment, various tools for accessing nearly four million data records, and 1,225 meticulously curated planning intents and reference plans. Comprehensive evaluations show that the current language agents are not yet capable of handling such complex planning tasks-even GPT-4 only achieves a success rate of 0.6%. Language agents struggle to stay on task, use the right tools to collect information, or keep track of multiple constraints. However, we note that the mere possibility for language agents to tackle such a complex problem is in itself non-trivial progress. TravelPlanner provides a challenging yet meaningful testbed for future language agents. △ Less

Submitted 23 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: ICML 2024 (Spotlight)

arXiv:2402.00157 [pdf, other]

Large Language Models for Mathematical Reasoning: Progresses and Challenges

Authors: Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin

Abstract: Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing… ▽ More Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing evaluation across diverse datasets and settings. This diversity makes it challenging to discern the true advancements and obstacles within this burgeoning field. This survey endeavors to address four pivotal dimensions: i) a comprehensive exploration of the various mathematical problems and their corresponding datasets that have been investigated; ii) an examination of the spectrum of LLM-oriented techniques that have been proposed for mathematical problem-solving; iii) an overview of factors and concerns affecting LLMs in solving math; and iv) an elucidation of the persisting challenges within this domain. To the best of our knowledge, this survey stands as one of the first extensive examinations of the landscape of LLMs in the realm of mathematics, providing a holistic perspective on the current state, accomplishments, and future challenges in this rapidly evolving field. △ Less

Submitted 5 April, 2024; v1 submitted 31 January, 2024; originally announced February 2024.

Comments: EACL 2024 Student Research Workshop, 8 pages

arXiv:2401.03082 [pdf, other]

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

Authors: Lin Sun, Kai Zhang, Qingyuan Li, Renze Lou

Abstract: Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases. However, current MIE methods often resort to using task-specific model structures, which results in limited generalizability across tasks and underutilizes shared knowledge across MIE tasks. To address these issues, we propose UMIE, a unified multimodal information extractor to un… ▽ More Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases. However, current MIE methods often resort to using task-specific model structures, which results in limited generalizability across tasks and underutilizes shared knowledge across MIE tasks. To address these issues, we propose UMIE, a unified multimodal information extractor to unify three MIE tasks as a generation problem using instruction tuning, being able to effectively extract both textual and visual mentions. Extensive experiments show that our single UMIE outperforms various state-of-the-art (SoTA) methods across six MIE datasets on three tasks. Furthermore, in-depth analysis demonstrates UMIE's strong generalization in the zero-shot setting, robustness to instruction variants, and interpretability. Our research serves as an initial step towards a unified MIE model and initiates the exploration into both instruction tuning and large language models within the MIE domain. Our code, data, and model are available at https://github.com/ZUCC-AI/UMIE △ Less

Submitted 5 January, 2024; originally announced January 2024.

arXiv:2312.02436 [pdf, other]

MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following

Authors: Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin

Abstract: In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data. This is achieved through two primary schemes: i) Scaling-Inputs: Amplifying (input, output) pairs per task instruction, aiming for better instruction adherence. ii) Scaling Input-Free Tasks: Enlarging tasks, each composed of an (instruction, output) pair (withou… ▽ More In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data. This is achieved through two primary schemes: i) Scaling-Inputs: Amplifying (input, output) pairs per task instruction, aiming for better instruction adherence. ii) Scaling Input-Free Tasks: Enlarging tasks, each composed of an (instruction, output) pair (without requiring a separate input anymore). However, LLMs under Scaling-Inputs tend to be overly sensitive to inputs, leading to misinterpretation or non-compliance with instructions. Conversely, Scaling Input-Free Tasks demands a substantial number of tasks but is less effective in instruction following when dealing with instances in Scaling-Inputs. This work introduces MUFFIN, a new scheme of instruction-following dataset curation. Specifically, we automatically Scale Tasks per Input by diversifying these tasks with various input facets. Experimental results across four zero-shot benchmarks, spanning both Scaling-Inputs and Scaling Input-Free Tasks schemes, reveal that LLMs, at various scales, trained on MUFFIN generally demonstrate superior instruction-following capabilities compared to those trained on the two aforementioned schemes. △ Less

Submitted 14 March, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

Comments: ICLR 2024. Data, model, and code are available at: https://renzelou.github.io/Muffin/

arXiv:2309.12998 [pdf, other]

Audience-specific Explanations for Machine Translation

Authors: Renhan Lou, Jan Niehues

Abstract: In machine translation, a common problem is that the translation of certain words even if translated can cause incomprehension of the target language audience due to different cultural backgrounds. A solution to solve this problem is to add explanations for these words. In a first step, we therefore need to identify these words or phrases. In this work we explore techniques to extract example expl… ▽ More In machine translation, a common problem is that the translation of certain words even if translated can cause incomprehension of the target language audience due to different cultural backgrounds. A solution to solve this problem is to add explanations for these words. In a first step, we therefore need to identify these words or phrases. In this work we explore techniques to extract example explanations from a parallel corpus. However, the sparsity of sentences containing words that need to be explained makes building the training dataset extremely difficult. In this work, we propose a semi-automatic technique to extract these explanations from a large parallel corpus. Experiments on English->German language pair show that our method is able to extract sentence so that more than 10% of the sentences contain explanation, while only 1.9% of the original sentences contain explanations. In addition, experiments on English->French and English->Chinese language pairs also show similar conclusions. This is therefore an essential first automatic step to create a explanation dataset. Furthermore we show that the technique is robust for all three language pairs. △ Less

Submitted 22 September, 2023; originally announced September 2023.

arXiv:2309.06399 [pdf, other]

Orbital-selective effect of spin reorientation on the Dirac fermions in a three-dimensional kagome ferromagnet Fe$_3$Ge

Authors: Rui Lou, Liqin Zhou, Wenhua Song, Alexander Fedorov, Zhijun Tu, Bei Jiang, Qi Wang, Man Li, Zhonghao Liu, Xuezhi Chen, Oliver Rader, Bernd Büchner, Yujie Sun, Hongming Weng, Hechang Lei, Shancai Wang

Abstract: Kagome magnets provide a fascinating platform for the realization of correlated topological quantum phases under various magnetic ground states. However, the intricate effect of the magnetic spin configurations on the characteristic electronic structure directly from the kagome lattice layer remains still elusive. Here, utilizing angle-resolved photoemission spectroscopy and density functional the… ▽ More Kagome magnets provide a fascinating platform for the realization of correlated topological quantum phases under various magnetic ground states. However, the intricate effect of the magnetic spin configurations on the characteristic electronic structure directly from the kagome lattice layer remains still elusive. Here, utilizing angle-resolved photoemission spectroscopy and density functional theory calculations, we report the spectroscopic evidence for the spin-reorientation effect of a kagome ferromagnet Fe$_3$Ge, which is composed only of the kagome planes. There are two kinds of kagome-derived Dirac fermions due to the structural three-dimensionality -- one is less dispersive ($k_z$ $\sim$ 0) and the other disperses linearly ($k_z$ $\sim$ $π$). As the Fe moments cant from the $c$ axis into the $ab$ plane upon cooling, the Dirac fermion in $k_z$ $\sim$ 0 plane with a mixture of the Fe-$3d_{xy}$ and Fe-$3d_{x^2-y^2}$ components evolves from gapped into nearly gapless, while the Dirac cone in $k_z$ $\sim$ $π$ plane mainly of the Fe-$3d_{x^2-y^2}$ orbital character remains intact, suggesting that the effect of spin reorientation on the Dirac fermions has an orbital selectivity. Our unambiguous observations provide a feasible route to design and manipulate the mass of Dirac fermions for realizing the novel quantum phases. We also perform comparative studies between the non-charge-ordered Fe$_3$Ge and its sibling compound FeGe, a newly established charge-density-wave kagome magnet, the results suggest that the orbital-selective van Hove singularities near the Fermi level play an indispensable part in driving the charge order on a magnetic kagome lattice. △ Less

Submitted 12 September, 2023; originally announced September 2023.

Comments: 36 pages, 4 figures

arXiv:2308.03795 [pdf, other]

Toward Zero-Shot Instruction Following

Authors: Renze Lou, Wenpeng Yin

Abstract: This work proposes a challenging yet more realistic setting for zero-shot cross-task generalization: zero-shot instruction following, presuming the existence of a paragraph-style task definition while no demonstrations exist. To better learn the task supervision from the definition, we propose two strategies: first, to automatically find out the critical sentences in the definition; second, a rank… ▽ More This work proposes a challenging yet more realistic setting for zero-shot cross-task generalization: zero-shot instruction following, presuming the existence of a paragraph-style task definition while no demonstrations exist. To better learn the task supervision from the definition, we propose two strategies: first, to automatically find out the critical sentences in the definition; second, a ranking objective to force the model to generate the gold outputs with higher probabilities when those critical parts are highlighted in the definition. The joint efforts of the two strategies yield state-of-the-art performance on the Super-NaturalInstructions. Our code is available on GitHub. △ Less

Submitted 25 January, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

Comments: EACL 2024 Student Research Workshop

arXiv:2305.13300 [pdf, other]

Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts

Authors: Jian Xie, Kai Zhang, Jiangjie Chen, Renze Lou, Yu Su

Abstract: By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory. However, how receptive are LLMs to such external evidence, especially when the evidence conflicts with their parametric memory? We present the first comprehensive and controlled… ▽ More By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory. However, how receptive are LLMs to such external evidence, especially when the evidence conflicts with their parametric memory? We present the first comprehensive and controlled investigation into the behavior of LLMs when encountering knowledge conflicts. We propose a systematic framework to elicit high-quality parametric memory from LLMs and construct the corresponding counter-memory, which enables us to conduct a series of controlled experiments. Our investigation reveals seemingly contradicting behaviors of LLMs. On the one hand, different from prior wisdom, we find that LLMs can be highly receptive to external evidence even when that conflicts with their parametric memory, given that the external evidence is coherent and convincing. On the other hand, LLMs also demonstrate a strong confirmation bias when the external evidence contains some information that is consistent with their parametric memory, despite being presented with conflicting evidence at the same time. These results pose important implications that are worth careful consideration for the further development and deployment of tool- and retrieval-augmented LLMs. Resources are available at https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict. △ Less

Submitted 27 February, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

Comments: ICLR 2024 (Spotlight)

arXiv:2305.02900 [pdf, other]

doi 10.1038/s41586-023-06977-7

Superconducting Arcs

Authors: Andrii Kuibarov, Oleksandr Suvorov, Riccardo Vocaturo, Alexander Fedorov, Rui Lou, Luise Merkwitz, Vladimir Voroshnin, Jorge I. Facio, Klaus Koepernik, Alexander Yaresko, Grigoriy Shipunov, Saicharan Aswartham, Jeroen van den Brink, Bernd Büchner, Sergey Borisenko

Abstract: An essential ingredient for the production of Majorana fermions that can be used for quantum computing is the presence of topological superconductivity. As bulk topological superconductors remain elusive, the most promising approaches exploit proximity-induced superconductivity making systems fragile and difficult to realize. Weyl semimetals due to their intrinsic topology belong to potential cand… ▽ More An essential ingredient for the production of Majorana fermions that can be used for quantum computing is the presence of topological superconductivity. As bulk topological superconductors remain elusive, the most promising approaches exploit proximity-induced superconductivity making systems fragile and difficult to realize. Weyl semimetals due to their intrinsic topology belong to potential candidates too, but search for Majorana fermions has always been connected with the superconductivity in the bulk, leaving the possibility of intrinsic superconductivity of the Fermi surface arcs themselves practically without attention, even from the theory side.Here, by means of angle-resolved photoemission spectroscopy and ab-initio calculations, we unambiguously identify topological Fermi arcs on two opposing surfaces of the non-centrosymmetric Weyl material PtBi2. We show that these states become superconducting at different temperatures around 10K. Remarkably, the corresponding coherencepeaks appear as the strongest and sharpest excitations ever detected by photoemission from solids, suggesting significant technological relevance. Our findings indicate that topological superconductivity in PtBi2 occurs exclusively at the surface, which not only makes it an ideal platform to host Majorana fermions, but may also lead to a unique quantum phase - an intrinsic topological SNS Josephson junction. △ Less

Submitted 4 May, 2023; originally announced May 2023.

Comments: 8 pages, 4 figures and Supplementary Information

Journal ref: Nature 626, 294 (2024)

arXiv:2303.10475 [pdf, other]

Large Language Model Instruction Following: A Survey of Progresses and Challenges

Authors: Renze Lou, Kai Zhang, Wenpeng Yin

Abstract: Task semantics can be expressed by a set of input-output examples or a piece of textual instruction. Conventional machine learning approaches for natural language processing (NLP) mainly rely on the availability of large-scale sets of task-specific examples. Two issues arise: first, collecting task-specific labeled examples does not apply to scenarios where tasks may be too complicated or costly t… ▽ More Task semantics can be expressed by a set of input-output examples or a piece of textual instruction. Conventional machine learning approaches for natural language processing (NLP) mainly rely on the availability of large-scale sets of task-specific examples. Two issues arise: first, collecting task-specific labeled examples does not apply to scenarios where tasks may be too complicated or costly to annotate, or the system is required to handle a new task immediately; second, this is not user-friendly since end-users are probably more willing to provide task description rather than a set of examples before using the system. Therefore, the community is paying increasing interest in a new supervision-seeking paradigm for NLP: learning to follow task instructions, i.e., instruction following. Despite its impressive progress, there are some common issues that the community struggles with. This survey paper tries to summarize and provide insights to the current research on instruction following, particularly, by answering the following questions: (i) What is task instruction, and what instruction types exist? (ii) How to model instructions? (iii) What are popular instruction following datasets and evaluation metrics? (iv) What factors influence and explain the instructions' performance? (v) What challenges remain in instruction following? To our knowledge, this is the first comprehensive survey about instruction following. △ Less

Submitted 24 May, 2024; v1 submitted 18 March, 2023; originally announced March 2023.

Comments: Accepted by Computational Linguistics Journal. The paper list is available at https://github.com/RenzeLou/awesome-instruction-learning

arXiv:2303.01795 [pdf, other]

PAGE: A Position-Aware Graph-Based Model for Emotion Cause Entailment in Conversation

Authors: Xiaojie Gu, Renze Lou, Lin Sun, Shangxin Li

Abstract: Conversational Causal Emotion Entailment (C2E2) is a task that aims at recognizing the causes corresponding to a target emotion in a conversation. The order of utterances in the conversation affects the causal inference. However, most current position encoding strategies ignore the order relation among utterances and speakers. To address the issue, we devise a novel position-aware graph to encode… ▽ More Conversational Causal Emotion Entailment (C2E2) is a task that aims at recognizing the causes corresponding to a target emotion in a conversation. The order of utterances in the conversation affects the causal inference. However, most current position encoding strategies ignore the order relation among utterances and speakers. To address the issue, we devise a novel position-aware graph to encode the entire conversation, fully modeling causal relations among utterances. The comprehensive experiments show that our method consistently achieves state-of-the-art performance on two challenging test sets, proving the effectiveness of our model. Our source code is available on Github: https://github.com/XiaojieGu/PAGE. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: ICASSP 2023

arXiv:2301.13768 [pdf, other]

doi 10.1103/PhysRevB.107.035158

Signature of weakly coupled $f$ electrons and conduction electrons in magnetic Weyl semimetal candidates PrAlSi and SmAlSi

Authors: Rui Lou, Alexander Fedorov, Lingxiao Zhao, Alexander Yaresko, Bernd Büchner, Sergey Borisenko

Abstract: Magnetic topological materials are a class of compounds with the underlying interplay of nontrivial band topology and magnetic spin configuration. Extensive interests have been aroused due to their application potential involved with an array of exotic quantum states. With angle-resolved photoemission spectroscopy and first-principles calculations, here we study the electronic properties of two ma… ▽ More Magnetic topological materials are a class of compounds with the underlying interplay of nontrivial band topology and magnetic spin configuration. Extensive interests have been aroused due to their application potential involved with an array of exotic quantum states. With angle-resolved photoemission spectroscopy and first-principles calculations, here we study the electronic properties of two magnetic Weyl semimetal candidates PrAlSi and SmAlSi. Though the two compounds harbor distinct magnetic ground states (ferromagnetic and antiferromagnetic for PrAlSi and SmAlSi, respectively) and 4$f$ shell fillings, we find that they share quite analogous low-energy band structure. By the measurements across the magnetic transitions, we further reveal that there is no evident evolution of the band structure in both compounds and the experimental spectra can be well reproduced by the nonmagnetic calculations, together suggesting a negligible effect of the magnetism on their electronic structures and a possibly weak coupling between the localized 4$f$ electrons and the itinerant conduction electrons. Our results offer essential insights into the interactions between magnetism, electron correlations, and topological orders in the $R$Al$X$ ($R$ = light rare earth and $X$ = Si or Ge) family. △ Less

Submitted 31 January, 2023; originally announced January 2023.

Comments: 7 pages, 3 figures

Journal ref: Phys. Rev. B 107, 035158 (2023)

arXiv:2301.03800 [pdf]

Tunable positions of Weyl nodes via magnetism and pressure in the ferromagnetic Weyl semimetal CeAlSi

Authors: Erjian Cheng, Limin Yan, Xianbiao Shi, Rui Lou, Alexander Fedorov, Mahdi Behnami, Jian Yuan, Yuanji Xu, Yang Xu, Wei Xia, Nikolai Pavlovskii, Darren C. Peets, Weiwei Zhao, Yimin Wan, Yanfeng Guo, Shiyan Li, Wenge Yang, Bernd Büchner

Abstract: The noncentrosymmetric ferromagnetic Weyl semimetal CeAlSi with simultaneous space-inversion (SI) and time-reversal (TR) symmetry breaking provides a unique platform for the exploration of novel topological states. Here, by employing electrical and thermoelectrical transport, angle-resolved photoemission spectroscopy (ARPES), high-pressure techniques, and band calculations, we demonstrate that mag… ▽ More The noncentrosymmetric ferromagnetic Weyl semimetal CeAlSi with simultaneous space-inversion (SI) and time-reversal (TR) symmetry breaking provides a unique platform for the exploration of novel topological states. Here, by employing electrical and thermoelectrical transport, angle-resolved photoemission spectroscopy (ARPES), high-pressure techniques, and band calculations, we demonstrate that magnetism and pressure can serve as efficient parameters to tune the positions of Weyl nodes in CeAlSi. At ambient pressure, an anomalous Hall effect (AHE) and an anomalous Nernst effect (ANE) arise in the paramagnetic state, and then are enhanced when temperature approaches the ferromagnetic ordering temperature, evidencing magnetism facilitates the AHE/ANE. Such an enhancement of AHE/ANE can be ascribed to the tuning of the positions of Weyl nodes via magnetism. The ARPES measurements reveal that the ferromagnetism serves as a pivotal knob to tune the band structure of CeAlSi both in the bulk and on the surface. Such magnetism-tunable electronic structure has hitherto not been reported in other magnetic $R$Al$Pn$ ($R$ = rare earth elements, $Pn$ = Si, Ge) siblings, suggesting the great potential of controlling Weyl node positions in CeAlSi. Under pressure, an enhancement and a sign change of AHE are discovered. Based on band calculations, the evolution of AHE may root in the tuning of Weyl nodes via pressure. Moreover, multiple pressure-induced phase transitions are uncovered. These findings indicate that CeAlSi provides a unique and tunable platform for exploring exotic topological physics and electron correlations, as well as catering to an array of potential applications, such as spintronics and thermoelectrics. △ Less

Submitted 19 March, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

arXiv:2207.14416 [pdf, other]

doi 10.1103/PhysRevResearch.5.043011

Suppression of nematicity by tensile strain in multilayer FeSe/SrTiO$_3$ films

Authors: Rui Lou, Oleksandr Suvorov, Hans-Joachim Grafe, Andrii Kuibarov, Maxim Krivenkov, Oliver Rader, Bernd Büchner, Sergey Borisenko, Alexander Fedorov

Abstract: The nematicity in multilayer FeSe/SrTiO$_3$ films has been previously suggested to be enhanced with decreasing film thickness. Motivated by this, there have been many discussions about the competing relation between nematicity and superconductivity. However, the criterion for determining the nematicity strength in FeSe remains highly debated. The understanding of nematicity and its relation to sup… ▽ More The nematicity in multilayer FeSe/SrTiO$_3$ films has been previously suggested to be enhanced with decreasing film thickness. Motivated by this, there have been many discussions about the competing relation between nematicity and superconductivity. However, the criterion for determining the nematicity strength in FeSe remains highly debated. The understanding of nematicity and its relation to superconductivity in FeSe films is therefore still controversial. Here, we fabricate multilayer FeSe/SrTiO$_3$ films using molecular beam epitaxy and study the nematic properties by combining angle-resolved photoemission spectroscopy, nuclear magnetic resonance, and scanning tunneling microscopy experiments. We unambiguously demonstrate that, near the interface, the nematicity is suppressed by the SrTiO$_3$-induced tensile strain; in the bulk region further away from the interface, the strength of nematicity recovers to the bulk value. Our results not only solve the controversy about the nematicity in multilayer FeSe films, but also offer valuable insights into the relationship between nematicity and superconductivity. △ Less

Submitted 29 June, 2023; v1 submitted 28 July, 2022; originally announced July 2022.

Comments: 23 pages, 4 figures

Journal ref: Phys. Rev. Research 5, 043011 (2023)

arXiv:2206.00289 [pdf, other]

MORE: A Metric Learning Based Framework for Open-domain Relation Extraction

Authors: Yutong Wang, Renze Lou, Kai Zhang, MaoYan Chen, Yujiu Yang

Abstract: Open relation extraction (OpenRE) is the task of extracting relation schemes from open-domain corpora. Most existing OpenRE methods either do not fully benefit from high-quality labeled corpora or can not learn semantic representation directly, affecting downstream clustering efficiency. To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-base… ▽ More Open relation extraction (OpenRE) is the task of extracting relation schemes from open-domain corpora. Most existing OpenRE methods either do not fully benefit from high-quality labeled corpora or can not learn semantic representation directly, affecting downstream clustering efficiency. To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-based Open Relation Extraction). The framework utilizes deep metric learning to obtain rich supervision signals from labeled data and drive the neural model to learn semantic relational representation directly. Experiments result in two real-world datasets show that our method outperforms other state-of-the-art baselines. Our source code is available on Github. △ Less

Submitted 1 June, 2022; originally announced June 2022.

Comments: 5 pages, 3 figures, accepted by ICASSP 2021

arXiv:2203.12511 [pdf, other]

doi 10.1038/s41586-022-04412-x

Emergence of Fermi arcs and novel magnetic splitting in an antiferromagnet

Authors: Benjamin Schrunk, Yevhen Kushnirenko, Brinda Kuthanazhi, Junyeong Ahn, Lin-Lin Wang, Evan O`Leary, Kyungchan Lee, Andrew Eaton, Alexander Fedorov, Rui Lou, Vladimir Voroshnin, Oliver J. Clark, Jaime Sanchez-Barriga, Sergey L. Bud`ko, Robert-Jan Slager, Paul C. Canfield, Adam Kaminski

Abstract: The Fermi arcs are signatures of exotic states in solids because they defy conventional concept of Fermi surfaces as closed contours in momentum space. Fermi arcs were first discovered in cuprates, and caused by the pseudogap. Weyl semimetals provided another way to generate Fermi arcs by breaking either the time reversal symmetry (TRS) or inversion symmetry of a 3D Dirac semimetal, which can resu… ▽ More The Fermi arcs are signatures of exotic states in solids because they defy conventional concept of Fermi surfaces as closed contours in momentum space. Fermi arcs were first discovered in cuprates, and caused by the pseudogap. Weyl semimetals provided another way to generate Fermi arcs by breaking either the time reversal symmetry (TRS) or inversion symmetry of a 3D Dirac semimetal, which can result in a Weyl semimetal with pairs of Weyl nodes that have opposite chirality. The bulk-boundary correspondence associated with the Chern number leads to the emergence of Fermi arcs on the boundary. Here, we present experimental evidence that pairs of magnetically split hole- and electron-like Fermi arcs emerge below the Neel temperature, in the antiferromagnetic (AFM) state of cubic NdBi due to a novel band splitting effect. Whereas TRS is broken by the AFM order, both inversion and nonsymmorphic TRS are preserved in the bulk, precluding the possibility of a Weyl semimetal. The observed magnetic splitting is highly unusual, as it creates bands of opposing curvature, that changes with temperature and follows the antiferromagnetic order parameter. This is completely different from previously reported cases of magnetic splittings such as traditional Zeeman and Rashba, where the curvature of the bands is preserved. Therefore, our finding represents a new Fermionic state created by new type of magnetic band splitting in the presence of a long-range AFM order that are not readily explained by existing theoretical ideas. △ Less

Submitted 23 March, 2022; originally announced March 2022.

Comments: 16 pages, 4 figures main text and 20 pages, 12 figures supplement

Journal ref: The version of record of this article, first published in Nature, is available online at Publisher`s website: https://www.nature.com/articles/s41586-022-04412-x (2022)

arXiv:2203.11848 [pdf, other]

doi 10.1063/5.0087141

Electronic structure and open-orbit Fermi surface topology in isostructural semimetals NbAs$_2$ and W$_2$As$_3$ with extremely large magnetoresistance

Authors: Rui Lou, Yiyan Wang, Lingxiao Zhao, Chenchao Xu, Man Li, Xiaoyang Chen, Anmin Zhang, Yaobo Huang, Chao Cao, Genfu Chen, Tianlong Xia, Qingming Zhang, Hong Ding, Shancai Wang

Abstract: In transition-metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the origin of extremely large magnetoresistance (XMR) is yet to be studied by the direct visualization of the experimental band structures. Here, using angle-resolved photoemission spectroscopy, we map out the three-dimensional electronic structure of NbAs$_2$. The open-orbit topology contributes to a non-negligible part o… ▽ More In transition-metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the origin of extremely large magnetoresistance (XMR) is yet to be studied by the direct visualization of the experimental band structures. Here, using angle-resolved photoemission spectroscopy, we map out the three-dimensional electronic structure of NbAs$_2$. The open-orbit topology contributes to a non-negligible part of the Fermi surfaces (FSs), like that of the isostructural compound MoAs$_2$, where the open FS is proposed to likely explain the origin of XMR. We further demonstrate the observation of open characters in the overall FSs of W$_2$As$_3$, which is also a XMR semimetal with the same space group of $C$12/$m$1 as $TmPn_2$ family and MoAs$_2$. Our results suggest that the open-orbit FS topology may be a shared feature between XMR materials with the space group of $C$12/$m$1, and thus could possibly play a role in determining the corresponding XMR effect together with the electron-hole compensation. △ Less

Submitted 22 March, 2022; originally announced March 2022.

Comments: 7 pages, 4 figures, Editor's pick

Journal ref: Appl. Phys. Lett. 120, 123101 (2022)

arXiv:2109.10066 [pdf, other]

doi 10.1038/s41535-021-00381-y

Electronic structure and signature of Tomonaga-Luttinger liquid state in epitaxial CoSb$_{1-x}$ nanoribbons

Authors: Rui Lou, Minyinan Lei, Wenjun Ding, Wentao Yang, Xiaoyang Chen, Ran Tao, Shuyue Ding, Xiaoping Shen, Yajun Yan, Ping Cui, Haichao Xu, Rui Peng, Tong Zhang, Zhenyu Zhang, Donglai Feng

Abstract: Recently, monolayer CoSb/SrTiO$_3$ has been proposed as a candidate harboring interfacial superconductivity in analogy with monolayer FeSe/SrTiO$_3$. Experimentally, while the CoSb-based compounds manifesting as nanowires and thin films have been realized on SrTiO$_3$ substrates, serving as a rich playground, their electronic structures are still unknown and yet to be resolved. Here, we have fabri… ▽ More Recently, monolayer CoSb/SrTiO$_3$ has been proposed as a candidate harboring interfacial superconductivity in analogy with monolayer FeSe/SrTiO$_3$. Experimentally, while the CoSb-based compounds manifesting as nanowires and thin films have been realized on SrTiO$_3$ substrates, serving as a rich playground, their electronic structures are still unknown and yet to be resolved. Here, we have fabricated CoSb$_{1-x}$ nanoribbons with quasi-one-dimensional stripes on SrTiO$_3$(001) substrates using molecular beam epitaxy, and investigated the electronic structure by in situ angle-resolved photoemission spectroscopy. Straight Fermi surfaces without lateral dispersions are observed. CoSb$_{1-x}$/SrTiO$_3$ is slightly hole doped, where the interfacial charge transfer is opposite to that in monolayer FeSe/SrTiO$_3$. The spectral weight near Fermi level exhibits power-law-like suppression and obeys a universal temperature scaling, serving as the signature of Tomonaga-Luttinger liquid (TLL) state. The obtained TLL parameter of $\sim$0.21 shows the underlying strong correlations. Our results not only suggest CoSb$_{1-x}$ nanoribbon as a representative TLL system, but also provide clues for further investigations on the CoSb-related interface. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: 27 pages, 3 figures

Journal ref: npj Quantum Materials 6, 79 (2021)

arXiv:2109.05748 [pdf, other]

GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks

Authors: Weicheng Ma, Renze Lou, Kai Zhang, Lili Wang, Soroush Vosoughi

Abstract: A key problem in multi-task learning (MTL) research is how to select high-quality auxiliary tasks automatically. This paper presents GradTS, an automatic auxiliary task selection method based on gradient calculation in Transformer-based models. Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0.33% to 17.93% on 8 na… ▽ More A key problem in multi-task learning (MTL) research is how to select high-quality auxiliary tasks automatically. This paper presents GradTS, an automatic auxiliary task selection method based on gradient calculation in Transformer-based models. Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0.33% to 17.93% on 8 natural language understanding (NLU) tasks in the GLUE benchmarks. GradTS is also time-saving since (1) its gradient calculations are based on single-task experiments and (2) the gradients are re-used without additional experiments when the candidate task set changes. On the 8 GLUE classification tasks, for example, GradTS costs on average 21.32% less time than AUTOSEM with comparable GPU consumption. Further, we show the robustness of GradTS across various task settings and model selections, e.g. mixed objectives among candidate tasks. The efficiency and efficacy of GradTS in these case studies illustrate its general applicability in MTL research without requiring manual task filtering or costly parameter tuning. △ Less

Submitted 13 September, 2021; originally announced September 2021.

Comments: In EMNLP 2021

arXiv:2108.08375 [pdf, other]

doi 10.18653/v1/2021.acl-long.152

Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks

Authors: Weicheng Ma, Kai Zhang, Renze Lou, Lili Wang, Soroush Vosoughi

Abstract: This paper studies the relative importance of attention heads in Transformer-based models to aid their interpretability in cross-lingual and multi-lingual tasks. Prior research has found that only a few attention heads are important in each mono-lingual Natural Language Processing (NLP) task and pruning the remaining heads leads to comparable or improved performance of the model. However, the impa… ▽ More This paper studies the relative importance of attention heads in Transformer-based models to aid their interpretability in cross-lingual and multi-lingual tasks. Prior research has found that only a few attention heads are important in each mono-lingual Natural Language Processing (NLP) task and pruning the remaining heads leads to comparable or improved performance of the model. However, the impact of pruning attention heads is not yet clear in cross-lingual and multi-lingual tasks. Through extensive experiments, we show that (1) pruning a number of attention heads in a multi-lingual Transformer-based model has, in general, positive effects on its performance in cross-lingual and multi-lingual tasks and (2) the attention heads to be pruned can be ranked using gradients and identified with a few trial experiments. Our experiments focus on sequence labeling tasks, with potential applicability on other cross-lingual and multi-lingual tasks. For comprehensiveness, we examine two pre-trained multi-lingual models, namely multi-lingual BERT (mBERT) and XLM-R, on three tasks across 9 languages each. We also discuss the validity of our findings and their extensibility to truly resource-scarce languages and other task settings. △ Less

Submitted 18 August, 2021; originally announced August 2021.

Comments: In ACL 2021

arXiv:2106.06497 [pdf, other]

doi 10.1103/PhysRevLett.128.036402

Charge-Density-Wave-Induced Peak-Dip-Hump Structure and the Multiband Superconductivity in a Kagome Superconductor CsV$_{3}$Sb$_{5}$

Authors: Rui Lou, Alexander Fedorov, Qiangwei Yin, Andrii Kuibarov, Zhijun Tu, Chunsheng Gong, Eike F. Schwier, Bernd Büchner, Hechang Lei, Sergey Borisenko

Abstract: The entanglement of charge density wave (CDW), superconductivity, and topologically nontrivial electronic structure has recently been discovered in the kagome metal $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) family. With high-resolution angle-resolved photoemission spectroscopy, we study the electronic properties of CDW and superconductivity in CsV$_3$Sb$_5$. The spectra around $\bar{K}$ is found to exhibit… ▽ More The entanglement of charge density wave (CDW), superconductivity, and topologically nontrivial electronic structure has recently been discovered in the kagome metal $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) family. With high-resolution angle-resolved photoemission spectroscopy, we study the electronic properties of CDW and superconductivity in CsV$_3$Sb$_5$. The spectra around $\bar{K}$ is found to exhibit a peak-dip-hump structure associated with two separate branches of dispersion, demonstrating the isotropic CDW gap opening below $E_{\rm F}$. The peak-dip-hump lineshape is contributed by linearly dispersive Dirac bands in the lower branch and a dispersionless flat band close to $E_{\rm F}$ in the upper branch. The electronic instability via Fermi surface nesting could play a role in determining these CDW-related features. The superconducting gap of $\sim$0.4 meV is observed on both the electron band around $\barΓ$ and the flat band around $\bar{K}$, implying the multiband superconductivity. The finite density of states (DOS) at $E_{\rm F}$ in the CDW phase are most likely in favor of the emergence of multiband superconductivity, particularly the enhanced DOS associated with the flat band. Our results not only shed light on the controversial origin of the CDW, but also offer insights into the relationship between CDW and superconductivity. △ Less

Submitted 8 January, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

Comments: 6 pages, 4 figures, PRL in press

Journal ref: Phys. Rev. Lett. 128, 036402 (2022)

arXiv:1911.11474 [pdf, ps, other]

doi 10.1103/PhysRevB.100.201107

Topological phase transition between distinctWeyl semimetal states in MoTe2

Authors: Anmin Zhang, Xiaoli Ma, Changle Liu, Rui Lou, Yimeng Wang, Qiaohe Yu, Yiyan Wang, Tian-long Xia, Shancai Wang, Lei Zhang, Xiaoqun Wang, Changfeng Chen, Qingming Zhang

Abstract: We present experimental evidence of an intriguing phase transition between distinct topological states in the type-II Weyl semimetal MoTe2. We observe anomalies in the Raman phonon frequencies and linewidths as well as electronic quasielastic peaks around 70 K, which, together with structural, thermodynamic measurements, and electron-phonon coupling calculations, demonstrate a temperature-induced… ▽ More We present experimental evidence of an intriguing phase transition between distinct topological states in the type-II Weyl semimetal MoTe2. We observe anomalies in the Raman phonon frequencies and linewidths as well as electronic quasielastic peaks around 70 K, which, together with structural, thermodynamic measurements, and electron-phonon coupling calculations, demonstrate a temperature-induced transition between two topological phases previously identified by contrasting spectroscopic measurements. An analysis of experimental data suggests electron-phonon coupling as the main driving mechanism for the change of key topological characters in the electronic structure of MoTe2.We also find the phase transition to be sensitive to sample conditions distinguished by synthesis methods. These discoveries of temperature and material condition-dependent topological phase evolutions and transitions in MoTe2 advance the fundamental understanding of the underlying physics and enable an effective approach to tuning Weyl semimetal states for technological applications. △ Less

Submitted 26 November, 2019; originally announced November 2019.

Comments: 6 pages, 4 figures

Journal ref: Physical Review B 100, 201107(R) (2019)

arXiv:1805.00827 [pdf, other]

doi 10.1038/s41535-018-0121-4

Experimental observation of bulk nodal lines and electronic surface states in ZrB$_2$

Authors: Rui Lou, Pengjie Guo, Man Li, Qi Wang, Zhonghao Liu, Shanshan Sun, Chenghe Li, Xuchuan Wu, Zilu Wang, Zhe Sun, Dawei Shen, Yaobo Huang, Kai Liu, Zhong-Yi Lu, Hechang Lei, Hong Ding, Shancai Wang

Abstract: Topological nodal-line semimetals are characterized by the line-contact bulk band crossings and the topological surface states. Breaking certain protecting symmetry turns this system into a Dirac semimetal or Weyl semimetal that hosts zero-dimensional isolated nodal points. Recent advances in band theory predicted a topological nodal-line semimetal state possessing a new type of nodal line in AlB… ▽ More Topological nodal-line semimetals are characterized by the line-contact bulk band crossings and the topological surface states. Breaking certain protecting symmetry turns this system into a Dirac semimetal or Weyl semimetal that hosts zero-dimensional isolated nodal points. Recent advances in band theory predicted a topological nodal-line semimetal state possessing a new type of nodal line in AlB$_2$-type diborides. Here, we report an experimental realization of nodal-line fermions and associated surface states near the Fermi energy in ZrB$_2$ by angle-resolved photoemission spectroscopy combined with first-principles calculations. The Dirac nodal lines in ZrB$_2$ wind into two groups of nodal rings, which are linked together along the $Γ$-$K$ direction. We further observe a distinct surface state connecting to each nodal line, indicative of the nontrivial topological nature of the bulk nodal lines. Therefore, our results provide convincing experimental evidence of the nodal-line semimetal states in ZrB$_2$ both in the bulk and on the surface, suggesting ZrB$_2$ as a remarkable platform for discovering unique phenomena induced by nodal-line fermions. △ Less

Submitted 13 September, 2018; v1 submitted 2 May, 2018; originally announced May 2018.

Comments: 24 pages, 4 figures

Journal ref: npj Quantum Materials 3, 43 (2018)

arXiv:1712.09947 [pdf, ps, other]

doi 10.1038/s41467-018-06088-2

Large intrinsic anomalous Hall effect in half-metallic ferromagnet Co3Sn2S2 with magnetic Weyl fermions

Authors: Qi Wang, Yuanfeng Xu, Rui Lou, Zhonghao Liu, Man Li, Yaobo Huang, Dawei Shen, Hongming Weng, Shancai Wang, Hechang Lei

Abstract: The origin of anomalous Hall effect (AHE) in magnetic materials is one of the most intriguing aspect in condensed matter physics and has been controversial for a long time. Recent studies indicate that the intrinsic AHE is closely related to the Berry curvature of occupied electronic states. In a magnetic Weyl semimetal with broken time-reversal symmetry, there are significant contributions on Ber… ▽ More The origin of anomalous Hall effect (AHE) in magnetic materials is one of the most intriguing aspect in condensed matter physics and has been controversial for a long time. Recent studies indicate that the intrinsic AHE is closely related to the Berry curvature of occupied electronic states. In a magnetic Weyl semimetal with broken time-reversal symmetry, there are significant contributions on Berry curvature around Weyl nodes, which would lead to a large intrinsic AHE. Here, we report the large intrinsic AHE in the half-metallic ferromagnet Co3Sn2S2 single crystal. By systematically mapping out the electronic structure of Co3Sn2S2 theoretically and experimentally, the large intrinsic AHE should originate from the Weyl fermions near the Fermi energy. Furthermore, the intrinsic anomalous Hall conductivity depends linearly on the magnetization and this can be attributed to the sharp decrease of magnetization and the change of topological characteristics. △ Less

Submitted 28 December, 2017; originally announced December 2017.

Comments: 24 pages, 4 figures

Journal ref: Nature Communications 9, 3681 (2018)

arXiv:1712.03048 [pdf]

doi 10.1103/PhysRevX.8.031044

Experimental Observation of Dirac Nodal Links in Centrosymmetric Semimetal TiB$_2$

Authors: Zhonghao Liu, Rui Lou, Pengjie Guo, Qi Wang, Shanshan Sun, Chenghe Li, Setti Thirupathaiah, Alexander Fedorov, Dawei Shen, Kai Liu, Hechang Lei, Shancai Wang

Abstract: The topological nodal-line semimetal state, serving as a fertile ground for various topological quantum phases, where a topological insulator, Dirac semimetal, or Weyl semimetal can be realized when the certain protecting symmetry is broken, has only been experimentally studied in very few materials. In contrast to discrete nodes, nodal lines with rich topological configurations can lead to more u… ▽ More The topological nodal-line semimetal state, serving as a fertile ground for various topological quantum phases, where a topological insulator, Dirac semimetal, or Weyl semimetal can be realized when the certain protecting symmetry is broken, has only been experimentally studied in very few materials. In contrast to discrete nodes, nodal lines with rich topological configurations can lead to more unusual transport phenomena. Utilizing angle-resolved photoemission spectroscopy and first-principles calculations, here, we provide compelling evidence of nodal-line fermions in centrosymmetric semimetal TiB$_2$ with a negligible spin-orbit coupling effect. With the band crossings just below the Fermi energy, two groups of Dirac nodal rings are clearly observed without any interference from other bands, one surrounding the Brillouin zone (BZ) corner in the horizontal mirror plane $σ_h$ and the other surrounding the BZ center in the vertical mirror plane $σ_v$. The linear dispersions forming Dirac nodal rings are as wide as 2 eV. We further observe that the two groups of nodal rings link together along the $Γ$-$K$ direction, composing a nodal-link configuration. The simple electronic structure with Dirac nodal links mainly constituting the Fermi surfaces suggests TiB$_2$ as a remarkable platform for studying and applying the novel physical properties related to nodal-line fermions. △ Less

Submitted 17 August, 2018; v1 submitted 8 December, 2017; originally announced December 2017.

Comments: 17 pages, 4 figures

Journal ref: Phys. Rev. X 8, 031044 (2018)

arXiv:1707.05025 [pdf, other]

doi 10.1103/PhysRevB.96.241106

Observation of Open-Orbit Fermi Surface Topology in Extremely Large Magnetoresistance Semimetal MoAs$_2$

Authors: R. Lou, Y. F. Xu, L. -X. Zhao, Z. -Q. Han, P. -J. Guo, M. Li, J. -C. Wang, B. -B. Fu, Z. -H. Liu, Y. -B. Huang, P. Richard, T. Qian, K. Liu, G. -F. Chen, H. M. Weng, H. Ding, S. -C. Wang

Abstract: While recent advances in band theory and sample growth have expanded the series of extremely large magnetoresistance (XMR) semimetals in transition metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the experimental study on their electronic structure and the origin of XMR is still absent. Here, using angle-resolved photoemission spectroscopy combined with first-principles calculations… ▽ More While recent advances in band theory and sample growth have expanded the series of extremely large magnetoresistance (XMR) semimetals in transition metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the experimental study on their electronic structure and the origin of XMR is still absent. Here, using angle-resolved photoemission spectroscopy combined with first-principles calculations and magnetotransport measurements, we performed a comprehensive investigation on MoAs$_2$, which is isostructural to the $TmPn_2$ family and also exhibits quadratic XMR. We resolve a clear band structure well agreeing with the predictions. Intriguingly, the unambiguously observed Fermi surfaces (FSs) are dominated by an open-orbit topology extending along both the [100] and [001] directions in the three-dimensional Brillouin zone. We further reveal the trivial topological nature of MoAs$_2$ by bulk parity analysis. Based on these results, we examine the proposed XMR mechanisms in other semimetals, and conclusively ascribe the origin of quadratic XMR in MoAs$_2$ to the carriers motion on the FSs with dominant open-orbit topology, innovating in the understanding of quadratic XMR in semimetals. △ Less

Submitted 22 November, 2017; v1 submitted 17 July, 2017; originally announced July 2017.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. B 96, 241106(R) (2017)

arXiv:1704.02928 [pdf, other]

doi 10.1103/PhysRevX.7.041020

Observation of oscillatory relaxation in the Sn-terminated surface of epitaxial rock-salt SnSe $\{111\}$ topological crystalline insulator

Authors: Wencan Jin, Suresh Vishwanath, Jianpeng Liu, Lingyuan Kong, Rui Lou, Zhongwei Dai, Jerzy T. Sadowski, Xinyu Liu, Huai-Hsun Lien, Alexander Chaney, Yimo Han, Micheal Cao, Junzhang Ma, Tian Qian, Jerry I. Dadap, Shancai Wang, Malgorzata Dobrowolska, Jacek Furdyna, David A. Muller, Karsten Pohl, Hong Ding, Huili Grace Xing, Richard M. Osgood, Jr

Abstract: Topological crystalline insulators have been recently predicted and observed in rock-salt structure SnSe $\{111\}$ thin films. Previous studies have suggested that the Se-terminated surface of this thin film with hydrogen passivation, has a reduced surface energy and is thus a preferred configuration. In this paper, synchrotron-based angle-resolved photoemission spectroscopy, along with density fu… ▽ More Topological crystalline insulators have been recently predicted and observed in rock-salt structure SnSe $\{111\}$ thin films. Previous studies have suggested that the Se-terminated surface of this thin film with hydrogen passivation, has a reduced surface energy and is thus a preferred configuration. In this paper, synchrotron-based angle-resolved photoemission spectroscopy, along with density functional theory calculations, are used to demonstrate conclusively that a rock-salt SnSe $\{111\}$ thin film epitaxially-grown on \ce{Bi2Se3} has a stable Sn-terminated surface. These observations are supported by low energy electron diffraction (LEED) intensity-voltage measurements and dynamical LEED calculations, which further show that the Sn-terminated SnSe $\{111\}$ thin film has undergone a surface structural relaxation of the interlayer spacing between the Sn and Se atomic planes. In sharp contrast to the Se-terminated counterpart, the observed Dirac surface state in the Sn-terminated SnSe $\{111\}$ thin film is shown to yield a high Fermi velocity, $0.50\times10^6$m/s, which suggests a potential mechanism of engineering the Dirac surface state of topological materials by tuning the surface configuration. △ Less

Submitted 10 April, 2017; originally announced April 2017.

Comments: 12 pages, 13 figures, supplementary materials included

Journal ref: Phys. Rev. X 7, 041020 (2017)

arXiv:1612.03589 [pdf, other]

doi 10.1103/PhysRevB.95.115140

Evidence of topological insulator state in the semimetal LaBi

Authors: R. Lou, B. -B. Fu, Q. N. Xu, P. -J. Guo, L. -Y. Kong, L. -K. Zeng, J. -Z. Ma, P. Richard, C. Fang, Y. -B. Huang, S. -S. Sun, Q. Wang, L. Wang, Y. -G. Shi, H. C. Lei, K. Liu, H. M. Weng, T. Qian, H. Ding, S. -C. Wang

Abstract: By employing angle-resolved photoemission spectroscopy combined with first-principles calculations, we performed a systematic investigation on the electronic structure of LaBi, which exhibits extremely large magnetoresistance (XMR), and is theoretically predicted to possess band anticrossing with nontrivial topological properties. Here, the observations of the Fermi-surface topology and band dispe… ▽ More By employing angle-resolved photoemission spectroscopy combined with first-principles calculations, we performed a systematic investigation on the electronic structure of LaBi, which exhibits extremely large magnetoresistance (XMR), and is theoretically predicted to possess band anticrossing with nontrivial topological properties. Here, the observations of the Fermi-surface topology and band dispersions are similar to previous studies on LaSb [Phys. Rev. Lett. 117, 127204 (2016)], a topologically trivial XMR semimetal, except the existence of a band inversion along the $Γ$-$X$ direction, with one massless and one gapped Dirac-like surface state at the $X$ and $Γ$ points, respectively. The odd number of massless Dirac cones suggests that LaBi is analogous to the time-reversal $Z_2$ nontrivial topological insulator. These findings open up a new series for exploring novel topological states and investigating their evolution from the perspective of topological phase transition within the family of rare-earth monopnictides. △ Less

Submitted 23 March, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. B 95, 115140 (2017)

arXiv:1610.02480 [pdf, ps, other]

doi 10.1021/acs.nanolett.6b04814

Engineering the structural and electronic phases of MoTe2 through W substitution

Authors: D. Rhodes, D. A. Chenet, B. E. Janicek, C. Nyby, Y. Lin, W. Jin, D. Edelberg, E. Mannebach, N. Finney, A. Antony, T. Schiros, T. Klarr, A. Mazzoni, M. Chin, Y. -c Chiu, W. Zheng, Q. R. Zhang, F. Ernst, J. I. Dadap, X. Tong, J. Ma, R. Lou, S. Wang, T. Qian, H. Ding , et al. (8 additional authors not shown)

Abstract: MoTe$_2$ is an exfoliable transition metal dichalcogenide (TMD) which crystallizes in three symmetries, the semiconducting trigonal-prismatic $2H-$phase, the semimetallic $1T^{\prime}$ monoclinic phase, and the semimetallic orthorhombic $T_d$ structure. The $2H-$phase displays a band gap of $\sim 1$ eV making it appealing for flexible and transparent optoelectronics. The $T_d-$phase is predicted t… ▽ More MoTe$_2$ is an exfoliable transition metal dichalcogenide (TMD) which crystallizes in three symmetries, the semiconducting trigonal-prismatic $2H-$phase, the semimetallic $1T^{\prime}$ monoclinic phase, and the semimetallic orthorhombic $T_d$ structure. The $2H-$phase displays a band gap of $\sim 1$ eV making it appealing for flexible and transparent optoelectronics. The $T_d-$phase is predicted to possess unique topological properties which might lead to topologically protected non-dissipative transport channels. Recently, it was argued that it is possible to locally induce phase-transformations in TMDs, through chemical doping, local heating, or electric-field to achieve ohmic contacts or to induce useful functionalities such as electronic phase-change memory elements. The combination of semiconducting and topological elements based upon the same compound, might produce a new generation of high performance, low dissipation optoelectronic elements. Here, we show that it is possible to engineer the phases of MoTe$_2$ through W substitution by unveiling the phase-diagram of the Mo$_{1-x}$W$_x$Te$_2$ solid solution which displays a semiconducting to semimetallic transition as a function of $x$. We find that only $\sim 8$ \% of W stabilizes the $T_d-$phase at room temperature. Photoemission spectroscopy, indicates that this phase possesses a Fermi surface akin to that of WTe$_2$. △ Less

Submitted 8 October, 2016; originally announced October 2016.

Comments: 10 paged, 5 pages, supplementary information not included

Journal ref: Nano Letters, 17, 1616 (2017)

arXiv:1604.08142 [pdf, other]

doi 10.1103/PhysRevLett.117.127204

Compensated semimetal LaSb with unsaturated magnetoresistance

Authors: L. -K. Zeng, R. Lou, D. -S. Wu, Q. N. Xu, P. -J. Guo, L. -Y. Kong, Y. -G. Zhong, J. -Z. Ma, B. -B. Fu, P. Richard, P. Wang, G. T. Liu, L. Lu, Y. -B. Huang, C. Fang, S. -S. Sun, Q. Wang, L. Wang, Y. -G. Shi, H. M. Weng, H. -C. Lei, K. Liu, S. -C. Wang, T. Qian, J. -L. Luo , et al. (1 additional authors not shown)

Abstract: By combining angle-resolved photoemission spectroscopy and quantum oscillation measurements, we performed a comprehensive investigation on the electronic structure of LaSb, which exhibits near-quadratic extremely large magnetoresistance (XMR) without any sign of saturation at magnetic fields as high as 40 T. We clearly resolve one spherical and one intersecting-ellipsoidal hole Fermi surfaces (FSs… ▽ More By combining angle-resolved photoemission spectroscopy and quantum oscillation measurements, we performed a comprehensive investigation on the electronic structure of LaSb, which exhibits near-quadratic extremely large magnetoresistance (XMR) without any sign of saturation at magnetic fields as high as 40 T. We clearly resolve one spherical and one intersecting-ellipsoidal hole Fermi surfaces (FSs) at the Brillouin zone (BZ) center $Γ$ and one ellipsoidal electron FS at the BZ boundary $X$. The hole and electron carriers calculated from the enclosed FS volumes are perfectly compensated, and the carrier compensation is unaffected by temperature. We further reveal that LaSb is topologically trivial but share many similarities with the Weyl semimetal TaAs family in the bulk electronic structure. Based on these results, we have examined the mechanisms that have been proposed so far to explain the near-quadratic XMR in semimetals. △ Less

Submitted 19 September, 2016; v1 submitted 27 April, 2016; originally announced April 2016.

Comments: 6 pages, 3 figures

Journal ref: Phys. Rev. Lett. 117, 127204 (2016)

arXiv:1604.05912 [pdf, ps, other]

doi 10.1209/0295-5075/119/17002

Magnetoresistance and Shubnikov-de Hass oscillation in YSb

Authors: Qiao-He Yu, Yi-Yan Wang, Rui Lou, Peng-Jie Guo, Sheng Xu, Kai Liu, Shancai Wang, Tian-Long Xia

Abstract: YSb crystals are grown and the transport properties under magnetic field are measured. The resistivity exhibits metallic behavior under zero magnetic field and the low temperature resistivity shows a clear upturn once a moderate magnetic field is applied. The upturn is greatly enhanced by increasing magnetic field, finally resulting in a metal-to-insulator-like transition. With temperature further… ▽ More YSb crystals are grown and the transport properties under magnetic field are measured. The resistivity exhibits metallic behavior under zero magnetic field and the low temperature resistivity shows a clear upturn once a moderate magnetic field is applied. The upturn is greatly enhanced by increasing magnetic field, finally resulting in a metal-to-insulator-like transition. With temperature further decreased, a resistivity plateau emerges after the insulator-like regime. At low temperature (2.5 K) and high field (14 T), the transverse magnetoresistance (MR) is quite large (3.47 $\times 10^4\%$ ). In addition, Shubnikov-de Haas (SdH) oscillation has also been observed in YSb. Periodic behavior of the oscillation amplitude reveals the related information about Fermi surface and two major oscillation frequencies can be obtained from the FFT spectra of the oscillations. The trivial Berry phase extracted from SdH oscillation, band structure revealed by angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations demonstrate that YSb is a topologically trivial material. △ Less

Submitted 5 March, 2017; v1 submitted 20 April, 2016; originally announced April 2016.

Comments: 6 pages, 7 figures

Journal ref: EPL 119, 17002 (2017)

arXiv:1601.07294 [pdf, other]

doi 10.1103/PhysRevB.93.241104

Emergence of topological bands on the surface of ZrSnTe crystal

Authors: R. Lou, J. -Z. Ma, Q. -N. Xu, B. -B. Fu, L. -Y. Kong, Y. -G. Shi, P. Richard, H. -M. Weng, Z. Fang, S. -S. Sun, Q. Wang, H. -C. Lei, T. Qian, H. Ding, S. -C. Wang

Abstract: By using angle-resolved photoemission spectroscopy combined with first-principles calculations, we reveal that the topmost unit cell of ZrSnTe crystal hosts two-dimensional (2D) electronic bands of topological insulator (TI) state, though such a TI state is defined with a curved Fermi level instead of a global band gap. Furthermore, we find that by modifying the dangling bonds on the surface throu… ▽ More By using angle-resolved photoemission spectroscopy combined with first-principles calculations, we reveal that the topmost unit cell of ZrSnTe crystal hosts two-dimensional (2D) electronic bands of topological insulator (TI) state, though such a TI state is defined with a curved Fermi level instead of a global band gap. Furthermore, we find that by modifying the dangling bonds on the surface through hydrogenation, this 2D band structure can be manipulated so that the expected global energy gap is most likely to be realized. This facilitates the practical applications of 2D TI in heterostructural devices and those with surface decoration and coverage. Since ZrSnTe belongs to a large family of compounds having the similar crystal and band structures, our findings shed light on identifying more 2D TI candidates and superconductor-TI heterojunctions supporting topological superconductors. △ Less

Submitted 19 September, 2016; v1 submitted 27 January, 2016; originally announced January 2016.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. B 93, 241104(R) (2016)

arXiv:1601.01564 [pdf, ps, other]

doi 10.1103/PhysRevB.93.115133

Interplay between multiple charge-density waves and the relationship with superconductivity in Pd$_x$HoTe$_{3}$

Authors: Rui Lou, Yipeng Cai, Zhonghao Liu, Tian Qian, Lingxiao Zhao, Yu Li, Kai Liu, Zhiqing Han, Dandan Zhang, Junbao He, Genfu Chen, Hong Ding, Shancai Wang

Abstract: HoTe$_{3}$, a member of the rare-earth tritelluride ($R$Te$_{3}$) family, and its Pd-intercalated compounds, Pd$_x$HoTe$_{3}$, where superconductivity (SC) sets in as the charge-density wave (CDW) transition is suppressed by the intercalation of a small amount of Pd, are investigated using angle-resolved photoemission spectroscopy (ARPES) and electrical resistivity. Two incommensurate CDWs with pe… ▽ More HoTe$_{3}$, a member of the rare-earth tritelluride ($R$Te$_{3}$) family, and its Pd-intercalated compounds, Pd$_x$HoTe$_{3}$, where superconductivity (SC) sets in as the charge-density wave (CDW) transition is suppressed by the intercalation of a small amount of Pd, are investigated using angle-resolved photoemission spectroscopy (ARPES) and electrical resistivity. Two incommensurate CDWs with perpendicular nesting vectors are observed in HoTe$_{3}$ at low temperatures. With a slight Pd intercalation ($x$ = 0.01), the large CDW gap decreases and the small one increases. The momentum dependence of the gaps along the inner Fermi surface (FS) evolves from orthorhombicity to near tetragonality, manifesting the competition between two CDW orders. At $x$ = 0.02, both CDW gaps decreases with the emergence of SC. Further increasing the content of Pd for $x$ = 0.04 will completely suppress the CDW instabilities and give rise to the maximal SC order. The evolution of the electronic structures and electron-phonon couplings (EPCs) of the multiple CDWs upon Pd intercalation are carefully scrutinized. We discuss the interplay between multiple CDW orders, and the competition between CDW and SC in detail. △ Less

Submitted 19 September, 2016; v1 submitted 7 January, 2016; originally announced January 2016.

Comments: 6 pages, 5 figures

Journal ref: Phys. Rev. B 93, 115133 (2016)

arXiv:1503.07674 [pdf, ps, other]

doi 10.1103/PhysRevB.92.115150

Sudden gap-closure across the topological phase transition in Bi$_{2-x}$In$_{x}$Se$_{3}$

Authors: Rui Lou, Zhonghao Liu, Wencan Jin, Haifeng Wang, Zhiqing Han, Kai Liu, Xueyun Wang, Tian Qian, Yevhen Kushnirenko, Sang-Wook Cheong, Richard M. Osgood, Jr., Hong Ding, Shancai Wang

Abstract: The phase transition from a topological insulator to a trivial band insulator is studied by angle-resoled photoemission spectroscopy on Bi$_{2-x}$In$_{x}$Se$_{3}$ single crystals. We first report the complete evolution of the bulk band structures throughout the transition. The robust surface state and the bulk gap size ($\sim$ 0.50 eV) show no significant change upon doping for $x$ = 0.05, 0.10 an… ▽ More The phase transition from a topological insulator to a trivial band insulator is studied by angle-resoled photoemission spectroscopy on Bi$_{2-x}$In$_{x}$Se$_{3}$ single crystals. We first report the complete evolution of the bulk band structures throughout the transition. The robust surface state and the bulk gap size ($\sim$ 0.50 eV) show no significant change upon doping for $x$ = 0.05, 0.10 and 0.175. At $x$ $\geq$ 0.225, the surface state completely disappears and the bulk gap size increases, suggesting a sudden gap-closure and topological phase transition around $x \sim$ 0.175$-$0.225. We discuss the underlying mechanism of the phase transition, proposing that it is governed by the combined effect of spin-orbit coupling and interactions upon band hybridization. Our study provides a new venue to investigate the mechanism of the topological phase transition induced by non-magnetic impurities. △ Less

Submitted 19 September, 2016; v1 submitted 26 March, 2015; originally announced March 2015.

Comments: 5 pages, 4 figures

Journal ref: Phys. Rev. B 92, 115150 (2015)

Showing 1–38 of 38 results for author: Lou, R