-
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
Authors:
Jiangshu Du,
Yibo Wang,
Wenting Zhao,
Zhongfen Deng,
Shuaiqi Liu,
Renze Lou,
Henry Peng Zou,
Pranav Narayanan Venkit,
Nan Zhang,
Mukund Srinath,
Haoran Ranran Zhang,
Vipul Gupta,
Yinghui Li,
Tao Li,
Fei Wang,
Qin Liu,
Tianlin Liu,
Pengzhi Gao,
Congying Xia,
Chen Xing,
Jiayang Cheng,
Zhaowei Wang,
Ying Su,
Raj Sanjay Shah,
Ruohao Guo
, et al. (15 additional authors not shown)
Abstract:
This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as th…
▽ More
This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as they have to spend more time reading, writing, and reviewing papers. This raises the question: how can LLMs potentially assist researchers in alleviating their heavy workload?
This study focuses on the topic of LLMs assist NLP Researchers, particularly examining the effectiveness of LLM in assisting paper (meta-)reviewing and its recognizability. To address this, we constructed the ReviewCritique dataset, which includes two types of information: (i) NLP papers (initial submissions rather than camera-ready) with both human-written and LLM-generated reviews, and (ii) each review comes with "deficiency" labels and corresponding explanations for individual segments, annotated by experts. Using ReviewCritique, this study explores two threads of research questions: (i) "LLMs as Reviewers", how do reviews generated by LLMs compare with those written by humans in terms of quality and distinguishability? (ii) "LLMs as Metareviewers", how effectively can LLMs identify potential issues, such as Deficient or unprofessional review segments, within individual paper reviews? To our knowledge, this is the first work to provide such a comprehensive analysis.
△ Less
Submitted 25 June, 2024; v1 submitted 23 June, 2024;
originally announced June 2024.
-
LLMs' Classification Performance is Overclaimed
Authors:
Hanzi Xu,
Renze Lou,
Jiangshu Du,
Vahid Mahzoon,
Elmira Talebianaraki,
Zhuoan Zhou,
Elizabeth Garrison,
Slobodan Vucetic,
Wenpeng Yin
Abstract:
In many classification tasks designed for AI or human to solve, gold labels are typically included within the label space by default, often posed as "which of the following is correct?" This standard setup has traditionally highlighted the strong performance of advanced AI, particularly top-performing Large Language Models (LLMs), in routine classification tasks. However, when the gold label is in…
▽ More
In many classification tasks designed for AI or human to solve, gold labels are typically included within the label space by default, often posed as "which of the following is correct?" This standard setup has traditionally highlighted the strong performance of advanced AI, particularly top-performing Large Language Models (LLMs), in routine classification tasks. However, when the gold label is intentionally excluded from the label space, it becomes evident that LLMs still attempt to select from the available label candidates, even when none are correct. This raises a pivotal question: Do LLMs truly demonstrate their intelligence in understanding the essence of classification tasks?
In this study, we evaluate both closed-source and open-source LLMs across representative classification tasks, arguing that the perceived performance of LLMs is overstated due to their inability to exhibit the expected comprehension of the task. This paper makes a threefold contribution: i) To our knowledge, this is the first work to identify the limitations of LLMs in classification tasks when gold labels are absent. We define this task as Classify-w/o-Gold and propose it as a new testbed for LLMs. ii) We introduce a benchmark, Know-No, comprising two existing classification tasks and one new task, to evaluate Classify-w/o-Gold. iii) This work defines and advocates for a new evaluation metric, OmniAccuracy, which assesses LLMs' performance in classification tasks both when gold labels are present and absent.
△ Less
Submitted 3 July, 2024; v1 submitted 23 June, 2024;
originally announced June 2024.
-
Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Authors:
Xi Li,
Yusen Zhang,
Renze Lou,
Chen Wu,
Jiaqi Wang
Abstract:
Backdoor attacks present significant threats to Large Language Models (LLMs), particularly with the rise of third-party services that offer API integration and prompt engineering. Untrustworthy third parties can plant backdoors into LLMs and pose risks to users by embedding malicious instructions into user queries. The backdoor-compromised LLM will generate malicious output when and input is embed…
▽ More
Backdoor attacks present significant threats to Large Language Models (LLMs), particularly with the rise of third-party services that offer API integration and prompt engineering. Untrustworthy third parties can plant backdoors into LLMs and pose risks to users by embedding malicious instructions into user queries. The backdoor-compromised LLM will generate malicious output when and input is embedded with a specific trigger predetermined by an attacker. Traditional defense strategies, which primarily involve model parameter fine-tuning and gradient calculation, are inadequate for LLMs due to their extensive computational and clean data requirements. In this paper, we propose a novel solution, Chain-of-Scrutiny (CoS), to address these challenges. Backdoor attacks fundamentally create a shortcut from the trigger to the target output, thus lack reasoning support. Accordingly, CoS guides the LLMs to generate detailed reasoning steps for the input, then scrutinizes the reasoning process to ensure consistency with the final answer. Any inconsistency may indicate an attack. CoS only requires black-box access to LLM, offering a practical defense, particularly for API-accessible LLMs. It is user-friendly, enabling users to conduct the defense themselves. Driven by natural language, the entire defense process is transparent to users. We validate the effectiveness of CoS through extensive experiments across various tasks and LLMs. Additionally, experiments results shows CoS proves more beneficial for more powerful LLMs.
△ Less
Submitted 9 June, 2024;
originally announced June 2024.
-
Evaluating LLMs at Detecting Errors in LLM Responses
Authors:
Ryo Kamoi,
Sarkar Snigdha Sarathi Das,
Renze Lou,
Jihyun Janice Ahn,
Yilun Zhao,
Xiaoxin Lu,
Nan Zhang,
Yusen Zhang,
Ranran Haoran Zhang,
Sujeeth Reddy Vummanthala,
Salika Dave,
Shaobo Qin,
Arman Cohan,
Wenpeng Yin,
Rui Zhang
Abstract:
With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g.…
▽ More
With Large Language Models (LLMs) being widely used across various tasks, detecting errors in their responses is increasingly crucial. However, little research has been conducted on error detection of LLM responses. Collecting error annotations on LLM responses is challenging due to the subjective nature of many NLP tasks, and thus previous research focuses on tasks of little practical value (e.g., word sorting) or limited error types (e.g., faithfulness in summarization). This work introduces ReaLMistake, the first error detection benchmark consisting of objective, realistic, and diverse errors made by LLMs. ReaLMistake contains three challenging and meaningful tasks that introduce objectively assessable errors in four categories (reasoning correctness, instruction-following, context-faithfulness, and parameterized knowledge), eliciting naturally observed and diverse errors in responses of GPT-4 and Llama 2 70B annotated by experts. We use ReaLMistake to evaluate error detectors based on 12 LLMs. Our findings show: 1) Top LLMs like GPT-4 and Claude 3 detect errors made by LLMs at very low recall, and all LLM-based error detectors perform much worse than humans. 2) Explanations by LLM-based error detectors lack reliability. 3) LLMs-based error detection is sensitive to small changes in prompts but remains challenging to improve. 4) Popular approaches to improving LLMs, including self-consistency and majority vote, do not improve the error detection performance. Our benchmark and code are provided at https://github.com/psunlpgroup/ReaLMistake.
△ Less
Submitted 4 April, 2024;
originally announced April 2024.
-
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
Authors:
Jian Xie,
Kai Zhang,
Jiangjie Chen,
Tinghui Zhu,
Renze Lou,
Yuandong Tian,
Yanghua Xiao,
Yu Su
Abstract:
Planning has been part of the core pursuit for artificial intelligence since its conception, but earlier AI agents mostly focused on constrained settings because many of the cognitive substrates necessary for human-level planning have been lacking. Recently, language agents powered by large language models (LLMs) have shown interesting capabilities such as tool use and reasoning. Are these languag…
▽ More
Planning has been part of the core pursuit for artificial intelligence since its conception, but earlier AI agents mostly focused on constrained settings because many of the cognitive substrates necessary for human-level planning have been lacking. Recently, language agents powered by large language models (LLMs) have shown interesting capabilities such as tool use and reasoning. Are these language agents capable of planning in more complex settings that are out of the reach of prior AI agents? To advance this investigation, we propose TravelPlanner, a new planning benchmark that focuses on travel planning, a common real-world planning scenario. It provides a rich sandbox environment, various tools for accessing nearly four million data records, and 1,225 meticulously curated planning intents and reference plans. Comprehensive evaluations show that the current language agents are not yet capable of handling such complex planning tasks-even GPT-4 only achieves a success rate of 0.6%. Language agents struggle to stay on task, use the right tools to collect information, or keep track of multiple constraints. However, we note that the mere possibility for language agents to tackle such a complex problem is in itself non-trivial progress. TravelPlanner provides a challenging yet meaningful testbed for future language agents.
△ Less
Submitted 23 June, 2024; v1 submitted 2 February, 2024;
originally announced February 2024.
-
Large Language Models for Mathematical Reasoning: Progresses and Challenges
Authors:
Janice Ahn,
Rishu Verma,
Renze Lou,
Di Liu,
Rui Zhang,
Wenpeng Yin
Abstract:
Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing…
▽ More
Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive capabilities of human intelligence. In recent times, there has been a notable surge in the development of Large Language Models (LLMs) geared towards the automated resolution of mathematical problems. However, the landscape of mathematical problem types is vast and varied, with LLM-oriented techniques undergoing evaluation across diverse datasets and settings. This diversity makes it challenging to discern the true advancements and obstacles within this burgeoning field. This survey endeavors to address four pivotal dimensions: i) a comprehensive exploration of the various mathematical problems and their corresponding datasets that have been investigated; ii) an examination of the spectrum of LLM-oriented techniques that have been proposed for mathematical problem-solving; iii) an overview of factors and concerns affecting LLMs in solving math; and iv) an elucidation of the persisting challenges within this domain. To the best of our knowledge, this survey stands as one of the first extensive examinations of the landscape of LLMs in the realm of mathematics, providing a holistic perspective on the current state, accomplishments, and future challenges in this rapidly evolving field.
△ Less
Submitted 5 April, 2024; v1 submitted 31 January, 2024;
originally announced February 2024.
-
UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Authors:
Lin Sun,
Kai Zhang,
Qingyuan Li,
Renze Lou
Abstract:
Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases. However, current MIE methods often resort to using task-specific model structures, which results in limited generalizability across tasks and underutilizes shared knowledge across MIE tasks. To address these issues, we propose UMIE, a unified multimodal information extractor to un…
▽ More
Multimodal information extraction (MIE) gains significant attention as the popularity of multimedia content increases. However, current MIE methods often resort to using task-specific model structures, which results in limited generalizability across tasks and underutilizes shared knowledge across MIE tasks. To address these issues, we propose UMIE, a unified multimodal information extractor to unify three MIE tasks as a generation problem using instruction tuning, being able to effectively extract both textual and visual mentions. Extensive experiments show that our single UMIE outperforms various state-of-the-art (SoTA) methods across six MIE datasets on three tasks. Furthermore, in-depth analysis demonstrates UMIE's strong generalization in the zero-shot setting, robustness to instruction variants, and interpretability. Our research serves as an initial step towards a unified MIE model and initiates the exploration into both instruction tuning and large language models within the MIE domain. Our code, data, and model are available at https://github.com/ZUCC-AI/UMIE
△ Less
Submitted 5 January, 2024;
originally announced January 2024.
-
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following
Authors:
Renze Lou,
Kai Zhang,
Jian Xie,
Yuxuan Sun,
Janice Ahn,
Hanzi Xu,
Yu Su,
Wenpeng Yin
Abstract:
In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data. This is achieved through two primary schemes: i) Scaling-Inputs: Amplifying (input, output) pairs per task instruction, aiming for better instruction adherence. ii) Scaling Input-Free Tasks: Enlarging tasks, each composed of an (instruction, output) pair (withou…
▽ More
In the realm of large language models (LLMs), enhancing instruction-following capability often involves curating expansive training data. This is achieved through two primary schemes: i) Scaling-Inputs: Amplifying (input, output) pairs per task instruction, aiming for better instruction adherence. ii) Scaling Input-Free Tasks: Enlarging tasks, each composed of an (instruction, output) pair (without requiring a separate input anymore). However, LLMs under Scaling-Inputs tend to be overly sensitive to inputs, leading to misinterpretation or non-compliance with instructions. Conversely, Scaling Input-Free Tasks demands a substantial number of tasks but is less effective in instruction following when dealing with instances in Scaling-Inputs. This work introduces MUFFIN, a new scheme of instruction-following dataset curation. Specifically, we automatically Scale Tasks per Input by diversifying these tasks with various input facets. Experimental results across four zero-shot benchmarks, spanning both Scaling-Inputs and Scaling Input-Free Tasks schemes, reveal that LLMs, at various scales, trained on MUFFIN generally demonstrate superior instruction-following capabilities compared to those trained on the two aforementioned schemes.
△ Less
Submitted 14 March, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
Audience-specific Explanations for Machine Translation
Authors:
Renhan Lou,
Jan Niehues
Abstract:
In machine translation, a common problem is that the translation of certain words even if translated can cause incomprehension of the target language audience due to different cultural backgrounds. A solution to solve this problem is to add explanations for these words. In a first step, we therefore need to identify these words or phrases. In this work we explore techniques to extract example expl…
▽ More
In machine translation, a common problem is that the translation of certain words even if translated can cause incomprehension of the target language audience due to different cultural backgrounds. A solution to solve this problem is to add explanations for these words. In a first step, we therefore need to identify these words or phrases. In this work we explore techniques to extract example explanations from a parallel corpus. However, the sparsity of sentences containing words that need to be explained makes building the training dataset extremely difficult. In this work, we propose a semi-automatic technique to extract these explanations from a large parallel corpus. Experiments on English->German language pair show that our method is able to extract sentence so that more than 10% of the sentences contain explanation, while only 1.9% of the original sentences contain explanations. In addition, experiments on English->French and English->Chinese language pairs also show similar conclusions. This is therefore an essential first automatic step to create a explanation dataset. Furthermore we show that the technique is robust for all three language pairs.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Orbital-selective effect of spin reorientation on the Dirac fermions in a three-dimensional kagome ferromagnet Fe$_3$Ge
Authors:
Rui Lou,
Liqin Zhou,
Wenhua Song,
Alexander Fedorov,
Zhijun Tu,
Bei Jiang,
Qi Wang,
Man Li,
Zhonghao Liu,
Xuezhi Chen,
Oliver Rader,
Bernd Büchner,
Yujie Sun,
Hongming Weng,
Hechang Lei,
Shancai Wang
Abstract:
Kagome magnets provide a fascinating platform for the realization of correlated topological quantum phases under various magnetic ground states. However, the intricate effect of the magnetic spin configurations on the characteristic electronic structure directly from the kagome lattice layer remains still elusive. Here, utilizing angle-resolved photoemission spectroscopy and density functional the…
▽ More
Kagome magnets provide a fascinating platform for the realization of correlated topological quantum phases under various magnetic ground states. However, the intricate effect of the magnetic spin configurations on the characteristic electronic structure directly from the kagome lattice layer remains still elusive. Here, utilizing angle-resolved photoemission spectroscopy and density functional theory calculations, we report the spectroscopic evidence for the spin-reorientation effect of a kagome ferromagnet Fe$_3$Ge, which is composed only of the kagome planes. There are two kinds of kagome-derived Dirac fermions due to the structural three-dimensionality -- one is less dispersive ($k_z$ $\sim$ 0) and the other disperses linearly ($k_z$ $\sim$ $π$). As the Fe moments cant from the $c$ axis into the $ab$ plane upon cooling, the Dirac fermion in $k_z$ $\sim$ 0 plane with a mixture of the Fe-$3d_{xy}$ and Fe-$3d_{x^2-y^2}$ components evolves from gapped into nearly gapless, while the Dirac cone in $k_z$ $\sim$ $π$ plane mainly of the Fe-$3d_{x^2-y^2}$ orbital character remains intact, suggesting that the effect of spin reorientation on the Dirac fermions has an orbital selectivity. Our unambiguous observations provide a feasible route to design and manipulate the mass of Dirac fermions for realizing the novel quantum phases. We also perform comparative studies between the non-charge-ordered Fe$_3$Ge and its sibling compound FeGe, a newly established charge-density-wave kagome magnet, the results suggest that the orbital-selective van Hove singularities near the Fermi level play an indispensable part in driving the charge order on a magnetic kagome lattice.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Toward Zero-Shot Instruction Following
Authors:
Renze Lou,
Wenpeng Yin
Abstract:
This work proposes a challenging yet more realistic setting for zero-shot cross-task generalization: zero-shot instruction following, presuming the existence of a paragraph-style task definition while no demonstrations exist. To better learn the task supervision from the definition, we propose two strategies: first, to automatically find out the critical sentences in the definition; second, a rank…
▽ More
This work proposes a challenging yet more realistic setting for zero-shot cross-task generalization: zero-shot instruction following, presuming the existence of a paragraph-style task definition while no demonstrations exist. To better learn the task supervision from the definition, we propose two strategies: first, to automatically find out the critical sentences in the definition; second, a ranking objective to force the model to generate the gold outputs with higher probabilities when those critical parts are highlighted in the definition. The joint efforts of the two strategies yield state-of-the-art performance on the Super-NaturalInstructions. Our code is available on GitHub.
△ Less
Submitted 25 January, 2024; v1 submitted 4 August, 2023;
originally announced August 2023.
-
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
Authors:
Jian Xie,
Kai Zhang,
Jiangjie Chen,
Renze Lou,
Yu Su
Abstract:
By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory. However, how receptive are LLMs to such external evidence, especially when the evidence conflicts with their parametric memory? We present the first comprehensive and controlled…
▽ More
By providing external information to large language models (LLMs), tool augmentation (including retrieval augmentation) has emerged as a promising solution for addressing the limitations of LLMs' static parametric memory. However, how receptive are LLMs to such external evidence, especially when the evidence conflicts with their parametric memory? We present the first comprehensive and controlled investigation into the behavior of LLMs when encountering knowledge conflicts. We propose a systematic framework to elicit high-quality parametric memory from LLMs and construct the corresponding counter-memory, which enables us to conduct a series of controlled experiments. Our investigation reveals seemingly contradicting behaviors of LLMs. On the one hand, different from prior wisdom, we find that LLMs can be highly receptive to external evidence even when that conflicts with their parametric memory, given that the external evidence is coherent and convincing. On the other hand, LLMs also demonstrate a strong confirmation bias when the external evidence contains some information that is consistent with their parametric memory, despite being presented with conflicting evidence at the same time. These results pose important implications that are worth careful consideration for the further development and deployment of tool- and retrieval-augmented LLMs. Resources are available at https://github.com/OSU-NLP-Group/LLM-Knowledge-Conflict.
△ Less
Submitted 27 February, 2024; v1 submitted 22 May, 2023;
originally announced May 2023.
-
Superconducting Arcs
Authors:
Andrii Kuibarov,
Oleksandr Suvorov,
Riccardo Vocaturo,
Alexander Fedorov,
Rui Lou,
Luise Merkwitz,
Vladimir Voroshnin,
Jorge I. Facio,
Klaus Koepernik,
Alexander Yaresko,
Grigoriy Shipunov,
Saicharan Aswartham,
Jeroen van den Brink,
Bernd Büchner,
Sergey Borisenko
Abstract:
An essential ingredient for the production of Majorana fermions that can be used for quantum computing is the presence of topological superconductivity. As bulk topological superconductors remain elusive, the most promising approaches exploit proximity-induced superconductivity making systems fragile and difficult to realize. Weyl semimetals due to their intrinsic topology belong to potential cand…
▽ More
An essential ingredient for the production of Majorana fermions that can be used for quantum computing is the presence of topological superconductivity. As bulk topological superconductors remain elusive, the most promising approaches exploit proximity-induced superconductivity making systems fragile and difficult to realize. Weyl semimetals due to their intrinsic topology belong to potential candidates too, but search for Majorana fermions has always been connected with the superconductivity in the bulk, leaving the possibility of intrinsic superconductivity of the Fermi surface arcs themselves practically without attention, even from the theory side.Here, by means of angle-resolved photoemission spectroscopy and ab-initio calculations, we unambiguously identify topological Fermi arcs on two opposing surfaces of the non-centrosymmetric Weyl material PtBi2. We show that these states become superconducting at different temperatures around 10K. Remarkably, the corresponding coherencepeaks appear as the strongest and sharpest excitations ever detected by photoemission from solids, suggesting significant technological relevance. Our findings indicate that topological superconductivity in PtBi2 occurs exclusively at the surface, which not only makes it an ideal platform to host Majorana fermions, but may also lead to a unique quantum phase - an intrinsic topological SNS Josephson junction.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Large Language Model Instruction Following: A Survey of Progresses and Challenges
Authors:
Renze Lou,
Kai Zhang,
Wenpeng Yin
Abstract:
Task semantics can be expressed by a set of input-output examples or a piece of textual instruction. Conventional machine learning approaches for natural language processing (NLP) mainly rely on the availability of large-scale sets of task-specific examples. Two issues arise: first, collecting task-specific labeled examples does not apply to scenarios where tasks may be too complicated or costly t…
▽ More
Task semantics can be expressed by a set of input-output examples or a piece of textual instruction. Conventional machine learning approaches for natural language processing (NLP) mainly rely on the availability of large-scale sets of task-specific examples. Two issues arise: first, collecting task-specific labeled examples does not apply to scenarios where tasks may be too complicated or costly to annotate, or the system is required to handle a new task immediately; second, this is not user-friendly since end-users are probably more willing to provide task description rather than a set of examples before using the system. Therefore, the community is paying increasing interest in a new supervision-seeking paradigm for NLP: learning to follow task instructions, i.e., instruction following. Despite its impressive progress, there are some common issues that the community struggles with. This survey paper tries to summarize and provide insights to the current research on instruction following, particularly, by answering the following questions: (i) What is task instruction, and what instruction types exist? (ii) How to model instructions? (iii) What are popular instruction following datasets and evaluation metrics? (iv) What factors influence and explain the instructions' performance? (v) What challenges remain in instruction following? To our knowledge, this is the first comprehensive survey about instruction following.
△ Less
Submitted 24 May, 2024; v1 submitted 18 March, 2023;
originally announced March 2023.
-
PAGE: A Position-Aware Graph-Based Model for Emotion Cause Entailment in Conversation
Authors:
Xiaojie Gu,
Renze Lou,
Lin Sun,
Shangxin Li
Abstract:
Conversational Causal Emotion Entailment (C2E2) is a task that aims at recognizing the causes corresponding to a target emotion in a conversation. The order of utterances in the conversation affects the causal inference. However, most current position encoding strategies ignore the order relation among utterances and speakers. To address the issue, we devise a novel position-aware graph to encode…
▽ More
Conversational Causal Emotion Entailment (C2E2) is a task that aims at recognizing the causes corresponding to a target emotion in a conversation. The order of utterances in the conversation affects the causal inference. However, most current position encoding strategies ignore the order relation among utterances and speakers. To address the issue, we devise a novel position-aware graph to encode the entire conversation, fully modeling causal relations among utterances. The comprehensive experiments show that our method consistently achieves state-of-the-art performance on two challenging test sets, proving the effectiveness of our model. Our source code is available on Github: https://github.com/XiaojieGu/PAGE.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Signature of weakly coupled $f$ electrons and conduction electrons in magnetic Weyl semimetal candidates PrAlSi and SmAlSi
Authors:
Rui Lou,
Alexander Fedorov,
Lingxiao Zhao,
Alexander Yaresko,
Bernd Büchner,
Sergey Borisenko
Abstract:
Magnetic topological materials are a class of compounds with the underlying interplay of nontrivial band topology and magnetic spin configuration. Extensive interests have been aroused due to their application potential involved with an array of exotic quantum states. With angle-resolved photoemission spectroscopy and first-principles calculations, here we study the electronic properties of two ma…
▽ More
Magnetic topological materials are a class of compounds with the underlying interplay of nontrivial band topology and magnetic spin configuration. Extensive interests have been aroused due to their application potential involved with an array of exotic quantum states. With angle-resolved photoemission spectroscopy and first-principles calculations, here we study the electronic properties of two magnetic Weyl semimetal candidates PrAlSi and SmAlSi. Though the two compounds harbor distinct magnetic ground states (ferromagnetic and antiferromagnetic for PrAlSi and SmAlSi, respectively) and 4$f$ shell fillings, we find that they share quite analogous low-energy band structure. By the measurements across the magnetic transitions, we further reveal that there is no evident evolution of the band structure in both compounds and the experimental spectra can be well reproduced by the nonmagnetic calculations, together suggesting a negligible effect of the magnetism on their electronic structures and a possibly weak coupling between the localized 4$f$ electrons and the itinerant conduction electrons. Our results offer essential insights into the interactions between magnetism, electron correlations, and topological orders in the $R$Al$X$ ($R$ = light rare earth and $X$ = Si or Ge) family.
△ Less
Submitted 31 January, 2023;
originally announced January 2023.
-
Tunable positions of Weyl nodes via magnetism and pressure in the ferromagnetic Weyl semimetal CeAlSi
Authors:
Erjian Cheng,
Limin Yan,
Xianbiao Shi,
Rui Lou,
Alexander Fedorov,
Mahdi Behnami,
Jian Yuan,
Yuanji Xu,
Yang Xu,
Wei Xia,
Nikolai Pavlovskii,
Darren C. Peets,
Weiwei Zhao,
Yimin Wan,
Yanfeng Guo,
Shiyan Li,
Wenge Yang,
Bernd Büchner
Abstract:
The noncentrosymmetric ferromagnetic Weyl semimetal CeAlSi with simultaneous space-inversion (SI) and time-reversal (TR) symmetry breaking provides a unique platform for the exploration of novel topological states. Here, by employing electrical and thermoelectrical transport, angle-resolved photoemission spectroscopy (ARPES), high-pressure techniques, and band calculations, we demonstrate that mag…
▽ More
The noncentrosymmetric ferromagnetic Weyl semimetal CeAlSi with simultaneous space-inversion (SI) and time-reversal (TR) symmetry breaking provides a unique platform for the exploration of novel topological states. Here, by employing electrical and thermoelectrical transport, angle-resolved photoemission spectroscopy (ARPES), high-pressure techniques, and band calculations, we demonstrate that magnetism and pressure can serve as efficient parameters to tune the positions of Weyl nodes in CeAlSi. At ambient pressure, an anomalous Hall effect (AHE) and an anomalous Nernst effect (ANE) arise in the paramagnetic state, and then are enhanced when temperature approaches the ferromagnetic ordering temperature, evidencing magnetism facilitates the AHE/ANE. Such an enhancement of AHE/ANE can be ascribed to the tuning of the positions of Weyl nodes via magnetism. The ARPES measurements reveal that the ferromagnetism serves as a pivotal knob to tune the band structure of CeAlSi both in the bulk and on the surface. Such magnetism-tunable electronic structure has hitherto not been reported in other magnetic $R$Al$Pn$ ($R$ = rare earth elements, $Pn$ = Si, Ge) siblings, suggesting the great potential of controlling Weyl node positions in CeAlSi. Under pressure, an enhancement and a sign change of AHE are discovered. Based on band calculations, the evolution of AHE may root in the tuning of Weyl nodes via pressure. Moreover, multiple pressure-induced phase transitions are uncovered. These findings indicate that CeAlSi provides a unique and tunable platform for exploring exotic topological physics and electron correlations, as well as catering to an array of potential applications, such as spintronics and thermoelectrics.
△ Less
Submitted 19 March, 2023; v1 submitted 10 January, 2023;
originally announced January 2023.
-
Suppression of nematicity by tensile strain in multilayer FeSe/SrTiO$_3$ films
Authors:
Rui Lou,
Oleksandr Suvorov,
Hans-Joachim Grafe,
Andrii Kuibarov,
Maxim Krivenkov,
Oliver Rader,
Bernd Büchner,
Sergey Borisenko,
Alexander Fedorov
Abstract:
The nematicity in multilayer FeSe/SrTiO$_3$ films has been previously suggested to be enhanced with decreasing film thickness. Motivated by this, there have been many discussions about the competing relation between nematicity and superconductivity. However, the criterion for determining the nematicity strength in FeSe remains highly debated. The understanding of nematicity and its relation to sup…
▽ More
The nematicity in multilayer FeSe/SrTiO$_3$ films has been previously suggested to be enhanced with decreasing film thickness. Motivated by this, there have been many discussions about the competing relation between nematicity and superconductivity. However, the criterion for determining the nematicity strength in FeSe remains highly debated. The understanding of nematicity and its relation to superconductivity in FeSe films is therefore still controversial. Here, we fabricate multilayer FeSe/SrTiO$_3$ films using molecular beam epitaxy and study the nematic properties by combining angle-resolved photoemission spectroscopy, nuclear magnetic resonance, and scanning tunneling microscopy experiments. We unambiguously demonstrate that, near the interface, the nematicity is suppressed by the SrTiO$_3$-induced tensile strain; in the bulk region further away from the interface, the strength of nematicity recovers to the bulk value. Our results not only solve the controversy about the nematicity in multilayer FeSe films, but also offer valuable insights into the relationship between nematicity and superconductivity.
△ Less
Submitted 29 June, 2023; v1 submitted 28 July, 2022;
originally announced July 2022.
-
MORE: A Metric Learning Based Framework for Open-domain Relation Extraction
Authors:
Yutong Wang,
Renze Lou,
Kai Zhang,
MaoYan Chen,
Yujiu Yang
Abstract:
Open relation extraction (OpenRE) is the task of extracting relation schemes from open-domain corpora. Most existing OpenRE methods either do not fully benefit from high-quality labeled corpora or can not learn semantic representation directly, affecting downstream clustering efficiency. To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-base…
▽ More
Open relation extraction (OpenRE) is the task of extracting relation schemes from open-domain corpora. Most existing OpenRE methods either do not fully benefit from high-quality labeled corpora or can not learn semantic representation directly, affecting downstream clustering efficiency. To address these problems, in this work, we propose a novel learning framework named MORE (Metric learning-based Open Relation Extraction). The framework utilizes deep metric learning to obtain rich supervision signals from labeled data and drive the neural model to learn semantic relational representation directly. Experiments result in two real-world datasets show that our method outperforms other state-of-the-art baselines. Our source code is available on Github.
△ Less
Submitted 1 June, 2022;
originally announced June 2022.
-
Emergence of Fermi arcs and novel magnetic splitting in an antiferromagnet
Authors:
Benjamin Schrunk,
Yevhen Kushnirenko,
Brinda Kuthanazhi,
Junyeong Ahn,
Lin-Lin Wang,
Evan O`Leary,
Kyungchan Lee,
Andrew Eaton,
Alexander Fedorov,
Rui Lou,
Vladimir Voroshnin,
Oliver J. Clark,
Jaime Sanchez-Barriga,
Sergey L. Bud`ko,
Robert-Jan Slager,
Paul C. Canfield,
Adam Kaminski
Abstract:
The Fermi arcs are signatures of exotic states in solids because they defy conventional concept of Fermi surfaces as closed contours in momentum space. Fermi arcs were first discovered in cuprates, and caused by the pseudogap. Weyl semimetals provided another way to generate Fermi arcs by breaking either the time reversal symmetry (TRS) or inversion symmetry of a 3D Dirac semimetal, which can resu…
▽ More
The Fermi arcs are signatures of exotic states in solids because they defy conventional concept of Fermi surfaces as closed contours in momentum space. Fermi arcs were first discovered in cuprates, and caused by the pseudogap. Weyl semimetals provided another way to generate Fermi arcs by breaking either the time reversal symmetry (TRS) or inversion symmetry of a 3D Dirac semimetal, which can result in a Weyl semimetal with pairs of Weyl nodes that have opposite chirality. The bulk-boundary correspondence associated with the Chern number leads to the emergence of Fermi arcs on the boundary. Here, we present experimental evidence that pairs of magnetically split hole- and electron-like Fermi arcs emerge below the Neel temperature, in the antiferromagnetic (AFM) state of cubic NdBi due to a novel band splitting effect. Whereas TRS is broken by the AFM order, both inversion and nonsymmorphic TRS are preserved in the bulk, precluding the possibility of a Weyl semimetal. The observed magnetic splitting is highly unusual, as it creates bands of opposing curvature, that changes with temperature and follows the antiferromagnetic order parameter. This is completely different from previously reported cases of magnetic splittings such as traditional Zeeman and Rashba, where the curvature of the bands is preserved. Therefore, our finding represents a new Fermionic state created by new type of magnetic band splitting in the presence of a long-range AFM order that are not readily explained by existing theoretical ideas.
△ Less
Submitted 23 March, 2022;
originally announced March 2022.
-
Electronic structure and open-orbit Fermi surface topology in isostructural semimetals NbAs$_2$ and W$_2$As$_3$ with extremely large magnetoresistance
Authors:
Rui Lou,
Yiyan Wang,
Lingxiao Zhao,
Chenchao Xu,
Man Li,
Xiaoyang Chen,
Anmin Zhang,
Yaobo Huang,
Chao Cao,
Genfu Chen,
Tianlong Xia,
Qingming Zhang,
Hong Ding,
Shancai Wang
Abstract:
In transition-metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the origin of extremely large magnetoresistance (XMR) is yet to be studied by the direct visualization of the experimental band structures. Here, using angle-resolved photoemission spectroscopy, we map out the three-dimensional electronic structure of NbAs$_2$. The open-orbit topology contributes to a non-negligible part o…
▽ More
In transition-metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the origin of extremely large magnetoresistance (XMR) is yet to be studied by the direct visualization of the experimental band structures. Here, using angle-resolved photoemission spectroscopy, we map out the three-dimensional electronic structure of NbAs$_2$. The open-orbit topology contributes to a non-negligible part of the Fermi surfaces (FSs), like that of the isostructural compound MoAs$_2$, where the open FS is proposed to likely explain the origin of XMR. We further demonstrate the observation of open characters in the overall FSs of W$_2$As$_3$, which is also a XMR semimetal with the same space group of $C$12/$m$1 as $TmPn_2$ family and MoAs$_2$. Our results suggest that the open-orbit FS topology may be a shared feature between XMR materials with the space group of $C$12/$m$1, and thus could possibly play a role in determining the corresponding XMR effect together with the electron-hole compensation.
△ Less
Submitted 22 March, 2022;
originally announced March 2022.
-
Electronic structure and signature of Tomonaga-Luttinger liquid state in epitaxial CoSb$_{1-x}$ nanoribbons
Authors:
Rui Lou,
Minyinan Lei,
Wenjun Ding,
Wentao Yang,
Xiaoyang Chen,
Ran Tao,
Shuyue Ding,
Xiaoping Shen,
Yajun Yan,
Ping Cui,
Haichao Xu,
Rui Peng,
Tong Zhang,
Zhenyu Zhang,
Donglai Feng
Abstract:
Recently, monolayer CoSb/SrTiO$_3$ has been proposed as a candidate harboring interfacial superconductivity in analogy with monolayer FeSe/SrTiO$_3$. Experimentally, while the CoSb-based compounds manifesting as nanowires and thin films have been realized on SrTiO$_3$ substrates, serving as a rich playground, their electronic structures are still unknown and yet to be resolved. Here, we have fabri…
▽ More
Recently, monolayer CoSb/SrTiO$_3$ has been proposed as a candidate harboring interfacial superconductivity in analogy with monolayer FeSe/SrTiO$_3$. Experimentally, while the CoSb-based compounds manifesting as nanowires and thin films have been realized on SrTiO$_3$ substrates, serving as a rich playground, their electronic structures are still unknown and yet to be resolved. Here, we have fabricated CoSb$_{1-x}$ nanoribbons with quasi-one-dimensional stripes on SrTiO$_3$(001) substrates using molecular beam epitaxy, and investigated the electronic structure by in situ angle-resolved photoemission spectroscopy. Straight Fermi surfaces without lateral dispersions are observed. CoSb$_{1-x}$/SrTiO$_3$ is slightly hole doped, where the interfacial charge transfer is opposite to that in monolayer FeSe/SrTiO$_3$. The spectral weight near Fermi level exhibits power-law-like suppression and obeys a universal temperature scaling, serving as the signature of Tomonaga-Luttinger liquid (TLL) state. The obtained TLL parameter of $\sim$0.21 shows the underlying strong correlations. Our results not only suggest CoSb$_{1-x}$ nanoribbon as a representative TLL system, but also provide clues for further investigations on the CoSb-related interface.
△ Less
Submitted 21 September, 2021;
originally announced September 2021.
-
GradTS: A Gradient-Based Automatic Auxiliary Task Selection Method Based on Transformer Networks
Authors:
Weicheng Ma,
Renze Lou,
Kai Zhang,
Lili Wang,
Soroush Vosoughi
Abstract:
A key problem in multi-task learning (MTL) research is how to select high-quality auxiliary tasks automatically. This paper presents GradTS, an automatic auxiliary task selection method based on gradient calculation in Transformer-based models. Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0.33% to 17.93% on 8 na…
▽ More
A key problem in multi-task learning (MTL) research is how to select high-quality auxiliary tasks automatically. This paper presents GradTS, an automatic auxiliary task selection method based on gradient calculation in Transformer-based models. Compared to AUTOSEM, a strong baseline method, GradTS improves the performance of MT-DNN with a bert-base-cased backend model, from 0.33% to 17.93% on 8 natural language understanding (NLU) tasks in the GLUE benchmarks. GradTS is also time-saving since (1) its gradient calculations are based on single-task experiments and (2) the gradients are re-used without additional experiments when the candidate task set changes. On the 8 GLUE classification tasks, for example, GradTS costs on average 21.32% less time than AUTOSEM with comparable GPU consumption. Further, we show the robustness of GradTS across various task settings and model selections, e.g. mixed objectives among candidate tasks. The efficiency and efficacy of GradTS in these case studies illustrate its general applicability in MTL research without requiring manual task filtering or costly parameter tuning.
△ Less
Submitted 13 September, 2021;
originally announced September 2021.
-
Contributions of Transformer Attention Heads in Multi- and Cross-lingual Tasks
Authors:
Weicheng Ma,
Kai Zhang,
Renze Lou,
Lili Wang,
Soroush Vosoughi
Abstract:
This paper studies the relative importance of attention heads in Transformer-based models to aid their interpretability in cross-lingual and multi-lingual tasks. Prior research has found that only a few attention heads are important in each mono-lingual Natural Language Processing (NLP) task and pruning the remaining heads leads to comparable or improved performance of the model. However, the impa…
▽ More
This paper studies the relative importance of attention heads in Transformer-based models to aid their interpretability in cross-lingual and multi-lingual tasks. Prior research has found that only a few attention heads are important in each mono-lingual Natural Language Processing (NLP) task and pruning the remaining heads leads to comparable or improved performance of the model. However, the impact of pruning attention heads is not yet clear in cross-lingual and multi-lingual tasks. Through extensive experiments, we show that (1) pruning a number of attention heads in a multi-lingual Transformer-based model has, in general, positive effects on its performance in cross-lingual and multi-lingual tasks and (2) the attention heads to be pruned can be ranked using gradients and identified with a few trial experiments. Our experiments focus on sequence labeling tasks, with potential applicability on other cross-lingual and multi-lingual tasks. For comprehensiveness, we examine two pre-trained multi-lingual models, namely multi-lingual BERT (mBERT) and XLM-R, on three tasks across 9 languages each. We also discuss the validity of our findings and their extensibility to truly resource-scarce languages and other task settings.
△ Less
Submitted 18 August, 2021;
originally announced August 2021.
-
Charge-Density-Wave-Induced Peak-Dip-Hump Structure and the Multiband Superconductivity in a Kagome Superconductor CsV$_{3}$Sb$_{5}$
Authors:
Rui Lou,
Alexander Fedorov,
Qiangwei Yin,
Andrii Kuibarov,
Zhijun Tu,
Chunsheng Gong,
Eike F. Schwier,
Bernd Büchner,
Hechang Lei,
Sergey Borisenko
Abstract:
The entanglement of charge density wave (CDW), superconductivity, and topologically nontrivial electronic structure has recently been discovered in the kagome metal $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) family. With high-resolution angle-resolved photoemission spectroscopy, we study the electronic properties of CDW and superconductivity in CsV$_3$Sb$_5$. The spectra around $\bar{K}$ is found to exhibit…
▽ More
The entanglement of charge density wave (CDW), superconductivity, and topologically nontrivial electronic structure has recently been discovered in the kagome metal $A$V$_3$Sb$_5$ ($A$ = K, Rb, Cs) family. With high-resolution angle-resolved photoemission spectroscopy, we study the electronic properties of CDW and superconductivity in CsV$_3$Sb$_5$. The spectra around $\bar{K}$ is found to exhibit a peak-dip-hump structure associated with two separate branches of dispersion, demonstrating the isotropic CDW gap opening below $E_{\rm F}$. The peak-dip-hump lineshape is contributed by linearly dispersive Dirac bands in the lower branch and a dispersionless flat band close to $E_{\rm F}$ in the upper branch. The electronic instability via Fermi surface nesting could play a role in determining these CDW-related features. The superconducting gap of $\sim$0.4 meV is observed on both the electron band around $\barΓ$ and the flat band around $\bar{K}$, implying the multiband superconductivity. The finite density of states (DOS) at $E_{\rm F}$ in the CDW phase are most likely in favor of the emergence of multiband superconductivity, particularly the enhanced DOS associated with the flat band. Our results not only shed light on the controversial origin of the CDW, but also offer insights into the relationship between CDW and superconductivity.
△ Less
Submitted 8 January, 2022; v1 submitted 11 June, 2021;
originally announced June 2021.
-
Topological phase transition between distinctWeyl semimetal states in MoTe2
Authors:
Anmin Zhang,
Xiaoli Ma,
Changle Liu,
Rui Lou,
Yimeng Wang,
Qiaohe Yu,
Yiyan Wang,
Tian-long Xia,
Shancai Wang,
Lei Zhang,
Xiaoqun Wang,
Changfeng Chen,
Qingming Zhang
Abstract:
We present experimental evidence of an intriguing phase transition between distinct topological states in the type-II Weyl semimetal MoTe2. We observe anomalies in the Raman phonon frequencies and linewidths as well as electronic quasielastic peaks around 70 K, which, together with structural, thermodynamic measurements, and electron-phonon coupling calculations, demonstrate a temperature-induced…
▽ More
We present experimental evidence of an intriguing phase transition between distinct topological states in the type-II Weyl semimetal MoTe2. We observe anomalies in the Raman phonon frequencies and linewidths as well as electronic quasielastic peaks around 70 K, which, together with structural, thermodynamic measurements, and electron-phonon coupling calculations, demonstrate a temperature-induced transition between two topological phases previously identified by contrasting spectroscopic measurements. An analysis of experimental data suggests electron-phonon coupling as the main driving mechanism for the change of key topological characters in the electronic structure of MoTe2.We also find the phase transition to be sensitive to sample conditions distinguished by synthesis methods. These discoveries of temperature and material condition-dependent topological phase evolutions and transitions in MoTe2 advance the fundamental understanding of the underlying physics and enable an effective approach to tuning Weyl semimetal states for technological applications.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Experimental observation of bulk nodal lines and electronic surface states in ZrB$_2$
Authors:
Rui Lou,
Pengjie Guo,
Man Li,
Qi Wang,
Zhonghao Liu,
Shanshan Sun,
Chenghe Li,
Xuchuan Wu,
Zilu Wang,
Zhe Sun,
Dawei Shen,
Yaobo Huang,
Kai Liu,
Zhong-Yi Lu,
Hechang Lei,
Hong Ding,
Shancai Wang
Abstract:
Topological nodal-line semimetals are characterized by the line-contact bulk band crossings and the topological surface states. Breaking certain protecting symmetry turns this system into a Dirac semimetal or Weyl semimetal that hosts zero-dimensional isolated nodal points. Recent advances in band theory predicted a topological nodal-line semimetal state possessing a new type of nodal line in AlB…
▽ More
Topological nodal-line semimetals are characterized by the line-contact bulk band crossings and the topological surface states. Breaking certain protecting symmetry turns this system into a Dirac semimetal or Weyl semimetal that hosts zero-dimensional isolated nodal points. Recent advances in band theory predicted a topological nodal-line semimetal state possessing a new type of nodal line in AlB$_2$-type diborides. Here, we report an experimental realization of nodal-line fermions and associated surface states near the Fermi energy in ZrB$_2$ by angle-resolved photoemission spectroscopy combined with first-principles calculations. The Dirac nodal lines in ZrB$_2$ wind into two groups of nodal rings, which are linked together along the $Γ$-$K$ direction. We further observe a distinct surface state connecting to each nodal line, indicative of the nontrivial topological nature of the bulk nodal lines. Therefore, our results provide convincing experimental evidence of the nodal-line semimetal states in ZrB$_2$ both in the bulk and on the surface, suggesting ZrB$_2$ as a remarkable platform for discovering unique phenomena induced by nodal-line fermions.
△ Less
Submitted 13 September, 2018; v1 submitted 2 May, 2018;
originally announced May 2018.
-
Large intrinsic anomalous Hall effect in half-metallic ferromagnet Co3Sn2S2 with magnetic Weyl fermions
Authors:
Qi Wang,
Yuanfeng Xu,
Rui Lou,
Zhonghao Liu,
Man Li,
Yaobo Huang,
Dawei Shen,
Hongming Weng,
Shancai Wang,
Hechang Lei
Abstract:
The origin of anomalous Hall effect (AHE) in magnetic materials is one of the most intriguing aspect in condensed matter physics and has been controversial for a long time. Recent studies indicate that the intrinsic AHE is closely related to the Berry curvature of occupied electronic states. In a magnetic Weyl semimetal with broken time-reversal symmetry, there are significant contributions on Ber…
▽ More
The origin of anomalous Hall effect (AHE) in magnetic materials is one of the most intriguing aspect in condensed matter physics and has been controversial for a long time. Recent studies indicate that the intrinsic AHE is closely related to the Berry curvature of occupied electronic states. In a magnetic Weyl semimetal with broken time-reversal symmetry, there are significant contributions on Berry curvature around Weyl nodes, which would lead to a large intrinsic AHE. Here, we report the large intrinsic AHE in the half-metallic ferromagnet Co3Sn2S2 single crystal. By systematically mapping out the electronic structure of Co3Sn2S2 theoretically and experimentally, the large intrinsic AHE should originate from the Weyl fermions near the Fermi energy. Furthermore, the intrinsic anomalous Hall conductivity depends linearly on the magnetization and this can be attributed to the sharp decrease of magnetization and the change of topological characteristics.
△ Less
Submitted 28 December, 2017;
originally announced December 2017.
-
Experimental Observation of Dirac Nodal Links in Centrosymmetric Semimetal TiB$_2$
Authors:
Zhonghao Liu,
Rui Lou,
Pengjie Guo,
Qi Wang,
Shanshan Sun,
Chenghe Li,
Setti Thirupathaiah,
Alexander Fedorov,
Dawei Shen,
Kai Liu,
Hechang Lei,
Shancai Wang
Abstract:
The topological nodal-line semimetal state, serving as a fertile ground for various topological quantum phases, where a topological insulator, Dirac semimetal, or Weyl semimetal can be realized when the certain protecting symmetry is broken, has only been experimentally studied in very few materials. In contrast to discrete nodes, nodal lines with rich topological configurations can lead to more u…
▽ More
The topological nodal-line semimetal state, serving as a fertile ground for various topological quantum phases, where a topological insulator, Dirac semimetal, or Weyl semimetal can be realized when the certain protecting symmetry is broken, has only been experimentally studied in very few materials. In contrast to discrete nodes, nodal lines with rich topological configurations can lead to more unusual transport phenomena. Utilizing angle-resolved photoemission spectroscopy and first-principles calculations, here, we provide compelling evidence of nodal-line fermions in centrosymmetric semimetal TiB$_2$ with a negligible spin-orbit coupling effect. With the band crossings just below the Fermi energy, two groups of Dirac nodal rings are clearly observed without any interference from other bands, one surrounding the Brillouin zone (BZ) corner in the horizontal mirror plane $σ_h$ and the other surrounding the BZ center in the vertical mirror plane $σ_v$. The linear dispersions forming Dirac nodal rings are as wide as 2 eV. We further observe that the two groups of nodal rings link together along the $Γ$-$K$ direction, composing a nodal-link configuration. The simple electronic structure with Dirac nodal links mainly constituting the Fermi surfaces suggests TiB$_2$ as a remarkable platform for studying and applying the novel physical properties related to nodal-line fermions.
△ Less
Submitted 17 August, 2018; v1 submitted 8 December, 2017;
originally announced December 2017.
-
Observation of Open-Orbit Fermi Surface Topology in Extremely Large Magnetoresistance Semimetal MoAs$_2$
Authors:
R. Lou,
Y. F. Xu,
L. -X. Zhao,
Z. -Q. Han,
P. -J. Guo,
M. Li,
J. -C. Wang,
B. -B. Fu,
Z. -H. Liu,
Y. -B. Huang,
P. Richard,
T. Qian,
K. Liu,
G. -F. Chen,
H. M. Weng,
H. Ding,
S. -C. Wang
Abstract:
While recent advances in band theory and sample growth have expanded the series of extremely large magnetoresistance (XMR) semimetals in transition metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the experimental study on their electronic structure and the origin of XMR is still absent. Here, using angle-resolved photoemission spectroscopy combined with first-principles calculations…
▽ More
While recent advances in band theory and sample growth have expanded the series of extremely large magnetoresistance (XMR) semimetals in transition metal dipnictides $TmPn_2$ ($Tm$ = Ta, Nb; $Pn$ = P, As, Sb), the experimental study on their electronic structure and the origin of XMR is still absent. Here, using angle-resolved photoemission spectroscopy combined with first-principles calculations and magnetotransport measurements, we performed a comprehensive investigation on MoAs$_2$, which is isostructural to the $TmPn_2$ family and also exhibits quadratic XMR. We resolve a clear band structure well agreeing with the predictions. Intriguingly, the unambiguously observed Fermi surfaces (FSs) are dominated by an open-orbit topology extending along both the [100] and [001] directions in the three-dimensional Brillouin zone. We further reveal the trivial topological nature of MoAs$_2$ by bulk parity analysis. Based on these results, we examine the proposed XMR mechanisms in other semimetals, and conclusively ascribe the origin of quadratic XMR in MoAs$_2$ to the carriers motion on the FSs with dominant open-orbit topology, innovating in the understanding of quadratic XMR in semimetals.
△ Less
Submitted 22 November, 2017; v1 submitted 17 July, 2017;
originally announced July 2017.
-
Observation of oscillatory relaxation in the Sn-terminated surface of epitaxial rock-salt SnSe $\{111\}$ topological crystalline insulator
Authors:
Wencan Jin,
Suresh Vishwanath,
Jianpeng Liu,
Lingyuan Kong,
Rui Lou,
Zhongwei Dai,
Jerzy T. Sadowski,
Xinyu Liu,
Huai-Hsun Lien,
Alexander Chaney,
Yimo Han,
Micheal Cao,
Junzhang Ma,
Tian Qian,
Jerry I. Dadap,
Shancai Wang,
Malgorzata Dobrowolska,
Jacek Furdyna,
David A. Muller,
Karsten Pohl,
Hong Ding,
Huili Grace Xing,
Richard M. Osgood, Jr
Abstract:
Topological crystalline insulators have been recently predicted and observed in rock-salt structure SnSe $\{111\}$ thin films. Previous studies have suggested that the Se-terminated surface of this thin film with hydrogen passivation, has a reduced surface energy and is thus a preferred configuration. In this paper, synchrotron-based angle-resolved photoemission spectroscopy, along with density fu…
▽ More
Topological crystalline insulators have been recently predicted and observed in rock-salt structure SnSe $\{111\}$ thin films. Previous studies have suggested that the Se-terminated surface of this thin film with hydrogen passivation, has a reduced surface energy and is thus a preferred configuration. In this paper, synchrotron-based angle-resolved photoemission spectroscopy, along with density functional theory calculations, are used to demonstrate conclusively that a rock-salt SnSe $\{111\}$ thin film epitaxially-grown on \ce{Bi2Se3} has a stable Sn-terminated surface. These observations are supported by low energy electron diffraction (LEED) intensity-voltage measurements and dynamical LEED calculations, which further show that the Sn-terminated SnSe $\{111\}$ thin film has undergone a surface structural relaxation of the interlayer spacing between the Sn and Se atomic planes. In sharp contrast to the Se-terminated counterpart, the observed Dirac surface state in the Sn-terminated SnSe $\{111\}$ thin film is shown to yield a high Fermi velocity, $0.50\times10^6$m/s, which suggests a potential mechanism of engineering the Dirac surface state of topological materials by tuning the surface configuration.
△ Less
Submitted 10 April, 2017;
originally announced April 2017.
-
Evidence of topological insulator state in the semimetal LaBi
Authors:
R. Lou,
B. -B. Fu,
Q. N. Xu,
P. -J. Guo,
L. -Y. Kong,
L. -K. Zeng,
J. -Z. Ma,
P. Richard,
C. Fang,
Y. -B. Huang,
S. -S. Sun,
Q. Wang,
L. Wang,
Y. -G. Shi,
H. C. Lei,
K. Liu,
H. M. Weng,
T. Qian,
H. Ding,
S. -C. Wang
Abstract:
By employing angle-resolved photoemission spectroscopy combined with first-principles calculations, we performed a systematic investigation on the electronic structure of LaBi, which exhibits extremely large magnetoresistance (XMR), and is theoretically predicted to possess band anticrossing with nontrivial topological properties. Here, the observations of the Fermi-surface topology and band dispe…
▽ More
By employing angle-resolved photoemission spectroscopy combined with first-principles calculations, we performed a systematic investigation on the electronic structure of LaBi, which exhibits extremely large magnetoresistance (XMR), and is theoretically predicted to possess band anticrossing with nontrivial topological properties. Here, the observations of the Fermi-surface topology and band dispersions are similar to previous studies on LaSb [Phys. Rev. Lett. 117, 127204 (2016)], a topologically trivial XMR semimetal, except the existence of a band inversion along the $Γ$-$X$ direction, with one massless and one gapped Dirac-like surface state at the $X$ and $Γ$ points, respectively. The odd number of massless Dirac cones suggests that LaBi is analogous to the time-reversal $Z_2$ nontrivial topological insulator. These findings open up a new series for exploring novel topological states and investigating their evolution from the perspective of topological phase transition within the family of rare-earth monopnictides.
△ Less
Submitted 23 March, 2017; v1 submitted 12 December, 2016;
originally announced December 2016.
-
Engineering the structural and electronic phases of MoTe2 through W substitution
Authors:
D. Rhodes,
D. A. Chenet,
B. E. Janicek,
C. Nyby,
Y. Lin,
W. Jin,
D. Edelberg,
E. Mannebach,
N. Finney,
A. Antony,
T. Schiros,
T. Klarr,
A. Mazzoni,
M. Chin,
Y. -c Chiu,
W. Zheng,
Q. R. Zhang,
F. Ernst,
J. I. Dadap,
X. Tong,
J. Ma,
R. Lou,
S. Wang,
T. Qian,
H. Ding
, et al. (8 additional authors not shown)
Abstract:
MoTe$_2$ is an exfoliable transition metal dichalcogenide (TMD) which crystallizes in three symmetries, the semiconducting trigonal-prismatic $2H-$phase, the semimetallic $1T^{\prime}$ monoclinic phase, and the semimetallic orthorhombic $T_d$ structure. The $2H-$phase displays a band gap of $\sim 1$ eV making it appealing for flexible and transparent optoelectronics. The $T_d-$phase is predicted t…
▽ More
MoTe$_2$ is an exfoliable transition metal dichalcogenide (TMD) which crystallizes in three symmetries, the semiconducting trigonal-prismatic $2H-$phase, the semimetallic $1T^{\prime}$ monoclinic phase, and the semimetallic orthorhombic $T_d$ structure. The $2H-$phase displays a band gap of $\sim 1$ eV making it appealing for flexible and transparent optoelectronics. The $T_d-$phase is predicted to possess unique topological properties which might lead to topologically protected non-dissipative transport channels. Recently, it was argued that it is possible to locally induce phase-transformations in TMDs, through chemical doping, local heating, or electric-field to achieve ohmic contacts or to induce useful functionalities such as electronic phase-change memory elements. The combination of semiconducting and topological elements based upon the same compound, might produce a new generation of high performance, low dissipation optoelectronic elements. Here, we show that it is possible to engineer the phases of MoTe$_2$ through W substitution by unveiling the phase-diagram of the Mo$_{1-x}$W$_x$Te$_2$ solid solution which displays a semiconducting to semimetallic transition as a function of $x$. We find that only $\sim 8$ \% of W stabilizes the $T_d-$phase at room temperature. Photoemission spectroscopy, indicates that this phase possesses a Fermi surface akin to that of WTe$_2$.
△ Less
Submitted 8 October, 2016;
originally announced October 2016.
-
Compensated semimetal LaSb with unsaturated magnetoresistance
Authors:
L. -K. Zeng,
R. Lou,
D. -S. Wu,
Q. N. Xu,
P. -J. Guo,
L. -Y. Kong,
Y. -G. Zhong,
J. -Z. Ma,
B. -B. Fu,
P. Richard,
P. Wang,
G. T. Liu,
L. Lu,
Y. -B. Huang,
C. Fang,
S. -S. Sun,
Q. Wang,
L. Wang,
Y. -G. Shi,
H. M. Weng,
H. -C. Lei,
K. Liu,
S. -C. Wang,
T. Qian,
J. -L. Luo
, et al. (1 additional authors not shown)
Abstract:
By combining angle-resolved photoemission spectroscopy and quantum oscillation measurements, we performed a comprehensive investigation on the electronic structure of LaSb, which exhibits near-quadratic extremely large magnetoresistance (XMR) without any sign of saturation at magnetic fields as high as 40 T. We clearly resolve one spherical and one intersecting-ellipsoidal hole Fermi surfaces (FSs…
▽ More
By combining angle-resolved photoemission spectroscopy and quantum oscillation measurements, we performed a comprehensive investigation on the electronic structure of LaSb, which exhibits near-quadratic extremely large magnetoresistance (XMR) without any sign of saturation at magnetic fields as high as 40 T. We clearly resolve one spherical and one intersecting-ellipsoidal hole Fermi surfaces (FSs) at the Brillouin zone (BZ) center $Γ$ and one ellipsoidal electron FS at the BZ boundary $X$. The hole and electron carriers calculated from the enclosed FS volumes are perfectly compensated, and the carrier compensation is unaffected by temperature. We further reveal that LaSb is topologically trivial but share many similarities with the Weyl semimetal TaAs family in the bulk electronic structure. Based on these results, we have examined the mechanisms that have been proposed so far to explain the near-quadratic XMR in semimetals.
△ Less
Submitted 19 September, 2016; v1 submitted 27 April, 2016;
originally announced April 2016.
-
Magnetoresistance and Shubnikov-de Hass oscillation in YSb
Authors:
Qiao-He Yu,
Yi-Yan Wang,
Rui Lou,
Peng-Jie Guo,
Sheng Xu,
Kai Liu,
Shancai Wang,
Tian-Long Xia
Abstract:
YSb crystals are grown and the transport properties under magnetic field are measured. The resistivity exhibits metallic behavior under zero magnetic field and the low temperature resistivity shows a clear upturn once a moderate magnetic field is applied. The upturn is greatly enhanced by increasing magnetic field, finally resulting in a metal-to-insulator-like transition. With temperature further…
▽ More
YSb crystals are grown and the transport properties under magnetic field are measured. The resistivity exhibits metallic behavior under zero magnetic field and the low temperature resistivity shows a clear upturn once a moderate magnetic field is applied. The upturn is greatly enhanced by increasing magnetic field, finally resulting in a metal-to-insulator-like transition. With temperature further decreased, a resistivity plateau emerges after the insulator-like regime. At low temperature (2.5 K) and high field (14 T), the transverse magnetoresistance (MR) is quite large (3.47 $\times 10^4\%$ ). In addition, Shubnikov-de Haas (SdH) oscillation has also been observed in YSb. Periodic behavior of the oscillation amplitude reveals the related information about Fermi surface and two major oscillation frequencies can be obtained from the FFT spectra of the oscillations. The trivial Berry phase extracted from SdH oscillation, band structure revealed by angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations demonstrate that YSb is a topologically trivial material.
△ Less
Submitted 5 March, 2017; v1 submitted 20 April, 2016;
originally announced April 2016.
-
Emergence of topological bands on the surface of ZrSnTe crystal
Authors:
R. Lou,
J. -Z. Ma,
Q. -N. Xu,
B. -B. Fu,
L. -Y. Kong,
Y. -G. Shi,
P. Richard,
H. -M. Weng,
Z. Fang,
S. -S. Sun,
Q. Wang,
H. -C. Lei,
T. Qian,
H. Ding,
S. -C. Wang
Abstract:
By using angle-resolved photoemission spectroscopy combined with first-principles calculations, we reveal that the topmost unit cell of ZrSnTe crystal hosts two-dimensional (2D) electronic bands of topological insulator (TI) state, though such a TI state is defined with a curved Fermi level instead of a global band gap. Furthermore, we find that by modifying the dangling bonds on the surface throu…
▽ More
By using angle-resolved photoemission spectroscopy combined with first-principles calculations, we reveal that the topmost unit cell of ZrSnTe crystal hosts two-dimensional (2D) electronic bands of topological insulator (TI) state, though such a TI state is defined with a curved Fermi level instead of a global band gap. Furthermore, we find that by modifying the dangling bonds on the surface through hydrogenation, this 2D band structure can be manipulated so that the expected global energy gap is most likely to be realized. This facilitates the practical applications of 2D TI in heterostructural devices and those with surface decoration and coverage. Since ZrSnTe belongs to a large family of compounds having the similar crystal and band structures, our findings shed light on identifying more 2D TI candidates and superconductor-TI heterojunctions supporting topological superconductors.
△ Less
Submitted 19 September, 2016; v1 submitted 27 January, 2016;
originally announced January 2016.
-
Interplay between multiple charge-density waves and the relationship with superconductivity in Pd$_x$HoTe$_{3}$
Authors:
Rui Lou,
Yipeng Cai,
Zhonghao Liu,
Tian Qian,
Lingxiao Zhao,
Yu Li,
Kai Liu,
Zhiqing Han,
Dandan Zhang,
Junbao He,
Genfu Chen,
Hong Ding,
Shancai Wang
Abstract:
HoTe$_{3}$, a member of the rare-earth tritelluride ($R$Te$_{3}$) family, and its Pd-intercalated compounds, Pd$_x$HoTe$_{3}$, where superconductivity (SC) sets in as the charge-density wave (CDW) transition is suppressed by the intercalation of a small amount of Pd, are investigated using angle-resolved photoemission spectroscopy (ARPES) and electrical resistivity. Two incommensurate CDWs with pe…
▽ More
HoTe$_{3}$, a member of the rare-earth tritelluride ($R$Te$_{3}$) family, and its Pd-intercalated compounds, Pd$_x$HoTe$_{3}$, where superconductivity (SC) sets in as the charge-density wave (CDW) transition is suppressed by the intercalation of a small amount of Pd, are investigated using angle-resolved photoemission spectroscopy (ARPES) and electrical resistivity. Two incommensurate CDWs with perpendicular nesting vectors are observed in HoTe$_{3}$ at low temperatures. With a slight Pd intercalation ($x$ = 0.01), the large CDW gap decreases and the small one increases. The momentum dependence of the gaps along the inner Fermi surface (FS) evolves from orthorhombicity to near tetragonality, manifesting the competition between two CDW orders. At $x$ = 0.02, both CDW gaps decreases with the emergence of SC. Further increasing the content of Pd for $x$ = 0.04 will completely suppress the CDW instabilities and give rise to the maximal SC order. The evolution of the electronic structures and electron-phonon couplings (EPCs) of the multiple CDWs upon Pd intercalation are carefully scrutinized. We discuss the interplay between multiple CDW orders, and the competition between CDW and SC in detail.
△ Less
Submitted 19 September, 2016; v1 submitted 7 January, 2016;
originally announced January 2016.
-
Sudden gap-closure across the topological phase transition in Bi$_{2-x}$In$_{x}$Se$_{3}$
Authors:
Rui Lou,
Zhonghao Liu,
Wencan Jin,
Haifeng Wang,
Zhiqing Han,
Kai Liu,
Xueyun Wang,
Tian Qian,
Yevhen Kushnirenko,
Sang-Wook Cheong,
Richard M. Osgood, Jr.,
Hong Ding,
Shancai Wang
Abstract:
The phase transition from a topological insulator to a trivial band insulator is studied by angle-resoled photoemission spectroscopy on Bi$_{2-x}$In$_{x}$Se$_{3}$ single crystals. We first report the complete evolution of the bulk band structures throughout the transition. The robust surface state and the bulk gap size ($\sim$ 0.50 eV) show no significant change upon doping for $x$ = 0.05, 0.10 an…
▽ More
The phase transition from a topological insulator to a trivial band insulator is studied by angle-resoled photoemission spectroscopy on Bi$_{2-x}$In$_{x}$Se$_{3}$ single crystals. We first report the complete evolution of the bulk band structures throughout the transition. The robust surface state and the bulk gap size ($\sim$ 0.50 eV) show no significant change upon doping for $x$ = 0.05, 0.10 and 0.175. At $x$ $\geq$ 0.225, the surface state completely disappears and the bulk gap size increases, suggesting a sudden gap-closure and topological phase transition around $x \sim$ 0.175$-$0.225. We discuss the underlying mechanism of the phase transition, proposing that it is governed by the combined effect of spin-orbit coupling and interactions upon band hybridization. Our study provides a new venue to investigate the mechanism of the topological phase transition induced by non-magnetic impurities.
△ Less
Submitted 19 September, 2016; v1 submitted 26 March, 2015;
originally announced March 2015.