subscribe to arXiv mailings

arXiv:2405.19853 [pdf]

Correlated Electronic Structure and Density-Wave Gap in Trilayer Nickelate La4Ni3O10

Authors: X. Du, Y. D. Li, Y. T. Cao, C. Y. Pei, M. X. Zhang, W. X. Zhao, K. Y. Zhai, R. Z. Xu, Z. K. Liu, Z. W. Li, J. K. Zhao, G. Li, Y. L. Chen, Y. P. Qi, H. J. Guo, L. X. Yang

Abstract: The discovery of pressurized superconductivity at 80 K in La3Ni2O7 officially brings nickelates into the family of high-temperature superconductors, which gives rise to not only new insights but also mysteries in the strongly correlated superconductivity. More recently, the sibling compound La4Ni3O10 was also shown to be superconducting below about 25 K under pressure, further boosting the popular… ▽ More The discovery of pressurized superconductivity at 80 K in La3Ni2O7 officially brings nickelates into the family of high-temperature superconductors, which gives rise to not only new insights but also mysteries in the strongly correlated superconductivity. More recently, the sibling compound La4Ni3O10 was also shown to be superconducting below about 25 K under pressure, further boosting the popularity of nickelates in the Ruddlesden-Popper phase. In this study, combining high-resolution angle-resolved photoemission spectroscopy and ab initio calculation, we systematically investigate the electronic structures of La4Ni3O10 at ambient pressure. We reveal a high resemblance of La4Ni3O10 with La3Ni2O7 in the orbital-dependent fermiology and electronic structure, suggesting a similar electronic correlation between the two compounds. The temperature-dependent measurements imply an orbital-dependent energy gap related to the density-wave transition in La4Ni3O10. By comparing the theoretical pressure-dependent electronic structure, clues about the superconducting high-pressure phase can be deduced from the ambient measurements, providing crucial information for deciphering the unconventional superconductivity in nickelates. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2403.05012 [pdf]

Ultrafast Dynamics of Bilayer and Trilayer Nickelate Superconductors

Authors: Y. D. Li, Y. T. Cao, L. Y. Liu, P. Peng, H. Lin, C. Y. Pei, M. X. Zhang, H. Wu, X. Du, W. X. Zhao, K. Y. Zhai, J. K. Zhao, M. -L. Lin, P. H. Tan, Y. P. Qi, G. Li, H. J. Guo, Luyi Yang, L. X. Yang

Abstract: In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ult… ▽ More In addition to the pressurized high-temperature superconductivity, bilayer and trilayer nickelate superconductors Lan+1NinO3n+1 (n = 2 and 3) exhibit many intriguing properties at ambient pressure, such as orbital-dependent electronic correlation, non-Fermi liquid behavior, and density-wave transitions. Here, using ultrafast reflectivity measurement, we observe a drastic difference between the ultrafast dynamics of the bilayer and trilayer nickelates at ambient pressure. Firstly, we observe a coherent phonon mode in La4Ni3O10 involving the collective vibration of La, Ni, and O atoms, which is absent in La3Ni2O7. Secondly, the temperature-dependent relaxation time diverges near the density-wave transition temperature of La4Ni3O10, in drastic contrast to kink-like changes in La3Ni2O7. Moreover, we estimate the electron-phonon coupling constants to be 0.05~0.07 and 0.12~0.16 for La3Ni2O7 and La4Ni3O10, respectively, suggesting a relatively minor role of electron-phonon coupling in the electronic properties of Lan+1NinO3n+1. Our work not only sheds light on the relevant microscopic interaction but also establishes a foundation for further studying the interplay between superconductivity and density-wave transitions in nickelate superconductors. △ Less

Submitted 7 March, 2024; originally announced March 2024.

arXiv:2312.07141 [pdf, other]

Multilingual large language models leak human stereotypes across language boundaries

Authors: Yang Trista Cao, Anna Sotnikova, Jieyu Zhao, Linda X. Zou, Rachel Rudinger, Hal Daume III

Abstract: Multilingual large language models have been increasingly popular for their proficiency in processing and generating text across various languages. Previous research has shown that the presence of stereotypes and biases in monolingual large language models can be attributed to the nature of their training data, which is collected from humans and reflects societal biases. Multilingual language mode… ▽ More Multilingual large language models have been increasingly popular for their proficiency in processing and generating text across various languages. Previous research has shown that the presence of stereotypes and biases in monolingual large language models can be attributed to the nature of their training data, which is collected from humans and reflects societal biases. Multilingual language models undergo the same training procedure as monolingual ones, albeit with training data sourced from various languages. This raises the question: do stereotypes present in one social context leak across languages within the model? In our work, we first define the term ``stereotype leakage'' and propose a framework for its measurement. With this framework, we investigate how stereotypical associations leak across four languages: English, Russian, Chinese, and Hindi. To quantify the stereotype leakage, we employ an approach from social psychology, measuring stereotypes via group-trait associations. We evaluate human stereotypes and stereotypical associations manifested in multilingual large language models such as mBERT, mT5, and GPT-3.5. Our findings show a noticeable leakage of positive, negative, and non-polar associations across all languages. Notably, Hindi within multilingual models appears to be the most susceptible to influence from other languages, while Chinese is the least. Additionally, GPT-3.5 exhibits a better alignment with human scores than other models. WARNING: This paper contains model outputs which could be offensive in nature. △ Less

Submitted 8 May, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

arXiv:2311.07879 [pdf, other]

Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators

Authors: Yang Trista Cao, Lovely-Frances Domingo, Sarah Ann Gilbert, Michelle Mazurek, Katie Shilton, Hal Daumé III

Abstract: Extensive efforts in automated approaches for content moderation have been focused on developing models to identify toxic, offensive, and hateful content with the aim of lightening the load for moderators. Yet, it remains uncertain whether improvements on those tasks have truly addressed moderators' needs in accomplishing their work. In this paper, we surface gaps between past research efforts tha… ▽ More Extensive efforts in automated approaches for content moderation have been focused on developing models to identify toxic, offensive, and hateful content with the aim of lightening the load for moderators. Yet, it remains uncertain whether improvements on those tasks have truly addressed moderators' needs in accomplishing their work. In this paper, we surface gaps between past research efforts that have aimed to provide automation for aspects of content moderation and the needs of volunteer content moderators, regarding identifying violations of various moderation rules. To do so, we conduct a model review on Hugging Face to reveal the availability of models to cover various moderation rules and guidelines from three exemplar forums. We further put state-of-the-art LLMs to the test, evaluating how well these models perform in flagging violations of platform rules from one particular forum. Finally, we conduct a user survey study with volunteer moderators to gain insight into their perspectives on useful moderation models. Overall, we observe a non-trivial gap, as missing developed models and LLMs exhibit moderate to low performance on a significant portion of the rules. Moderators' reports provide guides for future work on developing moderation assistant models. △ Less

Submitted 16 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

arXiv:2308.16634 [pdf, other]

doi 10.1103/PhysRevC.108.024610

Effect of initial-state geometric configurations on the nuclear liquid-gas phase transition

Authors: Y. T. Cao, X. G. Deng, Y. G. Ma

Abstract: Within the framework of an extended quantum molecular dynamics model, we simulated $^{40}$Ca + $^{16}$O collisions at beam energies ranging from 60 to 150 MeV/nucleon for $^{16}$O with different $α$-cluster configurations. Results imply that different $α$-cluster configurations lead to different yields of deuteron, triton, $^3$He and $^4$He, but not for proton and neutron. We discuss the effect of… ▽ More Within the framework of an extended quantum molecular dynamics model, we simulated $^{40}$Ca + $^{16}$O collisions at beam energies ranging from 60 to 150 MeV/nucleon for $^{16}$O with different $α$-cluster configurations. Results imply that different $α$-cluster configurations lead to different yields of deuteron, triton, $^3$He and $^4$He, but not for proton and neutron. We discuss the effect of geometric fluctuations which are presented by double ratios of light nuclei, namely $\mathcal{O}_\text{p-d-t}$ and $\mathcal{O}_\text{p-d-He}$. It is found that magnitude hierarchy of geometric fluctuations is chain, kite, square and tetrahedron structure of $^{16}$O. $\mathcal{O}_\text{p-d-t}$ has maximum value around 80 -- 100 MeV/nucleon which could be related to liquid-gas phase transition, that is consistent with results from the charge distribution of the heaviest fragments in the collisions. △ Less

Submitted 31 August, 2023; originally announced August 2023.

Comments: 10 pages, 8 figures

Journal ref: Physical Review C 108, 024610 (2023)

arXiv:2210.14966 [pdf, other]

What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility?

Authors: Yang Trista Cao, Kyle Seelman, Kyungjun Lee, Hal Daumé III

Abstract: In visual question answering (VQA), a machine must answer a question given an associated image. Recently, accessibility researchers have explored whether VQA can be deployed in a real-world setting where users with visual impairments learn about their environment by capturing their visual surroundings and asking questions. However, most of the existing benchmarking datasets for VQA focus on machin… ▽ More In visual question answering (VQA), a machine must answer a question given an associated image. Recently, accessibility researchers have explored whether VQA can be deployed in a real-world setting where users with visual impairments learn about their environment by capturing their visual surroundings and asking questions. However, most of the existing benchmarking datasets for VQA focus on machine "understanding" and it remains unclear how progress on those datasets corresponds to improvements in this real-world use case. We aim to answer this question by evaluating discrepancies between machine "understanding" datasets (VQA-v2) and accessibility datasets (VizWiz) by evaluating a variety of VQA models. Based on our findings, we discuss opportunities and challenges in VQA for accessibility and suggest directions for future work. △ Less

Submitted 26 October, 2022; originally announced October 2022.

Journal ref: AACL-IJCNLP 2022 The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing

arXiv:2206.11684 [pdf, other]

Theory-Grounded Measurement of U.S. Social Stereotypes in English Language Models

Authors: Yang Trista Cao, Anna Sotnikova, Hal Daumé III, Rachel Rudinger, Linda Zou

Abstract: NLP models trained on text have been shown to reproduce human stereotypes, which can magnify harms to marginalized groups when systems are deployed at scale. We adapt the Agency-Belief-Communion (ABC) stereotype model of Koch et al. (2016) from social psychology as a framework for the systematic study and discovery of stereotypic group-trait associations in language models (LMs). We introduce the… ▽ More NLP models trained on text have been shown to reproduce human stereotypes, which can magnify harms to marginalized groups when systems are deployed at scale. We adapt the Agency-Belief-Communion (ABC) stereotype model of Koch et al. (2016) from social psychology as a framework for the systematic study and discovery of stereotypic group-trait associations in language models (LMs). We introduce the sensitivity test (SeT) for measuring stereotypical associations from language models. To evaluate SeT and other measures using the ABC model, we collect group-trait judgments from U.S.-based subjects to compare with English LM stereotypes. Finally, we extend this framework to measure LM stereotyping of intersectional identities. △ Less

Submitted 23 June, 2022; originally announced June 2022.

arXiv:2203.13928 [pdf, other]

On the Intrinsic and Extrinsic Fairness Evaluation Metrics for Contextualized Language Representations

Authors: Yang Trista Cao, Yada Pruksachatkun, Kai-Wei Chang, Rahul Gupta, Varun Kumar, Jwala Dhamala, Aram Galstyan

Abstract: Multiple metrics have been introduced to measure fairness in various natural language processing tasks. These metrics can be roughly categorized into two categories: 1) \emph{extrinsic metrics} for evaluating fairness in downstream applications and 2) \emph{intrinsic metrics} for estimating fairness in upstream contextualized language representation models. In this paper, we conduct an extensive c… ▽ More Multiple metrics have been introduced to measure fairness in various natural language processing tasks. These metrics can be roughly categorized into two categories: 1) \emph{extrinsic metrics} for evaluating fairness in downstream applications and 2) \emph{intrinsic metrics} for estimating fairness in upstream contextualized language representation models. In this paper, we conduct an extensive correlation study between intrinsic and extrinsic metrics across bias notions using 19 contextualized language models. We find that intrinsic and extrinsic metrics do not necessarily correlate in their original setting, even when correcting for metric misalignments, noise in evaluation datasets, and confounding factors such as experiment configuration for extrinsic metrics. %al △ Less

Submitted 25 March, 2022; originally announced March 2022.

Journal ref: ACL 2022

arXiv:1910.13913 [pdf, other]

doi 10.18653/v1/2020.acl-main.418

Toward Gender-Inclusive Coreference Resolution

Authors: Yang Trista Cao, Hal Daumé III

Abstract: Correctly resolving textual mentions of people fundamentally entails making inferences about those people. Such inferences raise the risk of systemic biases in coreference resolution systems, including biases that can harm binary and non-binary trans and cis stakeholders. To better understand such biases, we foreground nuanced conceptualizations of gender from sociology and sociolinguistics, and d… ▽ More Correctly resolving textual mentions of people fundamentally entails making inferences about those people. Such inferences raise the risk of systemic biases in coreference resolution systems, including biases that can harm binary and non-binary trans and cis stakeholders. To better understand such biases, we foreground nuanced conceptualizations of gender from sociology and sociolinguistics, and develop two new datasets for interrogating bias in crowd annotations and in existing coreference resolution systems. Through these studies, conducted on English text, we confirm that without acknowledging and building systems that recognize the complexity of gender, we build systems that lead to many potential harms. △ Less

Submitted 2 December, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

Comments: 28 pages; ACL version

Journal ref: Association for Computational Linguistics. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (2020) 4568-4595

Showing 1–9 of 9 results for author: Cao, Y T