subscribe to arXiv mailings

arXiv:2407.10088 [pdf, other]

Predictability of weakly turbulent systems from spatially sparse observations using data assimilation and machine learning

Authors: Vikrant Gupta, Yuanqing Chen, Minping Wan

Abstract: We apply two data assimilation (DA) methods, a smoother and a filter, and a model-free machine learning (ML) shallow network to forecast two weakly turbulent systems. We analyse the effect of the spatial sparsity of observations on accuracy of the predictions obtained from these data-driven methods. Based on the results, we divide the spatial sparsity levels in three zones. First is the good-predi… ▽ More We apply two data assimilation (DA) methods, a smoother and a filter, and a model-free machine learning (ML) shallow network to forecast two weakly turbulent systems. We analyse the effect of the spatial sparsity of observations on accuracy of the predictions obtained from these data-driven methods. Based on the results, we divide the spatial sparsity levels in three zones. First is the good-predictions zone in which both DA and ML methods work. We find that in the good-predictions zone the observations remain dense enough to accurately capture the fractal manifold of the system's dynamics, which is measured using the correlation dimension. The accuracy of the DA methods in this zone remains almost as good as for full-resolution observations. Second is the reasonable-predictions zone in which the DA methods still work but at reduced prediction accuracy. Third is the bad-predictions zone in which even the DA methods fail. We find that the sparsity level up to which the DA methods work is almost the same up to which chaos synchronisation of these systems can be achieved. The main implications of these results are that they (i) firmly establish the spatial resolution up to which the data-driven methods can be utilised, (ii) provide measures to determine if adding more sensors will improve the predictions, and (iii) quantify the advantage (in terms of the required measurement resolution) of using the governing equations within data-driven methods. We also discuss the applicability of these results to fully developed turbulence. △ Less

Submitted 14 July, 2024; originally announced July 2024.

arXiv:2407.00372 [pdf, other]

Study of semileptonic $B\to DP\ell^+ν_\ell$ decays based on the SU(3) flavor symmetry

Authors: Ru-Min Wang, Yi-Jie Zhang, Meng-Yuan Wan, Xiao-Dong Cheng, Yuan-Guo Xu

Abstract: Decays $B\to DP\ell^+ν_\ell~(\ell=e,μ,τ)$ with the non-resonance, the charmed vector resonances, the charmed scalar resonances and the charmed tensor resonances are calculated by using the SU(3) flavor symmetry. Firstly, the decay amplitudes of different modes are related by the SU(3) flavor symmetry. Then, relevant experiential data are used to constrain nonperturbative coefficients in the non-re… ▽ More Decays $B\to DP\ell^+ν_\ell~(\ell=e,μ,τ)$ with the non-resonance, the charmed vector resonances, the charmed scalar resonances and the charmed tensor resonances are calculated by using the SU(3) flavor symmetry. Firstly, the decay amplitudes of different modes are related by the SU(3) flavor symmetry. Then, relevant experiential data are used to constrain nonperturbative coefficients in the non-resonant and various resonant $B\to DP\ell^+ν_\ell$ decays. Finally, using the constrained nonperturbative coefficients, the branching ratios of not-yet-measured $B\to D^*P\ell^+ν_\ell$ decays with the non-resonant and various charmed resonant contributions are predicted. Many branching ratios are predicted for the first time. We find that $B\to Dη'\ell^+ν_\ell, B_s\to D_sη'\ell^+ν_\ell$ decays only receive the non-resonant contributions, $B\to D_sK\ell^+ν_\ell$, $B_s\to DK\ell^+ν_\ell$, $B\to Dη\ell^+ν_\ell$ and $B_s\to D_sη\ell^+ν_\ell$ decays receive both non-resonant and charmed tensor resonant contributions, $B^+\to D^-π^+\ell^+ν_\ell$ decays receive the non-resonant, the charmed scalar resonant and the charmed tensor resonant contributions, and other $B\to Dπ\ell^+ν_\ell$ decays receive all four kinds of contributions. These results can be used to test the SU(3) flavor symmetry approach in the four-body semileptonic decays by the future LHCb and Belle-II experiments. △ Less

Submitted 29 June, 2024; originally announced July 2024.

Comments: 16 pages. arXiv admin note: text overlap with arXiv:2403.14929

arXiv:2405.04656 [pdf, other]

Corporate Communication Companion (CCC): An LLM-empowered Writing Assistant for Workplace Social Media

Authors: Zhuoran Lu, Sheshera Mysore, Tara Safavi, Jennifer Neville, Longqi Yang, Mengting Wan

Abstract: Workplace social media platforms enable employees to cultivate their professional image and connect with colleagues in a semi-formal environment. While semi-formal corporate communication poses a unique set of challenges, large language models (LLMs) have shown great promise in helping users draft and edit their social media posts. However, LLMs may fail to capture individualized tones and voices… ▽ More Workplace social media platforms enable employees to cultivate their professional image and connect with colleagues in a semi-formal environment. While semi-formal corporate communication poses a unique set of challenges, large language models (LLMs) have shown great promise in helping users draft and edit their social media posts. However, LLMs may fail to capture individualized tones and voices in such workplace use cases, as they often generate text using a "one-size-fits-all" approach that can be perceived as generic and bland. In this paper, we present Corporate Communication Companion (CCC), an LLM-empowered interactive system that helps people compose customized and individualized workplace social media posts. Using need-finding interviews to motivate our system design, CCC decomposes the writing process into two core functions, outline and edit: First, it suggests post outlines based on users' job status and previous posts, and next provides edits with attributions that users can contextually customize. We conducted a within-subjects user study asking participants both to write posts and evaluate posts written by others. The results show that CCC enhances users' writing experience, and audience members rate CCC-enhanced posts as higher quality than posts written using a non-customized writing assistant. We conclude by discussing the implications of LLM-empowered corporate communication. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.10133 [pdf, other]

WB LUTs: Contrastive Learning for White Balancing Lookup Tables

Authors: Sai Kumar Reddy Manne, Michael Wan

Abstract: Automatic white balancing (AWB), one of the first steps in an integrated signal processing (ISP) pipeline, aims to correct the color cast induced by the scene illuminant. An incorrect white balance (WB) setting or AWB failure can lead to an undesired blue or red tint in the rendered sRGB image. To address this, recent methods pose the post-capture WB correction problem as an image-to-image transla… ▽ More Automatic white balancing (AWB), one of the first steps in an integrated signal processing (ISP) pipeline, aims to correct the color cast induced by the scene illuminant. An incorrect white balance (WB) setting or AWB failure can lead to an undesired blue or red tint in the rendered sRGB image. To address this, recent methods pose the post-capture WB correction problem as an image-to-image translation task and train deep neural networks to learn the necessary color adjustments at a lower resolution. These low resolution outputs are post-processed to generate high resolution WB corrected images, forming a bottleneck in the end-to-end run time. In this paper we present a 3D Lookup Table (LUT) based WB correction model called WB LUTs that can generate high resolution outputs in real time. We introduce a contrastive learning framework with a novel hard sample mining strategy, which improves the WB correction quality of baseline 3D LUTs by 25.5%. Experimental results demonstrate that the proposed WB LUTs perform competitively against state-of-the-art models on two benchmark datasets while being 300 times faster using 12.7 times less memory. Our model and code are available at https://github.com/skrmanne/3DLUT_sRGB_WB. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.10130 [pdf, other]

NOISe: Nuclei-Aware Osteoclast Instance Segmentation for Mouse-to-Human Domain Transfer

Authors: Sai Kumar Reddy Manne, Brendan Martin, Tyler Roy, Ryan Neilson, Rebecca Peters, Meghana Chillara, Christine W. Lary, Katherine J. Motyl, Michael Wan

Abstract: Osteoclast cell image analysis plays a key role in osteoporosis research, but it typically involves extensive manual image processing and hand annotations by a trained expert. In the last few years, a handful of machine learning approaches for osteoclast image analysis have been developed, but none have addressed the full instance segmentation task required to produce the same output as that of th… ▽ More Osteoclast cell image analysis plays a key role in osteoporosis research, but it typically involves extensive manual image processing and hand annotations by a trained expert. In the last few years, a handful of machine learning approaches for osteoclast image analysis have been developed, but none have addressed the full instance segmentation task required to produce the same output as that of the human expert led process. Furthermore, none of the prior, fully automated algorithms have publicly available code, pretrained models, or annotated datasets, inhibiting reproduction and extension of their work. We present a new dataset with ~2*10^5 expert annotated mouse osteoclast masks, together with a deep learning instance segmentation method which works for both in vitro mouse osteoclast cells on plastic tissue culture plates and human osteoclast cells on bone chips. To our knowledge, this is the first work to automate the full osteoclast instance segmentation task. Our method achieves a performance of 0.82 mAP_0.5 (mean average precision at intersection-over-union threshold of 0.5) in cross validation for mouse osteoclasts. We present a novel nuclei-aware osteoclast instance segmentation training strategy (NOISe) based on the unique biology of osteoclasts, to improve the model's generalizability and boost the mAP_0.5 from 0.60 to 0.82 on human osteoclasts. We publish our annotated mouse osteoclast image dataset, instance segmentation models, and code at github.com/michaelwwan/noise to enable reproducibility and to provide a public tool to accelerate osteoporosis research. △ Less

Submitted 15 April, 2024; originally announced April 2024.

arXiv:2404.04268 [pdf]

The Use of Generative Search Engines for Knowledge Work and Complex Tasks

Authors: Siddharth Suri, Scott Counts, Leijie Wang, Chacha Chen, Mengting Wan, Tara Safavi, Jennifer Neville, Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Sathish Manivannan, Nagu Rangan, Longqi Yang

Abstract: Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine.… ▽ More Until recently, search engines were the predominant method for people to access online information. The recent emergence of large language models (LLMs) has given machines new capabilities such as the ability to generate new digital artifacts like text, images, code etc., resulting in a new tool, a generative search engine, which combines the capabilities of LLMs with a traditional search engine. Through the empirical analysis of Bing Copilot (Bing Chat), one of the first publicly available generative search engines, we analyze the types and complexity of tasks that people use Bing Copilot for compared to Bing Search. Findings indicate that people use the generative search engine for more knowledge work tasks that are higher in cognitive complexity than were commonly done with a traditional search engine. △ Less

Submitted 19 March, 2024; originally announced April 2024.

Comments: 32 pages, 3 figures, 4 tables

ACM Class: J.4

arXiv:2404.01897 [pdf, other]

Continuous Spiking Graph Neural Networks

Authors: Nan Yin, Mengzhu Wan, Li Shen, Hitesh Laxmichand Patel, Baopu Li, Bin Gu, Huan Xiong

Abstract: Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs req… ▽ More Continuous graph neural networks (CGNNs) have garnered significant attention due to their ability to generalize existing discrete graph neural networks (GNNs) by introducing continuous dynamics. They typically draw inspiration from diffusion-based methods to introduce a novel propagation scheme, which is analyzed using ordinary differential equations (ODE). However, the implementation of CGNNs requires significant computational power, making them challenging to deploy on battery-powered devices. Inspired by recent spiking neural networks (SNNs), which emulate a biological inference process and provide an energy-efficient neural architecture, we incorporate the SNNs with CGNNs in a unified framework, named Continuous Spiking Graph Neural Networks (COS-GNN). We employ SNNs for graph node representation at each time step, which are further integrated into the ODE process along with time. To enhance information preservation and mitigate information loss in SNNs, we introduce the high-order structure of COS-GNN, which utilizes the second-order ODE for spiking representation and continuous propagation. Moreover, we provide the theoretical proof that COS-GNN effectively mitigates the issues of exploding and vanishing gradients, enabling us to capture long-range dependencies between nodes. Experimental results on graph-based learning tasks demonstrate the effectiveness of the proposed COS-GNN over competitive baselines. △ Less

Submitted 2 April, 2024; originally announced April 2024.

arXiv:2403.14929 [pdf, other]

Four-body Semileptonic Decays $B\to D^*P\ell^+ν_\ell$ with the SU(3) Flavor Symmetry

Authors: Meng-Yuan Wan, Yuan-Guo Xu, Qi-Lin Jia, Yue-Xin Liu, Yi-Jie Zhang

Abstract: We present a complete study of the $B\to D^*P\ell^+ν_\ell~(\ell=e,μ,τ)$ decays with the non-resonant, the charmed axial vector resonant and the charmed tensor resonant contributions by using the SU(3) flavor symmetry. Relevant amplitude relations between different decay modes are obtained by the SU(3) flavor symmetry. We then predict non-measured branching ratios of the $B\to D^*P\ell^+ν_\ell$ dec… ▽ More We present a complete study of the $B\to D^*P\ell^+ν_\ell~(\ell=e,μ,τ)$ decays with the non-resonant, the charmed axial vector resonant and the charmed tensor resonant contributions by using the SU(3) flavor symmetry. Relevant amplitude relations between different decay modes are obtained by the SU(3) flavor symmetry. We then predict non-measured branching ratios of the $B\to D^*P\ell^+ν_\ell$ decays with the non-resonant and the charmed resonant contributions by using present experimental data of the $B\to D^*P\ell'^+ν_{\ell'}~(\ell'=e,μ)$ decays within $2σ$ errors. We have found that $B^{0,+}\to D^*η\ell^+ν_\ell$, $B^{0,+}\to D^*η'\ell^+ν_\ell$, $B^{0}_s\to D^*_s η\ell^+ν_\ell$, $B^{0}_s\to D^*_sη'\ell^+ν_\ell$ and $B^{0,+}\to D^{*}_sK\ell^+ν_\ell$ decays only receive non-resonant contributions. Decays $B^0_s\to D_s^{*-}π^0\ell^+ν_\ell$ only receive the $D'_{s1}$ resonant contributions. Other decays receive all three kinds of contributions, and three kinds of contributions are important in most of decays. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 15 pages, 1 figure

arXiv:2403.12726 [pdf]

Small Distance Increment Method for Measuring Complex Permittivity With mmWave Radar

Authors: Hang Song, Hyun Joon Kim, Mingxia Wan, Bo Wei, Takamaro Kikkawa, Jun-ichi Takada

Abstract: Measuring the complex permittivity of material is essential in many scenarios such as quality check and component analysis. Generally, measurement methods for characterizing the material are based on the usage of vector network analyzer, which is large and not easy for on-site measurement, especially in high frequency range such as millimeter wave (mmWave). In addition, some measurement methods re… ▽ More Measuring the complex permittivity of material is essential in many scenarios such as quality check and component analysis. Generally, measurement methods for characterizing the material are based on the usage of vector network analyzer, which is large and not easy for on-site measurement, especially in high frequency range such as millimeter wave (mmWave). In addition, some measurement methods require the destruction of samples, which is not suitable for non-destructive inspection. In this work, a small distance increment (SDI) method is proposed to non-destructively measure the complex permittivity of material. In SDI, the transmitter and receiver are formed as the monostatic radar, which is facing towards the material under test (MUT). During the measurement, the distance between radar and MUT changes with small increments and the signals are recorded at each position. A mathematical model is formulated to depict the relationship among the complex permittivity, distance increment, and measured signals. By fitting the model, the complex permittivity of MUT is estimated. To implement and evaluate the proposed SDI method, a commercial off-the-shelf mmWave radar is utilized and the measurement system is developed. Then, the evaluation was carried out on the acrylic plate. With the proposed method, the estimated complex permittivity of acrylic plate shows good agreement with the literature values, demonstrating the efficacy of SDI method for characterizing the complex permittivity of material. △ Less

Submitted 19 March, 2024; originally announced March 2024.

arXiv:2403.12388 [pdf, other]

Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

Authors: Ying-Chun Lin, Jennifer Neville, Jack W. Stokes, Longqi Yang, Tara Safavi, Mengting Wan, Scott Counts, Siddharth Suri, Reid Andersen, Xiaofeng Xu, Deepak Gupta, Sujay Kumar Jauhar, Xia Song, Georg Buscher, Saurabh Tiwary, Brent Hecht, Jaime Teevan

Abstract: Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featur… ▽ More Accurate and interpretable user satisfaction estimation (USE) is critical for understanding, evaluating, and continuously improving conversational systems. Users express their satisfaction or dissatisfaction with diverse conversational patterns in both general-purpose (ChatGPT and Bing Copilot) and task-oriented (customer service chatbot) conversational systems. Existing approaches based on featurized ML models or text embeddings fall short in extracting generalizable patterns and are hard to interpret. In this work, we show that LLMs can extract interpretable signals of user satisfaction from their natural language utterances more effectively than embedding-based approaches. Moreover, an LLM can be tailored for USE via an iterative prompting framework using supervision from labeled examples. The resulting method, Supervised Prompting for User satisfaction Rubrics (SPUR), not only has higher accuracy but is more interpretable as it scores user satisfaction via learned rubrics with a detailed breakdown. △ Less

Submitted 8 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.12173 [pdf, other]

TnT-LLM: Text Mining at Scale with Large Language Models

Authors: Mengting Wan, Tara Safavi, Sujay Kumar Jauhar, Yujin Kim, Scott Counts, Jennifer Neville, Siddharth Suri, Chirag Shah, Ryen W White, Longqi Yang, Reid Andersen, Georg Buscher, Dhruv Joshi, Nagu Rangan

Abstract: Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. Thi… ▽ More Transforming unstructured text into structured and meaningful forms, organized by useful category labels, is a fundamental step in text mining for downstream analysis and application. However, most existing methods for producing label taxonomies and building text-based label classifiers still rely heavily on domain expertise and manual curation, making the process expensive and time-consuming. This is particularly challenging when the label space is under-specified and large-scale data annotations are unavailable. In this paper, we address these challenges with Large Language Models (LLMs), whose prompt-based interface facilitates the induction and use of large-scale pseudo labels. We propose TnT-LLM, a two-phase framework that employs LLMs to automate the process of end-to-end label generation and assignment with minimal human effort for any given use-case. In the first phase, we introduce a zero-shot, multi-stage reasoning approach which enables LLMs to produce and refine a label taxonomy iteratively. In the second phase, LLMs are used as data labelers that yield training samples so that lightweight supervised classifiers can be reliably built, deployed, and served at scale. We apply TnT-LLM to the analysis of user intent and conversational domain for Bing Copilot (formerly Bing Chat), an open-domain chat-based search engine. Extensive experiments using both human and automatic evaluation metrics demonstrate that TnT-LLM generates more accurate and relevant label taxonomies when compared against state-of-the-art baselines, and achieves a favorable balance between accuracy and efficiency for classification at scale. We also share our practical experiences and insights on the challenges and opportunities of using LLMs for large-scale text mining in real-world applications. △ Less

Submitted 18 March, 2024; originally announced March 2024.

Comments: 9 pages main content, 8 pages references and appendix

arXiv:2402.02158 [pdf, other]

PatSTEG: Modeling Formation Dynamics of Patent Citation Networks via The Semantic-Topological Evolutionary Graph

Authors: Ran Miao, Xueyu Chen, Liang Hu, Zhifei Zhang, Minghua Wan, Qi Zhang, Cairong Zhao

Abstract: Patent documents in the patent database (PatDB) are crucial for research, development, and innovation as they contain valuable technical information. However, PatDB presents a multifaceted challenge compared to publicly available preprocessed databases due to the intricate nature of the patent text and the inherent sparsity within the patent citation network. Although patent text analysis and cita… ▽ More Patent documents in the patent database (PatDB) are crucial for research, development, and innovation as they contain valuable technical information. However, PatDB presents a multifaceted challenge compared to publicly available preprocessed databases due to the intricate nature of the patent text and the inherent sparsity within the patent citation network. Although patent text analysis and citation analysis bring new opportunities to explore patent data mining, no existing work exploits the complementation of them. To this end, we propose a joint semantic-topological evolutionary graph learning approach (PatSTEG) to model the formation dynamics of patent citation networks. More specifically, we first create a real-world dataset of Chinese patents named CNPat and leverage its patent texts and citations to construct a patent citation network. Then, PatSTEG is modeled to study the evolutionary dynamics of patent citation formation by considering the semantic and topological information jointly. Extensive experiments are conducted on CNPat and public datasets to prove the superiority of PatSTEG over other state-of-the-art methods. All the results provide valuable references for patent literature research and technical exploration. △ Less

Submitted 3 February, 2024; originally announced February 2024.

arXiv:2401.15398 [pdf, ps, other]

Resolvent analysis for predicting energetic structures in the far wake of a wind turbine

Authors: Dachuan Feng, Vikrant Gupta, Larry K. B. Li, Minping Wan

Abstract: A thorough understanding of the energetic flow structures that form in the far wake of a wind turbine is essential for accurate turbine wake modeling and wind farm performance estimation. We use resolvent analysis to explore such flow structures for a turbine operating in a neutral atmospheric boundary layer and validate our results against data-driven modes extracted through spectral proper ortho… ▽ More A thorough understanding of the energetic flow structures that form in the far wake of a wind turbine is essential for accurate turbine wake modeling and wind farm performance estimation. We use resolvent analysis to explore such flow structures for a turbine operating in a neutral atmospheric boundary layer and validate our results against data-driven modes extracted through spectral proper orthogonal decomposition. Our results confirm that convective instabilities play a dominant role in generating turbulent kinetic energy (TKE) in the far wake. Additionally, we find evidence of the non-modal Orr mechanism contributing to TKE generation, particularly at low Strouhal numbers. The resolvent analysis method requires only the mean wake velocity and eddy viscosity profiles as inputs but can capture the energetic modes and TKE spectra in the far wake. In this specific application, the resolvent analysis method approximates the wake to be axisymmetric, which suggests that it can be paired with engineering wake models. Overall this study demonstrates the use of resolvent analysis as a viable tool for estimating TKE and for uncovering the mechanism of TKE generation. △ Less

Submitted 27 January, 2024; originally announced January 2024.

arXiv:2312.04416 [pdf, other]

Monitoring Sustainable Global Development Along Shared Socioeconomic Pathways

Authors: Michelle W. L. Wan, Jeffrey N. Clark, Edward A. Small, Elena Fillola Mayoral, Raúl Santos-Rodríguez

Abstract: Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrat… ▽ More Sustainable global development is one of the most prevalent challenges facing the world today, hinging on the equilibrium between socioeconomic growth and environmental sustainability. We propose approaches to monitor and quantify sustainable development along the Shared Socioeconomic Pathways (SSPs), including mathematically derived scoring algorithms, and machine learning methods. These integrate socioeconomic and environmental datasets, to produce an interpretable metric for SSP alignment. An initial study demonstrates promising results, laying the groundwork for the application of different methods to the monitoring of sustainable global development. △ Less

Submitted 7 December, 2023; originally announced December 2023.

Comments: 5 pages, 1 figure. Presented at NeurIPS 2023 Workshop: Tackling Climate Change with Machine Learning

arXiv:2311.09417 [pdf]

Preliminary Design of CSNS-II Linac SRF LLRF

Authors: Zhexin Xie, Kai Guo, Zhencheng Mu, Xinpeng Ma, Nan Gan, Maliang Wan, Bo Wang, Linyan Rong, Hui Zhang, Hexin Wang

Abstract: China Spallation Neutron Source(CSNS) target power will upgrade to 500 kW(CSNS-II) from 300kW, energy gain of H-Linac will up to 300 MeV from 80 MeV using about 50 superconductor cavities. LLRF is an important device for controlling the amplitude and phase of the SRF cavity field to be less than 0.6% and 0.6 deg. The parameters and requirements for CSNS-II Linac LLRF are presented here. The prelim… ▽ More China Spallation Neutron Source(CSNS) target power will upgrade to 500 kW(CSNS-II) from 300kW, energy gain of H-Linac will up to 300 MeV from 80 MeV using about 50 superconductor cavities. LLRF is an important device for controlling the amplitude and phase of the SRF cavity field to be less than 0.6% and 0.6 deg. The parameters and requirements for CSNS-II Linac LLRF are presented here. The preliminary design work and algorithm verification progress and results at C-ADS Injector-I are introduced. △ Less

Submitted 16 November, 2023; originally announced November 2023.

Comments: Talk presented at LLRF Workshop 2023(LLRF2023, arXiv:2310.03199)

Report number: LLRF2023/11

arXiv:2311.09180 [pdf, other]

PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers

Authors: Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, Tara Safavi

Abstract: Powerful large language models have facilitated the development of writing assistants that promise to significantly improve the quality and efficiency of composition and communication. However, a barrier to effective assistance is the lack of personalization in LLM outputs to the author's communication style and specialized knowledge. In this paper, we address this challenge by proposing PEARL, a… ▽ More Powerful large language models have facilitated the development of writing assistants that promise to significantly improve the quality and efficiency of composition and communication. However, a barrier to effective assistance is the lack of personalization in LLM outputs to the author's communication style and specialized knowledge. In this paper, we address this challenge by proposing PEARL, a retrieval-augmented LLM writing assistant personalized with a generation-calibrated retriever. Our retriever is trained to select historic user-authored documents for prompt augmentation, such that they are likely to best personalize LLM generations for a user request. We propose two key novelties for training our retriever: 1) A training data selection method that identifies user requests likely to benefit from personalization and documents that provide that benefit; and 2) A scale-calibrating KL-divergence objective that ensures that our retriever closely tracks the benefit of a document for personalized generation. We demonstrate the effectiveness of PEARL in generating personalized workplace social media posts and Reddit comments. Finally, we showcase the potential of a generation-calibrated retriever to double as a performance predictor and further improve low-quality generations via LLM chaining. △ Less

Submitted 15 November, 2023; originally announced November 2023.

Comments: Pre-print, work in progress

arXiv:2311.07095 [pdf, other]

Revisit to the yield ratio of triton and $^3$He as an indicator of neutron-rich neck emission

Authors: Yijie Wang, Mengting Wan, Xinyue Diao, Sheng Xiao, Yuhao Qin, Zhi Qin, Dong Guo, Dawei Si, Boyuan Zhang, Baiting Tian, Fenhai Guan, Qianghua Wu, Xianglun Wei, Herun Yang, Peng Ma, Rongjiang Hu, Limin Duan, Fangfang Duan, Junbing Ma, Shiwei Xu, Qiang Hu, Zhen Bai, Yanyun Yang, Jiansong Wang, Wenbo Liu , et al. (12 additional authors not shown)

Abstract: The neutron rich neck zone created in heavy ion reaction is experimentally probed by the production of the $A=3$ isobars. The energy spectra and angular distributions of triton and $^3$He are measured with the CSHINE detector in $^{86}$Kr +$^{208}$Pb reactions at 25 MeV/u. While the energy spectrum of $^{3}$He is harder than that of triton, known as "$^{3}$He-puzzle", the yield ratio… ▽ More The neutron rich neck zone created in heavy ion reaction is experimentally probed by the production of the $A=3$ isobars. The energy spectra and angular distributions of triton and $^3$He are measured with the CSHINE detector in $^{86}$Kr +$^{208}$Pb reactions at 25 MeV/u. While the energy spectrum of $^{3}$He is harder than that of triton, known as "$^{3}$He-puzzle", the yield ratio $R({\rm t/^3He})$ presents a robust rising trend with the polar angle in laboratory. Using the fission fragments to reconstruct the fission plane, the enhancement of out-plane $R({\rm t/^3He})$ is confirmed in comparison to the in-plane ratios. Transport model simulations reproduce qualitatively the experimental trends, but the quantitative agreement is not achieved. The results demonstrate that a neutron rich neck zone is formed in the reactions. Further studies are called for to understand the clustering and the isospin dynamics related to neck formation. △ Less

Submitted 13 November, 2023; originally announced November 2023.

arXiv:2310.16138 [pdf, other]

Subtle Signals: Video-based Detection of Infant Non-nutritive Sucking as a Neurodevelopmental Cue

Authors: Shaotong Zhu, Michael Wan, Sai Kumar Reddy Manne, Emily Zimmerman, Sarah Ostadabbas

Abstract: Non-nutritive sucking (NNS), which refers to the act of sucking on a pacifier, finger, or similar object without nutrient intake, plays a crucial role in assessing healthy early development. In the case of preterm infants, NNS behavior is a key component in determining their readiness for feeding. In older infants, the characteristics of NNS behavior offer valuable insights into neural and motor d… ▽ More Non-nutritive sucking (NNS), which refers to the act of sucking on a pacifier, finger, or similar object without nutrient intake, plays a crucial role in assessing healthy early development. In the case of preterm infants, NNS behavior is a key component in determining their readiness for feeding. In older infants, the characteristics of NNS behavior offer valuable insights into neural and motor development. Additionally, NNS activity has been proposed as a potential safeguard against sudden infant death syndrome (SIDS). However, the clinical application of NNS assessment is currently hindered by labor-intensive and subjective finger-in-mouth evaluations. Consequently, researchers often resort to expensive pressure transducers for objective NNS signal measurement. To enhance the accessibility and reliability of NNS signal monitoring for both clinicians and researchers, we introduce a vision-based algorithm designed for non-contact detection of NNS activity using baby monitor footage in natural settings. Our approach involves a comprehensive exploration of optical flow and temporal convolutional networks, enabling the detection and amplification of subtle infant-sucking signals. We successfully classify short video clips of uniform length into NNS and non-NNS periods. Furthermore, we investigate manual and learning-based techniques to piece together local classification results, facilitating the segmentation of longer mixed-activity videos into NNS and non-NNS segments of varying duration. Our research introduces two novel datasets of annotated infant videos, including one sourced from our clinical study featuring 19 infant subjects and 183 hours of overnight baby monitor footage. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.07197 [pdf]

doi 10.1088/1674-1056/ad04cb

MatChat: A Large Language Model and Application Service Platform for Materials Science

Authors: Ziyi Chen, Fankai Xie, Meng Wan, Yang Yuan, Miao Liu, Zongguo Wang, Sheng Meng, Yangang Wang

Abstract: The prediction of chemical synthesis pathways plays a pivotal role in materials science research. Challenges, such as the complexity of synthesis pathways and the lack of comprehensive datasets, currently hinder our ability to predict these chemical processes accurately. However, recent advancements in generative artificial intelligence (GAI), including automated text generation and question-answe… ▽ More The prediction of chemical synthesis pathways plays a pivotal role in materials science research. Challenges, such as the complexity of synthesis pathways and the lack of comprehensive datasets, currently hinder our ability to predict these chemical processes accurately. However, recent advancements in generative artificial intelligence (GAI), including automated text generation and question-answering systems, coupled with fine-tuning techniques, have facilitated the deployment of large-scale AI models tailored to specific domains. In this study, we harness the power of the LLaMA2-7B model and enhance it through a learning process that incorporates 13,878 pieces of structured material knowledge data. This specialized AI model, named MatChat, focuses on predicting inorganic material synthesis pathways. MatChat exhibits remarkable proficiency in generating and reasoning with knowledge in materials science. Although MatChat requires further refinement to meet the diverse material design needs, this research undeniably highlights its impressive reasoning capabilities and innovative potential in the field of materials science. MatChat is now accessible online and open for use, with both the model and its application framework available as open source. This study establishes a robust foundation for collaborative innovation in the integration of generative AI in materials science. △ Less

Submitted 11 October, 2023; originally announced October 2023.

Journal ref: Chinese Physics B 32, 118104 (2023)

arXiv:2309.15965 [pdf, other]

TraCE: Trajectory Counterfactual Explanation Scores

Authors: Jeffrey N. Clark, Edward A. Small, Nawid Keshtmand, Michelle W. L. Wan, Elena Fillola Mayoral, Enrico Werner, Christopher P. Bourdeaux, Raul Santos-Rodriguez

Abstract: Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterf… ▽ More Counterfactual explanations, and their associated algorithmic recourse, are typically leveraged to understand, explain, and potentially alter a prediction coming from a black-box classifier. In this paper, we propose to extend the use of counterfactuals to evaluate progress in sequential decision making tasks. To this end, we introduce a model-agnostic modular framework, TraCE (Trajectory Counterfactual Explanation) scores, which is able to distill and condense progress in highly complex scenarios into a single value. We demonstrate TraCE's utility across domains by showcasing its main properties in two case studies spanning healthcare and climate change. △ Less

Submitted 26 January, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

Comments: 10 pages, 4 figures, appendix

arXiv:2309.13063 [pdf, other]

Using Large Language Models to Generate, Validate, and Apply User Intent Taxonomies

Authors: Chirag Shah, Ryen W. White, Reid Andersen, Georg Buscher, Scott Counts, Sarkar Snigdha Sarathi Das, Ali Montazer, Sathish Manivannan, Jennifer Neville, Xiaochuan Ni, Nagu Rangan, Tara Safavi, Siddharth Suri, Mengting Wan, Leijie Wang, Longqi Yang

Abstract: Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics.… ▽ More Log data can reveal valuable information about how users interact with Web search services, what they want, and how satisfied they are. However, analyzing user intents in log data is not easy, especially for emerging forms of Web search such as AI-driven chat. To understand user intents from log data, we need a way to label them with meaningful categories that capture their diversity and dynamics. Existing methods rely on manual or machine-learned labeling, which are either expensive or inflexible for large and dynamic datasets. We propose a novel solution using large language models (LLMs), which can generate rich and relevant concepts, descriptions, and examples for user intents. However, using LLMs to generate a user intent taxonomy and apply it for log analysis can be problematic for two main reasons: (1) such a taxonomy is not externally validated; and (2) there may be an undesirable feedback loop. To address this, we propose a new methodology with human experts and assessors to verify the quality of the LLM-generated taxonomy. We also present an end-to-end pipeline that uses an LLM with human-in-the-loop to produce, refine, and apply labels for user intent analysis in log data. We demonstrate its effectiveness by uncovering new insights into user intents from search and chat logs from the Microsoft Bing commercial search engine. The proposed work's novelty stems from the method for generating purpose-driven user intent taxonomies with strong validation. This method not only helps remove methodological and practical bottlenecks from intent-focused research, but also provides a new framework for generating, validating, and applying other kinds of taxonomies in a scalable and adaptable way with reasonable human effort. △ Less

Submitted 9 May, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

Report number: MSR-TR-2023-32

arXiv:2309.08827 [pdf, other]

S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs

Authors: Sarkar Snigdha Sarathi Das, Chirag Shah, Mengting Wan, Jennifer Neville, Longqi Yang, Reid Andersen, Georg Buscher, Tara Safavi

Abstract: The traditional Dialogue State Tracking (DST) problem aims to track user preferences and intents in user-agent conversations. While sufficient for task-oriented dialogue systems supporting narrow domain applications, the advent of Large Language Model (LLM)-based chat systems has introduced many real-world intricacies in open-domain dialogues. These intricacies manifest in the form of increased co… ▽ More The traditional Dialogue State Tracking (DST) problem aims to track user preferences and intents in user-agent conversations. While sufficient for task-oriented dialogue systems supporting narrow domain applications, the advent of Large Language Model (LLM)-based chat systems has introduced many real-world intricacies in open-domain dialogues. These intricacies manifest in the form of increased complexity in contextual interactions, extended dialogue sessions encompassing a diverse array of topics, and more frequent contextual shifts. To handle these intricacies arising from evolving LLM-based chat systems, we propose joint dialogue segmentation and state tracking per segment in open-domain dialogue systems. Assuming a zero-shot setting appropriate to a true open-domain dialogue system, we propose S3-DST, a structured prompting technique that harnesses Pre-Analytical Recollection, a novel grounding mechanism we designed for improving long context tracking. To demonstrate the efficacy of our proposed approach in joint segmentation and state tracking, we evaluate S3-DST on a proprietary anonymized open-domain dialogue dataset, as well as publicly available DST and segmentation datasets. Across all datasets and settings, S3-DST consistently outperforms the state-of-the-art, demonstrating its potency and robustness the next generation of LLM-based chat systems. △ Less

Submitted 15 September, 2023; originally announced September 2023.

arXiv:2308.16676 [pdf, other]

Twofold Structured Features-Based Siamese Network for Infrared Target Tracking

Authors: Wei-Jie Yan, Yun-Kai Xu, Qian Chen, Xiao-Fang Kong, Guo-Hua Gu, A-Jun Shao, Min-Jie Wan

Abstract: Nowadays, infrared target tracking has been a critical technology in the field of computer vision and has many applications, such as motion analysis, pedestrian surveillance, intelligent detection, and so forth. Unfortunately, due to the lack of color, texture and other detailed information, tracking drift often occurs when the tracker encounters infrared targets that vary in size or shape. To add… ▽ More Nowadays, infrared target tracking has been a critical technology in the field of computer vision and has many applications, such as motion analysis, pedestrian surveillance, intelligent detection, and so forth. Unfortunately, due to the lack of color, texture and other detailed information, tracking drift often occurs when the tracker encounters infrared targets that vary in size or shape. To address this issue, we present a twofold structured features-based Siamese network for infrared target tracking. First of all, in order to improve the discriminative capacity for infrared targets, a novel feature fusion network is proposed to fuse both shallow spatial information and deep semantic information into the extracted features in a comprehensive manner. Then, a multi-template update module based on template update mechanism is designed to effectively deal with interferences from target appearance changes which are prone to cause early tracking failures. Finally, both qualitative and quantitative experiments are carried out on VOT-TIR 2016 dataset, which demonstrates that our method achieves the balance of promising tracking performance and real-time tracking speed against other out-of-the-art trackers. △ Less

Submitted 26 June, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

Comments: 13 pages,9 figures,references added

arXiv:2307.13110 [pdf, other]

Automatic Infant Respiration Estimation from Video: A Deep Flow-based Algorithm and a Novel Public Benchmark

Authors: Sai Kumar Reddy Manne, Shaotong Zhu, Sarah Ostadabbas, Michael Wan

Abstract: Respiration is a critical vital sign for infants, and continuous respiratory monitoring is particularly important for newborns. However, neonates are sensitive and contact-based sensors present challenges in comfort, hygiene, and skin health, especially for preterm babies. As a step toward fully automatic, continuous, and contactless respiratory monitoring, we develop a deep-learning method for es… ▽ More Respiration is a critical vital sign for infants, and continuous respiratory monitoring is particularly important for newborns. However, neonates are sensitive and contact-based sensors present challenges in comfort, hygiene, and skin health, especially for preterm babies. As a step toward fully automatic, continuous, and contactless respiratory monitoring, we develop a deep-learning method for estimating respiratory rate and waveform from plain video footage in natural settings. Our automated infant respiration flow-based network (AIRFlowNet) combines video-extracted optical flow input and spatiotemporal convolutional processing tuned to the infant domain. We support our model with the first public annotated infant respiration dataset with 125 videos (AIR-125), drawn from eight infant subjects, set varied pose, lighting, and camera conditions. We include manual respiration annotations and optimize AIRFlowNet training on them using a novel spectral bandpass loss function. When trained and tested on the AIR-125 infant data, our method significantly outperforms other state-of-the-art methods in respiratory rate estimation, achieving a mean absolute error of $\sim$2.9 breaths per minute, compared to $\sim$4.7--6.2 for other public models designed for adult subjects and more uniform environments. △ Less

Submitted 24 July, 2023; originally announced July 2023.

arXiv:2304.07952 [pdf, ps, other]

doi 10.1140/epjp/s13360-023-03978-3

Shadows and quasinormal modes of a charged non-commutative black hole by different methods

Authors: Zening Yan, Xiaoji Zhang, Maoyuan Wan, Chen Wu

Abstract: In this paper, we calculated the quasinormal modes (QNMs) of a charged non-commutative black hole in scalar, electromagnetic and gravitational fields by three methods. We gave the influence of non-commutative parameter $θ$ and charge $Q$ on QNMs in different fields. Thereafter, we calculated the shadow radius of the black hole and provided the valid range of $θ$ and $Q$ using the constraints on th… ▽ More In this paper, we calculated the quasinormal modes (QNMs) of a charged non-commutative black hole in scalar, electromagnetic and gravitational fields by three methods. We gave the influence of non-commutative parameter $θ$ and charge $Q$ on QNMs in different fields. Thereafter, we calculated the shadow radius of the black hole and provided the valid range of $θ$ and $Q$ using the constraints on the shadow radius of $\text{M87}^{\ast}$ and $\text{Sgr A}^{\ast}$ from the Event Horizon Telescope (EHT). In addition, we estimated the ``relative deviation'' of the shadow radius ($δ_{R_{s}}$) between non-commutative spacetime and commutative spacetime. We found that the maximum values of $δ_{R_{s}}$ decreases with the increase of charge $Q$. In other words, the non-commutativity of spacetime becomes harder to distinguish as the charge of the black hole increases. △ Less

Submitted 16 April, 2023; originally announced April 2023.

Comments: 25 pages, 12 figures

Journal ref: European Physical Journal Plus (2023) 138:377

arXiv:2304.03441 [pdf, other]

doi 10.1145/3543507.3583400

Large-Scale Analysis of New Employee Network Dynamics

Authors: Yulin Yu, Longqi Yang, Siân Lindley, Mengting Wan

Abstract: The COVID-19 pandemic has accelerated digital transformations across industries, but also introduced new challenges into workplaces, including the difficulties of effectively socializing with colleagues when working remotely. This challenge is exacerbated for new employees who need to develop workplace networks from the outset. In this paper, by analyzing a large-scale telemetry dataset of more th… ▽ More The COVID-19 pandemic has accelerated digital transformations across industries, but also introduced new challenges into workplaces, including the difficulties of effectively socializing with colleagues when working remotely. This challenge is exacerbated for new employees who need to develop workplace networks from the outset. In this paper, by analyzing a large-scale telemetry dataset of more than 10,000 Microsoft employees who joined the company in the first three months of 2022, we describe how new employees interact and telecommute with their colleagues during their ``onboarding'' period. Our results reveal that although new hires are gradually expanding networks over time, there still exists significant gaps between their network statistics and those of tenured employees even after the six-month onboarding phase. We also observe that heterogeneity exists among new employees in how their networks change over time, where employees whose job tasks do not necessarily require extensive and diverse connections could be at a disadvantaged position in this onboarding process. By investigating how web-based people recommendations in organizational knowledge base facilitate new employees naturally expand their networks, we also demonstrate the potential of web-based applications for addressing the aforementioned socialization challenges. Altogether, our findings provide insights on new employee network dynamics in remote and hybrid work environments, which may help guide organizational leaders and web application developers on quantifying and improving the socialization experiences of new employees in digital workplaces. △ Less

Submitted 6 April, 2023; originally announced April 2023.

Comments: Accepted at the International World Wide Web Conference (WWW,2023)

arXiv:2303.16867 [pdf, other]

A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants

Authors: Shaotong Zhu, Michael Wan, Elaheh Hatamimajoumerd, Kashish Jain, Samuel Zlota, Cholpady Vikram Kamath, Cassandra B. Rowan, Emma C. Grace, Matthew S. Goodwin, Marie J. Hayes, Rebecca A. Schwartz-Mette, Emily Zimmerman, Sarah Ostadabbas

Abstract: We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS) -- an infant sucking pattern with no nutrition delivered -- as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of… ▽ More We present an end-to-end computer vision pipeline to detect non-nutritive sucking (NNS) -- an infant sucking pattern with no nutrition delivered -- as a potential biomarker for developmental delays, using off-the-shelf baby monitor video footage. One barrier to clinical (or algorithmic) assessment of NNS stems from its sparsity, requiring experts to wade through hours of footage to find minutes of relevant activity. Our NNS activity segmentation algorithm solves this problem by identifying periods of NNS with high certainty -- up to 94.0\% average precision and 84.9\% average recall across 30 heterogeneous 60 s clips, drawn from our manually annotated NNS clinical in-crib dataset of 183 hours of overnight baby monitor footage from 19 infants. Our method is based on an underlying NNS action recognition algorithm, which uses spatiotemporal deep learning networks and infant-specific pose estimation, achieving 94.9\% accuracy in binary classification of 960 2.5 s balanced NNS vs. non-NNS clips. Tested on our second, independent, and public NNS in-the-wild dataset, NNS recognition classification reaches 92.3\% accuracy, and NNS segmentation achieves 90.8\% precision and 84.2\% recall. △ Less

Submitted 29 March, 2023; originally announced March 2023.

arXiv:2303.08786 [pdf, other]

doi 10.3390/atmos14101590

Evolution of a Stratified Turbulent Cloud under Rotation

Authors: Tianyi Li, Minping Wan, Shiyi Chen

Abstract: Localized turbulence is common in geophysical flows, where the roles of rotation and stratification are paramount. In this study, we investigate the evolution of a stratified turbulent cloud under rotation. Recognizing that a turbulent cloud is composed of vortices of varying scales and shapes, we start our investigation with a single eddy using analytical solutions derived from a linearized syste… ▽ More Localized turbulence is common in geophysical flows, where the roles of rotation and stratification are paramount. In this study, we investigate the evolution of a stratified turbulent cloud under rotation. Recognizing that a turbulent cloud is composed of vortices of varying scales and shapes, we start our investigation with a single eddy using analytical solutions derived from a linearized system. Compared to an eddy under pure rotation, the stratified eddy shows the physical manifestation of a known potential vorticity mode, appearing as a static stable vortex. In addition, the expected shift from inertial waves to inertial-gravity waves is observed. In our numerical simulations of the turbulent cloud, carried out at a constant Rossby number over a range of Froude numbers, stratification causes columnar structures to deviate from vertical alignment. This deviation increases with increasing stratification, slowing the expansion rate of the cloud. The observed characteristics of these columnar structures are consistent with the predictions of linear theory, particularly in their tilt angles and vertical growth rates, suggesting a significant influence of inertial-gravity waves. Using Lagrangian particle tracking, we have identified regions where wave activity dominates over turbulence. In scenarios of milder stratification, these inertial-gravity waves are responsible for a significant energy transfer away from the turbulent cloud, a phenomenon that attenuates with increasing stratification. △ Less

Submitted 3 November, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

Journal ref: Atmosphere, vol. 14, no. 10, article 1590 (2023)

arXiv:2303.07057 [pdf]

Transition from turbulence-dominated to instability-dominated combustion regime in lean hydrogen-air flames

Authors: Hsu Chew Lee, Peng Dai, Minping Wan, Vladimir A. Sabelnikov, Andrei N. Lipatnikov

Abstract: Recent complex-chemistry direct numerical simulations of lean hydrogen-air flames propagating in forced turbulence in a box were continued by switching-off the turbulence forcing. Results show that a decrease in burning velocity U_T(t), caused by the turbulence decay, is reversed when the turbulence becomes weak and a peak of U_T(t) appears, with the peak magnitudes and associated Karlovitz number… ▽ More Recent complex-chemistry direct numerical simulations of lean hydrogen-air flames propagating in forced turbulence in a box were continued by switching-off the turbulence forcing. Results show that a decrease in burning velocity U_T(t), caused by the turbulence decay, is reversed when the turbulence becomes weak and a peak of U_T(t) appears, with the peak magnitudes and associated Karlovitz numbers being similar in two different cases. These results (i) are attributed to activation of laminar flame instabilities, which have been suppressed by intense turbulence, and (ii) are argued to indicate that the instabilities can substantially affect U_T in sufficiently weak turbulence only. △ Less

Submitted 13 March, 2023; originally announced March 2023.

arXiv:2302.11899 [pdf, other]

Turbulent burning velocity and thermo-diffusive instability of premixed flames

Authors: HsuChew Lee, BuChen Wu, Peng Dai, Minping Wan, Andrei Lipatnikov

Abstract: Reported in the paper are results of unsteady three-dimensional direct numerical simulations of laminar and turbulent, lean hydrogen-air, complex-chemistry flames propagating in forced turbulence in a box. To explore the eventual influence of thermo-diffusive instability of laminar flames on turbulent burning velocity, (i) a critical length scale Λn that bounds regimes of unstable and stable lamin… ▽ More Reported in the paper are results of unsteady three-dimensional direct numerical simulations of laminar and turbulent, lean hydrogen-air, complex-chemistry flames propagating in forced turbulence in a box. To explore the eventual influence of thermo-diffusive instability of laminar flames on turbulent burning velocity, (i) a critical length scale Λn that bounds regimes of unstable and stable laminar combustion is numerically determined by gradually decreasing the width Λ of computational domain until a stable laminar flame is obtained and (ii) simulations of turbulent flames are performed by varying the width from Λ < Λn (in this case, the instability is suppressed) to Λ > Λn (in this case, the instability may grow). Moreover, simulations are performed either using mixture-averaged transport properties (low Lewis number flames) or setting diffusivities of all species equal to heat diffusivity of the mixture (equidiffusive flames), with all other things being equal. Obtained results show a significant increase in turbulent burning velocity UT when the boundary Λ = Λn is crossed in weak turbulence, but almost equal values of UT are computed at Λ < Λn and Λ > Λn in moderately turbulent flames characterized by Karlovitz number equal to 3.4 or larger. These results imply that thermo-diffusive instability of laminar premixed flames substantially affects burning velocity in weak turbulence only, in line with a simple criterion proposed by Chomiak and Lipatnikov (Phys. Rev. E 107, 015102, 2023). △ Less

Submitted 23 February, 2023; originally announced February 2023.

arXiv:2301.07449 [pdf, ps, other]

doi 10.1140/epjc/s10052-022-11167-2

Absorption and scattering of a high dimensional noncommutative black hole

Authors: Mao-Yuan Wan, Chen Wu

Abstract: In this work, we investigate the scattering of massless plane scalar waves by the high dimensional noncommutative Schwarzschild-Tangherlini black hole. We use the partial wave approach to determine the scattering and absorption cross sections in the incident wavelength range. Our numerical results demonstrate that the bigger the noncommutative parameter, the smaller the maximum value of the relate… ▽ More In this work, we investigate the scattering of massless plane scalar waves by the high dimensional noncommutative Schwarzschild-Tangherlini black hole. We use the partial wave approach to determine the scattering and absorption cross sections in the incident wavelength range. Our numerical results demonstrate that the bigger the noncommutative parameter, the smaller the maximum value of the related partial absorption cross section, however the tendency is slightly. We also discovered that when the noncommutative parameter is weak, the absorption cross section of the high dimensional black hole oscillates in the low frequency zone. The total absorption cross section oscillates around the geometrical optical limit in the high frequency range, and the scattering characteristics of black holes with various parameters are visibly different. The influence on the differential scattering cross section is particularly pronounced at large angles. △ Less

Submitted 18 January, 2023; originally announced January 2023.

Comments: 10 pages and 4 figures

Journal ref: Eur. Phys. J. C (2023) 83:28

arXiv:2212.03617 [pdf, other]

Energy transfer and third-order law in forced anisotropic MHD turbulence with hyperviscosity

Authors: Bin Jiang, Cheng Li, Yan Yang, Kangcheng Zhou, William, H. Matthaeus, Minping Wan

Abstract: The Kolmogorov-Yaglom (third-order) law, links energy transfer rates in the inertial range of magneto-hydrodynamic (MHD) turbulence with third-order structure functions. Anisotropy, a typical property in the solar wind, largely challenges the applicability of the third-order law with isotropic assumption. To shed light on the energy transfer process in the presence of anisotropy, the present study… ▽ More The Kolmogorov-Yaglom (third-order) law, links energy transfer rates in the inertial range of magneto-hydrodynamic (MHD) turbulence with third-order structure functions. Anisotropy, a typical property in the solar wind, largely challenges the applicability of the third-order law with isotropic assumption. To shed light on the energy transfer process in the presence of anisotropy, the present study conducted direct numerical simulations (DNSs) on forced MHD turbulence with normal and hyper-viscosity under various strengths of the external magnetic field ($B_0$), and calculated three forms of third-order structure function with or without averaging azimuthal or polar angles to $B_0$ direction. Correspondingly, three forms of estimated energy transfer rates were studied systematically with various $B_0$. The result shows that the peak of the estimated longitudinal transfer rate occurs at larger scales as closer to the $B_0$ direction, and its maximum shifts away from the $B_0$ direction at larger $B_0$. Compared with normal viscous cases, hyper-viscous cases can attain better separation of the inertial range from the dissipation range, thus facilitating the analyses of the inertial range properties and the estimation of the energy cascade rates. The direction-averaged third-order structure function over a spherical surface proposed in literature predicts the energy transfer rates and inertial range accurately, even at very high $B_0$. With limited statistics, the calculation of the third-order structure function shows a stronger dependence on averaging of azimuthal angles than the time, especially at high $B_0$ cases. These findings provide insights into the anisotropic effect on the estimation of energy transfer rates. △ Less

Submitted 7 December, 2022; originally announced December 2022.

arXiv:2212.01798 [pdf, ps, other]

doi 10.1007/s10714-022-03034-y

Absorption and scattering of massless scalar wave from Regular Black Holes

Authors: Mao-Yuan Wan, Chen Wu

Abstract: In this work, we numerically investigate the scattering and absorption cross section of the massless scalar field from some well-known regular black holes by using the partial wave approach. Our computational results indicate that the larger the parameters, the lower the associated total absorption cross section maxima. When compared to the Schwarzschild black hole, the scattering cross section is… ▽ More In this work, we numerically investigate the scattering and absorption cross section of the massless scalar field from some well-known regular black holes by using the partial wave approach. Our computational results indicate that the larger the parameters, the lower the associated total absorption cross section maxima. When compared to the Schwarzschild black hole, the scattering cross section is enhanced in some regular black hole spacetimes, meanwhile the scattering width is narrow in the forward orientation. Moreover, it is found that the null geodesics of the critical impact parameter and the geometrical optical value in the high frequency regime have similar changing behavior. △ Less

Submitted 5 December, 2022; v1 submitted 4 December, 2022; originally announced December 2022.

Comments: 15 pages, 5 figures

Journal ref: General Relativity and Gravitation (2022)54:148

arXiv:2212.00001 [pdf, other]

Model-Free Forecasting of Partially Observable Spatiotemporally Chaotic Systems

Authors: Vikrant Gupta, Larry K. B. Li, Shiyi Chen, Minping Wan

Abstract: Reservoir computing is a powerful tool for forecasting turbulence because its simple architecture has the computational efficiency to handle large systems. Its implementation, however, often requires full state-vector measurements and knowledge of the system nonlinearities. We use nonlinear projector functions to expand the system measurements to a high dimensional space and then feed them to a re… ▽ More Reservoir computing is a powerful tool for forecasting turbulence because its simple architecture has the computational efficiency to handle large systems. Its implementation, however, often requires full state-vector measurements and knowledge of the system nonlinearities. We use nonlinear projector functions to expand the system measurements to a high dimensional space and then feed them to a reservoir to obtain forecasts. We demonstrate the application of such reservoir computing networks on spatiotemporally chaotic systems, which model several features of turbulence. We show that using radial basis functions as nonlinear projectors enables complex system nonlinearities to be captured robustly even with only partial observations and without knowing the governing equations. Finally, we show that when measurements are sparse or incomplete and noisy, such that even the governing equations become inaccurate, our networks can still produce reasonably accurate forecasts, thus paving the way towards model-free forecasting of practical turbulent systems. △ Less

Submitted 12 October, 2022; originally announced December 2022.

Comments: 15 pages, 7 figures, currently submitted to neural networks

arXiv:2211.06365 [pdf, other]

Situating Recommender Systems in Practice: Towards Inductive Learning and Incremental Updates

Authors: Tobias Schnabel, Mengting Wan, Longqi Yang

Abstract: With information systems becoming larger scale, recommendation systems are a topic of growing interest in machine learning research and industry. Even though progress on improving model design has been rapid in research, we argue that many advances fail to translate into practice because of two limiting assumptions. First, most approaches focus on a transductive learning setting which cannot handl… ▽ More With information systems becoming larger scale, recommendation systems are a topic of growing interest in machine learning research and industry. Even though progress on improving model design has been rapid in research, we argue that many advances fail to translate into practice because of two limiting assumptions. First, most approaches focus on a transductive learning setting which cannot handle unseen users or items and second, many existing methods are developed for static settings that cannot incorporate new data as it becomes available. We argue that these are largely impractical assumptions on real-world platforms where new user interactions happen in real time. In this survey paper, we formalize both concepts and contextualize recommender systems work from the last six years. We then discuss why and how future work should move towards inductive learning and incremental updates for recommendation model design and evaluation. In addition, we present best practices and fundamental open challenges for future research. △ Less

Submitted 11 November, 2022; originally announced November 2022.

arXiv:2210.15022 [pdf, other]

Automatic Assessment of Infant Face and Upper-Body Symmetry as Early Signs of Torticollis

Authors: Michael Wan, Xiaofei Huang, Bethany Tunik, Sarah Ostadabbas

Abstract: We apply computer vision pose estimation techniques developed expressly for the data-scarce infant domain to the study of torticollis, a common condition in infants for which early identification and treatment is critical. Specifically, we use a combination of facial landmark and body joint estimation techniques designed for infants to estimate a range of geometric measures pertaining to face and… ▽ More We apply computer vision pose estimation techniques developed expressly for the data-scarce infant domain to the study of torticollis, a common condition in infants for which early identification and treatment is critical. Specifically, we use a combination of facial landmark and body joint estimation techniques designed for infants to estimate a range of geometric measures pertaining to face and upper body symmetry, drawn from an array of sources in the physical therapy and ophthalmology research literature in torticollis. We gauge performance with a range of metrics and show that the estimates of most these geometric measures are successful, yielding strong to very strong Spearman's $ρ$ correlation with ground truth values. Furthermore, we show that these estimates, derived from pose estimation neural networks designed for the infant domain, cleanly outperform estimates derived from more widely known networks designed for the adult domain △ Less

Submitted 7 November, 2022; v1 submitted 26 October, 2022; originally announced October 2022.

arXiv:2210.11921 [pdf, other]

doi 10.1017/jfm.2023.573

Multi-scale data reconstruction of turbulent rotating flows with Gappy POD, Extended POD and Generative Adversarial Networks

Authors: Tianyi Li, Michele Buzzicotti, Luca Biferale, Fabio Bonaccorso, Shiyi Chen, Minping Wan

Abstract: Data reconstruction of rotating turbulent snapshots is investigated utilizing data-driven tools. This problem is crucial for numerous geophysical applications and fundamental aspects, given the concurrent effects of direct and inverse energy cascades, which lead to non-Gaussian statistics at both large and small scales. Data assimilation also serves as a tool to rank physical features within turbu… ▽ More Data reconstruction of rotating turbulent snapshots is investigated utilizing data-driven tools. This problem is crucial for numerous geophysical applications and fundamental aspects, given the concurrent effects of direct and inverse energy cascades, which lead to non-Gaussian statistics at both large and small scales. Data assimilation also serves as a tool to rank physical features within turbulence, by evaluating the performance of reconstruction in terms of the quality and quantity of the information used. Additionally, benchmarking various reconstruction techniques is essential to assess the trade-off between quantitative supremacy, implementation complexity, and explicability. In this study, we use linear and non-linear tools based on the Proper Orthogonal Decomposition (POD) and Generative Adversarial Network (GAN) for reconstructing rotating turbulence snapshots with spatial damages (inpainting). We focus on accurately reproducing both statistical properties and instantaneous velocity fields. Different gap sizes and gap geometries are investigated in order to assess the importance of coherency and multi-scale properties of the missing information. Surprisingly enough, concerning point-wise reconstruction, the non-linear GAN does not outperform one of the linear POD techniques. On the other hand, supremacy of the GAN approach is shown when the statistical multi-scale properties are compared. Similarly, extreme events in the gap region are better predicted when using GAN. The balance between point-wise error and statistical properties is controlled by the adversarial ratio, which determines the relative importance of the generator and the discriminator in the GAN training. Robustness against the measurement noise is also discussed. △ Less

Submitted 3 November, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

Journal ref: J. Fluid Mech. 971, A3 (2023)

arXiv:2209.04079 [pdf, other]

doi 10.1103/PhysRevC.107.L041601

Observing the Ping-pong Modality of Isospin Degree of Freedom in Cluster Emission from Heavy Ion Reactions

Authors: Yijie Wang, Fenhai Guan, Xinyue Diao, Mengting Wan, Yuhao Qin, Zhi Qin, Qianghua Wu, Dong Guo, Dawei Si, Sheng Xiao, Boyuan Zhang, Yaopeng Zhang, Baiting Tian, Xianglun Wei, Herun Yang, Peng Ma, Rongjiang Hu, Limin Duan, Fangfang Duan, Qiang Hu, Junbing Ma, Shiwei Xu, Zhen Bai, Yanyun Yang, Jiansong Wang , et al. (14 additional authors not shown)

Abstract: Two-body correlations of the isotope-resolved light and heavy clusters are measured in $^{86}$Kr+$^{\rm 208}$Pb reactions at 25 MeV/u. The yield and kinetic variables of the $A=3$ isobars, triton and $^3$He, are analyzed in coincidence with the heavy clusters of $7\le A \le 14$ emitted at the earlier chance. While the velocity spectra of both triton and $^3$He exhibit scaling behavior over the typ… ▽ More Two-body correlations of the isotope-resolved light and heavy clusters are measured in $^{86}$Kr+$^{\rm 208}$Pb reactions at 25 MeV/u. The yield and kinetic variables of the $A=3$ isobars, triton and $^3$He, are analyzed in coincidence with the heavy clusters of $7\le A \le 14$ emitted at the earlier chance. While the velocity spectra of both triton and $^3$He exhibit scaling behavior over the type of the heavy clusters, the yield ratios of ${\rm t/^3He}$ correlate reversely to the neutron-to-proton ratio $N/Z$ of the latter, showing the ping-pong modality of the $N/Z$ of emitted clusters. The commonality that the $N/Z$ of the residues keeps the initial system value is extended to the cluster emission in heavy ion reactions. The comparison of transport model calculations to the data is discussed. △ Less

Submitted 8 September, 2022; originally announced September 2022.

arXiv:2207.09352 [pdf, other]

Computer Vision to the Rescue: Infant Postural Symmetry Estimation from Incongruent Annotations

Authors: Xiaofei Huang, Michael Wan, Lingfei Luan, Bethany Tunik, Sarah Ostadabbas

Abstract: Bilateral postural symmetry plays a key role as a potential risk marker for autism spectrum disorder (ASD) and as a symptom of congenital muscular torticollis (CMT) in infants, but current methods of assessing symmetry require laborious clinical expert assessments. In this paper, we develop a computer vision based infant symmetry assessment system, leveraging 3D human pose estimation for infants.… ▽ More Bilateral postural symmetry plays a key role as a potential risk marker for autism spectrum disorder (ASD) and as a symptom of congenital muscular torticollis (CMT) in infants, but current methods of assessing symmetry require laborious clinical expert assessments. In this paper, we develop a computer vision based infant symmetry assessment system, leveraging 3D human pose estimation for infants. Evaluation and calibration of our system against ground truth assessments is complicated by our findings from a survey of human ratings of angle and symmetry, that such ratings exhibit low inter-rater reliability. To rectify this, we develop a Bayesian estimator of the ground truth derived from a probabilistic graphical model of fallible human raters. We show that the 3D infant pose estimation model can achieve 68% area under the receiver operating characteristic curve performance in predicting the Bayesian aggregate labels, compared to only 61% from a 2D infant pose estimation model and 60% from a 3D adult pose estimation model, highlighting the importance of 3D poses and infant domain knowledge in assessing infant body symmetry. Our survey analysis also suggests that human ratings are susceptible to higher levels of bias and inconsistency, and hence our final 3D pose-based symmetry assessment system is calibrated but not directly supervised by Bayesian aggregate human ratings, yielding higher levels of consistency and lower levels of inter-limb assessment bias. △ Less

Submitted 19 July, 2022; originally announced July 2022.

arXiv:2207.04049 [pdf, other]

Learning Causal Effects on Hypergraphs

Authors: Jing Ma, Mengting Wan, Longqi Yang, Jundong Li, Brent Hecht, Jaime Teevan

Abstract: Hypergraphs provide an effective abstraction for modeling multi-way group interactions among nodes, where each hyperedge can connect any number of nodes. Different from most existing studies which leverage statistical dependencies, we study hypergraphs from the perspective of causality. Specifically, in this paper, we focus on the problem of individual treatment effect (ITE) estimation on hypergra… ▽ More Hypergraphs provide an effective abstraction for modeling multi-way group interactions among nodes, where each hyperedge can connect any number of nodes. Different from most existing studies which leverage statistical dependencies, we study hypergraphs from the perspective of causality. Specifically, in this paper, we focus on the problem of individual treatment effect (ITE) estimation on hypergraphs, aiming to estimate how much an intervention (e.g., wearing face covering) would causally affect an outcome (e.g., COVID-19 infection) of each individual node. Existing works on ITE estimation either assume that the outcome on one individual should not be influenced by the treatment assignments on other individuals (i.e., no interference), or assume the interference only exists between pairs of connected individuals in an ordinary graph. We argue that these assumptions can be unrealistic on real-world hypergraphs, where higher-order interference can affect the ultimate ITE estimations due to the presence of group interactions. In this work, we investigate high-order interference modeling, and propose a new causality learning framework powered by hypergraph neural networks. Extensive experiments on real-world hypergraphs verify the superiority of our framework over existing baselines. △ Less

Submitted 7 July, 2022; originally announced July 2022.

arXiv:2204.13483 [pdf]

doi 10.1109/ITSC55140.2022.9922539

TJ4DRadSet: A 4D Radar Dataset for Autonomous Driving

Authors: Lianqing Zheng, Zhixiong Ma, Xichan Zhu, Bin Tan, Sen Li, Kai Long, Weiqi Sun, Sihan Chen, Lu Zhang, Mengyue Wan, Libo Huang, Jie Bai

Abstract: The next-generation high-resolution automotive radar (4D radar) can provide additional elevation measurement and denser point clouds, which has great potential for 3D sensing in autonomous driving. In this paper, we introduce a dataset named TJ4DRadSet with 4D radar points for autonomous driving research. The dataset was collected in various driving scenarios, with a total of 7757 synchronized fra… ▽ More The next-generation high-resolution automotive radar (4D radar) can provide additional elevation measurement and denser point clouds, which has great potential for 3D sensing in autonomous driving. In this paper, we introduce a dataset named TJ4DRadSet with 4D radar points for autonomous driving research. The dataset was collected in various driving scenarios, with a total of 7757 synchronized frames in 44 consecutive sequences, which are well annotated with 3D bounding boxes and track ids. We provide a 4D radar-based 3D object detection baseline for our dataset to demonstrate the effectiveness of deep learning methods for 4D radar point clouds. The dataset can be accessed via the following link: https://github.com/TJRadarLab/TJ4DRadSet. △ Less

Submitted 27 July, 2022; v1 submitted 28 April, 2022; originally announced April 2022.

Comments: 2022 IEEE International Intelligent Transportation Systems Conference (ITSC 2022)

arXiv:2202.02409 [pdf, other]

doi 10.3847/1538-4357/ac5d3e

Pressure-Strain Interaction as the Energy Dissipation Estimate in Collisionless Plasma

Authors: Yan Yang, William H. Matthaeus, Sohom Roy, Vadim Roytershteyn, Tulasi Parashar, Riddhi Bandyopadhyay, Minping Wan

Abstract: The dissipative mechanism in weakly collisional plasma is a topic that pervades decades of studies without a consensus solution. We compare several energy dissipation estimates based on energy transfer processes in plasma turbulence and provide justification for the pressure-strain interaction as a direct estimate of the energy dissipation rate. The global and scale-by-scale energy balances are ex… ▽ More The dissipative mechanism in weakly collisional plasma is a topic that pervades decades of studies without a consensus solution. We compare several energy dissipation estimates based on energy transfer processes in plasma turbulence and provide justification for the pressure-strain interaction as a direct estimate of the energy dissipation rate. The global and scale-by-scale energy balances are examined in 2.5D and 3D kinetic simulations. We show that the global internal energy increase and the temperature enhancement of each species are directly tracked by the pressure-strain interaction. The incompressive part of the pressure-strain interaction dominates over its compressive part in all simulations considered. The scale-by-scale energy balance is quantified by scale filtered Vlasov-Maxwell equations, a kinetic plasma approach, and the lag dependent von Kármán-Howarth equation, an approach based on fluid models. We find that the energy balance is exactly satisfied across all scales, but the lack of a well-defined inertial range influences the distribution of the energy budget among different terms in the inertial range. Therefore, the widespread use of the Yaglom relation to estimating dissipation rate is questionable in some cases, especially when the scale separation in the system is not clearly defined. In contrast, the pressure-strain interaction balances exactly the dissipation rate at kinetic scales regardless of the scale separation. △ Less

Submitted 4 February, 2022; originally announced February 2022.

Comments: 19 pages, 7 figures

arXiv:2201.03662 [pdf, other]

doi 10.1145/3488560.3498391

Learning Fair Node Representations with Graph Counterfactual Fairness

Authors: Jing Ma, Ruocheng Guo, Mengting Wan, Longqi Yang, Aidong Zhang, Jundong Li

Abstract: Fair machine learning aims to mitigate the biases of model predictions against certain subpopulations regarding sensitive attributes such as race and gender. Among the many existing fairness notions, counterfactual fairness measures the model fairness from a causal perspective by comparing the predictions of each individual from the original data and the counterfactuals. In counterfactuals, the se… ▽ More Fair machine learning aims to mitigate the biases of model predictions against certain subpopulations regarding sensitive attributes such as race and gender. Among the many existing fairness notions, counterfactual fairness measures the model fairness from a causal perspective by comparing the predictions of each individual from the original data and the counterfactuals. In counterfactuals, the sensitive attribute values of this individual had been modified. Recently, a few works extend counterfactual fairness to graph data, but most of them neglect the following facts that can lead to biases: 1) the sensitive attributes of each node's neighbors may causally affect the prediction w.r.t. this node; 2) the sensitive attributes may causally affect other features and the graph structure. To tackle these issues, in this paper, we propose a novel fairness notion - graph counterfactual fairness, which considers the biases led by the above facts. To learn node representations towards graph counterfactual fairness, we propose a novel framework based on counterfactual data augmentation. In this framework, we generate counterfactuals corresponding to perturbations on each node's and their neighbors' sensitive attributes. Then we enforce fairness by minimizing the discrepancy between the representations learned from the original graph and the counterfactuals for each node. Experiments on both synthetic and real-world graphs show that our framework outperforms the state-of-the-art baselines in graph counterfactual fairness, and also achieves comparable prediction performance. △ Less

Submitted 10 January, 2022; originally announced January 2022.

Comments: 9 pages, 4 figures

arXiv:2112.08873 [pdf, other]

Non-equilibrium time-relaxation kinetic model for compressible turbulence modeling

Authors: Guiyu Cao, Liang Pan, Kun Xu, Minping Wan, Shiyi Chen

Abstract: For the first time, the non-equilibrium time-relaxation kinetic model (NTRKM) is proposed for compressible turbulence modeling on unresolved grids. Within the non-equilibrium time-relaxation framework, NTRKM is extended in the form of modified Bhatnagar-Gross-Krook model. Based on the first-order Chapman-Enskog expansion, NTRKM connects with the six-variable macroscopic governing equations. The fi… ▽ More For the first time, the non-equilibrium time-relaxation kinetic model (NTRKM) is proposed for compressible turbulence modeling on unresolved grids. Within the non-equilibrium time-relaxation framework, NTRKM is extended in the form of modified Bhatnagar-Gross-Krook model. Based on the first-order Chapman-Enskog expansion, NTRKM connects with the six-variable macroscopic governing equations. The first five governing equations correspond to the conservative laws in mass, momentum and total energy, while the sixth equation governs the evolution of unresolved turbulence kinetic energy Kutke. The unknowns in NTRKM, including turbulent relaxation time and source term, are determined by essential gradient-type assumption and standard dynamic modeling approach. Current generalized kinetic model on unresolved grids consequently offers a profound mesoscopic understanding for one-equation subgrid-scale turbulence kinetic energy Ksgs model in compressible large eddy simulation. To solve NTRKM accurately and robustly, a non-equilibrium gas-kinetic scheme is developed, which succeeds the well-established gas-kinetic scheme for simulating Navier-Stokes equations. Three-dimensional decaying compressible isotropic turbulence and temporal compressible plane mixing layer on unresolved grids are simulated to evaluate the generalized kinetic model and non-equilibrium gas-kinetic scheme. The performance of key turbulent quantities up to second-order statistics confirms that NTRKM is comparable with the widely-used eddy-viscosity Smagorinsky model (SM) and dynamic Smagorinsky model (DSM). Specifically, compared with the DNS solution in temporal compressible plane mixing layer, the performance of NTRKM is much closer with DSM and better than SM. This study provides a workable approach for compressible turbulence modeling on unresolved grids. △ Less

Submitted 16 December, 2021; originally announced December 2021.

Comments: 35 pages

arXiv:2112.06286 [pdf, other]

doi 10.3847/1538-4357/abdf58

Effects of forcing mechanisms on the multiscale properties of magnetohydrodynamics

Authors: Yan Yang, Moritz Linkmann, Luca Biferale, Minping Wan

Abstract: We performed numerical simulations to study the response of magnetohydrodynamics (MHD) to large-scale stochastic forcing mechanisms parametrized by one parameter, $0 \le a \le1$, going from direct injection on the velocity field ($a = 1$) to stirring acts on the magnetic field only ($a = 0$). We study the multi-scale properties of the energy transfer, by splitting the total flux in channels mediat… ▽ More We performed numerical simulations to study the response of magnetohydrodynamics (MHD) to large-scale stochastic forcing mechanisms parametrized by one parameter, $0 \le a \le1$, going from direct injection on the velocity field ($a = 1$) to stirring acts on the magnetic field only ($a = 0$). We study the multi-scale properties of the energy transfer, by splitting the total flux in channels mediated by (i) the kinetic non-linear advection, (ii) the Lorentz force, (iii) the magnetic advection and (iv) magnetic stretching term. We further decompose the fluxes in two sub-channels given by heterochiral and homochiral components in order to distinguish forward, inverse and flux-loop cascades. We show that there exists a quasi-singular role of the magnetic forcing mechanism for $a \sim 1$: a small injection on the magnetic field $a < 1$ can strongly deplete the mean flux of kinetic energy transfer throughout the kinetic non-linear advection channel. We also show that this negligible mean flux is the result of a flux-loop balance between heterochiral (direct) and homochiral (inverse) transfers. Conversely, both homochiral and heterochiral channels transfer energy forward for the other three channels. Cross exchange between velocity and the magnetic field is reversed around $a = 0.4$ and except when $a \sim 1$ we always observe that heterochiral mixed velocity-magnetic energy triads tend to move energy from magnetic to velocity fields. Our study is an attempt to further characterize the multi-scale nature of MHD dynamics, by disentangling different properties of the total energy transfer mechanisms, which can be useful for improving sub-grid-modelling. △ Less

Submitted 12 December, 2021; originally announced December 2021.

Journal ref: The Astrophysical Journal 909 (2021): 175

arXiv:2111.03015 [pdf, other]

Modeling Techniques for Machine Learning Fairness: A Survey

Authors: Mingyang Wan, Daochen Zha, Ninghao Liu, Na Zou

Abstract: Machine learning models are becoming pervasive in high-stakes applications. Despite their clear benefits in terms of performance, the models could show discrimination against minority groups and result in fairness issues in a decision-making process, leading to severe negative impacts on the individuals and the society. In recent years, various techniques have been developed to mitigate the unfair… ▽ More Machine learning models are becoming pervasive in high-stakes applications. Despite their clear benefits in terms of performance, the models could show discrimination against minority groups and result in fairness issues in a decision-making process, leading to severe negative impacts on the individuals and the society. In recent years, various techniques have been developed to mitigate the unfairness for machine learning models. Among them, in-processing methods have drawn increasing attention from the community, where fairness is directly taken into consideration during model design to induce intrinsically fair models and fundamentally mitigate fairness issues in outputs and representations. In this survey, we review the current progress of in-processing fairness mitigation techniques. Based on where the fairness is achieved in the model, we categorize them into explicit and implicit methods, where the former directly incorporates fairness metrics in training objectives, and the latter focuses on refining latent representation learning. Finally, we conclude the survey with a discussion of the research challenges in this community to motivate future exploration. △ Less

Submitted 9 April, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

Comments: 26 pages, 4 figures

arXiv:2110.08935 [pdf, other]

InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild

Authors: Michael Wan, Shaotong Zhu, Lingfei Luan, Gulati Prateek, Xiaofei Huang, Rebecca Schwartz-Mette, Marie Hayes, Emily Zimmerman, Sarah Ostadabbas

Abstract: We lay the groundwork for research in the algorithmic comprehension of infant faces, in anticipation of applications from healthcare to psychology, especially in the early prediction of developmental disorders. Specifically, we introduce the first-ever dataset of infant faces annotated with facial landmark coordinates and pose attributes, demonstrate the inadequacies of existing facial landmark es… ▽ More We lay the groundwork for research in the algorithmic comprehension of infant faces, in anticipation of applications from healthcare to psychology, especially in the early prediction of developmental disorders. Specifically, we introduce the first-ever dataset of infant faces annotated with facial landmark coordinates and pose attributes, demonstrate the inadequacies of existing facial landmark estimation algorithms in the infant domain, and train new state-of-the-art models that significantly improve upon those algorithms using domain adaptation techniques. We touch on the closely related task of facial detection for infants, and also on a challenging case study of infrared baby monitor images gathered by our lab as part of in-field research into the aforementioned developmental issues. △ Less

Submitted 26 May, 2022; v1 submitted 17 October, 2021; originally announced October 2021.

arXiv:2109.09031 [pdf, other]

Hindsight Foresight Relabeling for Meta-Reinforcement Learning

Authors: Michael Wan, Jian Peng, Tanmay Gangwani

Abstract: Meta-reinforcement learning (meta-RL) algorithms allow for agents to learn new behaviors from small amounts of experience, mitigating the sample inefficiency problem in RL. However, while meta-RL agents can adapt quickly to new tasks at test time after experiencing only a few trajectories, the meta-training process is still sample-inefficient. Prior works have found that in the multi-task RL setti… ▽ More Meta-reinforcement learning (meta-RL) algorithms allow for agents to learn new behaviors from small amounts of experience, mitigating the sample inefficiency problem in RL. However, while meta-RL agents can adapt quickly to new tasks at test time after experiencing only a few trajectories, the meta-training process is still sample-inefficient. Prior works have found that in the multi-task RL setting, relabeling past transitions and thus sharing experience among tasks can improve sample efficiency and asymptotic performance. We apply this idea to the meta-RL setting and devise a new relabeling method called Hindsight Foresight Relabeling (HFR). We construct a relabeling distribution using the combination of "hindsight", which is used to relabel trajectories using reward functions from the training task distribution, and "foresight", which takes the relabeled trajectories and computes the utility of each trajectory for each task. HFR is easy to implement and readily compatible with existing meta-RL algorithms. We find that HFR improves performance when compared to other relabeling methods on a variety of meta-RL tasks. △ Less

Submitted 25 April, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

Comments: ICLR 2022 camera-ready

arXiv:2107.08609 [pdf, other]

High-order gas-kinetic scheme in general curvilinear coordinate for iLES of compressible wall-bounded turbulent flows

Authors: Guiyu Cao, Kun Xu, Liang Pan, Minping Wan, Shiyi Chen

Abstract: In this paper, a high-order gas-kinetic scheme in general curvilinear coordinate (HGKS-cur) is developed for the numerical simulation of compressible turbulence. Based on the coordinate transformation, the Bhatnagar-Gross-Krook (BGK) equation is transformed from physical space to computational space. To deal with the general mesh given by discretized points, the geometrical metrics need to be cons… ▽ More In this paper, a high-order gas-kinetic scheme in general curvilinear coordinate (HGKS-cur) is developed for the numerical simulation of compressible turbulence. Based on the coordinate transformation, the Bhatnagar-Gross-Krook (BGK) equation is transformed from physical space to computational space. To deal with the general mesh given by discretized points, the geometrical metrics need to be constructed by the dimension-by-dimension Lagrangian interpolation. The multidimensional weighted essentially non-oscillatory (WENO) reconstruction is adopted in the computational domain for spatial accuracy, where the reconstructed variables are the cell averaged Jacobian and the Jacobian-weighted conservative variables. The two-stage fourth-order method, which was developed for spatial-temporal coupled flow solvers, is used for temporal discretization. The numerical examples for inviscid and laminar flows validate the accuracy and geometrical conservation law of HGKS-cur. As a direct application, HGKS-cur is implemented for the implicit large eddy simulation (iLES) in compressible wall-bounded turbulent flows, including the compressible turbulent channel flow and compressible turbulent flow over periodic hills. The iLES results with HGKS-cur are in good agreement with the refereed spectral methods and high-order finite volume methods. The performance of HGKS-cur demonstrates its capability as a powerful tool for the numerical simulation of compressible wall-bounded turbulent flows and massively separated flows. △ Less

Submitted 19 July, 2021; originally announced July 2021.

Comments: high-order gas-kinetic scheme, general curvilinear coordinate, implicit large eddy simulation, wall-bounded turbulent flows, compressible turbulence

arXiv:2107.04951 [pdf, ps, other]

doi 10.1017/jfm.2022.35

Acceleration of tracer and light particles in compressible homogeneous isotropic turbulence

Authors: Xiangjun Wang, Minping Wan, Luca Biferale

Abstract: The accelerations of tracer and light particles in compressible homogeneous isotropic turbulence (CHIT) is investigated by using data from direct numerical simulations (DNS) up to turbulent Mach number $M_t =1$. For tracer particles, the flatness factor of acceleration components, $F_a$, increases gradually for $M_t \in [0.3, 1]$. On the contrary, $F_a$ for light particles develops a maximum aroun… ▽ More The accelerations of tracer and light particles in compressible homogeneous isotropic turbulence (CHIT) is investigated by using data from direct numerical simulations (DNS) up to turbulent Mach number $M_t =1$. For tracer particles, the flatness factor of acceleration components, $F_a$, increases gradually for $M_t \in [0.3, 1]$. On the contrary, $F_a$ for light particles develops a maximum around $M_t \sim 0.6$. The PDF of longitudinal acceleration of tracers is increasingly skewed towards the negative value as $M_t$ increases. By contrast, for light particles, the skewness factor of longitudinal acceleration, $S_a$, firstly becomes more negative with the increase of $M_t$, and then goes back to $0$ when $M_t$ is larger than $0.6$. Similarly, differences among tracers and light particles appear also in the zero-crossing time of acceleration correlation. It is argued that all these phenomenons are intimately linked to the flow structures in compression regions, e.g. close to shocklets. △ Less

Submitted 10 July, 2021; originally announced July 2021.

Showing 1–50 of 96 results for author: Wan, M