subscribe to arXiv mailings

Estimating Treatment Effects under Recommender Interference: A Structured Neural Networks Approach

Authors: Ruohan Zhan, Shichao Han, Yuchen Hu, Zhenling Jiang

Abstract: Recommender systems are essential for content-sharing platforms by curating personalized content. To evaluate updates to recommender systems targeting content creators, platforms frequently rely on creator-side randomized experiments. The treatment effect measures the change in outcomes when a new algorithm is implemented compared to the status quo. We show that the standard difference-in-means es… ▽ More Recommender systems are essential for content-sharing platforms by curating personalized content. To evaluate updates to recommender systems targeting content creators, platforms frequently rely on creator-side randomized experiments. The treatment effect measures the change in outcomes when a new algorithm is implemented compared to the status quo. We show that the standard difference-in-means estimator can lead to biased estimates due to recommender interference that arises when treated and control creators compete for exposure. We propose a "recommender choice model" that describes which item gets exposed from a pool containing both treated and control items. By combining a structural choice model with neural networks, this framework directly models the interference pathway while accounting for rich viewer-content heterogeneity. We construct a debiased estimator of the treatment effect and prove it is $\sqrt n$-consistent and asymptotically normal with potentially correlated samples. We validate our estimator's empirical performance with a field experiment on Weixin short-video platform. In addition to the standard creator-side experiment, we conduct a costly double-sided randomization design to obtain a benchmark estimate free from interference bias. We show that the proposed estimator yields results comparable to the benchmark, whereas the standard difference-in-means estimator can exhibit significant bias and even produce reversed signs. △ Less

Submitted 5 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

arXiv:2406.05017 [pdf, other]

Adaptively Learning to Select-Rank in Online Platforms

Authors: Jingyuan Wang, Perry Dong, Ying Jin, Ruohan Zhan, Zhengyuan Zhou

Abstract: Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key component in personalizing user experience. We develop a user response model that considers diverse user preferences and the varying effects of item positions, aimi… ▽ More Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key component in personalizing user experience. We develop a user response model that considers diverse user preferences and the varying effects of item positions, aiming to optimize overall user satisfaction with the ranked list. We frame this problem within a contextual bandits framework, with each ranked list as an action. Our approach incorporates an upper confidence bound to adjust predicted user satisfaction scores and selects the ranking action that maximizes these adjusted scores, efficiently solved via maximum weight imperfect matching. We demonstrate that our algorithm achieves a cumulative regret bound of $O(d\sqrt{NKT})$ for ranking $K$ out of $N$ items in a $d$-dimensional context space over $T$ rounds, under the assumption that user responses follow a generalized linear model. This regret alleviates dependence on the ambient action space, whose cardinality grows exponentially with $N$ and $K$ (thus rendering direct application of existing adaptive learning algorithms -- such as UCB or Thompson sampling -- infeasible). Experiments conducted on both simulated and real-world datasets demonstrate our algorithm outperforms the baseline. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 25 pages in total. Includes 4 figures and a pdf. International conference on machine learning. PMLR, 2024

arXiv:2406.03189 [pdf, other]

Novel Atmospheric Dynamics Shape Inner Edge of Habitable Zone Around White Dwarfs

Authors: Ruizhi Zhan, Daniel D. B. Koll, Feng Ding

Abstract: White dwarfs offer a unique opportunity to search nearby stellar systems for signs of life, but the habitable zone around these stars is still poorly understood. Since white dwarfs are compact stars with low luminosity, any planets in their habitable zone should be tidally locked, like planets around M-dwarfs. Unlike planets around M-dwarfs, however, habitable white dwarf planets have to rotate ve… ▽ More White dwarfs offer a unique opportunity to search nearby stellar systems for signs of life, but the habitable zone around these stars is still poorly understood. Since white dwarfs are compact stars with low luminosity, any planets in their habitable zone should be tidally locked, like planets around M-dwarfs. Unlike planets around M-dwarfs, however, habitable white dwarf planets have to rotate very rapidly, with orbital periods ranging from hours to several days. Here we use the ExoCAM Global Climate Model (GCM) to investigate the inner edge of the habitable zone (HZ) around white dwarfs. Our simulations show habitable planets with ultrashort orbital periods ($P\lesssim$1 day) enter a ``bat rotation" regime, which differs from typical atmospheric circulation regimes around M dwarfs. Bat rotators feature mean equatorial subrotation and a displacement of the surface's hottest regions from the equator towards the midlatitudes. We qualitatively explain the onset of bat rotation using shallow water theory. The resulting circulation shifts increase dayside cloud cover and decrease stratospheric water vapor, expanding the white dwarf habitable zone by $\sim$50\% compared to estimates based on 1D models. The James Webb Space Telescope (JWST) should be able to quickly characterize bat rotators around nearby white dwarfs thanks to their distinct thermal phase curves. Our work underlines that tidally locked planets on ultrashort orbits may exhibit unique atmospheric dynamics, and guides future habitability studies of white dwarf systems. △ Less

Submitted 5 June, 2024; originally announced June 2024.

arXiv:2405.20451 [pdf, other]

Statistical Properties of Robust Satisficing

Authors: Zhiyi Li, Yunbei Xu, Ruohan Zhan

Abstract: The Robust Satisficing (RS) model is an emerging approach to robust optimization, offering streamlined procedures and robust generalization across various applications. However, the statistical theory of RS remains unexplored in the literature. This paper fills in the gap by comprehensively analyzing the theoretical properties of the RS model. Notably, the RS structure offers a more straightforwar… ▽ More The Robust Satisficing (RS) model is an emerging approach to robust optimization, offering streamlined procedures and robust generalization across various applications. However, the statistical theory of RS remains unexplored in the literature. This paper fills in the gap by comprehensively analyzing the theoretical properties of the RS model. Notably, the RS structure offers a more straightforward path to deriving statistical guarantees compared to the seminal Distributionally Robust Optimization (DRO), resulting in a richer set of results. In particular, we establish two-sided confidence intervals for the optimal loss without the need to solve a minimax optimization problem explicitly. We further provide finite-sample generalization error bounds for the RS optimizer. Importantly, our results extend to scenarios involving distribution shifts, where discrepancies exist between the sampling and target distributions. Our numerical experiments show that the RS model consistently outperforms the baseline empirical risk minimization in small-sample regimes and under distribution shifts. Furthermore, compared to the DRO model, the RS model exhibits lower sensitivity to hyperparameter tuning, highlighting its practicability for robustness considerations. △ Less

Submitted 30 May, 2024; originally announced May 2024.

arXiv:2405.04286 [pdf, other]

Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore

Authors: Junchao Wu, Runzhe Zhan, Derek F. Wong, Shu Yang, Xuebo Liu, Lidia S. Chao, Min Zhang

Abstract: The efficacy of an large language model (LLM) generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the obs… ▽ More The efficacy of an large language model (LLM) generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the observation that human-written texts typically contain more grammatical errors than LLM-generated texts. This approach entails computing the Grammar Error Correction Score (GECScore) for the given text to distinguish between human-written and LLM-generated text. Extensive experimental results show that our method outperforms current state-of-the-art (SOTA) zero-shot and supervised methods, achieving an average AUROC of 98.7% and showing strong robustness against paraphrase and adversarial perturbation attacks. △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.16766 [pdf, other]

Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model

Authors: Runzhe Zhan, Xinyi Yang, Derek F. Wong, Lidia S. Chao, Yue Zhang

Abstract: While supervised fine-tuning (SFT) has been a straightforward approach for tailoring the output of foundation large language model (LLM) to specific preferences, concerns have been raised about the depth of this alignment, with some critiques suggesting it is merely "superficial". We critically examine this hypothesis within the scope of cross-lingual generation tasks, proposing that the effective… ▽ More While supervised fine-tuning (SFT) has been a straightforward approach for tailoring the output of foundation large language model (LLM) to specific preferences, concerns have been raised about the depth of this alignment, with some critiques suggesting it is merely "superficial". We critically examine this hypothesis within the scope of cross-lingual generation tasks, proposing that the effectiveness of SFT may be constrained by its reliance on prior tokens to guide cross-lingual generation. Based on this crucial insight, and in response to the challenges posed by the costly and limited availability of non-English data for SFT, we introduce a novel training-free alignment method named PreTTY, which employs minimal task-related prior tokens to bridge the foundation LLM and the SFT LLM, achieving comparable performance without training. Experiments on machine translation and part-of-speech tagging across eight languages demonstrate the efficacy of PreTTY in cross-lingual settings. Remarkably, by initiating the decoding process with only one or two prior tokens, foundation LLMs can achieve performance comparable to their SFT counterparts. This method presents a cost-effective alternative to SFT and advances the democratization of multilingual LLMs. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.12161 [pdf, other]

doi 10.3847/1538-4357/ad49a1

The 2018 outburst of MAXI J1820+070 as seen by Insight-HXMT

Authors: Ningyue Fan, Songyu Li, Rui Zhan, Honghui Liu, Zuobin Zhang, Cosimo Bambi, Long Ji, Xiang Ma, James F. Steiner, Shuang-Nan Zhang, Menglei Zhou

Abstract: We present an analysis of the whole 2018 outburst of the black hole X-ray binary MAXI J1820+070 with Insight-HXMT data. We focus our study on the temporal evolution of the parameters of the source. We employ two different models to fit the disk's thermal spectrum: the Newtonian model DISKBB and the relativistic model NKBB. These two models provide different pictures of the source in the soft state… ▽ More We present an analysis of the whole 2018 outburst of the black hole X-ray binary MAXI J1820+070 with Insight-HXMT data. We focus our study on the temporal evolution of the parameters of the source. We employ two different models to fit the disk's thermal spectrum: the Newtonian model DISKBB and the relativistic model NKBB. These two models provide different pictures of the source in the soft state. With DISKBB, we find that the inner edge of the disk is close to the innermost stable circular orbit of a fast-rotating black hole and the corona changes geometry from the hard to the soft state. With NKBB, we find that the disk is truncated in the soft state and that the coronal geometry does not change significantly during the whole outburst. However, the model with NKBB can predict an untruncated disk around a fast-rotating black hole if we assume that the disk inclination angle is around $30^\circ$ (instead of $\sim 60^\circ$, which is the inclination angle of the jet and is usually adopted as the disk inclination angle in the literature) and we employ a high-density reflection model. In such a case, we measure a high value of the black hole spin parameter with observations in the soft state, in agreement with the high spin value found from the analysis of the reflection features and in disagreement with the low spin value found by previous continuum-fitting method measurements with the disk inclination angle set to the value of the jet inclination angle. △ Less

Submitted 1 July, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

Comments: 14 pages, 8 figures. v2: refereed version

Journal ref: Astrophys.J. 969: 61 (2024)

arXiv:2403.11621 [pdf, other]

Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model

Authors: Haoyun Xu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao

Abstract: Large Language Models (LLMs) are composed of neurons that exhibit various behaviors and roles, which become increasingly diversified as models scale. Recent studies have revealed that not all neurons are active across different datasets, and this sparsity correlates positively with the task-specific ability, leading to advancements in model pruning and training efficiency. Traditional fine-tuning… ▽ More Large Language Models (LLMs) are composed of neurons that exhibit various behaviors and roles, which become increasingly diversified as models scale. Recent studies have revealed that not all neurons are active across different datasets, and this sparsity correlates positively with the task-specific ability, leading to advancements in model pruning and training efficiency. Traditional fine-tuning methods engage all parameters of LLMs, which is computationally expensive and may not be necessary. In contrast, Parameter-Efficient Fine-Tuning (PEFT) approaches aim to minimize the number of trainable parameters, yet they still operate at a relatively macro scale (e.g., layer-level). We introduce Neuron-Level Fine-Tuning (NeFT), a novel approach that refines the granularity of parameter training down to the individual neuron, enabling more precise and computationally efficient model updates. The experimental results show that NeFT not only exceeded the performance of full-parameter fine-tuning and PEFT but also provided insights into the analysis of neurons. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2310.14724 [pdf, other]

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

Authors: Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Derek F. Wong, Lidia S. Chao

Abstract: The powerful ability to understand, follow, and generate complex language emerging from large language models (LLMs) makes LLM-generated text flood many areas of our daily lives at an incredible speed and is widely accepted by humans. As LLMs continue to expand, there is an imperative need to develop detectors that can detect LLM-generated text. This is crucial to mitigate potential misuse of LLMs… ▽ More The powerful ability to understand, follow, and generate complex language emerging from large language models (LLMs) makes LLM-generated text flood many areas of our daily lives at an incredible speed and is widely accepted by humans. As LLMs continue to expand, there is an imperative need to develop detectors that can detect LLM-generated text. This is crucial to mitigate potential misuse of LLMs and safeguard realms like artistic expression and social networks from harmful influence of LLM-generated content. The LLM-generated text detection aims to discern if a piece of text was produced by an LLM, which is essentially a binary classification task. The detector techniques have witnessed notable advancements recently, propelled by innovations in watermarking techniques, statistics-based detectors, neural-base detectors, and human-assisted methods. In this survey, we collate recent research breakthroughs in this area and underscore the pressing need to bolster detector research. We also delve into prevalent datasets, elucidating their limitations and developmental requirements. Furthermore, we analyze various LLM-generated text detection paradigms, shedding light on challenges like out-of-distribution problems, potential attacks, real-world data issues and the lack of effective evaluation framework. Conclusively, we highlight interesting directions for future research in LLM-generated text detection to advance the implementation of responsible artificial intelligence (AI). Our aim with this survey is to provide a clear and comprehensive introduction for newcomers while also offering seasoned researchers a valuable update in the field of LLM-generated text detection. The useful resources are publicly available at: https://github.com/NLP2CT/LLM-generated-Text-Detection. △ Less

Submitted 19 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

arXiv:2310.08908 [pdf, other]

Human-in-the-loop Machine Translation with Large Language Model

Authors: Xinyi Yang, Runzhe Zhan, Derek F. Wong, Junchao Wu, Lidia S. Chao

Abstract: The large language model (LLM) has garnered significant attention due to its in-context learning mechanisms and emergent capabilities. The research community has conducted several pilot studies to apply LLMs to machine translation tasks and evaluate their performance from diverse perspectives. However, previous research has primarily focused on the LLM itself and has not explored human interventio… ▽ More The large language model (LLM) has garnered significant attention due to its in-context learning mechanisms and emergent capabilities. The research community has conducted several pilot studies to apply LLMs to machine translation tasks and evaluate their performance from diverse perspectives. However, previous research has primarily focused on the LLM itself and has not explored human intervention in the inference process of LLM. The characteristics of LLM, such as in-context learning and prompt engineering, closely mirror human cognitive abilities in language tasks, offering an intuitive solution for human-in-the-loop generation. In this study, we propose a human-in-the-loop pipeline that guides LLMs to produce customized outputs with revision instructions. The pipeline initiates by prompting the LLM to produce a draft translation, followed by the utilization of automatic retrieval or human feedback as supervision signals to enhance the LLM's translation through in-context learning. The human-machine interactions generated in this pipeline are also stored in an external database to expand the in-context retrieval database, enabling us to leverage human supervision in an offline setting. We evaluate the proposed pipeline using GPT-3.5-turbo API on five domain-specific benchmarks for German-English translation. The results demonstrate the effectiveness of the pipeline in tailoring in-domain translations and improving translation performance compared to direct translation. Additionally, we discuss the results from the following perspectives: 1) the effectiveness of different in-context retrieval methods; 2) the construction of a retrieval database under low-resource scenarios; 3) the observed domains differences; 4) the quantitative analysis of linguistic statistics; and 5) the qualitative analysis of translation cases. The code and data are available at https://github.com/NLP2CT/HIL-MT/. △ Less

Submitted 13 October, 2023; originally announced October 2023.

Comments: Accepted to MT Summit 2023

arXiv:2307.02108 [pdf, other]

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Authors: Sanath Kumar Krishnamurthy, Ruohan Zhan, Susan Athey, Emma Brunskill

Abstract: In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to minimize simple regret. However, this objective remains understudied. We propose a new family of computationally efficient bandit algorithms for the stochastic contextual bandit setting, where a tuning parameter de… ▽ More In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to minimize simple regret. However, this objective remains understudied. We propose a new family of computationally efficient bandit algorithms for the stochastic contextual bandit setting, where a tuning parameter determines the weight placed on cumulative regret minimization (where we establish near-optimal minimax guarantees) versus simple regret minimization (where we establish state-of-the-art guarantees). Our algorithms work with any function class, are robust to model misspecification, and can be used in continuous arm settings. This flexibility comes from constructing and relying on "conformal arm sets" (CASs). CASs provide a set of arms for every context, encompassing the context-specific optimal arm with a certain probability across the context distribution. Our positive results on simple and cumulative regret guarantees are contrasted with a negative result, which shows that no algorithm can achieve instance-dependent simple regret guarantees while simultaneously achieving minimax optimal cumulative regret guarantees. △ Less

Submitted 2 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

arXiv:2302.08854 [pdf, other]

Post Reinforcement Learning Inference

Authors: Vasilis Syrgkanis, Ruohan Zhan

Abstract: We consider estimation and inference using data collected from reinforcement learning algorithms. These algorithms, characterized by their adaptive experimentation, interact with individual units over multiple stages, dynamically adjusting their strategies based on previous interactions. Our goal is to evaluate a counterfactual policy post-data collection and estimate structural parameters, like d… ▽ More We consider estimation and inference using data collected from reinforcement learning algorithms. These algorithms, characterized by their adaptive experimentation, interact with individual units over multiple stages, dynamically adjusting their strategies based on previous interactions. Our goal is to evaluate a counterfactual policy post-data collection and estimate structural parameters, like dynamic treatment effects, which can be used for credit assignment and determining the effect of earlier actions on final outcomes. Such parameters of interest can be framed as solutions to moment equations, but not minimizers of a population loss function, leading to Z-estimation approaches for static data. However, in the adaptive data collection environment of reinforcement learning, where algorithms deploy nonstationary behavior policies, standard estimators do not achieve asymptotic normality due to the fluctuating variance. We propose a weighted Z-estimation approach with carefully designed adaptive weights to stabilize the time-varying estimation variance. We identify proper weighting schemes to restore the consistency and asymptotic normality of the weighted Z-estimators for target parameters, which allows for hypothesis testing and constructing uniform confidence regions. Primary applications include dynamic treatment effect estimation and dynamic off-policy evaluation. △ Less

Submitted 10 May, 2024; v1 submitted 17 February, 2023; originally announced February 2023.

arXiv:2302.01680 [pdf, other]

Two-Stage Constrained Actor-Critic for Short Video Recommendation

Authors: Qingpeng Cai, Zhenghai Xue, Chi Zhang, Wanqi Xue, Shuchang Liu, Ruohan Zhan, Xueliang Wang, Tianyou Zuo, Wentao Xie, Dong Zheng, Peng Jiang, Kun Gai

Abstract: The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users sequentially interact with the system and provide complex and multi-faceted responses, including watch time and various types of interactions with multiple videos. One the one hand, the platforms aims at optimizing the users' cumulative wa… ▽ More The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users sequentially interact with the system and provide complex and multi-faceted responses, including watch time and various types of interactions with multiple videos. One the one hand, the platforms aims at optimizing the users' cumulative watch time (main goal) in long term, which can be effectively optimized by Reinforcement Learning. On the other hand, the platforms also needs to satisfy the constraint of accommodating the responses of multiple user interactions (auxiliary goals) such like, follow, share etc. In this paper, we formulate the problem of short video recommendation as a Constrained Markov Decision Process (CMDP). We find that traditional constrained reinforcement learning algorithms can not work well in this setting. We propose a novel two-stage constrained actor-critic method: At stage one, we learn individual policies to optimize each auxiliary signal. At stage two, we learn a policy to (i) optimize the main signal and (ii) stay close to policies learned at the first stage, which effectively guarantees the performance of this main policy on the auxiliaries. Through extensive offline evaluations, we demonstrate effectiveness of our method over alternatives in both optimizing the main goal as well as balancing the others. We further show the advantage of our method in live experiments of short video recommendations, where it significantly outperforms other baselines in terms of both watch time and interactions. Our approach has been fully launched in the production system to optimize user experiences on the platform. △ Less

Submitted 9 January, 2024; v1 submitted 3 February, 2023; originally announced February 2023.

Comments: Code Available at https://github.com/AIDefender/TSCAC. arXiv admin note: substantial text overlap with arXiv:2205.13248

Journal ref: The Web Conference 2023

arXiv:2206.06003 [pdf, other]

Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation

Authors: Ruohan Zhan, Changhua Pei, Qiang Su, Jianfeng Wen, Xueliang Wang, Guanyu Mu, Dong Zheng, Peng Jiang

Abstract: Watch-time prediction remains to be a key factor in reinforcing user engagement via video recommendations. It has become increasingly important given the ever-growing popularity of online videos. However, prediction of watch time not only depends on the match between the user and the video but is often mislead by the duration of the video itself. With the goal of improving watch time, recommendati… ▽ More Watch-time prediction remains to be a key factor in reinforcing user engagement via video recommendations. It has become increasingly important given the ever-growing popularity of online videos. However, prediction of watch time not only depends on the match between the user and the video but is often mislead by the duration of the video itself. With the goal of improving watch time, recommendation is always biased towards videos with long duration. Models trained on this imbalanced data face the risk of bias amplification, which misguides platforms to over-recommend videos with long duration but overlook the underlying user interests. This paper presents the first work to study duration bias in watch-time prediction for video recommendation. We employ a causal graph illuminating that duration is a confounding factor that concurrently affects video exposure and watch-time prediction -- the first effect on video causes the bias issue and should be eliminated, while the second effect on watch time originates from video intrinsic characteristics and should be preserved. To remove the undesired bias but leverage the natural effect, we propose a Duration Deconfounded Quantile-based (D2Q) watch-time prediction framework, which allows for scalability to perform on industry production systems. Through extensive offline evaluation and live experiments, we showcase the effectiveness of this duration-deconfounding framework by significantly outperforming the state-of-the-art baselines. We have fully launched our approach on Kuaishou App, which has substantially improved real-time video consumption due to more accurate watch-time predictions. △ Less

Submitted 13 June, 2022; originally announced June 2022.

Comments: 10 pages

arXiv:2206.02620 [pdf, other]

ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor

Authors: Wanqi Xue, Qingpeng Cai, Ruohan Zhan, Dong Zheng, Peng Jiang, Kun Gai, Bo An

Abstract: Long-term engagement is preferred over immediate engagement in sequential recommendation as it directly affects product operational metrics such as daily active users (DAUs) and dwell time. Meanwhile, reinforcement learning (RL) is widely regarded as a promising framework for optimizing long-term engagement in sequential recommendation. However, due to expensive online interactions, it is very dif… ▽ More Long-term engagement is preferred over immediate engagement in sequential recommendation as it directly affects product operational metrics such as daily active users (DAUs) and dwell time. Meanwhile, reinforcement learning (RL) is widely regarded as a promising framework for optimizing long-term engagement in sequential recommendation. However, due to expensive online interactions, it is very difficult for RL algorithms to perform state-action value estimation, exploration and feature extraction when optimizing long-term engagement. In this paper, we propose ResAct which seeks a policy that is close to, but better than, the online-serving policy. In this way, we can collect sufficient data near the learned policy so that state-action values can be properly estimated, and there is no need to perform online exploration. ResAct optimizes the policy by first reconstructing the online behaviors and then improving it via a Residual Actor. To extract long-term information, ResAct utilizes two information-theoretical regularizers to confirm the expressiveness and conciseness of features. We conduct experiments on a benchmark dataset and a large-scale industrial dataset which consists of tens of millions of recommendation requests. Experimental results show that our method significantly outperforms the state-of-the-art baselines in various long-term engagement optimization tasks. △ Less

Submitted 16 June, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

Comments: Accpetd by ICLR 2023

arXiv:2205.13248 [pdf, other]

Constrained Reinforcement Learning for Short Video Recommendation

Authors: Qingpeng Cai, Ruohan Zhan, Chi Zhang, Jie Zheng, Guangwei Ding, Pinghua Gong, Dong Zheng, Peng Jiang

Abstract: The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to… ▽ More The wide popularity of short videos on social media poses new opportunities and challenges to optimize recommender systems on the video-sharing platforms. Users provide complex and multi-faceted responses towards recommendations, including watch time and various types of interactions with videos. As a result, established recommendation algorithms that concern a single objective are not adequate to meet this new demand of optimizing comprehensive user experiences. In this paper, we formulate the problem of short video recommendation as a constrained Markov Decision Process (MDP), where platforms want to optimize the main goal of user watch time in long term, with the constraint of accommodating the auxiliary responses of user interactions such as sharing/downloading videos. To solve the constrained MDP, we propose a two-stage reinforcement learning approach based on actor-critic framework. At stage one, we learn individual policies to optimize each auxiliary response. At stage two, we learn a policy to (i) optimize the main response and (ii) stay close to policies learned at the first stage, which effectively guarantees the performance of this main policy on the auxiliaries. Through extensive simulations, we demonstrate effectiveness of our approach over alternatives in both optimizing the main goal as well as balancing the others. We further show the advantage of our approach in live experiments of short video recommendations, where it significantly outperforms other baselines in terms of watch time and interactions from video views. Our approach has been fully launched in the production system to optimize user experiences on the platform. △ Less

Submitted 26 May, 2022; originally announced May 2022.

arXiv:2202.08992 [pdf, other]

Enhanced Multi-Objective A* Using Balanced Binary Search Trees

Authors: Zhongqiang Ren, Richard Zhan, Sivakumar Rathinam, Maxim Likhachev, Howie Choset

Abstract: This work addresses a Multi-Objective Shortest Path Problem (MO-SPP) on a graph where the goal is to find a set of Pareto-optimal solutions from a start node to a destination in the graph. A family of approaches based on MOA* have been developed to solve MO-SPP in the literature. Typically, these approaches maintain a "frontier" set at each node during the search process to keep track of the non-d… ▽ More This work addresses a Multi-Objective Shortest Path Problem (MO-SPP) on a graph where the goal is to find a set of Pareto-optimal solutions from a start node to a destination in the graph. A family of approaches based on MOA* have been developed to solve MO-SPP in the literature. Typically, these approaches maintain a "frontier" set at each node during the search process to keep track of the non-dominated, partial paths to reach that node. This search process becomes computationally expensive when the number of objectives increases as the number of Pareto-optimal solutions becomes large. In this work, we introduce a new method to efficiently maintain these frontiers for multiple objectives by incrementally constructing balanced binary search trees within the MOA* search framework. We first show that our approach correctly finds the Pareto-optimal front, and then provide extensive simulation results for problems with three, four and five objectives to show that our method runs faster than existing techniques by up to an order of magnitude. △ Less

Submitted 28 May, 2022; v1 submitted 17 February, 2022; originally announced February 2022.

Comments: Accepted to SoCS 2022, 11 pages, 4 figures

arXiv:2111.04079 [pdf, other]

Variance-Aware Machine Translation Test Sets

Authors: Runzhe Zhan, Xuebo Liu, Derek F. Wong, Lidia S. Chao

Abstract: We release 70 small and discriminative test sets for machine translation (MT) evaluation called variance-aware test sets (VAT), covering 35 translation directions from WMT16 to WMT20 competitions. VAT is automatically created by a novel variance-aware filtering method that filters the indiscriminative test instances of the current MT test sets without any human labor. Experimental results show tha… ▽ More We release 70 small and discriminative test sets for machine translation (MT) evaluation called variance-aware test sets (VAT), covering 35 translation directions from WMT16 to WMT20 competitions. VAT is automatically created by a novel variance-aware filtering method that filters the indiscriminative test instances of the current MT test sets without any human labor. Experimental results show that VAT outperforms the original WMT test sets in terms of the correlation with human judgement across mainstream language pairs and test sets. Further analysis on the properties of VAT reveals the challenging linguistic features (e.g., translation of low-frequency words and proper nouns) for competitive MT systems, providing guidance for constructing future MT test sets. The test sets and the code for preparing variance-aware MT test sets are freely available at https://github.com/NLP2CT/Variance-Aware-MT-Test-Sets . △ Less

Submitted 7 November, 2021; originally announced November 2021.

Comments: Accepted to NeurIPS 2021 Datasets and Benchmarks Track

arXiv:2107.14402 [pdf, other]

Difficulty-Aware Machine Translation Evaluation

Authors: Runzhe Zhan, Xuebo Liu, Derek F. Wong, Lidia S. Chao

Abstract: The high-quality translation results produced by machine translation (MT) systems still pose a huge challenge for automatic evaluation. Current MT evaluation pays the same attention to each sentence component, while the questions of real-world examinations (e.g., university examinations) have different difficulties and weightings. In this paper, we propose a novel difficulty-aware MT evaluation me… ▽ More The high-quality translation results produced by machine translation (MT) systems still pose a huge challenge for automatic evaluation. Current MT evaluation pays the same attention to each sentence component, while the questions of real-world examinations (e.g., university examinations) have different difficulties and weightings. In this paper, we propose a novel difficulty-aware MT evaluation metric, expanding the evaluation dimension by taking translation difficulty into consideration. A translation that fails to be predicted by most MT systems will be treated as a difficult one and assigned a large weight in the final score function, and conversely. Experimental results on the WMT19 English-German Metrics shared tasks show that our proposed method outperforms commonly used MT metrics in terms of human correlation. In particular, our proposed method performs well even when all the MT systems are very competitive, which is when most existing metrics fail to distinguish between them. The source code is freely available at https://github.com/NLP2CT/Difficulty-Aware-MT-Evaluation. △ Less

Submitted 29 July, 2021; originally announced July 2021.

Comments: Accepted to ACL 2021

arXiv:2106.02029 [pdf, other]

Off-Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits

Authors: Ruohan Zhan, Vitor Hadad, David A. Hirshberg, Susan Athey

Abstract: It has become increasingly common for data to be collected adaptively, for example using contextual bandits. Historical data of this type can be used to evaluate other treatment assignment policies to guide future innovation or experiments. However, policy evaluation is challenging if the target policy differs from the one used to collect data, and popular estimators, including doubly robust (DR)… ▽ More It has become increasingly common for data to be collected adaptively, for example using contextual bandits. Historical data of this type can be used to evaluate other treatment assignment policies to guide future innovation or experiments. However, policy evaluation is challenging if the target policy differs from the one used to collect data, and popular estimators, including doubly robust (DR) estimators, can be plagued by bias, excessive variance, or both. In particular, when the pattern of treatment assignment in the collected data looks little like the pattern generated by the policy to be evaluated, the importance weights used in DR estimators explode, leading to excessive variance. In this paper, we improve the DR estimator by adaptively weighting observations to control its variance. We show that a t-statistic based on our improved estimator is asymptotically normal under certain conditions, allowing us to form confidence intervals and test hypotheses. Using synthetic data and public benchmarks, we provide empirical evidence for our estimator's improved accuracy and inferential properties relative to existing alternatives. △ Less

Submitted 10 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

arXiv:2105.02377 [pdf, other]

Towards Content Provider Aware Recommender Systems: A Simulation Study on the Interplay between User and Provider Utilities

Authors: Ruohan Zhan, Konstantina Christakopoulou, Ya Le, Jayden Ooi, Martin Mladenov, Alex Beutel, Craig Boutilier, Ed H. Chi, Minmin Chen

Abstract: Most existing recommender systems focus primarily on matching users to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into… ▽ More Most existing recommender systems focus primarily on matching users to content which maximizes user satisfaction on the platform. It is increasingly obvious, however, that content providers have a critical influence on user satisfaction through content creation, largely determining the content pool available for recommendation. A natural question thus arises: can we design recommenders taking into account the long-term utility of both users and content providers? By doing so, we hope to sustain more providers and a more diverse content pool for long-term user satisfaction. Understanding the full impact of recommendations on both user and provider groups is challenging. This paper aims to serve as a research investigation of one approach toward building a provider-aware recommender, and evaluating its impact in a simulated setup. To characterize the user-recommender-provider interdependence, we complement user modeling by formalizing provider dynamics as well. The resulting joint dynamical system gives rise to a weakly-coupled partially observable Markov decision process driven by recommender actions and user feedback to providers. We then build a REINFORCE recommender agent, coined EcoAgent, to optimize a joint objective of user utility and the counterfactual utility lift of the provider associated with the recommended content, which we show to be equivalent to maximizing overall user utility and the utilities of all providers on the platform under some mild assumptions. To evaluate our approach, we introduce a simulation environment capturing the key interactions among users, providers, and the recommender. We offer a number of simulated experiments that shed light on both the benefits and the limitations of our approach. These results help understand how and when a provider-aware recommender agent is of benefit in building multi-stakeholder recommender systems. △ Less

Submitted 5 May, 2021; originally announced May 2021.

arXiv:2105.02344 [pdf, other]

Policy Learning with Adaptively Collected Data

Authors: Ruohan Zhan, Zhimei Ren, Susan Athey, Zhengyuan Zhou

Abstract: Learning optimal policies from historical data enables personalization in a wide variety of applications including healthcare, digital recommendations, and online education. The growing policy learning literature focuses on settings where the data collection rule stays fixed throughout the experiment. However, adaptive data collection is becoming more common in practice, from two primary sources:… ▽ More Learning optimal policies from historical data enables personalization in a wide variety of applications including healthcare, digital recommendations, and online education. The growing policy learning literature focuses on settings where the data collection rule stays fixed throughout the experiment. However, adaptive data collection is becoming more common in practice, from two primary sources: 1) data collected from adaptive experiments that are designed to improve inferential efficiency; 2) data collected from production systems that progressively evolve an operational policy to improve performance over time (e.g. contextual bandits). Yet adaptivity complicates the optimal policy identification ex post, since samples are dependent, and each treatment may not receive enough observations for each type of individual. In this paper, we make initial research inquiries into addressing the challenges of learning the optimal policy with adaptively collected data. We propose an algorithm based on generalized augmented inverse propensity weighted (AIPW) estimators, which non-uniformly reweight the elements of a standard AIPW estimator to control worst-case estimation variance. We establish a finite-sample regret upper bound for our algorithm and complement it with a regret lower bound that quantifies the fundamental difficulty of policy learning with adaptive data. When equipped with the best weighting scheme, our algorithm achieves minimax rate optimal regret guarantees even with diminishing exploration. Finally, we demonstrate our algorithm's effectiveness using both synthetic data and public benchmark datasets. △ Less

Submitted 16 November, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

Comments: Improved the upper bound; added simulations

arXiv:2104.00429 [pdf]

Tunable Hyperbolic Phonon Polaritons in a Gradiently-Suspended Van Der Waals α-MoO3

Authors: Zebo Zheng, Fengsheng Sun, Wuchao Huang, Xuexian Chen, Yanlin Ke, Runze Zhan, Huanjun Chen, Shaozhi Deng

Abstract: Highly confined and low-loss hyperbolic phonon polaritons (HPhPs) sustained in van der Waals crystals exhibit outstanding capabilities of concentrating long-wave electromagnetic fields deep to the subwavelength region. Precise tuning on the HPhP propagation characteristics remains a great challenge for practical applications such as nanophotonic devices and circuits. Here, we show that by taking a… ▽ More Highly confined and low-loss hyperbolic phonon polaritons (HPhPs) sustained in van der Waals crystals exhibit outstanding capabilities of concentrating long-wave electromagnetic fields deep to the subwavelength region. Precise tuning on the HPhP propagation characteristics remains a great challenge for practical applications such as nanophotonic devices and circuits. Here, we show that by taking advantage of the varying air gaps in a van der Waals α-MoO3 crystal suspended gradiently, it is able to tune the wavelengths and dampings of the HPhPs propagating inside the α-MoO3. The results indicate that the dependences of polariton wavelength on gap distance for HPhPs in lower and upper Reststrahlen bands are opposite to each other. Most interestingly, the tuning range of the polariton wavelengths for HPhPs in the lower band, which exhibit in-plane hyperbolicities, is wider than that for the HPhPs in the upper band of out-of-plane hyperbolicities. A polariton wavelength elongation up to 160% and a reduction of damping rate up to 35% are obtained. These findings can not only provide fundamental insights into manipulation of light by polaritonic crystals at nanoscale, but also open up new opportunities for tunable nanophotonic applications. △ Less

Submitted 1 April, 2021; originally announced April 2021.

arXiv:2103.02262 [pdf, other]

Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation

Authors: Runzhe Zhan, Xuebo Liu, Derek F. Wong, Lidia S. Chao

Abstract: Meta-learning has been sufficiently validated to be beneficial for low-resource neural machine translation (NMT). However, we find that meta-trained NMT fails to improve the translation performance of the domain unseen at the meta-training stage. In this paper, we aim to alleviate this issue by proposing a novel meta-curriculum learning for domain adaptation in NMT. During meta-training, the NMT f… ▽ More Meta-learning has been sufficiently validated to be beneficial for low-resource neural machine translation (NMT). However, we find that meta-trained NMT fails to improve the translation performance of the domain unseen at the meta-training stage. In this paper, we aim to alleviate this issue by proposing a novel meta-curriculum learning for domain adaptation in NMT. During meta-training, the NMT first learns the similar curricula from each domain to avoid falling into a bad local optimum early, and finally learns the curricula of individualities to improve the model robustness for learning domain-specific knowledge. Experimental results on 10 different low-resource domains show that meta-curriculum learning can improve the translation performance of both familiar and unfamiliar domains. All the codes and data are freely available at https://github.com/NLP2CT/Meta-Curriculum. △ Less

Submitted 3 March, 2021; originally announced March 2021.

Comments: Accepted to AAAI 2021

arXiv:2103.00416 [pdf]

doi 10.1016/j.xcrp.2021.100526

A Spontaneously Formed Plasmonic-MoTe2 Hybrid Platform for Ultrasensitive Raman Enhancement

Authors: Li Tao, Zhiyong Li, Kun Chen, Yaoqiang Zhou, Hao Li, Ximiao Wang, Runze Zhan, Xiangyu Hou, Yu Zhao, Junling Xu, Teng Qiu, Xi Wan, Jian-Bin Xu

Abstract: To develop highly sensitive, stable and repeatable surface-enhanced Raman scattering (SERS) substrates is crucial for analytical detection, which is a challenge for traditional metallic structures. Herein, by taking advantage of the high surface activity of 1T' transition metal telluride, we have fabricated high-density gold nanoparticles (AuNPs) that are spontaneously in-situ prepared on the 1T'… ▽ More To develop highly sensitive, stable and repeatable surface-enhanced Raman scattering (SERS) substrates is crucial for analytical detection, which is a challenge for traditional metallic structures. Herein, by taking advantage of the high surface activity of 1T' transition metal telluride, we have fabricated high-density gold nanoparticles (AuNPs) that are spontaneously in-situ prepared on the 1T' MoTe2 atomic layers via a facile method, forming a plasmonic-2D material hybrid SERS substrate. This AuNP formation is unique to the 1T' phase, which is repressed in 2H MoTe2 with less surface activity. The hybrid structure generates coupling effects of electromagnetic and chemical enhancements, as well as excellent molecule adsorption, leading to the ultrasensitive (4*10^-17 M) and reproducible detection. Additionally, the immense fluorescence and photobleaching phenomena are mostly avoided. Flexible SERS tapes have been demonstrated in practical applications. Our approach facilitates the ultrasensitive SERS detection by a facile method, as well as the better mechanistic understanding of SERS beyond plasmonic effects. △ Less

Submitted 28 July, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

Journal ref: Cell Reports Physical Science, 2021, 2, 100526

arXiv:2010.12775 [pdf]

The Discovery of Tunable Universality Class in Superconducting $β$-W Thin Films

Authors: Ce Huang, Enze Zhang, Yong Zhang, Jinglei Zhang, Faxian Xiu, Haiwen Liu, Xiaoyi Xie, Linfeng Ai, Yunkun Yang, Minhao Zhao, Junjie Qi, Lun Li, Shanshan Liu, Zihan Li, Runze Zhan, Ya-Qing Bie, Xufeng Kou, Shaozhi Deng, X. C. Xie

Abstract: The interplay between quenched disorder and critical behavior in quantum phase transitions is conceptually fascinating and of fundamental importance for understanding phase transitions. However, it is still unclear whether or not the quenched disorder influences the universality class of quantum phase transitions. More crucially, the absence of superconducting-metal transitions under in-plane magn… ▽ More The interplay between quenched disorder and critical behavior in quantum phase transitions is conceptually fascinating and of fundamental importance for understanding phase transitions. However, it is still unclear whether or not the quenched disorder influences the universality class of quantum phase transitions. More crucially, the absence of superconducting-metal transitions under in-plane magnetic fields in 2D superconductors imposes constraints on the universality of quantum criticality. Here, we discover the tunable universality class of superconductor-metal transition by changing the disorder strength in $β$-W films with varying thickness. The finite-size scaling uncovers the switch of universality class: quantum Griffiths singularity to multiple quantum criticality at a critical thickness of $t_{c \perp 1}\sim 8 nm$ and then from multiple quantum criticality to single criticality at $t_{c\perp 2}\sim 16 nm$. Moreover, the superconducting-metal transition is observed for the first time under in-plane magnetic fields and the universality class is changed at $t_{c \parallel }\sim 8 nm$. The discovery of tunable universality class under both out-of-plane and in-plane magnetic fields provides broad information for the disorder effect on superconducting-metal transitions and quantum criticality. △ Less

Submitted 24 October, 2020; originally announced October 2020.

arXiv:2001.04580 [pdf, other]

Distortion Agnostic Deep Watermarking

Authors: Xiyang Luo, Ruohan Zhan, Huiwen Chang, Feng Yang, Peyman Milanfar

Abstract: Watermarking is the process of embedding information into an image that can survive under distortions, while requiring the encoded image to have little or no perceptual difference from the original image. Recently, deep learning-based methods achieved impressive results in both visual quality and message payload under a wide variety of image distortions. However, these methods all require differen… ▽ More Watermarking is the process of embedding information into an image that can survive under distortions, while requiring the encoded image to have little or no perceptual difference from the original image. Recently, deep learning-based methods achieved impressive results in both visual quality and message payload under a wide variety of image distortions. However, these methods all require differentiable models for the image distortions at training time, and may generalize poorly to unknown distortions. This is undesirable since the types of distortions applied to watermarked images are usually unknown and non-differentiable. In this paper, we propose a new framework for distortion-agnostic watermarking, where the image distortion is not explicitly modeled during training. Instead, the robustness of our system comes from two sources: adversarial training and channel coding. Compared to training on a fixed set of distortions and noise levels, our method achieves comparable or better results on distortions available during training, and better performance on unknown distortions. △ Less

Submitted 13 January, 2020; originally announced January 2020.

arXiv:1912.12593 [pdf]

Polariton Waveguide Modes in Two-Dimensional Van der Waals Crystals: An Analytical Model and Correlative Scanning Near-Field Optical Microscopy Studies

Authors: Fengsheng Sun, Wuchao Huang, Zebo Zheng, Ningsheng Xu, Yanlin Ke, Runze Zhan, Huanjun Chen, Shaozhi Deng

Abstract: Two-dimensional van der Waals (vdW) crystals can sustain various types of polaritons with strong electromagnetic confinements, making them highly attractive for the nanoscale photonic and optoelectronic applications. While extensive experimental and numerical studies are devoted to the polaritons of the vdW crystals, analytical models are sparse. Particularly, applying such a model to describe the… ▽ More Two-dimensional van der Waals (vdW) crystals can sustain various types of polaritons with strong electromagnetic confinements, making them highly attractive for the nanoscale photonic and optoelectronic applications. While extensive experimental and numerical studies are devoted to the polaritons of the vdW crystals, analytical models are sparse. Particularly, applying such a model to describe the polariton behaviors visualized by state-of-art near-field optical microscopy requires further investigation. Herein, we develop an analytical waveguide model to describe the polariton propagations in vdW crystals. The dispersion contours, dispersion relations, and electromagnetic field distributions of different polariton waveguide modes are derived. The model is verified by near-field optical imaging and numerical simulation of phonon polaritons in the α-MoO3, a typical vdW biaxial crystals. The model can be extended to other types of polaritons in vdW crystals, thus allowing for describing and understanding their localized electromagnetic behaviors analytically. △ Less

Submitted 1 October, 2020; v1 submitted 29 December, 2019; originally announced December 2019.

arXiv:1911.02768 [pdf, other]

Confidence Intervals for Policy Evaluation in Adaptive Experiments

Authors: Vitor Hadad, David A. Hirshberg, Ruohan Zhan, Stefan Wager, Susan Athey

Abstract: Adaptive experiment designs can dramatically improve statistical efficiency in randomized trials, but they also complicate statistical inference. For example, it is now well known that the sample mean is biased in adaptive trials. Inferential challenges are exacerbated when our parameter of interest differs from the parameter the trial was designed to target, such as when we are interested in esti… ▽ More Adaptive experiment designs can dramatically improve statistical efficiency in randomized trials, but they also complicate statistical inference. For example, it is now well known that the sample mean is biased in adaptive trials. Inferential challenges are exacerbated when our parameter of interest differs from the parameter the trial was designed to target, such as when we are interested in estimating the value of a sub-optimal treatment after running a trial to determine the optimal treatment using a stochastic bandit design. In this context, typical estimators that use inverse propensity weighting to eliminate sampling bias can be problematic: their distributions become skewed and heavy-tailed as the propensity scores decay to zero. In this paper, we present a class of estimators that overcome these issues. Our approach is to adaptively reweight the terms of an augmented inverse propensity weighting estimator to control the contribution of each term to the estimator's variance. This adaptive weighting scheme prevents estimates from becoming heavy-tailed, ensuring asymptotically correct coverage. It also reduces variance, allowing us to test hypotheses with greater power - especially hypotheses that were not targeted by the experimental design. We validate the accuracy of the resulting estimates and their confidence intervals in numerical experiments and show our methods compare favorably to existing alternatives in terms of RMSE and coverage. △ Less

Submitted 12 February, 2021; v1 submitted 7 November, 2019; originally announced November 2019.

arXiv:1902.05097 [pdf]

doi 10.1109/TGRS.2019.2924221

Knowledge-aided Two-dimensional Autofocus for Spotlight SAR Filtered Backprojection Imagery

Authors: Xinhua Mao, Lan Ding, Yudong Zhang, Ronghui Zhan, Shan Li

Abstract: Filtered backprojection (FBP) algorithm is a popular choice for complicated trajectory SAR image formation processing due to its inherent nonlinear motion compensation capability. However, how to efficiently autofocus the defocused FBP imagery when the motion measurement is not accurate enough is still a challenging problem. In this paper, a new interpretation of the FBP derivation is presented fr… ▽ More Filtered backprojection (FBP) algorithm is a popular choice for complicated trajectory SAR image formation processing due to its inherent nonlinear motion compensation capability. However, how to efficiently autofocus the defocused FBP imagery when the motion measurement is not accurate enough is still a challenging problem. In this paper, a new interpretation of the FBP derivation is presented from the Fourier transform point of view. Based on this new viewpoint, the property of the residual 2-D phase error in FBP imagery is analyzed in detail. Then, by incorporating the derived a priori knowledge on the 2-D phase error, an accurate and efficient 2-D autofocus approach is proposed. The new approach performs the parameter estimation in a dimension-reduced parameter subspace by exploiting the a priori analytical structure of the 2-D phase error, therefore possesses much higher accuracy and efficiency than conventional blind methods. Finally, experimental results clearly demonstrate the effectiveness and robustness of the proposed method. △ Less

Submitted 13 February, 2019; originally announced February 2019.

Comments: 14 pages, 24 figures

arXiv:1810.11812 [pdf, ps, other]

doi 10.1063/1.5081452

Topologically nontrivial phases in superconducting transition metal carbides

Authors: Richard Zhan, Xuan Luo

Abstract: Topological superconductors have shown great potential in the search for unique quasiparticles such as Majorana fermions. Combining nontrivial band topology and superconductivity can lead to topological superconductivity due to the proximity effect. In this work, we used first principles calculations to predict that rock-salt phases of VC and CrC are superconducting with topologically nontrivial s… ▽ More Topological superconductors have shown great potential in the search for unique quasiparticles such as Majorana fermions. Combining nontrivial band topology and superconductivity can lead to topological superconductivity due to the proximity effect. In this work, we used first principles calculations to predict that rock-salt phases of VC and CrC are superconducting with topologically nontrivial states. The phonon dispersions of these transition metal carbides displayed no imaginary frequencies, which suggests dynamic stability. Additionally, the presence of soft acoustic phonon bands supports the existence of Bardeen-Cooper-Schrieffer superconductivity in rock-salt VC and CrC. Therefore, these transition metal carbides are practical candidates for studying topological superconductors and their associated Majorana bound states. △ Less

Submitted 20 February, 2019; v1 submitted 28 October, 2018; originally announced October 2018.

Journal ref: J. Appl. Phys. 125, 053903 (2019)

arXiv:1601.00811 [pdf, other]

CT Image Reconstruction by Spatial-Radon Domain Data-Driven Tight Frame Regularization

Authors: Ruohan Zhan, Bin Dong

Abstract: This paper proposes a spatial-Radon domain CT image reconstruction model based on data-driven tight frames (SRD-DDTF). The proposed SRD-DDTF model combines the idea of joint image and Radon domain inpainting model of \cite{Dong2013X} and that of the data-driven tight frames for image denoising \cite{cai2014data}. It is different from existing models in that both CT image and its corresponding high… ▽ More This paper proposes a spatial-Radon domain CT image reconstruction model based on data-driven tight frames (SRD-DDTF). The proposed SRD-DDTF model combines the idea of joint image and Radon domain inpainting model of \cite{Dong2013X} and that of the data-driven tight frames for image denoising \cite{cai2014data}. It is different from existing models in that both CT image and its corresponding high quality projection image are reconstructed simultaneously using sparsity priors by tight frames that are adaptively learned from the data to provide optimal sparse approximations. An alternative minimization algorithm is designed to solve the proposed model which is nonsmooth and nonconvex. Convergence analysis of the algorithm is provided. Numerical experiments showed that the SRD-DDTF model is superior to the model by \cite{Dong2013X} especially in recovering some subtle structures in the images. △ Less

Submitted 26 January, 2016; v1 submitted 5 January, 2016; originally announced January 2016.

arXiv:1402.2829 [pdf, other]

doi 10.1103/PhysRevB.89.245434

Tunable terahertz radiation from graphene induced by moving electrons

Authors: T. R. Zhan, D. Z. Han, X. H. Hu, X. H. Liu, S. T. Chui, J. Zi

Abstract: Based on a structure consisting of a single graphene layer situated on a periodic dielectric grating, we show theoretically that intense terahertz (THz) radiations can be generated by an electron bunch moving atop the graphene layer. The underlying physics lies in the fact that a moving electron bunch with rather low electron energy ($\sim$1 keV) can efficiently excite graphene plasmons (GPs) of T… ▽ More Based on a structure consisting of a single graphene layer situated on a periodic dielectric grating, we show theoretically that intense terahertz (THz) radiations can be generated by an electron bunch moving atop the graphene layer. The underlying physics lies in the fact that a moving electron bunch with rather low electron energy ($\sim$1 keV) can efficiently excite graphene plasmons (GPs) of THz frequencies with a strong confinement of near-fields. GPs can be further scattered into free space by the grating for those satisfying the phase matching condition. The radiation patterns can be controlled by varying the velocity of the moving electrons. Importantly, the radiation frequencies can be tuned by varying the Fermi level of the graphene layer, offering tunable THz radiations that can cover a wide frequency range. Our results could pave the way toward developing tunable and miniature THz radiation sources based on graphene. △ Less

Submitted 12 February, 2014; originally announced February 2014.

Showing 1–33 of 33 results for author: Zhan, R