-
Faraday laser pumped cesium beam clock
Authors:
Hangbo Shi,
Xiaomin Qin,
Haijun Chen,
Yufei Yan,
Ziqi Lu,
Zhiyang Wang,
Zijie Liu,
Xiaolei Guan,
Qiang Wei,
Tiantian Shi,
Jingbiao Chen
Abstract:
We realize a high-performance compact optically pumped cesium beam clock using Faraday laser simultaneously as pumping and detection lasers. The Faraday laser, which is frequency stabilized by modulation transfer spectroscopy (MTS) technique, has narrow linewidth and superior frequency stability. Measured by optical heterodyne method between two identical systems, the linewidth of the Faraday lase…
▽ More
We realize a high-performance compact optically pumped cesium beam clock using Faraday laser simultaneously as pumping and detection lasers. The Faraday laser, which is frequency stabilized by modulation transfer spectroscopy (MTS) technique, has narrow linewidth and superior frequency stability. Measured by optical heterodyne method between two identical systems, the linewidth of the Faraday laser is 2.5 kHz after MTS locking, and the fractional frequency stability of the Faraday laser is optimized to $1.8\times{10}^{-12}/\sqrtτ$. Based on this high-performance Faraday laser, the cesium beam clock realizes a signal-to-noise ratio (SNR) in 1 Hz bandwidth of $39600$ when the cesium oven temperature is 130°C. Frequency-compared with Hydrogen maser, the fractional frequency stability of the Faraday laser pumped cesium beam clock can reach $1.3\times{10}^{-12}/\sqrtτ$ and drops to $1.4\times{10}^{-14}$ at 10000 s when the cesium oven temperature is 110°C. %, which is the best reported result compared with other cesium beam clocks. This Faraday laser pumped cesium beam clock demonstrates its excellent performance, and its great potential in the fields of timekeeping, navigation, and communication. Meanwhile, the Faraday laser, as a high-performance optical frequency standard, can also contribute to the development of other applications in quantum metrology, precision measurement and atomic physics.
△ Less
Submitted 11 July, 2024; v1 submitted 8 July, 2024;
originally announced July 2024.
-
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
Authors:
Yue Fan,
Lei Ding,
Ching-Chen Kuo,
Shan Jiang,
Yang Zhao,
Xinze Guan,
Jie Yang,
Yi Zhang,
Xin Eric Wang
Abstract:
Graphical User Interfaces (GUIs) are central to our interaction with digital devices. Recently, growing efforts have been made to build models for various GUI understanding tasks. However, these efforts largely overlook an important GUI-referring task: screen reading based on user-indicated points, which we name the Screen Point-and-Read (SPR) task. This task is predominantly handled by rigid acce…
▽ More
Graphical User Interfaces (GUIs) are central to our interaction with digital devices. Recently, growing efforts have been made to build models for various GUI understanding tasks. However, these efforts largely overlook an important GUI-referring task: screen reading based on user-indicated points, which we name the Screen Point-and-Read (SPR) task. This task is predominantly handled by rigid accessible screen reading tools, in great need of new models driven by advancements in Multimodal Large Language Models (MLLMs). In this paper, we propose a Tree-of-Lens (ToL) agent, utilizing a novel ToL grounding mechanism, to address the SPR task. Based on the input point coordinate and the corresponding GUI screenshot, our ToL agent constructs a Hierarchical Layout Tree. Based on the tree, our ToL agent not only comprehends the content of the indicated area but also articulates the layout and spatial relationships between elements. Such layout information is crucial for accurately interpreting information on the screen, distinguishing our ToL agent from other screen reading tools. We also thoroughly evaluate the ToL agent against other baselines on a newly proposed SPR benchmark, which includes GUIs from mobile, web, and operating systems. Last but not least, we test the ToL agent on mobile GUI navigation tasks, demonstrating its utility in identifying incorrect actions along the path of agent execution trajectories. Code and data: screen-point-and-read.github.io
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
Tight Toughness and Isolated Toughness for $\{K_2,C_n\}$-factor critical avoidable graph
Authors:
Xiaxia Guan,
Hongxia Ma,
Maoqun Wang
Abstract:
A spannning subgraph $F$ of $G$ is a $\{K_2,C_n\}$-factor if each component of $F$ is either $K_{2}$ or $C_{n}$. A graph $G$ is called a $(\{K_2,C_n\},n)$-factor critical avoidable graph if $G-X-e$ has a $\{K_2,C_n\}$-factor for any $S\subseteq V(G)$ with $|X|=n$ and $e\in E(G-X)$. In this paper, we first obtain a sufficient condition with regard to isolated toughness of a graph $G$ such that $G$…
▽ More
A spannning subgraph $F$ of $G$ is a $\{K_2,C_n\}$-factor if each component of $F$ is either $K_{2}$ or $C_{n}$. A graph $G$ is called a $(\{K_2,C_n\},n)$-factor critical avoidable graph if $G-X-e$ has a $\{K_2,C_n\}$-factor for any $S\subseteq V(G)$ with $|X|=n$ and $e\in E(G-X)$. In this paper, we first obtain a sufficient condition with regard to isolated toughness of a graph $G$ such that $G$ is $\{K_2,C_{n}\}$-factor critical avoidable. In addition, we give a sufficient condition with regard to tight toughness and isolated toughness of a graph $G$ such that $G$ is $\{K_2,C_{2i+1}|i \geqslant 2\}$-factor critical avoidable respectively.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
JobFair: A Framework for Benchmarking Gender Hiring Bias in Large Language Models
Authors:
Ze Wang,
Zekun Wu,
Xin Guan,
Michael Thaler,
Adriano Koshiyama,
Skylar Lu,
Sachin Beepath,
Ediz Ertekin Jr.,
Maria Perez-Ortiz
Abstract:
This paper presents a novel framework for benchmarking hierarchical gender hiring bias in Large Language Models (LLMs) for resume scoring, revealing significant issues of reverse bias and overdebiasing. Our contributions are fourfold: First, we introduce a framework using a real, anonymized resume dataset from the Healthcare, Finance, and Construction industries, meticulously used to avoid confoun…
▽ More
This paper presents a novel framework for benchmarking hierarchical gender hiring bias in Large Language Models (LLMs) for resume scoring, revealing significant issues of reverse bias and overdebiasing. Our contributions are fourfold: First, we introduce a framework using a real, anonymized resume dataset from the Healthcare, Finance, and Construction industries, meticulously used to avoid confounding factors. It evaluates gender hiring biases across hierarchical levels, including Level bias, Spread bias, Taste-based bias, and Statistical bias. This framework can be generalized to other social traits and tasks easily. Second, we propose novel statistical and computational hiring bias metrics based on a counterfactual approach, including Rank After Scoring (RAS), Rank-based Impact Ratio, Permutation Test-Based Metrics, and Fixed Effects Model-based Metrics. These metrics, rooted in labor economics, NLP, and law, enable holistic evaluation of hiring biases. Third, we analyze hiring biases in ten state-of-the-art LLMs. Six out of ten LLMs show significant biases against males in healthcare and finance. An industry-effect regression reveals that the healthcare industry is the most biased against males. GPT-4o and GPT-3.5 are the most biased models, showing significant bias in all three industries. Conversely, Gemini-1.5-Pro, Llama3-8b-Instruct, and Llama3-70b-Instruct are the least biased. The hiring bias of all LLMs, except for Llama3-8b-Instruct and Claude-3-Sonnet, remains consistent regardless of random expansion or reduction of resume content. Finally, we offer a user-friendly demo to facilitate adoption and practical application of the framework.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation
Authors:
Xueru Wen,
Xinyu Lu,
Xinyan Guan,
Yaojie Lu,
Hongyu Lin,
Ben He,
Xianpei Han,
Le Sun
Abstract:
Hallucination occurs when large language models (LLMs) exhibit behavior that deviates from the boundaries of their knowledge during the response generation process. Previous learning-based methods focus on detecting knowledge boundaries and finetuning models with instance-level feedback, but they suffer from inaccurate signals due to off-policy data sampling and coarse-grained feedback. In this pa…
▽ More
Hallucination occurs when large language models (LLMs) exhibit behavior that deviates from the boundaries of their knowledge during the response generation process. Previous learning-based methods focus on detecting knowledge boundaries and finetuning models with instance-level feedback, but they suffer from inaccurate signals due to off-policy data sampling and coarse-grained feedback. In this paper, we introduce \textit{\b{R}einforcement \b{L}earning \b{f}or \b{H}allucination} (RLFH), a fine-grained feedback-based online reinforcement learning method for hallucination mitigation. Unlike previous learning-based methods, RLFH enables LLMs to explore the boundaries of their internal knowledge and provide on-policy, fine-grained feedback on these explorations. To construct fine-grained feedback for learning reliable generation behavior, RLFH decomposes the outcomes of large models into atomic facts, provides statement-level evaluation signals, and traces back the signals to the tokens of the original responses. Finally, RLFH adopts the online reinforcement algorithm with these token-level rewards to adjust model behavior for hallucination mitigation. For effective on-policy optimization, RLFH also introduces an LLM-based fact assessment framework to verify the truthfulness and helpfulness of atomic facts without human intervention. Experiments on HotpotQA, SQuADv2, and Biography benchmarks demonstrate that RLFH can balance their usage of internal knowledge during the generation process to eliminate the hallucination behavior of LLMs.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Localized subspace iteration methods for elliptic multiscale problems
Authors:
Xiaofei Guan,
Lijian Jiang,
Yajun Wang,
Zihao Yang
Abstract:
This paper proposes localized subspace iteration (LSI) methods to construct generalized finite element basis functions for elliptic problems with multiscale coefficients. The key components of the proposed method consist of the localization of the original differential operator and the subspace iteration of the corresponding local spectral problems, where the localization is conducted by enforcing…
▽ More
This paper proposes localized subspace iteration (LSI) methods to construct generalized finite element basis functions for elliptic problems with multiscale coefficients. The key components of the proposed method consist of the localization of the original differential operator and the subspace iteration of the corresponding local spectral problems, where the localization is conducted by enforcing the local homogeneous Dirichlet condition and the partition of the unity functions. From a novel perspective, some multiscale methods can be regarded as one iteration step under approximating the eigenspace of the corresponding local spectral problems. Vice versa, new multiscale methods can be designed through subspaces of spectral problem algorithms. Then, we propose the efficient localized standard subspace iteration (LSSI) method and the localized Krylov subspace iteration (LKSI) method based on the standard subspace and Krylov subspace, respectively. Convergence analysis is carried out for the proposed method. Various numerical examples demonstrate the effectiveness of our methods. In addition, the proposed methods show significant superiority in treating long-channel cases over other well-known multiscale methods.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
A semi-implicit stochastic multiscale method for radiative heat transfer problem
Authors:
Shan Zhang,
Yajun Wang,
Xiaofei Guan
Abstract:
In this paper, we propose and analyze a new semi-implicit stochastic multiscale method for the radiative heat transfer problem with additive noise fluctuation in composite materials. In the proposed method, the strong nonlinearity term induced by heat radiation is first approximated, by a semi-implicit predictor-corrected numerical scheme, for each fixed time step, resulting in a spatially random…
▽ More
In this paper, we propose and analyze a new semi-implicit stochastic multiscale method for the radiative heat transfer problem with additive noise fluctuation in composite materials. In the proposed method, the strong nonlinearity term induced by heat radiation is first approximated, by a semi-implicit predictor-corrected numerical scheme, for each fixed time step, resulting in a spatially random multiscale heat transfer equation. Then, the infinite-dimensional stochastic processes are modeled and truncated using a complete orthogonal system, facilitating the reduction of the model's dimensionality in the random space. The resulting low-rank random multiscale heat transfer equation is approximated and computed by using efficient spatial basis functions based multiscale method. The main advantage of the proposed method is that it separates the computational difficulty caused by the spatial multiscale properties, the high-dimensional randomness and the strong nonlinearity of the solution, so they can be overcome separately using different strategies. The convergence analysis is carried out, and the optimal rate of convergence is also obtained for the proposed semi-implicit stochastic multiscale method. Numerical experiments on several test problems for composite materials with various microstructures are also presented to gauge the efficiency and accuracy of the proposed semi-implicit stochastic multiscale method.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Joint Association, Beamforming, and Resource Allocation for Multi-IRS Enabled MU-MISO Systems With RSMA
Authors:
Chunjie Wang,
Xuhui Zhang,
Huijun Xing,
Liang Xue,
Shuqiang Wang,
Yanyan Shen,
Bo Yang,
Xinping Guan
Abstract:
Intelligent reflecting surface (IRS) and rate-splitting multiple access (RSMA) technologies are at the forefront of enhancing spectrum and energy efficiency in the next generation multi-antenna communication systems. This paper explores a RSMA system with multiple IRSs, and proposes two purpose-driven scheduling schemes, i.e., the exhaustive IRS-aided (EIA) and opportunistic IRS-aided (OIA) scheme…
▽ More
Intelligent reflecting surface (IRS) and rate-splitting multiple access (RSMA) technologies are at the forefront of enhancing spectrum and energy efficiency in the next generation multi-antenna communication systems. This paper explores a RSMA system with multiple IRSs, and proposes two purpose-driven scheduling schemes, i.e., the exhaustive IRS-aided (EIA) and opportunistic IRS-aided (OIA) schemes. The aim is to optimize the system weighted energy efficiency (EE) under the above two schemes, respectively. Specifically, the Dinkelbach, branch and bound, successive convex approximation, and the semidefinite relaxation methods are exploited within the alternating optimization framework to obtain effective solutions to the considered problems. The numerical findings indicate that the EIA scheme exhibits better performance compared to the OIA scheme in diverse scenarios when considering the weighted EE, and the proposed algorithm demonstrates superior performance in comparison to the baseline algorithms.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
Hybrid-Parallel: Achieving High Performance and Energy Efficient Distributed Inference on Robots
Authors:
Zekai Sun,
Xiuxian Guan,
Junming Wang,
Haoze Song,
Yuhao Qing,
Tianxiang Shen,
Dong Huang,
Fangming Liu,
Heming Cui
Abstract:
The rapid advancements in machine learning techniques have led to significant achievements in various real-world robotic tasks. These tasks heavily rely on fast and energy-efficient inference of deep neural network (DNN) models when deployed on robots. To enhance inference performance, distributed inference has emerged as a promising approach, parallelizing inference across multiple powerful GPU d…
▽ More
The rapid advancements in machine learning techniques have led to significant achievements in various real-world robotic tasks. These tasks heavily rely on fast and energy-efficient inference of deep neural network (DNN) models when deployed on robots. To enhance inference performance, distributed inference has emerged as a promising approach, parallelizing inference across multiple powerful GPU devices in modern data centers using techniques such as data parallelism, tensor parallelism, and pipeline parallelism. However, when deployed on real-world robots, existing parallel methods fail to provide low inference latency and meet the energy requirements due to the limited bandwidth of robotic IoT. We present Hybrid-Parallel, a high-performance distributed inference system optimized for robotic IoT. Hybrid-Parallel employs a fine-grained approach to parallelize inference at the granularity of local operators within DNN layers (i.e., operators that can be computed independently with the partial input, such as the convolution kernel in the convolution layer). By doing so, Hybrid-Parallel enables different operators of different layers to be computed and transmitted concurrently, and overlap the computation and transmission phases within the same inference task. The evaluation demonstrate that Hybrid-Parallel reduces inference time by 14.9% ~41.1% and energy consumption per inference by up to 35.3% compared to the state-of-the-art baselines.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Blade: A package for block-triangular form improved Feynman integrals decomposition
Authors:
Xin Guan,
Xiao Liu,
Yan-Qing Ma,
Wen-Hao Wu
Abstract:
In this article, we present the package Blade as the first implementation of the block-triangular form improved Feynman integral reduction method. The block-triangular form has orders of magnitude fewer equations compared to the plain integration-by-parts system, allowing for strictly block-by-block solutions. This results in faster evaluations and reduced resource consumption. We elucidate the al…
▽ More
In this article, we present the package Blade as the first implementation of the block-triangular form improved Feynman integral reduction method. The block-triangular form has orders of magnitude fewer equations compared to the plain integration-by-parts system, allowing for strictly block-by-block solutions. This results in faster evaluations and reduced resource consumption. We elucidate the algorithms involved in obtaining the block-triangular form along with their implementations. Additionally, we introduce novel algorithms for finding the canonical form and symmetry relations of Feynman integrals, as well as for performing spanning-sector reduction. Our benchmarks for various state-of-the-art problems demonstrate that Blade is remarkably competitive among existing reduction tools. Furthermore, the Blade package offers several distinctive features, including support for complex kinematic variables or masses, user-defined Feynman prescriptions for each propagator, and general integrands.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing
Authors:
Wei Zhang,
Xianfu Cheng,
Yi Zhang,
Jian Yang,
Hongcheng Guo,
Zhoujun Li,
Xiaolin Yin,
Xiangyuan Guan,
Xu Shi,
Liangfan Zheng,
Bo Zhang
Abstract:
Log parsing, a vital task for interpreting the vast and complex data produced within software architectures faces significant challenges in the transition from academic benchmarks to the industrial domain. Existing log parsers, while highly effective on standardized public datasets, struggle to maintain performance and efficiency when confronted with the sheer scale and diversity of real-world ind…
▽ More
Log parsing, a vital task for interpreting the vast and complex data produced within software architectures faces significant challenges in the transition from academic benchmarks to the industrial domain. Existing log parsers, while highly effective on standardized public datasets, struggle to maintain performance and efficiency when confronted with the sheer scale and diversity of real-world industrial logs. These challenges are two-fold: 1) massive log templates: The performance and efficiency of most existing parsers will be significantly reduced when logs of growing quantities and different lengths; 2) Complex and changeable semantics: Traditional template-matching algorithms cannot accurately match the log templates of complicated industrial logs because they cannot utilize cross-language logs with similar semantics. To address these issues, we propose ECLIPSE, Enhanced Cross-Lingual Industrial log Parsing with Semantic Entropy-LCS, since cross-language logs can robustly parse industrial logs. On the one hand, it integrates two efficient data-driven template-matching algorithms and Faiss indexing. On the other hand, driven by the powerful semantic understanding ability of the Large Language Model (LLM), the semantics of log keywords were accurately extracted, and the retrieval space was effectively reduced. Notably, we launch a Chinese and English cross-platform industrial log parsing benchmark ECLIPSE- BENCH to evaluate the performance of mainstream parsers in industrial scenarios. Our experimental results across public benchmarks and ECLIPSE- BENCH underscore the superior performance and robustness of our proposed ECLIPSE. Notably, ECLIPSE both delivers state-of-the-art performance when compared to strong baselines and preserves a significant edge in processing efficiency.
△ Less
Submitted 24 May, 2024; v1 submitted 22 May, 2024;
originally announced May 2024.
-
MIPI 2024 Challenge on Demosaic for HybridEVS Camera: Methods and Results
Authors:
Yaqi Wu,
Zhihao Fan,
Xiaofeng Chu,
Jimmy S. Ren,
Xiaoming Li,
Zongsheng Yue,
Chongyi Li,
Shangcheng Zhou,
Ruicheng Feng,
Yuekun Dai,
Peiqing Yang,
Chen Change Loy,
Senyan Xu,
Zhijing Sun,
Jiaying Zhu,
Yurui Zhu,
Xueyang Fu,
Zheng-Jun Zha,
Jun Cao,
Cheng Li,
Shu Chen,
Liang Ma,
Shiyang Zhou,
Haijin Zeng,
Kai Feng
, et al. (24 additional authors not shown)
Abstract:
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra…
▽ More
The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photography and imaging (MIPI). Building on the achievements of the previous MIPI Workshops held at ECCV 2022 and CVPR 2023, we introduce our third MIPI challenge including three tracks focusing on novel image sensors and imaging algorithms. In this paper, we summarize and review the Nighttime Flare Removal track on MIPI 2024. In total, 170 participants were successfully registered, and 14 teams submitted results in the final testing phase. The developed solutions in this challenge achieved state-of-the-art performance on Nighttime Flare Removal. More details of this challenge and the link to the dataset can be found at https://mipi-challenge.org/MIPI2024/.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Joint Identity Verification and Pose Alignment for Partial Fingerprints
Authors:
Xiongjun Guan,
Zhiyu Pan,
Jianjiang Feng,
Jie Zhou
Abstract:
Currently, portable electronic devices are becoming more and more popular. For lightweight considerations, their fingerprint recognition modules usually use limited-size sensors. However, partial fingerprints have few matchable features, especially when there are differences in finger pressing posture or image quality, which makes partial fingerprint verification challenging. Most existing methods…
▽ More
Currently, portable electronic devices are becoming more and more popular. For lightweight considerations, their fingerprint recognition modules usually use limited-size sensors. However, partial fingerprints have few matchable features, especially when there are differences in finger pressing posture or image quality, which makes partial fingerprint verification challenging. Most existing methods regard fingerprint position rectification and identity verification as independent tasks, ignoring the coupling relationship between them -- relative pose estimation typically relies on paired features as anchors, and authentication accuracy tends to improve with more precise pose alignment. In this paper, we propose a novel framework for joint identity verification and pose alignment of partial fingerprint pairs, aiming to leverage their inherent correlation to improve each other. To achieve this, we present a multi-task CNN (Convolutional Neural Network)-Transformer hybrid network, and design a pre-training task to enhance the feature extraction capability. Experiments on multiple public datasets (NIST SD14, FVC2002 DB1A & DB3A, FVC2004 DB1A & DB2A, FVC2006 DB1A) and an in-house dataset show that our method achieves state-of-the-art performance in both partial fingerprint verification and relative pose estimation, while being more efficient than previous methods.
△ Less
Submitted 21 May, 2024; v1 submitted 6 May, 2024;
originally announced May 2024.
-
Deep Learning of ab initio Hessians for Transition State Optimization
Authors:
Eric C. -Y. Yuan,
Anup Kumar,
Xingyi Guan,
Eric D. Hermes,
Andrew S. Rosen,
Judit Zádor,
Teresa Head-Gordon,
Samuel M. Blau
Abstract:
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fu…
▽ More
Identifying transition states -- saddle points on the potential energy surface connecting reactant and product minima -- is central to predicting kinetic barriers and understanding chemical reaction mechanisms. In this work, we train an equivariant neural network potential, NewtonNet, on an ab initio dataset of thousands of organic reactions from which we derive the analytical Hessians from the fully differentiable machine learning (ML) model. By reducing the computational cost by several orders of magnitude relative to the Density Functional Theory (DFT) ab initio source, we can afford to use the learned Hessians at every step for the saddle point optimizations. We have implemented our ML Hessian algorithm in Sella, an open source software package designed to optimize atomic systems to find saddle point structures, in order to compare transition state optimization against quasi-Newton Hessian updates using DFT or the ML model. We show that the full ML Hessian robustly finds the transition states of 240 unseen organic reactions, even when the quality of the initial guess structures are degraded, while reducing the number of optimization steps to convergence by 2--3$\times$ compared to the quasi-Newton DFT and ML methods. All data generation, NewtonNet model, and ML transition state finding methods are available in an automated workflow.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Latent Fingerprint Matching via Dense Minutia Descriptor
Authors:
Zhiyu Pan,
Yongjie Duan,
Xiongjun Guan,
Jianjiang Feng,
Jie Zhou
Abstract:
Latent fingerprint matching is a daunting task, primarily due to the poor quality of latent fingerprints. In this study, we propose a deep-learning based dense minutia descriptor (DMD) for latent fingerprint matching. A DMD is obtained by extracting the fingerprint patch aligned by its central minutia, capturing detailed minutia information and texture information. Our dense descriptor takes the f…
▽ More
Latent fingerprint matching is a daunting task, primarily due to the poor quality of latent fingerprints. In this study, we propose a deep-learning based dense minutia descriptor (DMD) for latent fingerprint matching. A DMD is obtained by extracting the fingerprint patch aligned by its central minutia, capturing detailed minutia information and texture information. Our dense descriptor takes the form of a three-dimensional representation, with two dimensions associated with the original image plane and the other dimension representing the abstract features. Additionally, the extraction process outputs the fingerprint segmentation map, ensuring that the descriptor is only valid in the foreground region. The matching between two descriptors occurs in their overlapping regions, with a score normalization strategy to reduce the impact brought by the differences outside the valid area. Our descriptor achieves state-of-the-art performance on several latent fingerprint datasets. Overall, our DMD is more representative and interpretable compared to previous methods.
△ Less
Submitted 5 July, 2024; v1 submitted 2 May, 2024;
originally announced May 2024.
-
Deformability, inherent mechanical properties and chemical bonding of Al11Nd3 in Al-Nd target material
Authors:
Xue-Qian Wang,
Run-Xin Song,
Xu Guan,
Shuan Li,
Shuchen Sun,
Hongbo Yang,
Daogao Wu,
Ganfeng Tu,
Song Li,
Hai-Le Yan,
Liang Zuo
Abstract:
Microstructure uniformity of the Al-Nd target materials with Al11Nd3 significantly affects the performance of the fabricated film, which is widely used as wiring material in largesize thin-film transistor liquid crystal display (TFT-LCD) panels. Understanding the inherent mechanical properties and chemical bonds of Al11Nd3 is crucial for homogenizing the Al-Nd target. Here, by a combined experimen…
▽ More
Microstructure uniformity of the Al-Nd target materials with Al11Nd3 significantly affects the performance of the fabricated film, which is widely used as wiring material in largesize thin-film transistor liquid crystal display (TFT-LCD) panels. Understanding the inherent mechanical properties and chemical bonds of Al11Nd3 is crucial for homogenizing the Al-Nd target. Here, by a combined experimental and ab-initio theoretical study, the microstructure and deformability of the Al-3wt%Nd alloy and the inherent mechanical properties and chemical bonds of Al11Nd3 are investigated comprehensively. The Al-3wt%Nd alloy is composed of the pre-eutectic α-Al matrix and the eutectic α-Al and a high stable α-Al11Nd3 phases. During the plastic deformation, the eutectic microstructure transforms from a cellular to a lamellar shape, while the morphology and dimension of α-Al11Nd3 are not changed significantly. By examining ideal tensile strength, elastic moduli, hardness and brittleness-ductility, the hardnessbrittleness of α-Al11Nd3 is quantitatively evaluated, accounting for its difficulties of plastic deformation and fragmentation. Combining band structure, population analysis, topological analysis and crystal orbital Hamilton population, it is revealed that α-Al11Nd3 possesses two types of chemical bonds: the Nd-Al and Al-Al bonds. The former is a typical ionic bond with electron transfer from Nd to Al, while the latter, dominated by both 3s-3p and 3p-3p interactions, is a weak covalent bond. The mixed chemical bond is responsible for the high hardness-brittleness of α-Al11Nd3. This work is expected to lay a foundation for Al-Nd alloy and catalyze the fabrication of high-quality Al-Nd target materials.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
A Survey of Deep Learning Library Testing Methods
Authors:
Xiaoyu Zhang,
Weipeng Jiang,
Chao Shen,
Qi Li,
Qian Wang,
Chenhao Lin,
Xiaohong Guan
Abstract:
In recent years, software systems powered by deep learning (DL) techniques have significantly facilitated people's lives in many aspects. As the backbone of these DL systems, various DL libraries undertake the underlying optimization and computation. However, like traditional software, DL libraries are not immune to bugs, which can pose serious threats to users' personal property and safety. Study…
▽ More
In recent years, software systems powered by deep learning (DL) techniques have significantly facilitated people's lives in many aspects. As the backbone of these DL systems, various DL libraries undertake the underlying optimization and computation. However, like traditional software, DL libraries are not immune to bugs, which can pose serious threats to users' personal property and safety. Studying the characteristics of DL libraries, their associated bugs, and the corresponding testing methods is crucial for enhancing the security of DL systems and advancing the widespread application of DL technology. This paper provides an overview of the testing research related to various DL libraries, discusses the strengths and weaknesses of existing methods, and provides guidance and reference for the application of the DL library. This paper first introduces the workflow of DL underlying libraries and the characteristics of three kinds of DL libraries involved, namely DL framework, DL compiler, and DL hardware library. It then provides definitions for DL underlying library bugs and testing. Additionally, this paper summarizes the existing testing methods and tools tailored to these DL libraries separately and analyzes their effectiveness and limitations. It also discusses the existing challenges of DL library testing and outlines potential directions for future research.
△ Less
Submitted 27 April, 2024;
originally announced April 2024.
-
Regression of Dense Distortion Field from a Single Fingerprint Image
Authors:
Xiongjun Guan,
Yongjie Duan,
Jianjiang Feng,
Jie Zhou
Abstract:
Skin distortion is a long standing challenge in fingerprint matching, which causes false non-matches. Previous studies have shown that the recognition rate can be improved by estimating the distortion field from a distorted fingerprint and then rectifying it into a normal fingerprint. However, existing rectification methods are based on principal component representation of distortion fields, whic…
▽ More
Skin distortion is a long standing challenge in fingerprint matching, which causes false non-matches. Previous studies have shown that the recognition rate can be improved by estimating the distortion field from a distorted fingerprint and then rectifying it into a normal fingerprint. However, existing rectification methods are based on principal component representation of distortion fields, which is not accurate and are very sensitive to finger pose. In this paper, we propose a rectification method where a self-reference based network is utilized to directly estimate the dense distortion field of distorted fingerprint instead of its low dimensional representation. This method can output accurate distortion fields of distorted fingerprints with various finger poses and distortion patterns. We conducted experiments on FVC2004 DB1\_A, expanded Tsinghua Distorted Fingerprint database (with additional distorted fingerprints in diverse finger poses and distortion patterns) and a latent fingerprint database. Experimental results demonstrate that our proposed method achieves the state-of-the-art rectification performance in terms of distortion field estimation and rectified fingerprint matching.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Phase-aggregated Dual-branch Network for Efficient Fingerprint Dense Registration
Authors:
Xiongjun Guan,
Jianjiang Feng,
Jie Zhou
Abstract:
Fingerprint dense registration aims to finely align fingerprint pairs at the pixel level, thereby reducing intra-class differences caused by distortion. Unfortunately, traditional methods exhibited subpar performance when dealing with low-quality fingerprints while suffering from slow inference speed. Although deep learning based approaches shows significant improvement in these aspects, their reg…
▽ More
Fingerprint dense registration aims to finely align fingerprint pairs at the pixel level, thereby reducing intra-class differences caused by distortion. Unfortunately, traditional methods exhibited subpar performance when dealing with low-quality fingerprints while suffering from slow inference speed. Although deep learning based approaches shows significant improvement in these aspects, their registration accuracy is still unsatisfactory. In this paper, we propose a Phase-aggregated Dual-branch Registration Network (PDRNet) to aggregate the advantages of both types of methods. A dual-branch structure with multi-stage interactions is introduced between correlation information at high resolution and texture feature at low resolution, to perceive local fine differences while ensuring global stability. Extensive experiments are conducted on more comprehensive databases compared to previous works. Experimental results demonstrate that our method reaches the state-of-the-art registration performance in terms of accuracy and robustness, while maintaining considerable competitiveness in efficiency.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Pose-Specific 3D Fingerprint Unfolding
Authors:
Xiongjun Guan,
Jianjiang Feng,
Jie Zhou
Abstract:
In order to make 3D fingerprints compatible with traditional 2D flat fingerprints, a common practice is to unfold the 3D fingerprint into a 2D rolled fingerprint, which is then matched with the flat fingerprints by traditional 2D fingerprint recognition algorithms. The problem with this method is that there may be large elastic deformation between the unfolded rolled fingerprint and flat fingerpri…
▽ More
In order to make 3D fingerprints compatible with traditional 2D flat fingerprints, a common practice is to unfold the 3D fingerprint into a 2D rolled fingerprint, which is then matched with the flat fingerprints by traditional 2D fingerprint recognition algorithms. The problem with this method is that there may be large elastic deformation between the unfolded rolled fingerprint and flat fingerprint, which affects the recognition rate. In this paper, we propose a pose-specific 3D fingerprint unfolding algorithm to unfold the 3D fingerprint using the same pose as the flat fingerprint. Our experiments show that the proposed unfolding algorithm improves the compatibility between 3D fingerprint and flat fingerprint and thus leads to higher genuine matching scores.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Direct Regression of Distortion Field from a Single Fingerprint Image
Authors:
Xiongjun Guan,
Yongjie Duan,
Jianjiang Feng,
Jie Zhou
Abstract:
Skin distortion is a long standing challenge in fingerprint matching, which causes false non-matches. Previous studies have shown that the recognition rate can be improved by estimating the distortion field from a distorted fingerprint and then rectifying it into a normal fingerprint. However, existing rectification methods are based on principal component representation of distortion fields, whic…
▽ More
Skin distortion is a long standing challenge in fingerprint matching, which causes false non-matches. Previous studies have shown that the recognition rate can be improved by estimating the distortion field from a distorted fingerprint and then rectifying it into a normal fingerprint. However, existing rectification methods are based on principal component representation of distortion fields, which is not accurate and are very sensitive to finger pose. In this paper, we propose a rectification method where a self-reference based network is utilized to directly estimate the dense distortion field of distorted fingerprint instead of its low dimensional representation. This method can output accurate distortion fields of distorted fingerprints with various finger poses. Considering the limited number and variety of distorted fingerprints in the existing public dataset, we collected more distorted fingerprints with diverse finger poses and distortion patterns as a new database. Experimental results demonstrate that our proposed method achieves the state-of-the-art rectification performance in terms of distortion field estimation and rectified fingerprint matching.
△ Less
Submitted 26 April, 2024;
originally announced April 2024.
-
Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey
Authors:
Marcos V. Conde,
Zhijun Lei,
Wen Li,
Cosmin Stejerean,
Ioannis Katsavounidis,
Radu Timofte,
Kihwan Yoon,
Ganzorig Gankhuyag,
Jiangtao Lv,
Long Sun,
Jinshan Pan,
Jiangxin Dong,
Jinhui Tang,
Zhiyuan Li,
Hao Wei,
Chenyang Ge,
Dongyang Zhang,
Tianle Liu,
Huaian Chen,
Yi Jin,
Menghan Zhou,
Yiqiang Yan,
Si Gao,
Biao Wu,
Shaoli Liu
, et al. (50 additional authors not shown)
Abstract:
This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF cod…
▽ More
This paper introduces a novel benchmark as part of the AIS 2024 Real-Time Image Super-Resolution (RTSR) Challenge, which aims to upscale compressed images from 540p to 4K resolution (4x factor) in real-time on commercial GPUs. For this, we use a diverse test set containing a variety of 4K images ranging from digital art to gaming and photography. The images are compressed using the modern AVIF codec, instead of JPEG. All the proposed methods improve PSNR fidelity over Lanczos interpolation, and process images under 10ms. Out of the 160 participants, 25 teams submitted their code and models. The solutions present novel designs tailored for memory-efficiency and runtime on edge devices. This survey describes the best solutions for real-time SR of compressed high-resolution images.
△ Less
Submitted 25 April, 2024;
originally announced April 2024.
-
NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results
Authors:
Zheng Chen,
Zongwei Wu,
Eduard Zamfir,
Kai Zhang,
Yulun Zhang,
Radu Timofte,
Xiaokang Yang,
Hongyuan Yu,
Cheng Wan,
Yuxin Hong,
Zhijuan Huang,
Yajun Zou,
Yuan Huang,
Jiamin Lin,
Bingnan Han,
Xianyu Guan,
Yongsheng Yu,
Daoan Zhang,
Xuanwu Yin,
Kunlong Zuo,
Jinhua Hao,
Kai Zhao,
Kun Yuan,
Ming Sun,
Chao Zhou
, et al. (63 additional authors not shown)
Abstract:
This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i…
▽ More
This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge is to obtain designs/solutions with the most advanced SR performance, with no constraints on computational resources (e.g., model size and FLOPs) or training data. The track of this challenge assesses performance with the PSNR metric on the DIV2K testing dataset. The competition attracted 199 registrants, with 20 teams submitting valid entries. This collective endeavour not only pushes the boundaries of performance in single-image SR but also offers a comprehensive overview of current trends in this field.
△ Less
Submitted 15 April, 2024;
originally announced April 2024.
-
Stochastic-Robust Planning of Networked Hydrogen-Electrical Microgrids: A Study on Induced Refueling Demand
Authors:
Xunhang Sun,
Xiaoyu Cao,
Bo Zeng,
Qiaozhu Zhai,
Tamer Başar,
Xiaohong Guan
Abstract:
Hydrogen-electrical (HE) microgrids are increasingly assuming an important role on the pathway toward decarbonization of energy and transportation systems. This paper studies networked HE microgrids planning (NHEMP), considering a critical but often-overlooked issue, i.e., the demand-inducing effect (DIE) associated with infrastructure development decisions. Specifically, higher refueling capaciti…
▽ More
Hydrogen-electrical (HE) microgrids are increasingly assuming an important role on the pathway toward decarbonization of energy and transportation systems. This paper studies networked HE microgrids planning (NHEMP), considering a critical but often-overlooked issue, i.e., the demand-inducing effect (DIE) associated with infrastructure development decisions. Specifically, higher refueling capacities will attract more refueling demand of hydrogen-powered vehicles (HVs). To capture such interactions between investment decisions and induced refueling demand, we introduce a decision-dependent uncertainty (DDU) set and build a trilevel stochastic-robust formulation. The upper-level determines optimal investment strategies for HE microgrids, the lower-level optimizes the risk-aware operation schedules across a series of stochastic scenarios, and, for each scenario, the middle-level identifies the "worst" situation of refueling demand within an individual DDU set to ensure economic feasibility. Then, an adaptive and exact decomposition algorithm, based on Parametric Column-and-Constraint Generation (PC&CG), is customized and developed to address the computational challenge and to quantitatively analyze the impact of DIE. Case studies on an IEEE exemplary system validate the effectiveness of the proposed NHEMP model and the PC&CG algorithm. It is worth highlighting that DIE can make an important contribution to the economic benefits of NHEMP, yet its significance will gradually decrease when the main bottleneck transits to other system restrictions.
△ Less
Submitted 31 March, 2024;
originally announced April 2024.
-
Group Benefits Instances Selection for Data Purification
Authors:
Zhenhuang Cai,
Chuanyi Zhang,
Dan Huang,
Yuanbo Chen,
Xiuyun Guan,
Yazhou Yao
Abstract:
Manually annotating datasets for training deep models is very labor-intensive and time-consuming. To overcome such inferiority, directly leveraging web images to conduct training data becomes a natural choice. Nevertheless, the presence of label noise in web data usually degrades the model performance. Existing methods for combating label noise are typically designed and tested on synthetic noisy…
▽ More
Manually annotating datasets for training deep models is very labor-intensive and time-consuming. To overcome such inferiority, directly leveraging web images to conduct training data becomes a natural choice. Nevertheless, the presence of label noise in web data usually degrades the model performance. Existing methods for combating label noise are typically designed and tested on synthetic noisy datasets. However, they tend to fail to achieve satisfying results on real-world noisy datasets. To this end, we propose a method named GRIP to alleviate the noisy label problem for both synthetic and real-world datasets. Specifically, GRIP utilizes a group regularization strategy that estimates class soft labels to improve noise robustness. Soft label supervision reduces overfitting on noisy labels and learns inter-class similarities to benefit classification. Furthermore, an instance purification operation globally identifies noisy labels by measuring the difference between each training sample and its class soft label. Through operations at both group and instance levels, our approach integrates the advantages of noise-robust and noise-cleaning methods and remarkably alleviates the performance degradation caused by noisy labels. Comprehensive experimental results on synthetic and real-world datasets demonstrate the superiority of GRIP over the existing state-of-the-art methods.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
Two-scale Analysis for Multiscale Landau-Lifshitz-Gilbert Equation: Theory and Numerical Methods
Authors:
Xiaofei Guan,
Hang Qi,
Zhiwei Sun
Abstract:
This paper discusses the theory and numerical method of two-scale analysis for the multiscale Landau-Lifshitz-Gilbert equation in composite ferromagnetic materials. The novelty of this work can be summarized in three aspects: Firstly, the more realistic and complex model is considered, including the effects of the exchange field, anisotropy field, stray field, and external magnetic field. The expl…
▽ More
This paper discusses the theory and numerical method of two-scale analysis for the multiscale Landau-Lifshitz-Gilbert equation in composite ferromagnetic materials. The novelty of this work can be summarized in three aspects: Firstly, the more realistic and complex model is considered, including the effects of the exchange field, anisotropy field, stray field, and external magnetic field. The explicit convergence orders in the $H^1$ norm between the classical solution and the two-scale solution are obtained. Secondly, we propose a robust numerical framework, which is employed in several comprehensive experiments to validate the convergence results for the Periodic and Neumann problems. Thirdly, we design an improved implicit numerical scheme to reduce the required number of iterations and relaxes the constraints on the time step size, which can significantly improve computational efficiency. Specifically, the projection and the expansion methods are given to overcome the inherent non-consistency in the initial data between the multiscale problem and homogenized problem.
△ Less
Submitted 22 March, 2024;
originally announced March 2024.
-
AGRNav: Efficient and Energy-Saving Autonomous Navigation for Air-Ground Robots in Occlusion-Prone Environments
Authors:
Junming Wang,
Zekai Sun,
Xiuxian Guan,
Tianxiang Shen,
Zongyuan Zhang,
Tianyang Duan,
Dong Huang,
Shixiong Zhao,
Heming Cui
Abstract:
The exceptional mobility and long endurance of air-ground robots are raising interest in their usage to navigate complex environments (e.g., forests and large buildings). However, such environments often contain occluded and unknown regions, and without accurate prediction of unobserved obstacles, the movement of the air-ground robot often suffers a suboptimal trajectory under existing mapping-bas…
▽ More
The exceptional mobility and long endurance of air-ground robots are raising interest in their usage to navigate complex environments (e.g., forests and large buildings). However, such environments often contain occluded and unknown regions, and without accurate prediction of unobserved obstacles, the movement of the air-ground robot often suffers a suboptimal trajectory under existing mapping-based and learning-based navigation methods. In this work, we present AGRNav, a novel framework designed to search for safe and energy-saving air-ground hybrid paths. AGRNav contains a lightweight semantic scene completion network (SCONet) with self-attention to enable accurate obstacle predictions by capturing contextual information and occlusion area features. The framework subsequently employs a query-based method for low-latency updates of prediction results to the grid map. Finally, based on the updated map, the hierarchical path planner efficiently searches for energy-saving paths for navigation. We validate AGRNav's performance through benchmarks in both simulated and real-world environments, demonstrating its superiority over classical and state-of-the-art methods. The open-source code is available at https://github.com/jmwang0117/AGRNav.
△ Less
Submitted 18 March, 2024;
originally announced March 2024.
-
ABC-Channel: An Advanced Blockchain-based Covert Channel
Authors:
Xiaobo Ma,
Pengyu Pan,
Jianfeng Li,
Wei Wang,
Weizhi Meng,
Xiaohong Guan
Abstract:
Establishing efficient and robust covert channels is crucial for secure communication within insecure network environments. With its inherent benefits of decentralization and anonymization, blockchain has gained considerable attention in developing covert channels. To guarantee a highly secure covert channel, channel negotiation should be contactless before the communication, carrier transaction f…
▽ More
Establishing efficient and robust covert channels is crucial for secure communication within insecure network environments. With its inherent benefits of decentralization and anonymization, blockchain has gained considerable attention in developing covert channels. To guarantee a highly secure covert channel, channel negotiation should be contactless before the communication, carrier transaction features must be indistinguishable from normal transactions during the communication, and communication identities must be untraceable after the communication. Such a full-lifecycle covert channel is indispensable to defend against a versatile adversary who intercepts two communicating parties comprehensively (e.g., on-chain and off-chain). Unfortunately, it has not been thoroughly investigated in the literature. We make the first effort to achieve a full-lifecycle covert channel, a novel blockchain-based covert channel named ABC-Channel. We tackle a series of challenges, such as off-chain contact dependency, increased masquerading difficulties as growing transaction volume, and time-evolving, communicable yet untraceable identities, to achieve contactless channel negotiation, indistinguishable transaction features, and untraceable communication identities, respectively. We develop a working prototype to validate ABC-Channel and conduct extensive tests on the Bitcoin testnet. The experimental results demonstrate that ABC-Channel achieves substantially secure covert capabilities. In comparison to existing methods, it also exhibits state-of-the-art transmission efficiency.
△ Less
Submitted 24 March, 2024; v1 submitted 10 March, 2024;
originally announced March 2024.
-
A direct proof of well-definedness for the polymatroid Tutte polynomial
Authors:
Xiaxia Guan,
Xian'an Jin
Abstract:
For a polymatroid $P$ over $[n]$, Bernardi, Kálmán and Postnikov [\emph{Adv. Math.} 402 (2022) 108355] introduced the polymatroid Tutte polynomial $\mathscr{T}_{P}$ relying on the order $1<2<\cdots<n$ of $[n]$, which generalizes the classical Tutte polynomial from matroids to polymatroids. They proved the independence of this order by the fact that $\mathscr{T}_{P}$ is equivalent to another polyno…
▽ More
For a polymatroid $P$ over $[n]$, Bernardi, Kálmán and Postnikov [\emph{Adv. Math.} 402 (2022) 108355] introduced the polymatroid Tutte polynomial $\mathscr{T}_{P}$ relying on the order $1<2<\cdots<n$ of $[n]$, which generalizes the classical Tutte polynomial from matroids to polymatroids. They proved the independence of this order by the fact that $\mathscr{T}_{P}$ is equivalent to another polynomial that only depends on $P$. In this paper, similar to the Tutte's original proof of the well-definedness of the Tutte polynomial defined by the summation over all spanning trees using activities depending on the order of edges, we give a direct and elementary proof of the well-definedness of the polymatroid Tutte polynomial.
△ Less
Submitted 9 March, 2024;
originally announced March 2024.
-
Approaching the double-Heisenberg-scaling sensitivity in the Tavis-Cummings model
Authors:
Yuguo Su,
Hai-Long Shi,
Xiao-Guang Wang,
Chaohong Lee,
Xi-Wen Guan
Abstract:
The pursuit of quantum-enhanced parameter estimations without the need for nonclassical initial states has long been driven by the goal of achieving experimentally accessible quantum metrology. In this paper, employing a coherent averaging mechanism, we prove that the prototypical cavity-quantum electrodynamics (QED) system, such as the Tavis-Cummings (TC) model, enables us to achieve not only the…
▽ More
The pursuit of quantum-enhanced parameter estimations without the need for nonclassical initial states has long been driven by the goal of achieving experimentally accessible quantum metrology. In this paper, employing a coherent averaging mechanism, we prove that the prototypical cavity-quantum electrodynamics (QED) system, such as the Tavis-Cummings (TC) model, enables us to achieve not only the Heisenberg scaling (HS) precision in terms of the average photon number but also the double-HS sensitivity concerning both the average photon and atom numbers. Such a double sensibility can be experimentally realized by introducing either photon- or atom-number fluctuations through quantum squeezing. Furthermore, we discuss the methodology to achieve this double-HS precision in a realistic experimental circumstance where the squeezing is not perfect. Our results provide insights into understanding the coherent averaging mechanism for evaluating quantum-enhanced precision measurements and also present a usable metrological application of the cavity QED systems.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Model-Free Load Frequency Control of Nonlinear Power Systems Based on Deep Reinforcement Learning
Authors:
Xiaodi Chen,
Meng Zhang,
Zhengguang Wu,
Ligang Wu,
Xiaohong Guan
Abstract:
Load frequency control (LFC) is widely employed in power systems to stabilize frequency fluctuation and guarantee power quality. However, most existing LFC methods rely on accurate power system modeling and usually ignore the nonlinear characteristics of the system, limiting controllers' performance. To solve these problems, this paper proposes a model-free LFC method for nonlinear power systems b…
▽ More
Load frequency control (LFC) is widely employed in power systems to stabilize frequency fluctuation and guarantee power quality. However, most existing LFC methods rely on accurate power system modeling and usually ignore the nonlinear characteristics of the system, limiting controllers' performance. To solve these problems, this paper proposes a model-free LFC method for nonlinear power systems based on deep deterministic policy gradient (DDPG) framework. The proposed method establishes an emulator network to emulate power system dynamics. After defining the action-value function, the emulator network is applied for control actions evaluation instead of the critic network. Then the actor network controller is effectively optimized by estimating the policy gradient based on zeroth-order optimization (ZOO) and backpropagation algorithm. Simulation results and corresponding comparisons demonstrate the designed controller can generate appropriate control actions and has strong adaptability for nonlinear power systems.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining
Authors:
Fanjin Zhang,
Shijie Shi,
Yifan Zhu,
Bo Chen,
Yukuo Cen,
Jifan Yu,
Yelin Chen,
Lulu Wang,
Qingfei Zhao,
Yuqing Cheng,
Tianyi Han,
Yuwei An,
Dan Zhang,
Weng Lam Tam,
Kun Cao,
Yunhe Pang,
Xinyu Guan,
Huihui Yuan,
Jian Song,
Xiaoyan Li,
Yuxiao Dong,
Jie Tang
Abstract:
With the rapid proliferation of scientific literature, versatile academic knowledge services increasingly rely on comprehensive academic graph mining. Despite the availability of public academic graphs, benchmarks, and datasets, these resources often fall short in multi-aspect and fine-grained annotations, are constrained to specific task types and domains, or lack underlying real academic graphs.…
▽ More
With the rapid proliferation of scientific literature, versatile academic knowledge services increasingly rely on comprehensive academic graph mining. Despite the availability of public academic graphs, benchmarks, and datasets, these resources often fall short in multi-aspect and fine-grained annotations, are constrained to specific task types and domains, or lack underlying real academic graphs. In this paper, we present OAG-Bench, a comprehensive, multi-aspect, and fine-grained human-curated benchmark based on the Open Academic Graph (OAG). OAG-Bench covers 10 tasks, 20 datasets, 70+ baselines, and 120+ experimental results to date. We propose new data annotation strategies for certain tasks and offer a suite of data pre-processing codes, algorithm implementations, and standardized evaluation protocols to facilitate academic graph mining. Extensive experiments reveal that even advanced algorithms like large language models (LLMs) encounter difficulties in addressing key challenges in certain tasks, such as paper source tracing and scholar profiling. We also introduce the Open Academic Graph Challenge (OAG-Challenge) to encourage community input and sharing. We envisage that OAG-Bench can serve as a common ground for the community to evaluate and compare algorithms in academic graph mining, thereby accelerating algorithm development and advancement in this field. OAG-Bench is accessible at https://www.aminer.cn/data/.
△ Less
Submitted 20 June, 2024; v1 submitted 24 February, 2024;
originally announced February 2024.
-
Enhancing Source Code Representations for Deep Learning with Static Analysis
Authors:
Xueting Guan,
Christoph Treude
Abstract:
Deep learning techniques applied to program analysis tasks such as code classification, summarization, and bug detection have seen widespread interest. Traditional approaches, however, treat programming source code as natural language text, which may neglect significant structural or semantic details. Additionally, most current methods of representing source code focus solely on the code, without…
▽ More
Deep learning techniques applied to program analysis tasks such as code classification, summarization, and bug detection have seen widespread interest. Traditional approaches, however, treat programming source code as natural language text, which may neglect significant structural or semantic details. Additionally, most current methods of representing source code focus solely on the code, without considering beneficial additional context. This paper explores the integration of static analysis and additional context such as bug reports and design patterns into source code representations for deep learning models. We use the Abstract Syntax Tree-based Neural Network (ASTNN) method and augment it with additional context information obtained from bug reports and design patterns, creating an enriched source code representation that significantly enhances the performance of common software engineering tasks such as code classification and code clone detection. Utilizing existing open-source code data, our approach improves the representation and processing of source code, thereby improving task performance.
△ Less
Submitted 14 February, 2024;
originally announced February 2024.
-
On the connected coalition number
Authors:
Xiaxia Guan,
Maoqun Wang
Abstract:
For a graph $G=(V,E)$, a pair of vertex disjoint sets $A_{1}$ and $A_{2}$ form a connected coalition of $G$, if $A_{1}\cup A_{2}$ is a connected dominating set, but neither $A_{1}$ nor $A_{2}$ is a connected dominating set. A connected coalition partition of $G$ is a partition $Φ$ of $V(G)$ such that each set in $Φ$ either consists of only a singe vertex with the degree $|V(G)|-1$, or forms a conn…
▽ More
For a graph $G=(V,E)$, a pair of vertex disjoint sets $A_{1}$ and $A_{2}$ form a connected coalition of $G$, if $A_{1}\cup A_{2}$ is a connected dominating set, but neither $A_{1}$ nor $A_{2}$ is a connected dominating set. A connected coalition partition of $G$ is a partition $Φ$ of $V(G)$ such that each set in $Φ$ either consists of only a singe vertex with the degree $|V(G)|-1$, or forms a connected coalition of $G$ with another set in $Φ$. The connected coalition number of $G$, denoted by $CC(G)$, is the largest possible size of a connected coalition partition of $G$. In this paper, we characterize graphs that satisfy $CC(G)=2$. Moreover, we obtain the connected coalition number for unicycle graphs and for the corona product and join of two graphs. Finally, we give a lower bound on the connected coalition number of the Cartesian product and the lexicographic product of two graphs.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA
Authors:
Yue Fan,
Jing Gu,
Kaiwen Zhou,
Qianqi Yan,
Shan Jiang,
Ching-Chen Kuo,
Xinze Guan,
Xin Eric Wang
Abstract:
Multipanel images, commonly seen as web screenshots, posters, etc., pervade our daily lives. These images, characterized by their composition of multiple subfigures in distinct layouts, effectively convey information to people. Toward building advanced multimodal AI applications, such as agents that understand complex scenes and navigate through webpages, the skill of multipanel visual reasoning i…
▽ More
Multipanel images, commonly seen as web screenshots, posters, etc., pervade our daily lives. These images, characterized by their composition of multiple subfigures in distinct layouts, effectively convey information to people. Toward building advanced multimodal AI applications, such as agents that understand complex scenes and navigate through webpages, the skill of multipanel visual reasoning is essential, and a comprehensive evaluation of models in this regard is important. Therefore, we introduce Multipanel Visual Question Answering (MultipanelVQA), a novel benchmark comprising 6,600 triplets of questions, answers, and multipanel images that specifically challenge models in comprehending multipanel images. Our evaluation shows that questions in the MultipanelVQA benchmark pose significant challenges to the state-of-the-art Multimodal Large Language Models (MLLMs) tested, even though humans can attain approximately 99% accuracy on these questions. Distinctively, the MultipanelVQA benchmark features synthetically generated multipanel images specifically crafted to isolate and assess the impact of various factors, such as the layout, on MLLMs' multipanel image comprehension abilities. As a result, in addition to benchmarking the capabilities of MLLMs in understanding multipanel images, we analyze various factors of the multipanel image that affect MLLMs' performance with synthetic data and offer insights for enhancement. Code and data are released at https://sites.google.com/view/multipanelvqa/home.
△ Less
Submitted 27 June, 2024; v1 submitted 28 January, 2024;
originally announced January 2024.
-
Relations between near-field enhancements and Purcell factors in hybrid nanostructures of plasmonic antennas and dielectric cavities
Authors:
Xu-Tao Tang,
Lin Ma,
Yue You,
Xiao-Jing Du,
Hua Qiu,
Xi-Hua Guan,
Jun He,
Zhong-Jian Yang
Abstract:
Strong near-field enhancements (NFEs) of nanophotonic structures are believed to be closely related to high Purcell factors (FP). Here, we theoretically show that the correlation is partially correct; the extinction cross section (σ) response is also critical in determining FP. The divergence between NFE and FP is especially pronounced in plasmonic-dielectric hybrid systems, where the plasmonic an…
▽ More
Strong near-field enhancements (NFEs) of nanophotonic structures are believed to be closely related to high Purcell factors (FP). Here, we theoretically show that the correlation is partially correct; the extinction cross section (σ) response is also critical in determining FP. The divergence between NFE and FP is especially pronounced in plasmonic-dielectric hybrid systems, where the plasmonic antenna supports dipolar plasmon modes and the dielectric cavity hosts Mie-like resonances. The cavity's enhanced-field environment can boost the antenna's NFEs, but the FP is not increased concurrently due to the larger effective σ that is intrinsic to the FP calculations. Interestingly, the peak FP for the coupled system can be predicted by using the NFE and σ responses. Furthermore, the limits for FP of coupled systems are considered; they are determined by the sum of the FP of a redshifted (or modified, if applicable) antenna and an individual cavity. This contrasts starkly with the behavior of NFE which is closely associated with the multiplicative effects of the NFEs provided by the antenna and the dielectric cavity. The differing behaviors of NFE and FP in hybrid cavities have varied impacts on relevant nanophotonic applications such as fluorescence, Raman scattering and enhanced light-matter interactions.
△ Less
Submitted 14 January, 2024;
originally announced January 2024.
-
Adaptive Regularized Low-Rank Tensor Decomposition for Hyperspectral Image Denoising and Destriping
Authors:
Dongyi Li,
Dong Chu,
Xiaobin Guan,
Wei He,
Huanfeng Shen
Abstract:
Hyperspectral images (HSIs) are inevitably degraded by a mixture of various types of noise, such as Gaussian noise, impulse noise, stripe noise, and dead pixels, which greatly limits the subsequent applications. Although various denoising methods have already been developed, accurately recovering the spatial-spectral structure of HSIs remains a challenging problem to be addressed. Furthermore, ser…
▽ More
Hyperspectral images (HSIs) are inevitably degraded by a mixture of various types of noise, such as Gaussian noise, impulse noise, stripe noise, and dead pixels, which greatly limits the subsequent applications. Although various denoising methods have already been developed, accurately recovering the spatial-spectral structure of HSIs remains a challenging problem to be addressed. Furthermore, serious stripe noise, which is common in real HSIs, is still not fully separated by the previous models. In this paper, we propose an adaptive hyperLaplacian regularized low-rank tensor decomposition (LRTDAHL) method for HSI denoising and destriping. On the one hand, the stripe noise is separately modeled by the tensor decomposition, which can effectively encode the spatial-spectral correlation of the stripe noise. On the other hand, adaptive hyper-Laplacian spatial-spectral regularization is introduced to represent the distribution structure of different HSI gradient data by adaptively estimating the optimal hyper-Laplacian parameter, which can reduce the spatial information loss and over-smoothing caused by the previous total variation regularization. The proposed model is solved using the alternating direction method of multipliers (ADMM) algorithm. Extensive simulation and real-data experiments all demonstrate the effectiveness and superiority of the proposed method.
△ Less
Submitted 11 January, 2024;
originally announced January 2024.
-
Catalyst-free MBE growth of PbSnTe nanowires with tunable aspect ratio
Authors:
Mathijs G. C. Mientjes,
Xin Guan,
Pim J. H. Lueb,
Marcel A. Verheijen,
Erik P. A. M. Bakkers
Abstract:
Topological crystalline insulators (TCIs) are interesting for their topological surface states, which hold great promise for scattering-free transport channels and fault-tolerant quantum computing. A promising TCI is SnTe. However, Sn-vacancies form in SnTe, causing a high hole density, hindering topological transport from the surface being measured. This issue could be relieved by using nanowires…
▽ More
Topological crystalline insulators (TCIs) are interesting for their topological surface states, which hold great promise for scattering-free transport channels and fault-tolerant quantum computing. A promising TCI is SnTe. However, Sn-vacancies form in SnTe, causing a high hole density, hindering topological transport from the surface being measured. This issue could be relieved by using nanowires with a high surface-to-volume ratio. Furthermore, SnTe can be alloyed with Pb reducing the Sn-vacancies while maintaining its topological phase. Here we present the catalyst-free growth of monocrystalline PbSnTe in molecular beam epitaxy (MBE). By the addition of a pre-deposition stage before the growth, we have control over the nucleation phase and thereby increase the nanowire yield. This facilitates tuning the nanowire aspect ratio by a factor of four by varying the growth parameters. These results allow us to grow specific morphologies for future transport experiments to probe the topological surface states in a Pb1-xSnxTe-based platform.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
Distributionally Robust Frequency-Constrained Microgrid Scheduling Towards Seamless Islanding
Authors:
Lun Yang,
Haoxiang Yang,
Xiaoyu Cao,
Xiaohong Guan
Abstract:
Unscheduled islanding events of microgrids result in the transition between grid-connected and islanded modes and induce a sudden and unknown power imbalance, posing a threat to frequency security. To achieve seamless islanding, we propose a distributionally robust frequency-constrained microgrid scheduling model considering unscheduled islanding events. This model co-optimizes unit commitments, p…
▽ More
Unscheduled islanding events of microgrids result in the transition between grid-connected and islanded modes and induce a sudden and unknown power imbalance, posing a threat to frequency security. To achieve seamless islanding, we propose a distributionally robust frequency-constrained microgrid scheduling model considering unscheduled islanding events. This model co-optimizes unit commitments, power dispatch, upward/downward primary frequency response reserves, virtual inertia provisions from renewable energy sources (RESs), deloading ratios of RESs, and battery operations, while ensuring the system frequency security during unscheduled islanding. We establish an affine relationship between the actual power exchange and RES uncertainty in grid-connected mode, describe RES uncertainty with a Wasserstein-metric ambiguity set, and formulate frequency constraints under uncertain post-islanding power imbalance as distributionally robust quadratic chance constraints, which are further transformed by a tight conic relaxation. We solve the proposed mixed-integer convex program and demonstrate its effectiveness through case studies.
△ Less
Submitted 6 January, 2024;
originally announced January 2024.
-
A novel boundary-integral algorithm for nonlinear unsteady surface and interfacial waves
Authors:
Xin Guan,
Jean-Marc Vanden-Broeck
Abstract:
We devise a new time-stepping algorithm for two-dimensional nonlinear unsteady surface and interfacial waves. The algorithm uses Cauchy's integral formula, which only requires information on the interface, to solve Laplace equation by using iterative techniques. We derive Eulerian and mixed Eulerian-Lagrangian descriptions by using arclength to parameterize the interface which is updated through i…
▽ More
We devise a new time-stepping algorithm for two-dimensional nonlinear unsteady surface and interfacial waves. The algorithm uses Cauchy's integral formula, which only requires information on the interface, to solve Laplace equation by using iterative techniques. We derive Eulerian and mixed Eulerian-Lagrangian descriptions by using arclength to parameterize the interface which is updated through its inclination angle and velocity potential at each time step. The algorithm shows broad applicability and excellent numerical accuracy in various numerical simulations, including wave breaking, collisions of solitary waves, vortex roll-up, etc. We especially focus on the stability of symmetric interfacial gravity-capillary solitary waves in deep water. Linear stability analysis is performed using a new formulation which possesses excellent numerical efficiency and robustness. It is shown that the depression/elevation solitary waves are linearly stable/unstable except the portion where the monotonicity of energy curve changes firstly. These results are supported by our fully nonlinear simulations, especially the head-on collisions of solitary waves.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Joint Trading and Scheduling among Coupled Carbon-Electricity-Heat-Gas Industrial Clusters
Authors:
Dafeng Zhu,
Bo Yang,
Yu Wu,
Haoran Deng,
Zhaoyang Dong,
Kai Ma,
Xinping Guan
Abstract:
This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-…
▽ More
This paper presents a carbon-energy coupling management framework for an industrial park, where the carbon flow model accompanying multi-energy flows is adopted to track and suppress carbon emissions on the user side. To deal with the quadratic constraint of gas flows, a bound tightening algorithm for constraints relaxation is adopted. The synergies among the carbon capture, energy storage, power-to-gas further consume renewable energy and reduce carbon emissions. Aiming at carbon emissions disparities and supply-demand imbalances, this paper proposes a carbon trading ladder reward and punishment mechanism and an energy trading and scheduling method based on Lyapunov optimization and matching game to maximize the long-term benefits of each industrial cluster without knowing the prior information of random variables. Case studies show that our proposed trading method can reduce overall costs and carbon emissions while relieving energy pressure, which is important for Environmental, Social and Governance (ESG).
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Enlighten-Your-Voice: When Multimodal Meets Zero-shot Low-light Image Enhancement
Authors:
Xiaofeng Zhang,
Zishan Xu,
Hao Tang,
Chaochen Gu,
Wei Chen,
Shanying Zhu,
Xinping Guan
Abstract:
Low-light image enhancement is a crucial visual task, and many unsupervised methods tend to overlook the degradation of visible information in low-light scenes, which adversely affects the fusion of complementary information and hinders the generation of satisfactory results. To address this, our study introduces "Enlighten-Your-Voice", a multimodal enhancement framework that innovatively enriches…
▽ More
Low-light image enhancement is a crucial visual task, and many unsupervised methods tend to overlook the degradation of visible information in low-light scenes, which adversely affects the fusion of complementary information and hinders the generation of satisfactory results. To address this, our study introduces "Enlighten-Your-Voice", a multimodal enhancement framework that innovatively enriches user interaction through voice and textual commands. This approach does not merely signify a technical leap but also represents a paradigm shift in user engagement. Our model is equipped with a Dual Collaborative Attention Module (DCAM) that meticulously caters to distinct content and color discrepancies, thereby facilitating nuanced enhancements. Complementarily, we introduce a Semantic Feature Fusion (SFM) plug-and-play module that synergizes semantic context with low-light enhancement operations, sharpening the algorithm's efficacy. Crucially, "Enlighten-Your-Voice" showcases remarkable generalization in unsupervised zero-shot scenarios. The source code can be accessed from https://github.com/zhangbaijin/Enlighten-Your-Voice
△ Less
Submitted 1 February, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
The FAST all sky HI survey (FASHI): The first release of catalog
Authors:
Chuan-Peng Zhang,
M. Zhu,
P. Jiang,
C. Cheng,
J. Wang,
J. Wang,
J. -L. Xu,
X. -L. Liu,
N. -P. Yu,
L. Qian,
H. Yu,
M. Ai,
Y. Jing,
C. Xu,
Z. Liu,
X. Guan,
C. Sun,
Q. Yang,
M. Huang,
Q. Hao,
FAST Collaboration
Abstract:
The FAST All Sky HI survey (FASHI) was designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), spanning approximately 22000 square degrees of declination between -14 deg and +66 deg, and in the frequency range of 1050-1450 MHz, with the expectation of eventually detecting more than 100000 HI sources. Between August 2020 and June 2023, FASHI…
▽ More
The FAST All Sky HI survey (FASHI) was designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST), spanning approximately 22000 square degrees of declination between -14 deg and +66 deg, and in the frequency range of 1050-1450 MHz, with the expectation of eventually detecting more than 100000 HI sources. Between August 2020 and June 2023, FASHI had covered more than 7600 square degrees, which is approximately 35% of the total sky observable by FAST. It has a median detection sensitivity of around 0.76 mJy/beam and a spectral line velocity resolution of ~6.4 km/s at a frequency of ~1.4 GHz. As of now, a total of 41741 extragalactic HI sources have been detected in the frequency range 1305.5-1419.5 MHz, corresponding to a redshift limit of z<0.09. By cross-matching FASHI sources with the Siena Galaxy Atlas (SGA) and the Sloan Digital Sky Survey (SDSS) catalogs, we found that 16972 (40.7%) sources have spectroscopic redshifts and 10975 (26.3%) sources have only photometric redshifts. Most of the remaining 13794 (33.0%) HI sources are located in the direction of the Galactic plane, making their optical counterparts difficult to identify due to high extinction or high contamination of Galactic stellar sources. Based on current survey results, the FASHI survey is an unprecedented blind extragalactic HI survey. It has higher spectral and spatial resolution and broader coverage than the Arecibo Legacy Fast ALFA Survey (ALFALFA). When completed, FASHI will provide the largest extragalactic HI catalog and an objective view of HI content and large-scale structure in the local universe.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Accelerating Level-Value Adjustment for the Polyak Stepsize
Authors:
Anbang Liu,
Mikhail A. Bragin,
Xi Chen,
Xiaohong Guan
Abstract:
The Polyak stepsize formula has been widely used for convex optimization. However, stepsize computations require the generally unknown optimal value. Dynamic estimations of the optimal value are thus usually needed. In this paper, guided by a decision-based procedure through a novel easy-to-solve ``Polyak Stepsize Violation Detector'' (PSVD) linear constraint satisfaction problem, a series of leve…
▽ More
The Polyak stepsize formula has been widely used for convex optimization. However, stepsize computations require the generally unknown optimal value. Dynamic estimations of the optimal value are thus usually needed. In this paper, guided by a decision-based procedure through a novel easy-to-solve ``Polyak Stepsize Violation Detector'' (PSVD) linear constraint satisfaction problem, a series of level values is constructed to successively estimate the optimal value to guarantee convergence for subgradient as well as for approximate subgradient methods. Through a series of empirical tests of convex optimization problems with diverse characteristics, we illustrate the practical advantages of our approach as compared to existing methods.
△ Less
Submitted 14 January, 2024; v1 submitted 30 November, 2023;
originally announced November 2023.
-
65 GOPS/neuron Photonic Tensor Core with Thin-film Lithium Niobate Photonics
Authors:
Zhongjin Lin,
Bhavin J. Shastri,
Shangxuan Yu,
Jingxiang Song,
Yuntao Zhu,
Arman Safarnejadian,
Wangning Cai,
Yanmei Lin,
Wei Ke,
Mustafa Hammood,
Tianye Wang,
Mengyue Xu,
Zibo Zheng,
Mohammed Al-Qadasi,
Omid Esmaeeli,
Mohamed Rahim,
Grzegorz Pakulski,
Jens Schmid,
Pedro Barrios,
Weihong Jiang,
Hugh Morison,
Matthew Mitchell,
Xiaogang Qiang,
Xun Guan,
Nicolas A. F. Jaeger
, et al. (6 additional authors not shown)
Abstract:
Photonics offers a transformative approach to artificial intelligence (AI) and neuromorphic computing by providing low latency, high bandwidth, and energy-efficient computations. Here, we introduce a photonic tensor core processor enabled by time-multiplexed inputs and charge-integrated outputs. This fully integrated processor, comprising only two thin-film lithium niobate (TFLN) modulators, a III…
▽ More
Photonics offers a transformative approach to artificial intelligence (AI) and neuromorphic computing by providing low latency, high bandwidth, and energy-efficient computations. Here, we introduce a photonic tensor core processor enabled by time-multiplexed inputs and charge-integrated outputs. This fully integrated processor, comprising only two thin-film lithium niobate (TFLN) modulators, a III-V laser, and a charge-integration photoreceiver, can implement an entire layer of a neural network. It can execute 65 billion operations per second (GOPS) per neuron, including simultaneous weight updates-a hitherto unachieved speed. Our processor stands out from conventional photonic processors, which have static weights set during training, as it supports fast "hardware-in-the-loop" training, and can dynamically adjust the inputs (fan-in) and outputs (fan-out) within a layer, thereby enhancing its versatility. Our processor can perform large-scale dot-product operations with vector dimensions up to 131,072. Furthermore, it successfully classifies (supervised learning) and clusters (unsupervised learning) 112*112-pixel images after "hardware-in-the-loop" training. To handle "hardware-in-the-loop" training for clustering AI tasks, we provide a solution for multiplications involving two negative numbers based on our processor.
△ Less
Submitted 30 November, 2023; v1 submitted 28 November, 2023;
originally announced November 2023.
-
Spin-Orbital Coupling in All-Inorganic Metal-Halide Perovskites: the Hidden Force that Matters
Authors:
Pradeep Raja Anandan,
Muhammad Nadeem,
Chun-Ho Lin,
Simrjit Singh,
Xinwei Guan,
Jiyun Kim,
Shamim Shahroki,
Md Zahidur Rahaman,
Xun Geng,
Jing-Kai Huang,
Hien Nguyen,
Hanlin Hu,
Pankaj Sharma,
Jan Seidel,
Xiaolin Wang,
Tom Wu
Abstract:
Highlighted with improved long-term thermal and environmental stability, all-inorganic metal halide perovskites exhibit tunable physical properties, cost-effective synthesis, and satisfactory optoelectronic performance, attracting increasing research interests worldwide. However, a less explored feature of these materials is their strong spin-orbit coupling (SOC), which is the hidden force influen…
▽ More
Highlighted with improved long-term thermal and environmental stability, all-inorganic metal halide perovskites exhibit tunable physical properties, cost-effective synthesis, and satisfactory optoelectronic performance, attracting increasing research interests worldwide. However, a less explored feature of these materials is their strong spin-orbit coupling (SOC), which is the hidden force influencing not only band structure but also properties including magnetoresistance, spin lifetime and singlet-triplet splitting. This review provides an overview of the fundamental aspects and the latest progress of the SOC and debate regarding Rashba effects in all-inorganic metal halide perovskites, providing critical insights into the physical phenomena and potential applications. Meanwhile, crystal structures and photophysics of all-inorganic perovskite are discussed in the context of SOC, along with the related experimental and characterization techniques. Furthermore, a recent understanding of the band topology in the all-inorganic halide perovskites is introduced to push the boundary even further for the novel applications of all-inorganic halide perovskites. Finally, an outlook is given on the potential directions of breakthroughs via leveraging the SOC in halide perovskites.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Liquid-shaped microlens for scalable production of ultrahigh-resolution OCT microendoscope
Authors:
Chao Xu,
Xin Guan,
Syeda Aimen Abbasi,
Neng Xia,
To Ngai,
Li Zhang,
Ho-Pui Ho,
Sze Hang Calvin Ng,
Wu Yuan
Abstract:
Endoscopic optical coherence tomography (OCT) is a valuable tool for providing diagnostic images of internal organs and guiding interventions in real time. Miniaturized OCT endoscopes are essential for imaging small and convoluted luminal organs while minimizing invasiveness. However, current methods for fabricating miniature fiber probes have limited ability to correct optical aberrations, leadin…
▽ More
Endoscopic optical coherence tomography (OCT) is a valuable tool for providing diagnostic images of internal organs and guiding interventions in real time. Miniaturized OCT endoscopes are essential for imaging small and convoluted luminal organs while minimizing invasiveness. However, current methods for fabricating miniature fiber probes have limited ability to correct optical aberrations, leading to suboptimal imaging performance. In this study, we introduce a new paradigm of liquid shaping technique for the rapid and scalable fabrication of ultrathin and high-performance OCT microendoscopes suitable for minimally invasive clinical applications. This technique enables the flexible customization of freeform microlenses with sub-nanometer optical surface roughness by regulating the minimum energy state of curable optical liquid on a wettability-modified substrate and precisely controlling the liquid volume and physical boundary on a substrate. Using this technique, we simultaneously fabricated 800-nm OCT microendoscopes with a diameter of approximately 0.6 mm and evaluated their ultrahigh-resolution imaging performance in the esophagus of rats and the aorta and brain of mice.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-based Retrofitting
Authors:
Xinyan Guan,
Yanjiang Liu,
Hongyu Lin,
Yaojie Lu,
Ben He,
Xianpei Han,
Le Sun
Abstract:
Incorporating factual knowledge in knowledge graph is regarded as a promising approach for mitigating the hallucination of large language models (LLMs). Existing methods usually only use the user's input to query the knowledge graph, thus failing to address the factual hallucination generated by LLMs during its reasoning process. To address this problem, this paper proposes Knowledge Graph-based R…
▽ More
Incorporating factual knowledge in knowledge graph is regarded as a promising approach for mitigating the hallucination of large language models (LLMs). Existing methods usually only use the user's input to query the knowledge graph, thus failing to address the factual hallucination generated by LLMs during its reasoning process. To address this problem, this paper proposes Knowledge Graph-based Retrofitting (KGR), a new framework that incorporates LLMs with KGs to mitigate factual hallucination during the reasoning process by retrofitting the initial draft responses of LLMs based on the factual knowledge stored in KGs. Specifically, KGR leverages LLMs to extract, select, validate, and retrofit factual statements within the model-generated responses, which enables an autonomous knowledge verifying and refining procedure without any additional manual efforts. Experiments show that KGR can significantly improve the performance of LLMs on factual QA benchmarks especially when involving complex reasoning processes, which demonstrates the necessity and effectiveness of KGR in mitigating hallucination and enhancing the reliability of LLMs.
△ Less
Submitted 22 November, 2023;
originally announced November 2023.
-
Resilient Clock Synchronization Architecture for Industrial Time-Sensitive Networking
Authors:
Yafei Sun,
Qimin Xu,
Cailian Chen,
Xinping Guan
Abstract:
Time-Sensitive Networking (TSN) is a promising industrial Internet of Things technology. Clock synchronization provides unified time reference, which is critical to the deterministic communication of TSN. However, changes in internal network status and external work environments of devices both degrade practical synchronization performance. This paper proposes a temperature-resilient architecture…
▽ More
Time-Sensitive Networking (TSN) is a promising industrial Internet of Things technology. Clock synchronization provides unified time reference, which is critical to the deterministic communication of TSN. However, changes in internal network status and external work environments of devices both degrade practical synchronization performance. This paper proposes a temperature-resilient architecture considering delay asymmetry (TACD) to enhance the timing accuracy under the impacts of internal delay and external thermal changes. In TACD, an anti-delay-asymmetry method is developed, which employs a partial variational Bayesian algorithm to promote adaptability to non-stationary delay variation. An optimized skew estimator is further proposed, fusing the temperature skew model for ambiance perception with the traditional linear clock model to compensate for nonlinear error caused by temperature changes. Theoretical derivation of skew estimation lower bound proves the promotion of optimal accuracy after the fusion of clock models. Evaluations based on measured delay data demonstrate accuracy advantages regardless of internal or external influences.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
A deletion-contraction formula and monotonicity properties for the polymatroid Tutte polynomial
Authors:
Xiaxia Guan,
Xian'an Jin,
Tamás Kálmán
Abstract:
The Tutte polynomial is a crucial invariant of matroids. The polymatroid Tutte polynomial $\mathscr{T}_{P}(x,y)$, introduced by Bernardi et al., is an extension of the classical Tutte polynomial from matroids to polymatroids $P$. In this paper, we first obtain a deletion-contraction formula for $\mathscr{T}_{P}(x,y)$. Then we prove two natural monotonicity properties, for containment and for minor…
▽ More
The Tutte polynomial is a crucial invariant of matroids. The polymatroid Tutte polynomial $\mathscr{T}_{P}(x,y)$, introduced by Bernardi et al., is an extension of the classical Tutte polynomial from matroids to polymatroids $P$. In this paper, we first obtain a deletion-contraction formula for $\mathscr{T}_{P}(x,y)$. Then we prove two natural monotonicity properties, for containment and for minors of the interior polynomial $x^{n}\mathscr{T}_{P}(x^{-1},1)$ and the exterior polynomial $y^{n}\mathscr{T}_{P}(1,y^{-1})$, for polymatroids $P$ over $[n]$. We show by a counter-example that these monotonicity properties do not extend to $\mathscr{T}_{P}(x,y)$. Using deletion-contraction, we obtain formulas for the coefficients of terms of degree $n-1$ in $\mathscr{T}_{P}(x,y)$. Finally, for all $k\geq 0$, we characterize hypergraphs $\mathcal{H}=(V,E)$ so that the coefficient of $y^{k}$ in the exterior polynomial of the associated polymatroid $P_{\mathcal{H}}$ attains its maximal value $\binom{|V|+k-2}{k}$.
△ Less
Submitted 24 September, 2023;
originally announced September 2023.