-
COMET: "Cone of experience" enhanced large multimodal model for mathematical problem generation
Authors:
Sannyuya Liu,
Jintian Feng,
Zongkai Yang,
Yawei Luo,
Qian Wan,
Xiaoxuan Shen,
Jianwen Sun
Abstract:
The automatic generation of high-quality mathematical problems is practically valuable in many educational scenarios. Large multimodal model provides a novel technical approach for the mathematical problem generation because of its wide success in cross-modal data scenarios. However, the traditional method of separating problem solving from problem generation and the mainstream fine-tuning framewo…
▽ More
The automatic generation of high-quality mathematical problems is practically valuable in many educational scenarios. Large multimodal model provides a novel technical approach for the mathematical problem generation because of its wide success in cross-modal data scenarios. However, the traditional method of separating problem solving from problem generation and the mainstream fine-tuning framework of monotonous data structure with homogeneous training objectives limit the application of large multimodal model in mathematical problem generation. Addressing these challenges, this paper proposes COMET, a "Cone of Experience" enhanced large multimodal model for mathematical problem generation. Firstly, from the perspective of mutual ability promotion and application logic, we unify stem generation and problem solving into mathematical problem generation. Secondly, a three-stage fine-turning framework guided by the "Cone of Experience" is proposed. The framework divides the fine-tuning data into symbolic experience, iconic experience, and direct experience to draw parallels with experiences in the career growth of teachers. Several fine-grained data construction and injection methods are designed in this framework. Finally, we construct a Chinese multimodal mathematical problem dataset to fill the vacancy of Chinese multimodal data in this field. Combined with objective and subjective indicators, experiments on multiple datasets fully verify the effectiveness of the proposed framework and model.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Modified hybrid inflation in no-scale SUGRA with suppressed $R$-symmetry breaking
Authors:
Qian Wan,
Da-Xin Zhang
Abstract:
A well-motivated cosmological hybrid inflation scenario based on no-scale SUGRA is considered. It is demonstrated that an extra suppressed $R$-symmetry breaking term $S^n$ with $n\geq 4$ needs to be included in order to realize successful inflation. The resulting potential is found to be similar (but not identical) to the one in the Starobinsky inflation model. A relatively larger tensor-to-scalar…
▽ More
A well-motivated cosmological hybrid inflation scenario based on no-scale SUGRA is considered. It is demonstrated that an extra suppressed $R$-symmetry breaking term $S^n$ with $n\geq 4$ needs to be included in order to realize successful inflation. The resulting potential is found to be similar (but not identical) to the one in the Starobinsky inflation model. A relatively larger tensor-to-scalar ratio $r\sim 10^{-2}$ and a spectral index $n_s\approx 0.965$ are obtained, which are approximately independent of $n$.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Full Iso-recursive Types
Authors:
Litao Zhou,
Qianyong Wan,
Bruno C. d. S. Oliveira
Abstract:
There are two well-known formulations of recursive types: iso-recursive and equi-recursive types. Abadi and Fiore [1996] have shown that iso- and equi-recursive types have the same expressive power. However, their encoding of equi-recursive types in terms of iso-recursive types requires explicit coercions. These coercions come with significant additional computational overhead, and complicate reas…
▽ More
There are two well-known formulations of recursive types: iso-recursive and equi-recursive types. Abadi and Fiore [1996] have shown that iso- and equi-recursive types have the same expressive power. However, their encoding of equi-recursive types in terms of iso-recursive types requires explicit coercions. These coercions come with significant additional computational overhead, and complicate reasoning about the equivalence of the two formulations of recursive types.
This paper proposes a generalization of iso-recursive types called full iso-recursive types. Full iso-recursive types allow encoding all programs with equi-recursive types without computational overhead. Instead of explicit term coercions, all type transformations are captured by computationally irrelevant casts, which can be erased at runtime without affecting the semantics of the program. Consequently, reasoning about the equivalence between the two approaches can be greatly simplified. We present a calculus called $λ^μ_{Fi}$, which extends the simply typed lambda calculus (STLC) with full iso-recursive types. The $λ^μ_{Fi}$ calculus is proved to be type sound, and shown to have the same expressive power as a calculus with equi-recursive types. We also extend our results to subtyping, and show that equi-recursive subtyping can be expressed in terms of iso-recursive subtyping with cast operators.
△ Less
Submitted 7 July, 2024; v1 submitted 30 June, 2024;
originally announced July 2024.
-
Image anomaly detection and prediction scheme based on SSA optimized ResNet50-BiGRU model
Authors:
Qianhui Wan,
Zecheng Zhang,
Liheng Jiang,
Zhaoqi Wang,
Yan Zhou
Abstract:
Image anomaly detection is a popular research direction, with many methods emerging in recent years due to rapid advancements in computing. The use of artificial intelligence for image anomaly detection has been widely studied. By analyzing images of athlete posture and movement, it is possible to predict injury status and suggest necessary adjustments. Most existing methods rely on convolutional…
▽ More
Image anomaly detection is a popular research direction, with many methods emerging in recent years due to rapid advancements in computing. The use of artificial intelligence for image anomaly detection has been widely studied. By analyzing images of athlete posture and movement, it is possible to predict injury status and suggest necessary adjustments. Most existing methods rely on convolutional networks to extract information from irrelevant pixel data, limiting model accuracy. This paper introduces a network combining Residual Network (ResNet) and Bidirectional Gated Recurrent Unit (BiGRU), which can predict potential injury types and provide early warnings by analyzing changes in muscle and bone poses from video images. To address the high complexity of this network, the Sparrow search algorithm was used for optimization. Experiments conducted on four datasets demonstrated that our model has the smallest error in image anomaly detection compared to other models, showing strong adaptability. This provides a new approach for anomaly detection and predictive analysis in images, contributing to the sustainable development of human health and performance.
△ Less
Submitted 20 June, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Bose-Einstein condensation of polaritons at room temperature in a GaAs/AlGaAs structure
Authors:
Hassan Alnatah,
Qi Yao,
Qiaochu Wan,
Jonathan Beaumariage,
Ken West,
Kirk Baldwin,
Loren N. Pfeiffer,
David W. Snoke
Abstract:
We report the canonical properties of Bose-Einstein condensation of polaritons, seen previously in many low-temperature experiments, at room temperature in a GaAs/AlGaAs structure. These effects include a nonlinear energy shift of the polaritons, showing that they are not non-interacting photons, and dramatic line narrowing due to coherence, giving coherent emission with spectral width of 0.24 meV…
▽ More
We report the canonical properties of Bose-Einstein condensation of polaritons, seen previously in many low-temperature experiments, at room temperature in a GaAs/AlGaAs structure. These effects include a nonlinear energy shift of the polaritons, showing that they are not non-interacting photons, and dramatic line narrowing due to coherence, giving coherent emission with spectral width of 0.24 meV at room temperature with no external stabilization. This opens up the possibility of room temperature nonlinear optical devices based on polariton condensation.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Simulation of DAMPE silicon microstrip detectors in the $\rm Allpix^{2}$ framework
Authors:
Yu-Xin Cui,
Xiang Li,
Shen Wang,
Chuan Yue,
Qiang Wan,
Shi-Jun Lei,
Guan-Wen Yuan,
Yi-Ming Hu,
Jia-Ju Wei,
Jian-Hua Guo
Abstract:
Silicon strip detectors have been widely utilized in space experiments for gamma-ray and cosmic-ray detections thanks to their high spatial resolution and stable performance. For a silicon micro-strip detector, the Monte Carlo simulation is recognized as a practical and cost-effective approach to verify the detector performance. In this study, a technique for the simulation of the silicon micro-st…
▽ More
Silicon strip detectors have been widely utilized in space experiments for gamma-ray and cosmic-ray detections thanks to their high spatial resolution and stable performance. For a silicon micro-strip detector, the Monte Carlo simulation is recognized as a practical and cost-effective approach to verify the detector performance. In this study, a technique for the simulation of the silicon micro-strip detector with the $\rm Allpix^{2}$ framework is developed. By incorporating the electric field into the particle transport simulation based on Geant4, this framework could precisely emulate the carrier drift in the silicon micro-strip detector. The simulation results are validated using the beam test data as well as the flight data of the DAMPE experiment, which suggests that the $\rm Allpix^{2}$ framework is a powerful tool to obtain the performance of the silicon micro-strip detector.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
PTA: Enhancing Multimodal Sentiment Analysis through Pipelined Prediction and Translation-based Alignment
Authors:
Shezheng Song,
Shasha Li,
Shan Zhao,
Chengyu Wang,
Xiaopeng Li,
Jie Yu,
Qian Wan,
Jun Ma,
Tianwei Yan,
Wentao Ma,
Xiaoguang Mao
Abstract:
Multimodal aspect-based sentiment analysis (MABSA) aims to understand opinions in a granular manner, advancing human-computer interaction and other fields. Traditionally, MABSA methods use a joint prediction approach to identify aspects and sentiments simultaneously. However, we argue that joint models are not always superior. Our analysis shows that joint models struggle to align relevant text to…
▽ More
Multimodal aspect-based sentiment analysis (MABSA) aims to understand opinions in a granular manner, advancing human-computer interaction and other fields. Traditionally, MABSA methods use a joint prediction approach to identify aspects and sentiments simultaneously. However, we argue that joint models are not always superior. Our analysis shows that joint models struggle to align relevant text tokens with image patches, leading to misalignment and ineffective image utilization.
In contrast, a pipeline framework first identifies aspects through MATE (Multimodal Aspect Term Extraction) and then aligns these aspects with image patches for sentiment classification (MASC: Multimodal Aspect-Oriented Sentiment Classification). This method is better suited for multimodal scenarios where effective image use is crucial. We present three key observations: (a) MATE and MASC have different feature requirements, with MATE focusing on token-level features and MASC on sequence-level features; (b) the aspect identified by MATE is crucial for effective image utilization; and (c) images play a trivial role in previous MABSA methods due to high noise.
Based on these observations, we propose a pipeline framework that first predicts the aspect and then uses translation-based alignment (TBA) to enhance multimodal semantic consistency for better image utilization. Our method achieves state-of-the-art (SOTA) performance on widely used MABSA datasets Twitter-15 and Twitter-17. This demonstrates the effectiveness of the pipeline approach and its potential to provide valuable insights for future MABSA research.
For reproducibility, the code and checkpoint will be released.
△ Less
Submitted 13 June, 2024; v1 submitted 22 May, 2024;
originally announced June 2024.
-
DEGAP: Dual Event-Guided Adaptive Prefixes for Templated-Based Event Argument Extraction with Slot Querying
Authors:
Guanghui Wang,
Dexi Liu,
Jian-Yun Nie,
Qizhi Wan,
Rong Hu,
Xiping Liu,
Wanlong Liu,
Jiaming Liu
Abstract:
Recent advancements in event argument extraction (EAE) involve incorporating useful auxiliary information into models during training and inference, such as retrieved instances and event templates. These methods face two challenges: (1) the retrieval results may be irrelevant and (2) templates are developed independently for each event without considering their possible relationship. In this work,…
▽ More
Recent advancements in event argument extraction (EAE) involve incorporating useful auxiliary information into models during training and inference, such as retrieved instances and event templates. These methods face two challenges: (1) the retrieval results may be irrelevant and (2) templates are developed independently for each event without considering their possible relationship. In this work, we propose DEGAP to address these challenges through a simple yet effective components: dual prefixes, i.e. learnable prompt vectors, where the instance-oriented prefix and template-oriented prefix are trained to learn information from different event instances and templates. Additionally, we propose an event-guided adaptive gating mechanism, which can adaptively leverage possible connections between different events and thus capture relevant information from the prefix. Finally, these event-guided prefixes provide relevant information as cues to EAE model without retrieval. Extensive experiments demonstrate that our method achieves new state-of-the-art performance on four datasets (ACE05, RAMS, WIKIEVENTS, and MLEE). Further analysis shows the impact of different components.
△ Less
Submitted 15 June, 2024; v1 submitted 21 May, 2024;
originally announced May 2024.
-
A point cloud processing method of mmWave radar over automotive scenario
Authors:
Qingmian Wan,
Hongli Peng,
Xing Liao,
Kuayue Liu
Abstract:
This paper introduces in detail the effective method of comprehensive target judgment by using radar RA map and point cloud map. Different output of radar can effectively judge the road boundary of target and the relative coordinates of target, avoid the error of output caused by excessive processing information, and greatly improve the processing efficiency of DBSCAN of the measured target.
This paper introduces in detail the effective method of comprehensive target judgment by using radar RA map and point cloud map. Different output of radar can effectively judge the road boundary of target and the relative coordinates of target, avoid the error of output caused by excessive processing information, and greatly improve the processing efficiency of DBSCAN of the measured target.
△ Less
Submitted 23 March, 2024;
originally announced April 2024.
-
Quantum simulation of honeycomb lattice model by high-order moiré pattern
Authors:
Qiang Wan,
Chunlong Wu,
Xun-Jiang Luo,
Shenghao Dai,
Cao Peng,
Renzhe Li,
Shangkun Mo,
Keming Zhao,
Wen-Xuan Qiu,
Hao Zhong,
Yiwei Li,
Chendong Zhang,
Fengcheng Wu,
Nan Xu
Abstract:
Moiré superlattices have become an emergent solid-state platform for simulating quantum lattice models. However, in single moiré device, Hamiltonians parameters like lattice constant, hopping and interaction terms can hardly be manipulated, limiting the controllability and accessibility of moire quantum simulator. Here, by combining angle-resolved photoemission spectroscopy and theoretical analysi…
▽ More
Moiré superlattices have become an emergent solid-state platform for simulating quantum lattice models. However, in single moiré device, Hamiltonians parameters like lattice constant, hopping and interaction terms can hardly be manipulated, limiting the controllability and accessibility of moire quantum simulator. Here, by combining angle-resolved photoemission spectroscopy and theoretical analysis, we demonstrate that high-order moiré patterns in graphene-monolayered xenon/krypton heterostructures can simulate honeycomb model in mesoscale, with in-situ tunable Hamiltonians parameters. The length scale of simulated lattice constant can be tuned by annealing processes, which in-situ adjusts intervalley interaction and hopping parameters in the simulated honeycomb lattice. The sign of the lattice constant can be switched by choosing xenon or krypton monolayer deposited on graphene, which controls sublattice degree of freedom and valley arrangment of Dirac fermions. Our work establishes a novel path for experimentally simulating the honeycomb model with tunable parameters by high-order moiré patterns.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
YOLOOC: YOLO-based Open-Class Incremental Object Detection with Novel Class Discovery
Authors:
Qian Wan,
Xiang Xiang,
Qinhao Zhou
Abstract:
Because of its use in practice, open-world object detection (OWOD) has gotten a lot of attention recently. The challenge is how can a model detect novel classes and then incrementally learn them without forgetting previously known classes. Previous approaches hinge on strongly-supervised or weakly-supervised novel-class data for novel-class detection, which may not apply to real applications. We c…
▽ More
Because of its use in practice, open-world object detection (OWOD) has gotten a lot of attention recently. The challenge is how can a model detect novel classes and then incrementally learn them without forgetting previously known classes. Previous approaches hinge on strongly-supervised or weakly-supervised novel-class data for novel-class detection, which may not apply to real applications. We construct a new benchmark that novel classes are only encountered at the inference stage. And we propose a new OWOD detector YOLOOC, based on the YOLO architecture yet for the Open-Class setup. We introduce label smoothing to prevent the detector from over-confidently mapping novel classes to known classes and to discover novel classes. Extensive experiments conducted on our more realistic setup demonstrate the effectiveness of our method for discovering novel classes in our new benchmark.
△ Less
Submitted 22 April, 2024; v1 submitted 30 March, 2024;
originally announced April 2024.
-
Surface region band enhancement in noble gas adsorption assisted ARPES on kagome superconductor RbV3Sb5
Authors:
Cao Peng,
Yiwei Li,
Xu Chen,
Shenghao Dai,
Zewen Wu,
Chunlong Wu,
Qiang Wan,
Keming Zhao,
Renzhe Li,
Shangkun Mo,
Dingkun Qin,
Shuming Yu,
Hao Zhong,
Shengjun Yuan,
Jiangang Guo,
Nan Xu
Abstract:
Electronic states near surface regions can be distinct from bulk states, which are paramount in understanding various physical phenomena occurring at surfaces and in applications in semiconductors, energy, and catalysis. Here, we report an abnormal surface region band enhancement effect in angle-resolved photoemission spectroscopy on kagome superconductor RbV3Sb5, by depositing noble gases with fi…
▽ More
Electronic states near surface regions can be distinct from bulk states, which are paramount in understanding various physical phenomena occurring at surfaces and in applications in semiconductors, energy, and catalysis. Here, we report an abnormal surface region band enhancement effect in angle-resolved photoemission spectroscopy on kagome superconductor RbV3Sb5, by depositing noble gases with fine control. In contrast to conventional surface contamination, the intensity of surface region Sb band can be enhanced more than three times with noble gas adsorption. In the meantime, a hole-dope effect is observed for the enhanced surface region band, with other bands hardly changing. The doping effect is more pronounced with heavier noble gases. We propose that noble gas atoms selectively fill into alkali metal vacancy sites on the surface, which improves the surface condition, boosts surface region bands, and effectively dopes it with the Pauli repulsion mechanism. Our results provide a novel and reversible way to improve surface conditions and tune surface region bands by controlled surface noble gas deposition.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Metamorpheus: Interactive, Affective, and Creative Dream Narration Through Metaphorical Visual Storytelling
Authors:
Qian Wan,
Xin Feng,
Yining Bei,
Zhiqi Gao,
Zhicong Lu
Abstract:
Human emotions are essentially molded by lived experiences, from which we construct personalised meaning. The engagement in such meaning-making process has been practiced as an intervention in various psychotherapies to promote wellness. Nevertheless, to support recollecting and recounting lived experiences in everyday life remains under explored in HCI. It also remains unknown how technologies su…
▽ More
Human emotions are essentially molded by lived experiences, from which we construct personalised meaning. The engagement in such meaning-making process has been practiced as an intervention in various psychotherapies to promote wellness. Nevertheless, to support recollecting and recounting lived experiences in everyday life remains under explored in HCI. It also remains unknown how technologies such as generative AI models can facilitate the meaning making process, and ultimately support affective mindfulness. In this paper we present Metamorpheus, an affective interface that engages users in a creative visual storytelling of emotional experiences during dreams. Metamorpheus arranges the storyline based on a dream's emotional arc, and provokes self-reflection through the creation of metaphorical images and text depictions. The system provides metaphor suggestions, and generates visual metaphors and text depictions using generative AI models, while users can apply generations to recolour and re-arrange the interface to be visually affective. Our experience-centred evaluation manifests that, by interacting with Metamorpheus, users can recall their dreams in vivid detail, through which they relive and reflect upon their experiences in a meaningful way.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
A Point Cloud Enhancement Method for 4D mmWave Radar Imagery
Authors:
Qingmian Wan,
Hongli Peng,
Xing Liao,
Kuayue Liu,
Junfa Mao
Abstract:
A point cloud enhancement method for 4D mmWave radar imagery is proposed in this paper. Based on the patch antenna and MIMO array theories, the MIMO array with small redundancy and high SNR is designed to provide the probability of high angular resolution and detection rate. The antenna array is deployed using a ladder shape in vertical direction to decrease the redundancy and improve the resoluti…
▽ More
A point cloud enhancement method for 4D mmWave radar imagery is proposed in this paper. Based on the patch antenna and MIMO array theories, the MIMO array with small redundancy and high SNR is designed to provide the probability of high angular resolution and detection rate. The antenna array is deployed using a ladder shape in vertical direction to decrease the redundancy and improve the resolution in horizontal direction with the constrains of physical factors. Considering the complicated environment of the real world with non-uniform distributed clutters, the dynamic detection method is used to solve the weak target sensing problem. The window size of CFAR detector is assumed variant to be determined using optimization method, making it adaptive to different environments especially when weak targets exist. The angular resolution increase using FT-based DOA method and the designed antenna array is described, which provides the basis of accurate detection and dense point cloud. To verify the performance of the proposed method, experiments of simulations and practical measurements are carried out, whose results show that the accuracy and the point cloud density are improved with comparison of the original manufacturer mmWave radar of TI AWR2243.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
High Resolution Image Quality Database
Authors:
Huang Huang,
Qiang Wan,
Jari Korhonen
Abstract:
With technology for digital photography and high resolution displays rapidly evolving and gaining popularity, there is a growing demand for blind image quality assessment (BIQA) models for high resolution images. Unfortunately, the publicly available large scale image quality databases used for training BIQA models contain mostly low or general resolution images. Since image resizing affects image…
▽ More
With technology for digital photography and high resolution displays rapidly evolving and gaining popularity, there is a growing demand for blind image quality assessment (BIQA) models for high resolution images. Unfortunately, the publicly available large scale image quality databases used for training BIQA models contain mostly low or general resolution images. Since image resizing affects image quality, we assume that the accuracy of BIQA models trained on low resolution images would not be optimal for high resolution images. Therefore, we created a new high resolution image quality database (HRIQ), consisting of 1120 images with resolution of 2880x2160 pixels. We conducted a subjective study to collect the subjective quality ratings for HRIQ in a controlled laboratory setting, resulting in accurate MOS at high resolution. To demonstrate the importance of a high resolution image quality database for training BIQA models to predict mean opinion scores (MOS) of high resolution images accurately, we trained and tested several traditional and deep learning based BIQA methods on different resolution versions of our database. The database is publicly available in https://github.com/jarikorhonen/hriq.
△ Less
Submitted 29 January, 2024;
originally announced January 2024.
-
Harnessing Diffusion Models for Visual Perception with Meta Prompts
Authors:
Qiang Wan,
Zilong Huang,
Bingyi Kang,
Jiashi Feng,
Li Zhang
Abstract:
The issue of generative pretraining for vision models has persisted as a long-standing conundrum. At present, the text-to-image (T2I) diffusion model demonstrates remarkable proficiency in generating high-definition images matching textual inputs, a feat made possible through its pre-training on large-scale image-text pairs. This leads to a natural inquiry: can diffusion models be utilized to tack…
▽ More
The issue of generative pretraining for vision models has persisted as a long-standing conundrum. At present, the text-to-image (T2I) diffusion model demonstrates remarkable proficiency in generating high-definition images matching textual inputs, a feat made possible through its pre-training on large-scale image-text pairs. This leads to a natural inquiry: can diffusion models be utilized to tackle visual perception tasks? In this paper, we propose a simple yet effective scheme to harness a diffusion model for visual perception tasks. Our key insight is to introduce learnable embeddings (meta prompts) to the pre-trained diffusion models to extract proper features for perception. The effect of meta prompts are two-fold. First, as a direct replacement of the text embeddings in the T2I models, it can activate task-relevant features during feature extraction. Second, it will be used to re-arrange the extracted features to ensures that the model focuses on the most pertinent features for the task on hand. Additionally, we design a recurrent refinement training strategy that fully leverages the property of diffusion models, thereby yielding stronger visual features. Extensive experiments across various benchmarks validate the effectiveness of our approach. Our approach achieves new performance records in depth estimation tasks on NYU depth V2 and KITTI, and in semantic segmentation task on CityScapes. Concurrently, the proposed method attains results comparable to the current state-of-the-art in semantic segmentation on ADE20K and pose estimation on COCO datasets, further exemplifying its robustness and versatility.
△ Less
Submitted 22 December, 2023;
originally announced December 2023.
-
Optimal energy storage in the Tavis-Cummings quantum battery
Authors:
Hui-Yu Yang,
Hai-Long Shi,
Qing-Kun Wan,
Kun Zhang,
Xiao-Hui Wang,
Wen-Li Yang
Abstract:
The Tavis-Cummings (TC) model, which serves as a natural physical realization of a quantum battery, comprises $N_b$ atoms as battery cells that collectively interact with a shared photon field, functioning as the charger, initially containing $n_0$ photons. In this study, we introduce the invariant subspace method to effectively represent the quantum dynamics of the TC battery. Our findings indica…
▽ More
The Tavis-Cummings (TC) model, which serves as a natural physical realization of a quantum battery, comprises $N_b$ atoms as battery cells that collectively interact with a shared photon field, functioning as the charger, initially containing $n_0$ photons. In this study, we introduce the invariant subspace method to effectively represent the quantum dynamics of the TC battery. Our findings indicate that in the limiting case of $n_0\!\gg\! N_b$ or $N_b\!\gg\! n_0$, a distinct SU(2) symmetry emerges in the dynamics, thereby ensuring the realization of optimal energy storage. We also establish a negative relationship between the battery-charger entanglement and the energy storage capacity. As a result, we demonstrate that the asymptotically optimal energy storage can be achieved in the scenario where $N_b\!=\!n_0\!\gg\! 1$. Our approach not only enhances our comprehension of the algebraic structure inherent in the TC model but also contributes to the broader theoretical framework of quantum batteries. Furthermore, it provides crucial insights into the relation between energy transfer and quantum correlations.
△ Less
Submitted 8 January, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Role of Centrosymmetry in the Photophysics of Molecular Aggregates
Authors:
Qingyun Wan,
Chi-Ming Che
Abstract:
To understand the photophysics of molecular aggregates, exciton model of J- and H-aggregate has been extensively utilized. However, it lacks consideration of crystal symmetry. Although discrete molecules may lack symmetry, their aggregates can exhibit a high degree of symmetry. Herein, we utilized group theory to study the optical properties of centrosymmetric molecular aggregates, showing that th…
▽ More
To understand the photophysics of molecular aggregates, exciton model of J- and H-aggregate has been extensively utilized. However, it lacks consideration of crystal symmetry. Although discrete molecules may lack symmetry, their aggregates can exhibit a high degree of symmetry. Herein, we utilized group theory to study the optical properties of centrosymmetric molecular aggregates, showing that their optical selection rules (transition dipole moment and spin-orbit coupling) are determined by the symmetry of singlet and triplet excited states and the intermolecular orbital overlap. Symmetry-forbidden electronic transitions are closely related to ultralong organic phosphorescence. Our model's scope is broad, as over 50% of organic crystals belong to centrosymmetric space groups according to Cambridge Structural Database.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
The Agricultural Spraying Vehicle Routing Problem With Splittable Edge Demands
Authors:
Qian Wan,
Rodolfo García-Flores,
Simon A. Bowly,
Philip Kilby,
Andreas T. Ernst
Abstract:
In horticulture, spraying applications occur multiple times throughout any crop year. This paper presents a splittable agricultural chemical sprayed vehicle routing problem and formulates it as a mixed integer linear program. The main difference from the classical capacitated arc routing problem (CARP) is that our problem allows us to split the demand on a single demand edge amongst robotics spray…
▽ More
In horticulture, spraying applications occur multiple times throughout any crop year. This paper presents a splittable agricultural chemical sprayed vehicle routing problem and formulates it as a mixed integer linear program. The main difference from the classical capacitated arc routing problem (CARP) is that our problem allows us to split the demand on a single demand edge amongst robotics sprayers. We are using theoretical insights about the optimal solution structure to improve the formulation and provide two different formulations of the splittable capacitated arc routing problem (SCARP), a basic spray formulation and a large edge demands formulation for large edge demands problems. This study presents solution methods consisting of lazy constraints, symmetry elimination constraints, and a heuristic repair method. Computational experiments on a set of valuable data based on the properties of real-world agricultural orchard fields reveal that the proposed methods can solve the SCARP with different properties. We also report computational results on classical benchmark sets from previous CARP literature. The tested results indicated that the SCARP model can provide cheaper solutions in some instances when compared with the classical CARP literature. Besides, the heuristic repair method significantly improves the quality of the solution by decreasing the upper bound when solving large-scale problems.
△ Less
Submitted 29 August, 2023;
originally announced August 2023.
-
Bose condensation of upper-branch exciton-polaritons in a transferrable microcavity
Authors:
Xingzhou Chen,
Hassan Alnatah,
Danqun Mao,
Mengyao Xu,
Qiaochu Wan,
Jonathan Beaumariage,
Wei Xie,
Hongxing Xu,
Zhe-Yu Shi,
David Snoke,
Zheng Sun,
Jian Wu
Abstract:
Exciton-polaritons are composite bosonic quasiparticles arising from the strong coupling of excitonic transitions and optical modes. Exciton-polaritons have triggered wide exploration in the past decades not only due to their rich quantum phenomena such as superfluidity, superconductivity and quantized vortices but also due to their potential applications for unconventional coherent light sources…
▽ More
Exciton-polaritons are composite bosonic quasiparticles arising from the strong coupling of excitonic transitions and optical modes. Exciton-polaritons have triggered wide exploration in the past decades not only due to their rich quantum phenomena such as superfluidity, superconductivity and quantized vortices but also due to their potential applications for unconventional coherent light sources and all-optical control elements. Here, we report the observation of Bose-Einstein condensation of the upper polariton branch in a transferrable WS$_2$ monolayer microcavity. Near the condensation threshold, we observe a nonlinear increase in upper polariton intensity. This sharp increase in intensity is accompanied by a decrease of the linewidth and an increase of the upper polariton temporal coherence, all of which are hallmarks of Bose-Einstein condensation. By simulating the quantum Boltzmann equation, we show that the upper polariton condensation only occurs for a particular range of particle density. We can attribute the creation of Bose condensation of the upper polariton to the following requirements: 1) the upper polariton is more excitonic than the lower one; 2) there is relatively more pumping in the upper branch; and 3) the conversion time from the upper to the lower polariton branch is long compared to the lifetime of the upper polaritons.
△ Less
Submitted 28 August, 2023;
originally announced August 2023.
-
Investigating VTubing as a Reconstruction of Streamer Self-Presentation: Identity, Performance, and Gender
Authors:
Qian Wan,
Zhicong Lu
Abstract:
VTubers, or Virtual YouTubers, are live streamers who create streaming content using animated 2D or 3D virtual avatars. In recent years, there has been a significant increase in the number of VTuber creators and viewers across the globe. This practise has drawn research attention into topics such as viewers' engagement behaviors and perceptions, however, as animated avatars offer more identity and…
▽ More
VTubers, or Virtual YouTubers, are live streamers who create streaming content using animated 2D or 3D virtual avatars. In recent years, there has been a significant increase in the number of VTuber creators and viewers across the globe. This practise has drawn research attention into topics such as viewers' engagement behaviors and perceptions, however, as animated avatars offer more identity and performance flexibility than traditional live streaming where one uses their own body, little research has focused on how this flexibility influences how creators present themselves. This research thus seeks to fill this gap by presenting results from a qualitative study of 16 Chinese-speaking VTubers' streaming practices. The data revealed that the virtual avatars that were used while live streaming afforded creators opportunities to present themselves using inflated presentations and resulted in inclusive interactions with viewers. The results also unveiled the inflated, and often sexualized, gender expressions of VTubers while they were situated in misogynistic environments. The socio-technical facets of VTubing were found to potentially reduce sexual harassment and sexism, whilst also raising self-objectification concerns.
△ Less
Submitted 29 February, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
"It Felt Like Having a Second Mind": Investigating Human-AI Co-creativity in Prewriting with Large Language Models
Authors:
Qian Wan,
Siying Hu,
Yu Zhang,
Piaohong Wang,
Bo Wen,
Zhicong Lu
Abstract:
Prewriting is the process of discovering and developing ideas before a first draft, which requires divergent thinking and often implies unstructured strategies such as diagramming, outlining, free-writing, etc. Although large language models (LLMs) have been demonstrated to be useful for a variety of tasks including creative writing, little is known about how users would collaborate with LLMs to s…
▽ More
Prewriting is the process of discovering and developing ideas before a first draft, which requires divergent thinking and often implies unstructured strategies such as diagramming, outlining, free-writing, etc. Although large language models (LLMs) have been demonstrated to be useful for a variety of tasks including creative writing, little is known about how users would collaborate with LLMs to support prewriting. The preferred collaborative role and initiative of LLMs during such a creativity process is also unclear. To investigate human-LLM collaboration patterns and dynamics during prewriting, we conducted a three-session qualitative study with 15 participants in two creative tasks: story writing and slogan writing. The findings indicated that during collaborative prewriting, there appears to be a three-stage iterative Human-AI Co-creativity process that includes Ideation, Illumination, and Implementation stages. This collaborative process champions the human in a dominant role, in addition to mixed and shifting levels of initiative that exist between humans and LLMs. This research also reports on collaboration breakdowns that occur during this process, user perceptions of using existing LLMs during Human-AI Co-creativity, and discusses design implications to support this co-creativity process.
△ Less
Submitted 29 February, 2024; v1 submitted 20 July, 2023;
originally announced July 2023.
-
X-ray metal line emission from the hot circumgalactic medium: probing the effects of supermassive black hole feedback
Authors:
Nhut Truong,
Annalisa Pillepich,
Dylan Nelson,
Ákos Bogdán,
Gerrit Schellenberger,
Priyanka Chakraborty,
William R. Forman,
Ralph Kraft,
Maxim Markevitch,
Anna Ogorzalek,
Benjamin D. Oppenheimer,
Arnab Sarkar,
Sylvain Veilleux,
Mark Vogelsberger,
Q. Daniel Wan,
Norbert Werner,
Irina Zhuravleva,
John Zuhone
Abstract:
We derive predictions from state-of-the-art cosmological galaxy simulations for the spatial distribution of the hot circumgalactic medium (CGM, ${\rm [0.1-1]R_{200c}}$) through its emission lines in the X-ray soft band ($[0.3-1.3]$ keV). In particular, we compare IllustrisTNG, EAGLE, and SIMBA and focus on galaxies with stellar mass $10^{10-11.6}\, \MSUN$ at $z=0$. The three simulation models retu…
▽ More
We derive predictions from state-of-the-art cosmological galaxy simulations for the spatial distribution of the hot circumgalactic medium (CGM, ${\rm [0.1-1]R_{200c}}$) through its emission lines in the X-ray soft band ($[0.3-1.3]$ keV). In particular, we compare IllustrisTNG, EAGLE, and SIMBA and focus on galaxies with stellar mass $10^{10-11.6}\, \MSUN$ at $z=0$. The three simulation models return significantly different surface brightness radial profiles of prominent emission lines from ionized metals such as OVII(f), OVIII, and FeXVII as a function of galaxy mass. Likewise, the three simulations predict varying azimuthal distributions of line emission with respect to the galactic stellar planes, with IllustrisTNG predicting the strongest angular modulation of CGM physical properties at radial range ${\gtrsim0.3-0.5\,R_{200c}}$. This anisotropic signal is more prominent for higher-energy lines, where it can manifest as X-ray eROSITA-like bubbles. Despite different models of stellar and supermassive black hole (SMBH) feedback, the three simulations consistently predict a dichotomy between star-forming and quiescent galaxies at the Milky-Way and Andromeda mass range, where the former are X-ray brighter than the latter. This is a signature of SMBH-driven outflows, which are responsible for quenching star formation. Finally, we explore the prospect of testing these predictions with a microcalorimeter-based X-ray mission concept with a large field-of-view. Such a mission would probe the extended hot CGM via soft X-ray line emission, determine the physical properties of the CGM, including temperature, from the measurement of line ratios, and provide critical constraints on the efficiency and impact of SMBH feedback on the CGM.
△ Less
Submitted 26 August, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Token-Event-Role Structure-based Multi-Channel Document-Level Event Extraction
Authors:
Qizhi Wan,
Changxuan Wan,
Keli Xiao,
Hui Xiong,
Dexi Liu,
Xiping Liu
Abstract:
Document-level event extraction is a long-standing challenging information retrieval problem involving a sequence of sub-tasks: entity extraction, event type judgment, and event type-specific multi-event extraction. However, addressing the problem as multiple learning tasks leads to increased model complexity. Also, existing methods insufficiently utilize the correlation of entities crossing diffe…
▽ More
Document-level event extraction is a long-standing challenging information retrieval problem involving a sequence of sub-tasks: entity extraction, event type judgment, and event type-specific multi-event extraction. However, addressing the problem as multiple learning tasks leads to increased model complexity. Also, existing methods insufficiently utilize the correlation of entities crossing different events, resulting in limited event extraction performance. This paper introduces a novel framework for document-level event extraction, incorporating a new data structure called token-event-role and a multi-channel argument role prediction module. The proposed data structure enables our model to uncover the primary role of tokens in multiple events, facilitating a more comprehensive understanding of event relationships. By leveraging the multi-channel prediction module, we transform entity and multi-event extraction into a single task of predicting token-event pairs, thereby reducing the overall parameter size and enhancing model efficiency. The results demonstrate that our approach outperforms the state-of-the-art method by 9.5 percentage points in terms of the F1 score, highlighting its superior performance in event extraction. Furthermore, an ablation study confirms the significant value of the proposed data structure in improving event extraction tasks, further validating its importance in enhancing the overall performance of the framework.
△ Less
Submitted 30 June, 2023;
originally announced June 2023.
-
Inactivated COVID-19 Vaccination did not affect In vitro fertilization (IVF) / Intra-Cytoplasmic Sperm Injection (ICSI) cycle outcomes
Authors:
Qi Wan,
Ying Ling Yao,
XingYu Lv,
Li Hong Geng,
Yue Wang,
Enoch Appiah Adu-Gyamfi,
Xue Jiao Wang,
Yue Qian,
Juan Yang,
Ming Xing Chend,
Zhao Hui Zhong,
Yuan Li,
Yu Bin Ding
Abstract:
Background: The objective of this study is to evaluate the impact of COVID-19 inactivated vaccine administration on the outcomes of in vitro fertilization (IVF) and intracytoplasmic sperm injection (ICSI) cycles in infertile couples in China. Methods: We collected data from the CYART prospective cohort, which included couples undergoing IVF treatment from January 2021 to September 2022 at Sichuan…
▽ More
Background: The objective of this study is to evaluate the impact of COVID-19 inactivated vaccine administration on the outcomes of in vitro fertilization (IVF) and intracytoplasmic sperm injection (ICSI) cycles in infertile couples in China. Methods: We collected data from the CYART prospective cohort, which included couples undergoing IVF treatment from January 2021 to September 2022 at Sichuan Jinxin Xinan Women & Children's Hospital. Based on whether they received vaccination before ovarian stimulation, the couples were divided into the vaccination group and the non-vaccination group. We compared the laboratory parameters and pregnancy outcomes between the two groups. Findings: After performing propensity score matching (PSM), the analysis demonstrated similar clinical pregnancy rates, biochemical pregnancy and ongoing pregnancy rates between vaccinated and unvaccinated women. No significant disparities were found in terms of embryo development and laboratory parameters among the groups. Moreover, male vaccination had no impact on patient performance or pregnancy outcomes in assisted reproductive technology treatments. Additionally, there were no significant differences observed in the effects of vaccination on embryo development and pregnancy outcomes among couples undergoing ART. Interpretation: The findings suggest that COVID-19 vaccination did not have a significant effect on patients undergoing IVF/ICSI with fresh embryo transfer. Therefore, it is recommended that couples should receive COVID-19 vaccination as scheduled to help mitigate the COVID-19 pandemic.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Quantum-Enhanced Metrology in Cavity Magnomechanics
Authors:
Qing-Kun Wan,
Hai-Long Shi,
Xi-Wen Guan
Abstract:
Magnons, as fundamental quasiparticles emerged in elementary spin excitations, hold a big promise for innovating quantum technologies in information coding and processing. Here we discover subtle roles of entanglement in a metrological scheme based on an experimentally feasible cavity magnomechanical system, where the magnons are responsible for sensing a weak magnetic field whereas the cavity fie…
▽ More
Magnons, as fundamental quasiparticles emerged in elementary spin excitations, hold a big promise for innovating quantum technologies in information coding and processing. Here we discover subtle roles of entanglement in a metrological scheme based on an experimentally feasible cavity magnomechanical system, where the magnons are responsible for sensing a weak magnetic field whereas the cavity field carries out a precision measurement of the weak field. By establishing exact relations between the Fisher information and entanglement, we show that for the weak coupling case the measurement precision can reach the Heisenberg limit, whereas quantum criticality enables us to enhance measurement precision for the strong coupling case. In particular, we also find that the entanglement between magnons and photons is of crucial importance during the dynamical encoding process, but the presence of such an entanglement in the measurement process dramatically reduces the final measurement precision.
△ Less
Submitted 19 January, 2024; v1 submitted 13 May, 2023;
originally announced May 2023.
-
MassNet: A Deep Learning Approach for Body Weight Extraction from A Single Pressure Image
Authors:
Ziyu Wu,
Quan Wan,
Mingjie Zhao,
Yi Ke,
Yiran Fang,
Zhen Liang,
Fangting Xie,
Jingyuan Cheng
Abstract:
Body weight, as an essential physiological trait, is of considerable significance in many applications like body management, rehabilitation, and drug dosing for patient-specific treatments. Previous works on the body weight estimation task are mainly vision-based, using 2D/3D, depth, or infrared images, facing problems in illumination, occlusions, and especially privacy issues. The pressure mappin…
▽ More
Body weight, as an essential physiological trait, is of considerable significance in many applications like body management, rehabilitation, and drug dosing for patient-specific treatments. Previous works on the body weight estimation task are mainly vision-based, using 2D/3D, depth, or infrared images, facing problems in illumination, occlusions, and especially privacy issues. The pressure mapping mattress is a non-invasive and privacy-preserving tool to obtain the pressure distribution image over the bed surface, which strongly correlates with the body weight of the lying person. To extract the body weight from this image, we propose a deep learning-based model, including a dual-branch network to extract the deep features and pose features respectively. A contrastive learning module is also combined with the deep-feature branch to help mine the mutual factors across different postures of every single subject. The two groups of features are then concatenated for the body weight regression task. To test the model's performance over different hardware and posture settings, we create a pressure image dataset of 10 subjects and 23 postures, using a self-made pressure-sensing bedsheet. This dataset, which is made public together with this paper, together with a public dataset, are used for the validation. The results show that our model outperforms the state-of-the-art algorithms over both 2 datasets. Our research constitutes an important step toward fully automatic weight estimation in both clinical and at-home practice. Our dataset is available for research purposes at: https://github.com/USTCWzy/MassEstimation.
△ Less
Submitted 17 March, 2023;
originally announced March 2023.
-
SeaFormer++: Squeeze-enhanced Axial Transformer for Mobile Visual Recognition
Authors:
Qiang Wan,
Zilong Huang,
Jiachen Lu,
Gang Yu,
Li Zhang
Abstract:
Since the introduction of Vision Transformers, the landscape of many computer vision tasks (e.g., semantic segmentation), which has been overwhelmingly dominated by CNNs, recently has significantly revolutionized. However, the computational cost and memory requirement renders these methods unsuitable on the mobile device. In this paper, we introduce a new method squeeze-enhanced Axial Transformer…
▽ More
Since the introduction of Vision Transformers, the landscape of many computer vision tasks (e.g., semantic segmentation), which has been overwhelmingly dominated by CNNs, recently has significantly revolutionized. However, the computational cost and memory requirement renders these methods unsuitable on the mobile device. In this paper, we introduce a new method squeeze-enhanced Axial Transformer (SeaFormer) for mobile visual recognition. Specifically, we design a generic attention block characterized by the formulation of squeeze Axial and detail enhancement. It can be further used to create a family of backbone architectures with superior cost-effectiveness. Coupled with a light segmentation head, we achieve the best trade-off between segmentation accuracy and latency on the ARM-based mobile devices on the ADE20K, Cityscapes, Pascal Context and COCO-Stuff datasets. Critically, we beat both the mobilefriendly rivals and Transformer-based counterparts with better performance and lower latency without bells and whistles. Furthermore, we incorporate a feature upsampling-based multi-resolution distillation technique, further reducing the inference latency of the proposed framework. Beyond semantic segmentation, we further apply the proposed SeaFormer architecture to image classification and object detection problems, demonstrating the potential of serving as a versatile mobile-friendly backbone. Our code and models are made publicly available at https://github.com/fudan-zvg/SeaFormer.
△ Less
Submitted 17 June, 2024; v1 submitted 30 January, 2023;
originally announced January 2023.
-
An extended study on the supersymmetric SO(10) models with natural doublet-triplet splitting
Authors:
Qian Wan,
Da-Xin Zhang
Abstract:
In the supersymmetric SO(10) models, the doublet-triplet splitting problem can be solved through the Dimopoulos-Wilczek mechanism. This mechanism is extended in the non-renormalizable version. Improvement on the realistic model is also made.
In the supersymmetric SO(10) models, the doublet-triplet splitting problem can be solved through the Dimopoulos-Wilczek mechanism. This mechanism is extended in the non-renormalizable version. Improvement on the realistic model is also made.
△ Less
Submitted 25 April, 2023; v1 submitted 4 October, 2022;
originally announced October 2022.
-
QbyE-MLPMixer: Query-by-Example Open-Vocabulary Keyword Spotting using MLPMixer
Authors:
Jinmiao Huang,
Waseem Gharbieh,
Qianhui Wan,
Han Suk Shim,
Chul Lee
Abstract:
Current keyword spotting systems are typically trained with a large amount of pre-defined keywords. Recognizing keywords in an open-vocabulary setting is essential for personalizing smart device interaction. Towards this goal, we propose a pure MLP-based neural network that is based on MLPMixer - an MLP model architecture that effectively replaces the attention mechanism in Vision Transformers. We…
▽ More
Current keyword spotting systems are typically trained with a large amount of pre-defined keywords. Recognizing keywords in an open-vocabulary setting is essential for personalizing smart device interaction. Towards this goal, we propose a pure MLP-based neural network that is based on MLPMixer - an MLP model architecture that effectively replaces the attention mechanism in Vision Transformers. We investigate different ways of adapting the MLPMixer architecture to the QbyE open-vocabulary keyword spotting task. Comparisons with the state-of-the-art RNN and CNN models show that our method achieves better performance in challenging situations (10dB and 6dB environments) on both the publicly available Hey-Snips dataset and a larger scale internal dataset with 400 speakers. Our proposed model also has a smaller number of parameters and MACs compared to the baseline models.
△ Less
Submitted 23 June, 2022;
originally announced June 2022.
-
Entanglement, Coherence, and Extractable Work in Quantum Batteries
Authors:
Hai-Long Shi,
Shu Ding,
Qing-Kun Wan,
Xiao-Hui Wang,
Wen-Li Yang
Abstract:
We investigate the connection between quantum resources and extractable work in quantum batteries. We demonstrate that quantum coherence in the battery or the battery-charger entanglement is a necessary resource for generating nonzero extractable work during the charging process. At the end of the charging process, we also establish a tight link of coherence and entanglement with the final extract…
▽ More
We investigate the connection between quantum resources and extractable work in quantum batteries. We demonstrate that quantum coherence in the battery or the battery-charger entanglement is a necessary resource for generating nonzero extractable work during the charging process. At the end of the charging process, we also establish a tight link of coherence and entanglement with the final extractable work: coherence naturally promotes the coherent work while coherence and entanglement inhibit the incoherent work. We also show that obtaining maximally coherent work is faster than obtaining maximally incoherent work. Examples ranging from the central-spin battery and the Tavis-Cummings battery to the spin-chain battery are given to illustrate these results.
△ Less
Submitted 23 September, 2022; v1 submitted 23 May, 2022;
originally announced May 2022.
-
Cross Pairwise Ranking for Unbiased Item Recommendation
Authors:
Qi Wan,
Xiangnan He,
Xiang Wang,
Jiancan Wu,
Wei Guo,
Ruiming Tang
Abstract:
Most recommender systems optimize the model on observed interaction data, which is affected by the previous exposure mechanism and exhibits many biases like popularity bias. The loss functions, such as the mostly used pointwise Binary Cross-Entropy and pairwise Bayesian Personalized Ranking, are not designed to consider the biases in observed data. As a result, the model optimized on the loss woul…
▽ More
Most recommender systems optimize the model on observed interaction data, which is affected by the previous exposure mechanism and exhibits many biases like popularity bias. The loss functions, such as the mostly used pointwise Binary Cross-Entropy and pairwise Bayesian Personalized Ranking, are not designed to consider the biases in observed data. As a result, the model optimized on the loss would inherit the data biases, or even worse, amplify the biases. For example, a few popular items take up more and more exposure opportunities, severely hurting the recommendation quality on niche items -- known as the notorious Mathew effect. In this work, we develop a new learning paradigm named Cross Pairwise Ranking (CPR) that achieves unbiased recommendation without knowing the exposure mechanism. Distinct from inverse propensity scoring (IPS), we change the loss term of a sample -- we innovatively sample multiple observed interactions once and form the loss as the combination of their predictions. We prove in theory that this way offsets the influence of user/item propensity on the learning, removing the influence of data biases caused by the exposure mechanism. Advantageous to IPS, our proposed CPR ensures unbiased learning for each training instance without the need of setting the propensity scores. Experimental results demonstrate the superiority of CPR over state-of-the-art debiasing solutions in both model generalization and training efficiency. The codes are available at https://github.com/Qcactus/CPR.
△ Less
Submitted 26 April, 2022;
originally announced April 2022.
-
Tailoring Dirac fermions by in-situ tunable high-order moire pattern in graphene-monolayer xenon heterostructure
Authors:
Chunlong Wu,
Qiang Wan,
Cao Peng,
Shangkun Mo,
Renzhe Li,
Keming Zhao,
Yanping Guo,
Shengjun Yuan,
Fengcheng Wu,
Chendong Zhang,
Nan Xu
Abstract:
A variety of novel quantum phases have been achieved in twist bilayer graphene (tBLG) and other moire superlattices recently, including correlated insulators, superconductivity, magnetism, and topological states. These phenomena are very sensitive to the moire superlattices, which can hardly be changed rapidly or intensely. Here, we report the experimental realization of a high-order moire pattern…
▽ More
A variety of novel quantum phases have been achieved in twist bilayer graphene (tBLG) and other moire superlattices recently, including correlated insulators, superconductivity, magnetism, and topological states. These phenomena are very sensitive to the moire superlattices, which can hardly be changed rapidly or intensely. Here, we report the experimental realization of a high-order moire pattern (a high-order interference pattern) in graphene-monolayer xenon heterostructure (G/mXe), with moire period in-situ tuned from few nanometers to infinity by changing the lattice constant of Xe through different annealing temperatures and pressures. We use angle-resolved photoemission spectroscopy to directly observe that replicas of graphene Dirac cone emerge and move close to each other in momentum-space as moire pattern continuously expands in real-space. When the moire period approaches infinity, the replicas finally overlap with each other and an energy gap is observed at the Dirac point induced by intervalley coupling, which is a manifestation of Kekule distortion. We construct a continuum moire Hamiltonian, which can explain the experimental results well. The form of moire Hamiltonian in G/mXe is similar to that in tBLG, and moire band with narrow bandwidth is predicted in G/mXe. However, the moire Hamiltonian couples Dirac fermions from different valleys in G/mXe, instead of ones from different layers in tBLG. Our work demonstrates a novel platform to study the continuous evolution of moire pattern and its modulation effect on electronic structure, and provides an unprecedented approach for tailoring Dirac fermions with tunable intervalley coupling.
△ Less
Submitted 17 March, 2022;
originally announced March 2022.
-
Building time-surfaces by exploiting the complex volatility of an ECRAM memristor
Authors:
Marco Rasetto,
Qingzhou Wan,
Himanshu Akolkar,
Feng Xiong,
Bertram Shi,
Ryad Benosman
Abstract:
Memristors have emerged as a promising technology for efficient neuromorphic architectures owing to their ability to act as programmable synapses, combining processing and memory into a single device. Although they are most commonly used for static encoding of synaptic weights, recent work has begun to investigate the use of their dynamical properties, such as Short Term Plasticity (STP), to integ…
▽ More
Memristors have emerged as a promising technology for efficient neuromorphic architectures owing to their ability to act as programmable synapses, combining processing and memory into a single device. Although they are most commonly used for static encoding of synaptic weights, recent work has begun to investigate the use of their dynamical properties, such as Short Term Plasticity (STP), to integrate events over time in event-based architectures. However, we are still far from completely understanding the range of possible behaviors and how they might be exploited in neuromorphic computation. This work focuses on a newly developed Li$_\textbf{x}$WO$_\textbf{3}$-based three-terminal memristor that exhibits tunable STP and a conductance response modeled by a double exponential decay. We derive a stochastic model of the device from experimental data and investigate how device stochasticity, STP, and the double exponential decay affect accuracy in a hierarchy of time-surfaces (HOTS) architecture. We found that the device's stochasticity does not affect accuracy, that STP can reduce the effect of salt and pepper noise in signals from event-based sensors, and that the double exponential decay improves accuracy by integrating temporal information over multiple time scales. Our approach can be generalized to study other memristive devices to build a better understanding of how control over temporal dynamics can enable neuromorphic engineers to fine-tune devices and architectures to fit their problems at hand.
△ Less
Submitted 15 April, 2024; v1 submitted 29 January, 2022;
originally announced January 2022.
-
Real-Time Computer-Generated EIA for Light Field Display by Pre-Calculating and Pre-Storing the Invariable Voxel-Pixel Mapping
Authors:
Quanzhen Wan
Abstract:
The elemental image array (EIA) for light field display, especially integral imaging light field display, was reliant on a virtual camera array, novel sampling algorithms, high-performance hardware or corresponding complex algorithms, which hinder its application. Without sacrificing accuracy and precision, we innovate a novel algorithm set to achieve video-level EIA generation. The invariable vox…
▽ More
The elemental image array (EIA) for light field display, especially integral imaging light field display, was reliant on a virtual camera array, novel sampling algorithms, high-performance hardware or corresponding complex algorithms, which hinder its application. Without sacrificing accuracy and precision, we innovate a novel algorithm set to achieve video-level EIA generation. The invariable voxel to pixel relationship is pre-calculated and pre-stored as a lookup table or mapping. Benefiting from the very lookup table, the voxel array could be fast mapped to an EIA without contingent upon any high-end hardware.
△ Less
Submitted 27 April, 2022; v1 submitted 17 January, 2022;
originally announced January 2022.
-
A Real-Time Rendering Method for Light Field Display
Authors:
Quanzhen Wan
Abstract:
A real-time elemental image array (EIA) generation method which does not sacrifice accuracy nor rely on high-performance hardware is developed, through raytracing and pre-stored voxel-pixel lookup table (LUT). Benefiting from both offline and online working flow, experiments will verified the effectiveness.
A real-time elemental image array (EIA) generation method which does not sacrifice accuracy nor rely on high-performance hardware is developed, through raytracing and pre-stored voxel-pixel lookup table (LUT). Benefiting from both offline and online working flow, experiments will verified the effectiveness.
△ Less
Submitted 27 April, 2022; v1 submitted 20 January, 2022;
originally announced January 2022.
-
Ferrotronics for the creation of band gaps in Graphene
Authors:
Qifang Wan,
Zhuocong Xiao,
Ahmed Kursumovic,
Judith. L. MacManus-Driscoll,
Colm Durkan
Abstract:
We experimentally demonstrate a simple graphene/ ferrolectric device, termed Ferrotronic (electronic effect from ferroelectric) device in which the band-structure of single-layer graphene is modified. The device architecture consists of graphene deposited on a ferroelectric substrate which encodes a periodic surface potential achieved through domain engineering. This structure takes advantage of t…
▽ More
We experimentally demonstrate a simple graphene/ ferrolectric device, termed Ferrotronic (electronic effect from ferroelectric) device in which the band-structure of single-layer graphene is modified. The device architecture consists of graphene deposited on a ferroelectric substrate which encodes a periodic surface potential achieved through domain engineering. This structure takes advantage of the nature of conduction through graphene to modulate the Fermi velocity of the charge carriers by the variations in surface potential, leading to the emergence of energy mini-bands and a band gap at the superlattice Brillouin zone boundary. Our work represents a simple route to building circuits whose functionality is controlled by the underlying substrate.
△ Less
Submitted 14 December, 2021;
originally announced December 2021.
-
Assistive Tele-op: Leveraging Transformers to Collect Robotic Task Demonstrations
Authors:
Henry M. Clever,
Ankur Handa,
Hammad Mazhar,
Kevin Parker,
Omer Shapira,
Qian Wan,
Yashraj Narang,
Iretiayo Akinola,
Maya Cakmak,
Dieter Fox
Abstract:
Sharing autonomy between robots and human operators could facilitate data collection of robotic task demonstrations to continuously improve learned models. Yet, the means to communicate intent and reason about the future are disparate between humans and robots. We present Assistive Tele-op, a virtual reality (VR) system for collecting robot task demonstrations that displays an autonomous trajector…
▽ More
Sharing autonomy between robots and human operators could facilitate data collection of robotic task demonstrations to continuously improve learned models. Yet, the means to communicate intent and reason about the future are disparate between humans and robots. We present Assistive Tele-op, a virtual reality (VR) system for collecting robot task demonstrations that displays an autonomous trajectory forecast to communicate the robot's intent. As the robot moves, the user can switch between autonomous and manual control when desired. This allows users to collect task demonstrations with both a high success rate and with greater ease than manual teleoperation systems. Our system is powered by transformers, which can provide a window of potential states and actions far into the future -- with almost no added computation time. A key insight is that human intent can be injected at any location within the transformer sequence if the user decides that the model-predicted actions are inappropriate. At every time step, the user can (1) do nothing and allow autonomous operation to continue while observing the robot's future plan sequence, or (2) take over and momentarily prescribe a different set of actions to nudge the model back on track. We host the videos and other supplementary material at https://sites.google.com/view/assistive-teleop.
△ Less
Submitted 9 December, 2021;
originally announced December 2021.
-
A Computational Efficient Maximum Likelihood Direct Position Determination Approach for Multiple Emitters Using Angle and Doppler Measurements
Authors:
Ziqiang Wang,
Yimao Sun,
Qun Wan,
Lei Xie,
Ning Liu
Abstract:
Emitter localization is widely applied in the military and civilian _elds. In this paper, we tackle the problem of position estimation for multiple stationary emitters using Doppler frequency shifts and angles by moving receivers. The computational load for the exhaustive maximum likelihood (ML) direct position determination (DPD) search is insu_erable. Based on the Pincus' theorem and importance…
▽ More
Emitter localization is widely applied in the military and civilian _elds. In this paper, we tackle the problem of position estimation for multiple stationary emitters using Doppler frequency shifts and angles by moving receivers. The computational load for the exhaustive maximum likelihood (ML) direct position determination (DPD) search is insu_erable. Based on the Pincus' theorem and importance sampling (IS) concept, we propose a novel non-iterative ML DPD method. The proposed method transforms the original multidimensional grid search into random variables generation with multiple low-dimensional pseudo-probability density functions (PDF), and the circular mean is used for superior position estimation performance. The computational complexity of the proposed method is modest, and the o_-grid problem that most existing DPD techniques face is signi_cantly alleviated. Moreover, it can be implemented in parallel separately. Simulation results demonstrate that the proposed ML DPD estimator can achieve better estimation accuracy than state-of-the-art DPD techniques. With a reasonable parameter choice, the estimation performance of the proposed technique is very close to the Cram_er-Rao lower bound (CRLB), even in the adverse conditions of low signal-to-noise ratios (SNR) levels.
△ Less
Submitted 3 December, 2021;
originally announced December 2021.
-
Coarse-To-Fine Incremental Few-Shot Learning
Authors:
Xiang Xiang,
Yuwen Tan,
Qian Wan,
Jing Ma
Abstract:
Different from fine-tuning models pre-trained on a large-scale dataset of preset classes, class-incremental learning (CIL) aims to recognize novel classes over time without forgetting pre-trained classes. However, a given model will be challenged by test images with finer-grained classes, e.g., a basenji is at most recognized as a dog. Such images form a new training set (i.e., support set) so tha…
▽ More
Different from fine-tuning models pre-trained on a large-scale dataset of preset classes, class-incremental learning (CIL) aims to recognize novel classes over time without forgetting pre-trained classes. However, a given model will be challenged by test images with finer-grained classes, e.g., a basenji is at most recognized as a dog. Such images form a new training set (i.e., support set) so that the incremental model is hoped to recognize a basenji (i.e., query) as a basenji next time. This paper formulates such a hybrid natural problem of coarse-to-fine few-shot (C2FS) recognition as a CIL problem named C2FSCIL, and proposes a simple, effective, and theoretically-sound strategy Knowe: to learn, normalize, and freeze a classifier's weights from fine labels, once learning an embedding space contrastively from coarse labels. Besides, as CIL aims at a stability-plasticity balance, new overall performance metrics are proposed. In that sense, on CIFAR-100, BREEDS, and tieredImageNet, Knowe outperforms all recent relevant CIL/FSCIL methods that are tailored to the new problem setting for the first time.
△ Less
Submitted 24 November, 2021;
originally announced November 2021.
-
Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving
Authors:
Qiyu Wan,
Haojun Xia,
Xingyao Zhang,
Lening Wang,
Shuaiwen Leon Song,
Xin Fu
Abstract:
Bayesian Neural Networks (BNNs) that possess a property of uncertainty estimation have been increasingly adopted in a wide range of safety-critical AI applications which demand reliable and robust decision making, e.g., self-driving, rescue robots, medical image diagnosis. The training procedure of a probabilistic BNN model involves training an ensemble of sampled DNN models, which induces orders…
▽ More
Bayesian Neural Networks (BNNs) that possess a property of uncertainty estimation have been increasingly adopted in a wide range of safety-critical AI applications which demand reliable and robust decision making, e.g., self-driving, rescue robots, medical image diagnosis. The training procedure of a probabilistic BNN model involves training an ensemble of sampled DNN models, which induces orders of magnitude larger volume of data movement than training a single DNN model. In this paper, we reveal that the root cause for BNN training inefficiency originates from the massive off-chip data transfer by Gaussian Random Variables (GRVs). To tackle this challenge, we propose a novel design that eliminates all the off-chip data transfer by GRVs through the reversed shifting of Linear Feedback Shift Registers (LFSRs) without incurring any training accuracy loss. To efficiently support our LFSR reversion strategy at the hardware level, we explore the design space of the current DNN accelerators and identify the optimal computation mapping scheme to best accommodate our strategy. By leveraging this finding, we design and prototype the first highly efficient BNN training accelerator, named Shift-BNN, that is low-cost and scalable. Extensive evaluation on five representative BNN models demonstrates that Shift-BNN achieves an average of 4.9x (up to 10.8x) boost in energy efficiency and 1.6x (up to 2.8x) speedup over the baseline DNN training accelerator.
△ Less
Submitted 7 October, 2021;
originally announced October 2021.
-
AdjointBackMapV2: Precise Reconstruction of Arbitrary CNN Unit's Activation via Adjoint Operators
Authors:
Qing Wan,
Siu Wun Cheung,
Yoonsuck Choe
Abstract:
Adjoint operators have been found to be effective in the exploration of CNN's inner workings [1]. However, the previous no-bias assumption restricted its generalization. We overcome the restriction via embedding input images into an extended normed space that includes bias in all CNN layers as part of the extended space and propose an adjoint-operator-based algorithm that maps high-level weights b…
▽ More
Adjoint operators have been found to be effective in the exploration of CNN's inner workings [1]. However, the previous no-bias assumption restricted its generalization. We overcome the restriction via embedding input images into an extended normed space that includes bias in all CNN layers as part of the extended space and propose an adjoint-operator-based algorithm that maps high-level weights back to the extended input space for reconstructing an effective hypersurface. Such hypersurface can be computed for an arbitrary unit in the CNN, and we prove that this reconstructed hypersurface, when multiplied by the original input (through an inner product), will precisely replicate the output value of each unit. We show experimental results based on the CIFAR-10 and CIFAR-100 data sets where the proposed approach achieves near 0 activation value reconstruction error.
△ Less
Submitted 9 November, 2023; v1 submitted 4 October, 2021;
originally announced October 2021.
-
A Variational Bayesian Inference-Inspired Unrolled Deep Network for MIMO Detection
Authors:
Qian Wan,
Jun Fang,
Yinsen Huang,
Huiping Duan,
Hongbin Li
Abstract:
The great success of deep learning (DL) has inspired researchers to develop more accurate and efficient symbol detectors for multi-input multi-output (MIMO) systems. Existing DL-based MIMO detectors, however, suffer several drawbacks. To address these issues, in this paper, we develop a model-driven DL detector based on variational Bayesian inference. Specifically, the proposed unrolled DL archite…
▽ More
The great success of deep learning (DL) has inspired researchers to develop more accurate and efficient symbol detectors for multi-input multi-output (MIMO) systems. Existing DL-based MIMO detectors, however, suffer several drawbacks. To address these issues, in this paper, we develop a model-driven DL detector based on variational Bayesian inference. Specifically, the proposed unrolled DL architecture is inspired by an inverse-free variational Bayesian learning framework which circumvents matrix inversion via maximizing a relaxed evidence lower bound. Two networks are respectively developed for independent and identically distributed (i.i.d.) Gaussian channels and arbitrarily correlated channels. The proposed networks, referred to as VBINet, have only a few learnable parameters and thus can be efficiently trained with a moderate amount of training samples. The proposed VBINet-based detectors can work in both offline and online training modes. An important advantage of our proposed networks over state-of-the-art MIMO detection networks such as OAMPNet and MMNet is that the VBINet can automatically learn the noise variance from data, thus yielding a significant performance improvement over the OAMPNet and MMNet in the presence of noise variance uncertainty. Simulation results show that the proposed VBINet-based detectors achieve competitive performance for both i.i.d. Gaussian and realistic 3GPP MIMO channels.
△ Less
Submitted 11 January, 2022; v1 submitted 25 September, 2021;
originally announced September 2021.
-
Geometric Fabrics: Generalizing Classical Mechanics to Capture the Physics of Behavior
Authors:
Karl Van Wyk,
Mandy Xie,
Anqi Li,
Muhammad Asif Rana,
Buck Babich,
Bryan Peele,
Qian Wan,
Iretiayo Akinola,
Balakumar Sundaralingam,
Dieter Fox,
Byron Boots,
Nathan D. Ratliff
Abstract:
Classical mechanical systems are central to controller design in energy shaping methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guara…
▽ More
Classical mechanical systems are central to controller design in energy shaping methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guarantees. In this work, we generalize classical mechanics to what we call geometric fabrics, whose expressivity and theory enable the design of systems that outperform RMPs in practice. Geometric fabrics strictly generalize classical mechanics forming a new physics of behavior by first generalizing them to Finsler geometries and then explicitly bending them to shape their behavior while maintaining stability. We develop the theory of fabrics and present both a collection of controlled experiments examining their theoretical properties and a set of robot system experiments showing improved performance over a well-engineered and hardened implementation of RMPs, our current state-of-the-art in controller design.
△ Less
Submitted 18 January, 2022; v1 submitted 21 September, 2021;
originally announced September 2021.
-
Highly Efficient Ultrathin Light Emitting Diodes based on Perovskite Nanocrystals
Authors:
Qun Wan,
Weilin Zheng,
Chen Zoub,
Francesco Carulli,
Congyang Zhang,
Haili Song,
Mingming Liu,
Qinggang Zhang,
Lih Y. Lin,
Long Kong,
Liang Li,
Sergio Brovelli
Abstract:
Light-emitting diodes based on perovskite nanocrystals (PNCs-LEDs) have gained great interest for next-generation display and lighting technologies prized for their color purity, high brightness and luminous efficiency approaching the intrinsic limit imposed by extraction of electroluminescence from the device structure. Although the time is ripe for the development of effective light outcoupling…
▽ More
Light-emitting diodes based on perovskite nanocrystals (PNCs-LEDs) have gained great interest for next-generation display and lighting technologies prized for their color purity, high brightness and luminous efficiency approaching the intrinsic limit imposed by extraction of electroluminescence from the device structure. Although the time is ripe for the development of effective light outcoupling strategies to further boost the device performance, this technologically relevant aspect of PNC-LEDs is still without a definitive solution. Here, following theoretical guidelines and without the integration of complex photonic structures, we realize stable PNC-LEDs with EQE as high as 29.2% (average EQE=24.7%), which substantially break the outcoupling limit of common PNC-LEDs and systematically surpass any previous perovskite-based device. Key to such unprecedented performance is channeling the recombination zone in PNC emissive layers as thin as 10 nm, which we achieve by finely balancing the electron and hole transport using CsPbBr3 PNCs resurfaced with a nickel oxide layer. The ultra-thin approach general and, in principle, applicable to other perovskite nanostructures for fabricating highly efficient, color tunable transparent LEDs ideal for unobtrusive screens and displays and is compatible with the integration of photonic components for further enhanced performance.
△ Less
Submitted 3 June, 2021;
originally announced June 2021.
-
RFCBF: enhance the performance and stability of Fast Correlation-Based Filter
Authors:
Xiongshi Deng,
Min Li,
Lei Wang,
Qikang Wan
Abstract:
Feature selection is a preprocessing step which plays a crucial role in the domain of machine learning and data mining. Feature selection methods have been shown to be effctive in removing redundant and irrelevant features, improving the learning algorithm's prediction performance. Among the various methods of feature selection based on redundancy, the fast correlation-based filter (FCBF) is one o…
▽ More
Feature selection is a preprocessing step which plays a crucial role in the domain of machine learning and data mining. Feature selection methods have been shown to be effctive in removing redundant and irrelevant features, improving the learning algorithm's prediction performance. Among the various methods of feature selection based on redundancy, the fast correlation-based filter (FCBF) is one of the most effective. In this paper, we proposed a novel extension of FCBF, called RFCBF, which combines resampling technique to improve classification accuracy. We performed comprehensive experiments to compare the RFCBF with other state-of-the-art feature selection methods using the KNN classifier on 12 publicly available data sets. The experimental results show that the RFCBF algorithm yields significantly better results than previous state-of-the-art methods in terms of classification accuracy and runtime.
△ Less
Submitted 30 May, 2021;
originally announced May 2021.
-
Inherited Weak Topological Insulator Signatures in Topological Hourglass Semimetal Nb3XTe6 (X = Si, Ge)
Authors:
Q. Wan,
T. Y. Yang,
S. Li,
M. Yang,
Z. Zhu,
C. L. Wu,
C. Peng,
S. K. Mo,
W. Wu,
Z. H. Chen,
Y. B. Huang,
L. L. Lev,
V. N. Strocov,
J. Hu,
Z. Q. Mao,
Hao Zheng,
J. F. Jia,
Y. G. Shi,
Shengyuan A. Yang,
N. Xu
Abstract:
Using spin-resolved and angle-resolved photoemission spectroscopy and first-principles calculations, we have identified bulk band inversion and spin polarized surface state evolved from a weak topological insulator (TI) phase in van der Waals materials Nb3XTe6 (X = Si, Ge). The fingerprints of weak TI homologically emerge with hourglass fermions, as multi nodal chains composed by the same pair of…
▽ More
Using spin-resolved and angle-resolved photoemission spectroscopy and first-principles calculations, we have identified bulk band inversion and spin polarized surface state evolved from a weak topological insulator (TI) phase in van der Waals materials Nb3XTe6 (X = Si, Ge). The fingerprints of weak TI homologically emerge with hourglass fermions, as multi nodal chains composed by the same pair of valence and conduction bands gapped by spin orbit coupling. The novel topological state, with a pair of valence and conduction bands encoding both weak TI and hourglass semimetal nature, is essential and guaranteed by nonsymmorphic symmetry. It is distinct from TIs studied previously based on band inversions without symmetry protections.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Observation of multi Dirac fermion cloning induced by moiré potential in graphene-SiC heterostructure
Authors:
C. L. Wu,
Q. Wan,
C. Peng,
S. K. Mo,
R. Z. Li,
K. M. Zhao,
Y. P. Guo,
C. D. Zhang,
N. Xu
Abstract:
We reexamine the electronic structure of graphene on SiC substrate by angle-resolved photoemission spectroscopy. We directly observed multiply cloning of Dirac cone, in addition to ones previously attributed to reconstruction. The locations, relative distances and anisotropy of Dirac cone replicas fully agree with the moiré pattern of graphene-SiC heterostructure. Our results provide a straightfor…
▽ More
We reexamine the electronic structure of graphene on SiC substrate by angle-resolved photoemission spectroscopy. We directly observed multiply cloning of Dirac cone, in addition to ones previously attributed to reconstruction. The locations, relative distances and anisotropy of Dirac cone replicas fully agree with the moiré pattern of graphene-SiC heterostructure. Our results provide a straightforward example of moiré potential modulation in engineering electronic structure with Dirac fermions.
△ Less
Submitted 23 March, 2021;
originally announced March 2021.
-
Interaction between optical pulse and tumor using finite element analysis
Authors:
Xianlin Song,
Ao Teng,
Jianshuang Wei,
Hao Chen,
Yang Zhao,
Jianheng Chen,
Fangwei Liu,
Qianxiang Wan,
Guoning Huang,
Lingfang Song,
Aojie Zhao,
Bo Li,
Zihao Li,
Qiming He,
Jinhong Zhang
Abstract:
Photoacoustic imaging is an emerging technology based on the photoacoustic effect that has developed rapidly in recent years. It combines the high contrast of optical imaging and the high penetration and high resolution of acoustic imaging. As a non-destructive biological tissue imaging technology, photoacoustic imaging has important application value in the field of biomedicine. With its high eff…
▽ More
Photoacoustic imaging is an emerging technology based on the photoacoustic effect that has developed rapidly in recent years. It combines the high contrast of optical imaging and the high penetration and high resolution of acoustic imaging. As a non-destructive biological tissue imaging technology, photoacoustic imaging has important application value in the field of biomedicine. With its high efficiency bi-oimaging capabilities and excellent biosafety performance, it has been favored by researchers. The visualization of photoacoustic imaging has great research signifi-cance in the early diagnosis of some diseases, especially tumors. In photoacoustic imaging, light transmission and thermal effects are important processes. This article is based on COMSOL software and uses finite element analysis to construct a physi-cal model for simulation. Through laser pulses into the stomach tissue containing tumor, the physical process of light transmission and biological heat transfer was studied, and a photothermal model composed of two physical fields was built, and finally a series of visualization graphics were obtained. This work has certain theo-retical guiding significance for further promoting the application of photoacoustic imaging in the field of biomedicine.
△ Less
Submitted 19 January, 2021;
originally announced January 2021.
-
AdjointBackMap: Reconstructing Effective Decision Hypersurfaces from CNN Layers Using Adjoint Operators
Authors:
Qing Wan,
Yoonsuck Choe
Abstract:
There are several effective methods in explaining the inner workings of convolutional neural networks (CNNs). However, in general, finding the inverse of the function performed by CNNs as a whole is an ill-posed problem. In this paper, we propose a method based on adjoint operators to reconstruct, given an arbitrary unit in the CNN (except for the first convolutional layer), its effective hypersur…
▽ More
There are several effective methods in explaining the inner workings of convolutional neural networks (CNNs). However, in general, finding the inverse of the function performed by CNNs as a whole is an ill-posed problem. In this paper, we propose a method based on adjoint operators to reconstruct, given an arbitrary unit in the CNN (except for the first convolutional layer), its effective hypersurface in the input space that replicates that unit's decision surface conditioned on a particular input image. Our results show that the hypersurface reconstructed this way, when multiplied by the original input image, would give nearly the exact output value of that unit. We find that the CNN unit's decision surface is largely conditioned on the input, and this may explain why adversarial inputs can effectively deceive CNNs.
△ Less
Submitted 29 March, 2021; v1 submitted 16 December, 2020;
originally announced December 2020.