-
The instrumentation program at the Large Binocular Telescope Observatory in 2024
Authors:
Joseph C. Shields,
Jason Chu,
Albert Conrad,
Jonathan Crass,
Justin R. Crepp,
Steve Ertel,
Jacopo Farinato,
Ilya Ilyin,
Olga Kuhn,
Luca Marafatto,
Fernando Pedichini,
Roberto Piazzesi,
Richard W. Pogge,
Jennifer Power,
Sam Ragland,
Robert Reynolds,
James Riedl,
Mark Smithwright,
Klaus G. Strassmeier,
David Thompson
Abstract:
The Large Binocular Telescope, with its expansive collecting area, angular resolving power, and advanced optical design, provides a robust platform for development and operation of advanced instrumentation for astronomical research. The LBT currently hosts a mature suite of instruments for spectroscopy and imaging at optical through mid-infrared wavelengths, supported by sophisticated adaptive opt…
▽ More
The Large Binocular Telescope, with its expansive collecting area, angular resolving power, and advanced optical design, provides a robust platform for development and operation of advanced instrumentation for astronomical research. The LBT currently hosts a mature suite of instruments for spectroscopy and imaging at optical through mid-infrared wavelengths, supported by sophisticated adaptive optics systems. This contribution summarizes the current state of instrumentation, including upgrades to existing instruments and commissioning of second generation instruments now in progress. The LBT is soliciting proposals for next generation instrument concepts, with participation open to consortium members and others interested in participation in the Observatory.
△ Less
Submitted 15 July, 2024;
originally announced July 2024.
-
Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models
Authors:
Jiayue Chu,
Chenhe Du,
Xiyue Lin,
Yuyao Zhang,
Hongjiang Wei
Abstract:
Reconstructing high-fidelity magnetic resonance (MR) images from under-sampled k-space is a commonly used strategy to reduce scan time. The posterior sampling of diffusion models based on the real measurement data holds significant promise of improved reconstruction accuracy. However, traditional posterior sampling methods often lack effective data consistency guidance, leading to inaccurate and u…
▽ More
Reconstructing high-fidelity magnetic resonance (MR) images from under-sampled k-space is a commonly used strategy to reduce scan time. The posterior sampling of diffusion models based on the real measurement data holds significant promise of improved reconstruction accuracy. However, traditional posterior sampling methods often lack effective data consistency guidance, leading to inaccurate and unstable reconstructions. Implicit neural representation (INR) has emerged as a powerful paradigm for solving inverse problems by modeling a signal's attributes as a continuous function of spatial coordinates. In this study, we present a novel posterior sampler for diffusion models using INR, named DiffINR. The INR-based component incorporates both the diffusion prior distribution and the MRI physical model to ensure high data fidelity. DiffINR demonstrates superior performance on experimental datasets with remarkable accuracy, even under high acceleration factors (up to R=12 in single-channel reconstruction). Notably, our proposed framework can be a generalizable framework to solve inverse problems in other medical imaging tasks.
△ Less
Submitted 2 July, 2024;
originally announced July 2024.
-
Noise-induced quantum synchronization and maximally entangled mixed states in superconducting circuits
Authors:
Ziyu Tao,
Finn Schmolke,
Chang-Kang Hu,
Wenhui Huang,
Yuxuan Zhou,
Jiawei Zhang,
Ji Chu,
Libo Zhang,
Xuandong Sun,
Zecheng Guo,
Jingjing Niu,
Wenle Weng,
Song Liu,
Youpeng Zhong,
Dian Tan,
Dapeng Yu,
Eric Lutz
Abstract:
Random fluctuations can lead to cooperative effects in complex systems. We here report the experimental observation of noise-induced quantum synchronization in a chain of superconducting transmon qubits with nearest-neighbor interactions. The application of Gaussian white noise to a single site leads to synchronous oscillations in the entire chain. We show that the two synchronized end qubits are…
▽ More
Random fluctuations can lead to cooperative effects in complex systems. We here report the experimental observation of noise-induced quantum synchronization in a chain of superconducting transmon qubits with nearest-neighbor interactions. The application of Gaussian white noise to a single site leads to synchronous oscillations in the entire chain. We show that the two synchronized end qubits are entangled, with nonzero concurrence, and that they belong to a class of generalized Bell states known as maximally entangled mixed states, whose entanglement cannot be increased by any global unitary. We further demonstrate the stability against frequency detuning of both synchronization and entanglement by determining the corresponding generalized Arnold tongue diagrams. Our results highlight the constructive influence of noise in a quantum many-body system and uncover the potential role of synchronization for mixed-state quantum information science.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Ferromagnetism and Topology of the Higher Flat Band in a Fractional Chern Insulator
Authors:
Heonjoon Park,
Jiaqi Cai,
Eric Anderson,
Xiao-Wei Zhang,
Xiaoyu Liu,
William Holtzmann,
Weijie Li,
Chong Wang,
Chaowei Hu,
Yuzhou Zhao,
Takashi Taniguchi,
Kenji Watanabe,
Jihui Yang,
David Cobden,
Jiun-Haw Chu,
Nicolas Regnault,
B. Andrei Bernevig,
Liang Fu,
Ting Cao,
Di Xiao,
Xiaodong Xu
Abstract:
The recent observation of the fractional quantum anomalous Hall effect in moiré fractional Chern insulators (FCI) provides opportunities for investigating zero magnetic field anyons. So far, both experimental and theoretical results suggest that filling > 1/3 FCI states in the first Chern band share features with those of the lowest Landau level (LL). To create the possibility of realizing non-Abe…
▽ More
The recent observation of the fractional quantum anomalous Hall effect in moiré fractional Chern insulators (FCI) provides opportunities for investigating zero magnetic field anyons. So far, both experimental and theoretical results suggest that filling > 1/3 FCI states in the first Chern band share features with those of the lowest Landau level (LL). To create the possibility of realizing non-Abelian anyons, one route is to engineer higher flat Chern bands that mimic higher LLs. Here, we investigate the interaction, topology, and ferromagnetism of the second moiré miniband in twisted MoTe2 bilayer (tMoTe2). Around filling factor v = -3, i.e., half-filling of the second miniband, we uncover spontaneous ferromagnetism and an incipient Chern insulator state. By measuring the anomalous Hall effect as a function of twist angle, we find that the Chern numbers (C) of the top two moiré flat bands have opposite sign (C = -+1) at twist angles above 3.1° but the same sign (C = -1) around 2.6°. This observation is consistent with the recently predicted twist-angle dependent band topology, resulting from the competition between moiré ferroelectricity and piezoelectricity. As we increase the magnetic field, only the small twist-angle device (2.6°) experiences a topological phase transition with an emergent C = -2 state. This is attributed to a Zeeman field-induced band crossing between opposite valleys, with the determined C = -1 for the top two bands. Our results lay a firm foundation for understanding the higher flat Chern bands, which is essential for the prediction or discovery of non-Abelian FCIs.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Ubiquitous Flat Bands in a Cr-based Kagome Superconductor
Authors:
Yucheng Guo,
Zehao Wang,
Fang Xie,
Yuefei Huang,
Bin Gao,
Ji Seop Oh,
Han Wu,
Zhaoyu Liu,
Zheng Ren,
Yuan Fang,
Ananya Biswas,
Yichen Zhang,
Ziqin Yue,
Cheng Hu,
Chris Jozwiak,
Aaron Bostwick,
Eli Rotenberg,
Makoto Hashimoto,
Donghui Lu,
Junichiro Kono,
Jiun-Haw Chu,
Boris I Yakobson,
Robert J Birgeneau,
Qimiao Si,
Pengcheng Dai
, et al. (1 additional authors not shown)
Abstract:
In the quest for novel quantum states driven by topology and correlation, kagome lattice materials have garnered significant interest due to their distinctive electronic band structures, featuring flat bands (FBs) arising from the quantum destructive interference of the electronic wave function. The tuning of the FBs to the chemical potential would lead to the possibility of liberating electronic…
▽ More
In the quest for novel quantum states driven by topology and correlation, kagome lattice materials have garnered significant interest due to their distinctive electronic band structures, featuring flat bands (FBs) arising from the quantum destructive interference of the electronic wave function. The tuning of the FBs to the chemical potential would lead to the possibility of liberating electronic instabilities that lead to emergent electronic orders. Despite extensive studies, direct evidence of FBs tuned to the chemical potential and their participation in emergent electronic orders have been lacking in bulk quantum materials. Here using a combination of Angle-Resolved Photoemission Spectroscopy (ARPES) and Density Functional Theory (DFT), we reveal that the low-energy electronic structure of the recently discovered Cr-based kagome metal superconductor CsCr3Sb5 is dominated by a pervasive FB in close proximity to, and below the Fermi level. A comparative analysis with orbital-projected DFT and polarization dependence measurement uncovers that an orbital-selective renormalization mechanism is needed to reconcile the discrepancy with the DFT calculations, which predict the FB to appear 200 meV above the Fermi level. Furthermore, we observe the FB to shift away from the Fermi level by 20 meV in the low-temperature density wave-ordered phase, highlighting the role of the FB in the emergent electronic order. Our results reveal CsCr3Sb5 to stand out as a promising platform for further exploration into the effects of FBs near the Fermi level on kagome lattices, and their role in emergent orders in bulk quantum materials.
△ Less
Submitted 12 June, 2024; v1 submitted 7 June, 2024;
originally announced June 2024.
-
An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval
Authors:
Xiaolun Jing,
Genke Yang,
Jian Chu
Abstract:
CLIP4Clip model transferred from the CLIP has been the de-factor standard to solve the video clip retrieval task from frame-level input, triggering the surge of CLIP4Clip-based models in the video-text retrieval domain. In this work, we rethink the inherent limitation of widely-used mean pooling operation in the frame features aggregation and investigate the adaptions of excitation and aggregation…
▽ More
CLIP4Clip model transferred from the CLIP has been the de-factor standard to solve the video clip retrieval task from frame-level input, triggering the surge of CLIP4Clip-based models in the video-text retrieval domain. In this work, we rethink the inherent limitation of widely-used mean pooling operation in the frame features aggregation and investigate the adaptions of excitation and aggregation design for discriminative video representation generation. We present a novel excitationand-aggregation design, including (1) The excitation module is available for capturing non-mutuallyexclusive relationships among frame features and achieving frame-wise features recalibration, and (2) The aggregation module is applied to learn exclusiveness used for frame representations aggregation. Similarly, we employ the cascade of sequential module and aggregation design to generate discriminative video representation in the sequential type. Besides, we adopt the excitation design in the tight type to obtain representative frame features for multi-modal interaction. The proposed modules are evaluated on three benchmark datasets of MSR-VTT, ActivityNet and DiDeMo, achieving MSR-VTT (43.9 R@1), ActivityNet (44.1 R@1) and DiDeMo (31.0 R@1). They outperform the CLIP4Clip results by +1.2% (+0.5%), +4.5% (+1.9%) and +9.5% (+2.7%) relative (absolute) improvements, demonstrating the superiority of our proposed excitation and aggregation designs. We hope our work will serve as an alternative for frame representations aggregation and facilitate future research.
△ Less
Submitted 8 June, 2024; v1 submitted 25 May, 2024;
originally announced June 2024.
-
Llarull's theorem on punctured sphere with $L^\infty$ metric
Authors:
Jianchun Chu,
Man-Chun Lee,
Jintian Zhu
Abstract:
The classical Llarull theorem states that a smooth metric on $n$-sphere cannot have scalar curvature no less than $n(n-1)$ and dominate the standard spherical metric at the same time unless it is the standard spherical metric. In this work, we prove that Llarull's rigidity theorem holds for $L^{\infty}$ metrics on spheres with finitely many points punctured. This is related to a question of Gromov…
▽ More
The classical Llarull theorem states that a smooth metric on $n$-sphere cannot have scalar curvature no less than $n(n-1)$ and dominate the standard spherical metric at the same time unless it is the standard spherical metric. In this work, we prove that Llarull's rigidity theorem holds for $L^{\infty}$ metrics on spheres with finitely many points punctured. This is related to a question of Gromov.
△ Less
Submitted 4 June, 2024; v1 submitted 30 May, 2024;
originally announced May 2024.
-
Visualizing the microscopic origins of topology in twisted molybdenum ditelluride
Authors:
Ellis Thompson,
Keng Tou Chu,
Florie Mesple,
Xiao-Wei Zhang,
Chaowei Hu,
Yuzhou Zhao,
Heonjoon Park,
Jiaqi Cai,
Eric Anderson,
Kenji Watanabe,
Takashi Taniguchi,
Jihui Yang,
Jiun-Haw Chu,
Xiaodong Xu,
Ting Cao,
Di Xiao,
Matthew Yankowitz
Abstract:
In moiré materials with flat electronic bands and suitable quantum geometry, strong correlations can give rise to novel topological states of matter. The nontrivial band topology of twisted molybdenum ditelluride (tMoTe$_2$) -- responsible for its fractional quantum anomalous Hall (FQAH) states -- is predicted to arise from a layer-pseudospin skyrmion lattice. Tracing the layer polarization of wav…
▽ More
In moiré materials with flat electronic bands and suitable quantum geometry, strong correlations can give rise to novel topological states of matter. The nontrivial band topology of twisted molybdenum ditelluride (tMoTe$_2$) -- responsible for its fractional quantum anomalous Hall (FQAH) states -- is predicted to arise from a layer-pseudospin skyrmion lattice. Tracing the layer polarization of wavefunctions within the moiré unit cell can thus offer crucial insights into the band topology. Here, we use scanning tunneling microscopy and spectroscopy (STM/S) to probe the layer-pseudospin skyrmion textures of tMoTe$_2$. We do this by simultaneously visualizing the moiré lattice structure and the spatial localization of its electronic states. We find that the wavefunctions associated with the topological flat bands exhibit a spatially-dependent layer polarization within the moiré unit cell. This is in excellent agreement with our theoretical modeling, thereby revealing a direct microscopic connection between the structural properties of tMoTe$_2$ and its band topology. Our work enables new pathways for engineering FQAH states with strain, as well as future STM studies of the intertwined correlated and topological states arising in gate-tunable devices.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
Perspective: Probing elasto-quantum materials with X-ray techniques and in situ anisotropic strain
Authors:
Han Zhang,
Joshua J. Sanchez,
Jiun-Haw Chu,
Jian Liu
Abstract:
Anisotropic lattice deformation plays an important role in the quantum mechanics of solid state physics. The possibility of mediating the competition and cooperation among different order parameters by applying in situ strain/stress on quantum materials has led to discoveries of a variety of elasto-quantum effects on emergent phenomena. It has become increasingly critical to have the capability of…
▽ More
Anisotropic lattice deformation plays an important role in the quantum mechanics of solid state physics. The possibility of mediating the competition and cooperation among different order parameters by applying in situ strain/stress on quantum materials has led to discoveries of a variety of elasto-quantum effects on emergent phenomena. It has become increasingly critical to have the capability of combining the in situ strain tuning with X-ray techniques, especially those based on synchrotrons, to probe the microscopic elasto-responses of the lattice, spin, charge, and orbital degrees of freedom. Herein, we briefly review the recent studies that embarked on utilizing elasto-X-ray characterizations on representative material systems and demonstrated the emerging opportunities enabled by this method. With that, we further discuss the promising prospect in this rising area of quantum materials research and the bright future of elasto-X-ray techniques.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Large Language Model Informed Patent Image Retrieval
Authors:
Hao-Cheng Lo,
Jung-Mei Chu,
Jieh Hsiang,
Chun-Chieh Cho
Abstract:
In patent prosecution, image-based retrieval systems for identifying similarities between current patent images and prior art are pivotal to ensure the novelty and non-obviousness of patent applications. Despite their growing popularity in recent years, existing attempts, while effective at recognizing images within the same patent, fail to deliver practical value due to their limited generalizabi…
▽ More
In patent prosecution, image-based retrieval systems for identifying similarities between current patent images and prior art are pivotal to ensure the novelty and non-obviousness of patent applications. Despite their growing popularity in recent years, existing attempts, while effective at recognizing images within the same patent, fail to deliver practical value due to their limited generalizability in retrieving relevant prior art. Moreover, this task inherently involves the challenges posed by the abstract visual features of patent images, the skewed distribution of image classifications, and the semantic information of image descriptions. Therefore, we propose a language-informed, distribution-aware multimodal approach to patent image feature learning, which enriches the semantic understanding of patent image by integrating Large Language Models and improves the performance of underrepresented classes with our proposed distribution-aware contrastive losses. Extensive experiments on DeepPatent2 dataset show that our proposed method achieves state-of-the-art or comparable performance in image-based patent retrieval with mAP +53.3%, Recall@10 +41.8%, and MRR@10 +51.9%. Furthermore, through an in-depth user analysis, we explore our model in aiding patent professionals in their image retrieval efforts, highlighting the model's real-world applicability and effectiveness.
△ Less
Submitted 30 April, 2024;
originally announced April 2024.
-
Local probe of bulk and edge states in a fractional Chern insulator
Authors:
Zhurun Ji,
Heonjoon Park,
Mark E. Barber,
Chaowei Hu,
Kenji Watanabe,
Takashi Taniguchi,
Jiun-Haw Chu,
Xiaodong Xu,
Zhi-xun Shen
Abstract:
Fractional quantum Hall effect (FQHE) is a prime example of topological quantum many-body phenomena, arising from the interplay between strong electron correlation, topological order, and time reversal symmetry breaking. Recently, a lattice analog of FQHE at zero magnetic field has been observed, confirming the existence of a zero-field fractional Chern insulator (FCI). Despite this, the bulk-edge…
▽ More
Fractional quantum Hall effect (FQHE) is a prime example of topological quantum many-body phenomena, arising from the interplay between strong electron correlation, topological order, and time reversal symmetry breaking. Recently, a lattice analog of FQHE at zero magnetic field has been observed, confirming the existence of a zero-field fractional Chern insulator (FCI). Despite this, the bulk-edge correspondence -- a hallmark of FCI featuring an insulating bulk with conductive edges -- has not been directly observed. In fact, this correspondence has not been visualized in any system for fractional states due to experimental challenges. Here we report the imaging of FCI edge states in twisted MoTe2 by employing a newly developed modality of microwave-impedance microscopy. By tuning the carrier density, we observe the system evolving between metallic and FCI states, the latter of which exhibits insulating bulk and conductive edges as expected from bulk-boundary correspondence. We also observe the evolution of edge states across the topological phase transition from an incompressible Chern insulator state to a metal and finally to a putative charge ordered insulating state as a function of interlayer electric field. The local measurement further reveals tantalizing prospects of neighboring domains with different fractional orders. These findings pave the way for research into topologically protected 1D interfaces between various anyonic states at zero magnetic field, such as topological entanglement entropy, Halperin-Laughlin interfaces, and the creation of non-abelian anyons.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Disentanglement of mixed interference fringes in optical interferometers: theory and applications
Authors:
Kaiyuan Yang,
Weilong Wei,
Xiafei Ma,
Botao Chen,
Junqiu Chu,
Xinling Liu,
Yuhua Cheng,
Hu Yang,
Haotong Ma,
Bo Qi,
Zongliang Xie
Abstract:
Optical interferometric imaging enables astronomical observation at extremely high angular resolution. The necessary optical information for imaging, such as the optical path differences and visibilities, is easy to extract from fringes generated by the combination of two beams. With more than two apertures, the image-plane interference pattern becomes an increasingly indistinguishable mixture of…
▽ More
Optical interferometric imaging enables astronomical observation at extremely high angular resolution. The necessary optical information for imaging, such as the optical path differences and visibilities, is easy to extract from fringes generated by the combination of two beams. With more than two apertures, the image-plane interference pattern becomes an increasingly indistinguishable mixture of fringe spacings and directions. For decades, the state-of-the-art approaches for obtaining two-aperture fringes from an interferometer array composed of many apertures are limited to pairwise combinations using bulk optics. Here, we derive and demonstrate a fringe disentanglement theory that can digitally transform the interference pattern of N apertures to N(N-1)/2 pairwise fringes without any optics, thus providing straightforward methods of information acquisition for interferometers. We demonstrate applications of our technique by both simulation and experiment, showing that this theory can be used for simultaneously sensing pistons and determining the individual visibilities of all combining apertures. Furthermore, we use the proposed theory to phase a 1.5-meter segmented flat telescope, demonstrating its validity for engineering implementation. This theory may not only benefit optical imaging but also interferometry-based measurements, by providing an exceptional capability to simplify the interferometric output generated by a system of many apertures.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Dynamic Backtracking in GFlowNets: Enhancing Decision Steps with Reward-Dependent Adjustment Mechanisms
Authors:
Shuai Guo,
Jielei Chu,
Lei Zhu,
Zhaoyu Li,
Tianrui Li
Abstract:
Generative Flow Networks (GFlowNets or GFNs) are probabilistic models predicated on Markov flows, and they employ specific amortization algorithms to learn stochastic policies that generate compositional substances including biomolecules, chemical materials, etc. With a strong ability to generate high-performance biochemical molecules, GFNs accelerate the discovery of scientific substances, effect…
▽ More
Generative Flow Networks (GFlowNets or GFNs) are probabilistic models predicated on Markov flows, and they employ specific amortization algorithms to learn stochastic policies that generate compositional substances including biomolecules, chemical materials, etc. With a strong ability to generate high-performance biochemical molecules, GFNs accelerate the discovery of scientific substances, effectively overcoming the time-consuming, labor-intensive, and costly shortcomings of conventional material discovery methods. However, previous studies rarely focus on accumulating exploratory experience by adjusting generative structures, which leads to disorientation in complex sampling spaces. Efforts to address this issue, such as LS-GFN, are limited to local greedy searches and lack broader global adjustments. This paper introduces a novel variant of GFNs, the Dynamic Backtracking GFN (DB-GFN), which improves the adaptability of decision-making steps through a reward-based dynamic backtracking mechanism. DB-GFN allows backtracking during the network construction process according to the current state's reward value, thereby correcting disadvantageous decisions and exploring alternative pathways during the exploration process. When applied to generative tasks involving biochemical molecules and genetic material sequences, DB-GFN outperforms GFN models such as LS-GFN and GTB, as well as traditional reinforcement learning methods, in sample quality, sample exploration quantity, and training convergence speed. Additionally, owing to its orthogonal nature, DB-GFN shows great potential in future improvements of GFNs, and it can be integrated with other strategies to achieve higher search performance.
△ Less
Submitted 13 May, 2024; v1 submitted 8 April, 2024;
originally announced April 2024.
-
Continuously tunable uniaxial strain control of van der Waals heterostructure devices
Authors:
Zhaoyu Liu,
Xuetao Ma,
John Cenker,
Jiaqi Cai,
Zaiyao Fei,
Paul Malinowski,
Joshua Mutch,
Yuzhou Zhao,
Kyle Hwangbo,
Zhong Lin,
Arnab Manna,
Jihui Yang,
David Cobden,
Xiaodong Xu,
Matthew Yankowitz,
Jiun-Haw Chu
Abstract:
Uniaxial strain has been widely used as a powerful tool for investigating and controlling the properties of quantum materials. However, existing strain techniques have so far mostly been limited to use with bulk crystals. Although recent progress has been made in extending the application of strain to two-dimensional van der Waals (vdW) heterostructures, these techniques have been limited to optic…
▽ More
Uniaxial strain has been widely used as a powerful tool for investigating and controlling the properties of quantum materials. However, existing strain techniques have so far mostly been limited to use with bulk crystals. Although recent progress has been made in extending the application of strain to two-dimensional van der Waals (vdW) heterostructures, these techniques have been limited to optical characterization and extremely simple electrical device geometries. Here, we report a piezoelectric-based \textit{in situ} uniaxial strain technique enabling simultaneous electrical transport and optical spectroscopy characterization of dual-gated vdW heterostructure devices. Critically, our technique remains compatible with vdW heterostructure devices of arbitrary complexity fabricated on conventional silicon/silicon dioxide wafer substrates. We demonstrate a large and continuously tunable strain of up to $-0.15\%$ at millikelvin temperatures, with larger strain values also likely achievable. We quantify the strain transmission from the silicon wafer to the vdW heterostructure, and further demonstrate the ability of strain to modify the electronic properties of twisted bilayer graphene. Our technique provides a highly versatile new method for exploring the effect of uniaxial strain on both the electrical and optical properties of vdW heterostructures, and can be easily extended to include additional characterization techniques.
△ Less
Submitted 23 May, 2024; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Incorporating Exponential Smoothing into MLP: A Simple but Effective Sequence Model
Authors:
Jiqun Chu,
Zuoquan Lin
Abstract:
Modeling long-range dependencies in sequential data is a crucial step in sequence learning. A recently developed model, the Structured State Space (S4), demonstrated significant effectiveness in modeling long-range sequences. However, It is unclear whether the success of S4 can be attributed to its intricate parameterization and HiPPO initialization or simply due to State Space Models (SSMs). To f…
▽ More
Modeling long-range dependencies in sequential data is a crucial step in sequence learning. A recently developed model, the Structured State Space (S4), demonstrated significant effectiveness in modeling long-range sequences. However, It is unclear whether the success of S4 can be attributed to its intricate parameterization and HiPPO initialization or simply due to State Space Models (SSMs). To further investigate the potential of the deep SSMs, we start with exponential smoothing (ETS), a simple SSM, and propose a stacked architecture by directly incorporating it into an element-wise MLP. We augment simple ETS with additional parameters and complex field to reduce the inductive bias. Despite increasing less than 1\% of parameters of element-wise MLP, our models achieve comparable results to S4 on the LRA benchmark.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Coupler-Assisted Leakage Reduction for Scalable Quantum Error Correction with Superconducting Qubits
Authors:
Xiaohan Yang,
Ji Chu,
Zechen Guo,
Wenhui Huang,
Yongqi Liang,
Jiawei Liu,
Jiawei Qiu,
Xuandong Sun,
Ziyu Tao,
Jiawei Zhang,
Jiajian Zhang,
Libo Zhang,
Yuxuan Zhou,
Weijie Guo,
Ling Hu,
Ji Jiang,
Yang Liu,
Xiayu Linpeng,
Tingyong Chen,
Yuanzhen Chen,
Jingjing Niu,
Song Liu,
Youpeng Zhong,
Dapeng Yu
Abstract:
Superconducting qubits are a promising platform for building fault-tolerant quantum computers, with recent achievement showing the suppression of logical error with increasing code size. However, leakage into non-computational states, a common issue in practical quantum systems including superconducting circuits, introduces correlated errors that undermine QEC scalability. Here, we propose and dem…
▽ More
Superconducting qubits are a promising platform for building fault-tolerant quantum computers, with recent achievement showing the suppression of logical error with increasing code size. However, leakage into non-computational states, a common issue in practical quantum systems including superconducting circuits, introduces correlated errors that undermine QEC scalability. Here, we propose and demonstrate a leakage reduction scheme utilizing tunable couplers, a widely adopted ingredient in large-scale superconducting quantum processors. Leveraging the strong frequency tunability of the couplers and stray interaction between the couplers and readout resonators, we eliminate state leakage on the couplers, thus suppressing space-correlated errors caused by population propagation among the couplers. Assisted by the couplers, we further reduce leakage to higher qubit levels with high efficiency (98.1%) and low error rate on the computational subspace (0.58%), suppressing time-correlated errors during QEC cycles. The performance of our scheme demonstrates its potential as an indispensable building block for scalable QEC with superconducting qubits.
△ Less
Submitted 24 March, 2024;
originally announced March 2024.
-
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
Authors:
Joonmyung Choi,
Sanghyeok Lee,
Jaewon Chu,
Minhyuk Choi,
Hyunwoo J. Kim
Abstract:
Video Transformers have become the prevalent solution for various video downstream tasks with superior expressive power and flexibility. However, these video transformers suffer from heavy computational costs induced by the massive number of tokens across the entire video frames, which has been the major barrier to training the model. Further, the patches irrelevant to the main contents, e.g., bac…
▽ More
Video Transformers have become the prevalent solution for various video downstream tasks with superior expressive power and flexibility. However, these video transformers suffer from heavy computational costs induced by the massive number of tokens across the entire video frames, which has been the major barrier to training the model. Further, the patches irrelevant to the main contents, e.g., backgrounds, degrade the generalization performance of models. To tackle these issues, we propose training free token merging for lightweight video Transformer (vid-TLDR) that aims to enhance the efficiency of video Transformers by merging the background tokens without additional training. For vid-TLDR, we introduce a novel approach to capture the salient regions in videos only with the attention map. Further, we introduce the saliency-aware token merging strategy by dropping the background tokens and sharpening the object scores. Our experiments show that vid-TLDR significantly mitigates the computational complexity of video Transformers while achieving competitive performance compared to the base model without vid-TLDR. Code is available at https://github.com/mlvlab/vid-TLDR.
△ Less
Submitted 30 March, 2024; v1 submitted 20 March, 2024;
originally announced March 2024.
-
A multidisciplinary framework for deconstructing bots' pluripotency in dualistic antagonism
Authors:
Wentao Xu,
Kazutoshi Sasahara,
Jianxun Chu,
Bin Wang,
Wenlu Fan,
Zhiwen Hu
Abstract:
Anthropomorphic social bots are engineered to emulate human verbal communication and generate toxic or inflammatory content across social networking services (SNSs). Bot-disseminated misinformation could subtly yet profoundly reshape societal processes by complexly interweaving factors like repeated disinformation exposure, amplified political polarization, compromised indicators of democratic hea…
▽ More
Anthropomorphic social bots are engineered to emulate human verbal communication and generate toxic or inflammatory content across social networking services (SNSs). Bot-disseminated misinformation could subtly yet profoundly reshape societal processes by complexly interweaving factors like repeated disinformation exposure, amplified political polarization, compromised indicators of democratic health, shifted perceptions of national identity, propagation of false social norms, and manipulation of collective memory over time. However, extrapolating bots' pluripotency across hybridized, multilingual, and heterogeneous media ecologies from isolated SNS analyses remains largely unknown, underscoring the need for a comprehensive framework to characterise bots' emergent risks to civic discourse. Here we propose an interdisciplinary framework to characterise bots' pluripotency, incorporating quantification of influence, network dynamics monitoring, and interlingual feature analysis. When applied to the geopolitical discourse around the Russo-Ukrainian conflict, results from interlanguage toxicity profiling and network analysis elucidated spatiotemporal trajectories of pro-Russian and pro-Ukrainian human and bots across hybrid SNSs. Weaponized bots predominantly inhabited X, while human primarily populated Reddit in the social media warfare. This rigorous framework promises to elucidate interlingual homogeneity and heterogeneity in bots' pluripotent behaviours, revealing synergistic human-bot mechanisms underlying regimes of information manipulation, echo chamber formation, and collective memory manifestation in algorithmically structured societies.
△ Less
Submitted 11 May, 2024; v1 submitted 23 February, 2024;
originally announced February 2024.
-
Enhancing Security in Blockchain Networks: Anomalies, Frauds, and Advanced Detection Techniques
Authors:
Joerg Osterrieder,
Stephen Chan,
Jeffrey Chu,
Yuanyuan Zhang,
Branka Hadji Misheva,
Codruta Mare
Abstract:
Blockchain technology, a foundational distributed ledger system, enables secure and transparent multi-party transactions. Despite its advantages, blockchain networks are susceptible to anomalies and frauds, posing significant risks to their integrity and security. This paper offers a detailed examination of blockchain's key definitions and properties, alongside a thorough analysis of the various a…
▽ More
Blockchain technology, a foundational distributed ledger system, enables secure and transparent multi-party transactions. Despite its advantages, blockchain networks are susceptible to anomalies and frauds, posing significant risks to their integrity and security. This paper offers a detailed examination of blockchain's key definitions and properties, alongside a thorough analysis of the various anomalies and frauds that undermine these networks. It describes an array of detection and prevention strategies, encompassing statistical and machine learning methods, game-theoretic solutions, digital forensics, reputation-based systems, and comprehensive risk assessment techniques. Through case studies, we explore practical applications of anomaly and fraud detection in blockchain networks, extracting valuable insights and implications for both current practice and future research. Moreover, we spotlight emerging trends and challenges within the field, proposing directions for future investigation and technological development. Aimed at both practitioners and researchers, this paper seeks to provide a technical, in-depth overview of anomaly and fraud detection within blockchain networks, marking a significant step forward in the search for enhanced network security and reliability.
△ Less
Submitted 17 February, 2024;
originally announced February 2024.
-
A Deep Learning Approach to Radar-based QPE
Authors:
Ting-Shuo Yo,
Shih-Hao Su,
Jung-Lien Chu,
Chiao-Wei Chang,
Hung-Chi Kuo
Abstract:
In this study, we propose a volume-to-point framework for quantitative precipitation estimation (QPE) based on the Quantitative Precipitation Estimation and Segregation Using Multiple Sensor (QPESUMS) Mosaic Radar data set. With a data volume consisting of the time series of gridded radar reflectivities over the Taiwan area, we used machine learning algorithms to establish a statistical model for…
▽ More
In this study, we propose a volume-to-point framework for quantitative precipitation estimation (QPE) based on the Quantitative Precipitation Estimation and Segregation Using Multiple Sensor (QPESUMS) Mosaic Radar data set. With a data volume consisting of the time series of gridded radar reflectivities over the Taiwan area, we used machine learning algorithms to establish a statistical model for QPE in weather stations. The model extracts spatial and temporal features from the input data volume and then associates these features with the location-specific precipitations. In contrast to QPE methods based on the Z-R relation, we leverage the machine learning algorithms to automatically detect the evolution and movement of weather systems and associate these patterns to a location with specific topographic attributes. Specifically, we evaluated this framework with the hourly precipitation data of 45 weather stations in Taipei during 2013-2016. In comparison to the operational QPE scheme used by the Central Weather Bureau, the volume-to-point framework performed comparably well in general cases and excelled in detecting heavy-rainfall events. By using the current results as the reference benchmark, the proposed method can integrate the heterogeneous data sources and potentially improve the forecast in extreme precipitation scenarios.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Efficient Resource Scheduling for Distributed Infrastructures Using Negotiation Capabilities
Authors:
Junjie Chu,
Prashant Singh,
Salman Toor
Abstract:
In the past few decades, the rapid development of information and internet technologies has spawned massive amounts of data and information. The information explosion drives many enterprises or individuals to seek to rent cloud computing infrastructure to put their applications in the cloud. However, the agreements reached between cloud computing providers and clients are often not efficient. Many…
▽ More
In the past few decades, the rapid development of information and internet technologies has spawned massive amounts of data and information. The information explosion drives many enterprises or individuals to seek to rent cloud computing infrastructure to put their applications in the cloud. However, the agreements reached between cloud computing providers and clients are often not efficient. Many factors affect the efficiency, such as the idleness of the providers' cloud computing infrastructure, and the additional cost to the clients. One possible solution is to introduce a comprehensive, bargaining game (a type of negotiation), and schedule resources according to the negotiation results. We propose an agent-based auto-negotiation system for resource scheduling based on fuzzy logic. The proposed method can complete a one-to-one auto-negotiation process and generate optimal offers for the provider and client. We compare the impact of different member functions, fuzzy rule sets, and negotiation scenario cases on the offers to optimize the system. It can be concluded that our proposed method can utilize resources more efficiently and is interpretable, highly flexible, and customizable. We successfully train machine learning models to replace the fuzzy negotiation system to improve processing speed. The article also highlights possible future improvements to the proposed system and machine learning models. All the codes and data are available in the open-source repository.
△ Less
Submitted 13 February, 2024; v1 submitted 10 February, 2024;
originally announced February 2024.
-
Comprehensive Assessment of Jailbreak Attacks Against LLMs
Authors:
Junjie Chu,
Yugeng Liu,
Ziqing Yang,
Xinyue Shen,
Michael Backes,
Yang Zhang
Abstract:
Misuse of the Large Language Models (LLMs) has raised widespread concern. To address this issue, safeguards have been taken to ensure that LLMs align with social ethics. However, recent findings have revealed an unsettling vulnerability bypassing the safeguards of LLMs, known as jailbreak attacks. By applying techniques, such as employing role-playing scenarios, adversarial examples, or subtle sub…
▽ More
Misuse of the Large Language Models (LLMs) has raised widespread concern. To address this issue, safeguards have been taken to ensure that LLMs align with social ethics. However, recent findings have revealed an unsettling vulnerability bypassing the safeguards of LLMs, known as jailbreak attacks. By applying techniques, such as employing role-playing scenarios, adversarial examples, or subtle subversion of safety objectives as a prompt, LLMs can produce an inappropriate or even harmful response. While researchers have studied several categories of jailbreak attacks, they have done so in isolation. To fill this gap, we present the first large-scale measurement of various jailbreak attack methods. We concentrate on 13 cutting-edge jailbreak methods from four categories, 160 questions from 16 violation categories, and six popular LLMs. Our extensive experimental results demonstrate that the optimized jailbreak prompts consistently achieve the highest attack success rates, as well as exhibit robustness across different LLMs. Some jailbreak prompt datasets, available from the Internet, can also achieve high attack success rates on many LLMs, such as ChatGLM3, GPT-3.5, and PaLM2. Despite the claims from many organizations regarding the coverage of violation categories in their policies, the attack success rates from these categories remain high, indicating the challenges of effectively aligning LLM policies and the ability to counter jailbreak attacks. We also discuss the trade-off between the attack performance and efficiency, as well as show that the transferability of the jailbreak prompts is still viable, becoming an option for black-box models. Overall, our research highlights the necessity of evaluating different jailbreak methods. We hope our study can provide insights for future research on jailbreak attacks and serve as a benchmark tool for evaluating them for practitioners.
△ Less
Submitted 8 February, 2024;
originally announced February 2024.
-
The Eigenvalue Problem for the Complex Hessian Operator on $m$-Pseudoconvex Manifolds
Authors:
Jianchun Chu,
Yaxiong Liu,
Nicholas McCleerey
Abstract:
We establish $C^{1,1}$-regularity and uniqueness of the first eigenfunction of the complex Hessian operator on strongly $m$-pseudoconvex manifolds, along with a variational formula for the first eigenvalue. From these results, we derive a number of applications, including a bifurcation-type theorem and geometric bounds for the eigenvalue.
We establish $C^{1,1}$-regularity and uniqueness of the first eigenfunction of the complex Hessian operator on strongly $m$-pseudoconvex manifolds, along with a variational formula for the first eigenvalue. From these results, we derive a number of applications, including a bifurcation-type theorem and geometric bounds for the eigenvalue.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
Conversation Reconstruction Attack Against GPT Models
Authors:
Junjie Chu,
Zeyang Sha,
Michael Backes,
Yang Zhang
Abstract:
In recent times, significant advancements have been made in the field of large language models (LLMs), represented by GPT series models. To optimize task execution, users often engage in multi-round conversations with GPT models hosted in cloud environments. These multi-round conversations, potentially replete with private information, require transmission and storage within the cloud. However, th…
▽ More
In recent times, significant advancements have been made in the field of large language models (LLMs), represented by GPT series models. To optimize task execution, users often engage in multi-round conversations with GPT models hosted in cloud environments. These multi-round conversations, potentially replete with private information, require transmission and storage within the cloud. However, this operational paradigm introduces additional attack surfaces. In this paper, we first introduce a specific Conversation Reconstruction Attack targeting GPT models. Our introduced Conversation Reconstruction Attack is composed of two steps: hijacking a session and reconstructing the conversations. Subsequently, we offer an exhaustive evaluation of the privacy risks inherent in conversations when GPT models are subjected to the proposed attack. However, GPT-4 demonstrates certain robustness to the proposed attacks. We then introduce two advanced attacks aimed at better reconstructing previous conversations, specifically the UNR attack and the PBU attack. Our experimental findings indicate that the PBU attack yields substantial performance across all models, achieving semantic similarity scores exceeding 0.60, while the UNR attack is effective solely on GPT-3.5. Our results reveal the concern about privacy risks associated with conversations involving GPT models and aim to draw the community's attention to prevent the potential misuse of these models' remarkable capabilities. We will responsibly disclose our findings to the suppliers of related large language models.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
From PARIS to LE-PARIS: Toward Patent Response Automation with Recommender Systems and Collaborative Large Language Models
Authors:
Jung-Mei Chu,
Hao-Cheng Lo,
Jieh Hsiang,
Chun-Chieh Cho
Abstract:
In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). Th…
▽ More
In patent prosecution, timely and effective responses to Office Actions (OAs) are crucial for securing patents. However, past automation and artificial intelligence research have largely overlooked this aspect. To bridge this gap, our study introduces the Patent Office Action Response Intelligence System (PARIS) and its advanced version, the Large Language Model (LLM) Enhanced PARIS (LE-PARIS). These systems are designed to enhance the efficiency of patent attorneys in handling OA responses through collaboration with AI. The systems' key features include the construction of an OA Topics Database, development of Response Templates, and implementation of Recommender Systems and LLM-based Response Generation. To validate the effectiveness of the systems, we have employed a multi-paradigm analysis using the USPTO Office Action database and longitudinal data based on attorney interactions with our systems over six years. Through five studies, we have examined the constructiveness of OA topics (studies 1 and 2) using topic modeling and our proposed Delphi process, the efficacy of our proposed hybrid LLM-based recommender system tailored for OA responses (study 3), the quality of generated responses (study 4), and the systems' practical value in real-world scenarios through user studies (study 5). The results indicate that both PARIS and LE-PARIS significantly achieve key metrics and have a positive impact on attorney performance.
△ Less
Submitted 4 March, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
The rigidity of eigenvalues on Kähler manifolds with positive Ricci lower bound
Authors:
Jianchun Chu,
Feng Wang,
Kewei Zhang
Abstract:
In this work, optimal rigidity results for eigenvalues on Kähler manifolds with positive Ricci lower bound are established. More precisely, for those Kähler manifolds whose first eigenvalue agrees with the Ricci lower bound, we show that the complex projective space is the only one with the largest multiplicity of the first eigenvalue. Moreover, there is a specific gap between the largest and the…
▽ More
In this work, optimal rigidity results for eigenvalues on Kähler manifolds with positive Ricci lower bound are established. More precisely, for those Kähler manifolds whose first eigenvalue agrees with the Ricci lower bound, we show that the complex projective space is the only one with the largest multiplicity of the first eigenvalue. Moreover, there is a specific gap between the largest and the second largest multiplicity. In the Kähler--Einstein case, almost rigidity results for eigenvalues are also obtained.
△ Less
Submitted 28 January, 2024;
originally announced January 2024.
-
Quantum Oscillations Measurement of the Heavy Electron Mass near the van Hove Singularity in a Kagome Metal
Authors:
Elliott Rosenberg,
Jonathan DeStefano,
Yongbin Lee,
Chaowei Hu,
Yue Shi,
David Graf,
Shermane M. Benjamin,
Liqin Ke,
Jiun-Haw Chu
Abstract:
Kagome metals with the Fermi energy tuned near the van Hove singularities (vHss) have shown to host exotic phases including unconventional superconductivity and a chiral flux phase arising from a charge density wave. However, most quantum oscillations studies of the electronic structure of kagome metals focus on compounds which electronically or magnetically order, obscuring the unperturbed vHs. H…
▽ More
Kagome metals with the Fermi energy tuned near the van Hove singularities (vHss) have shown to host exotic phases including unconventional superconductivity and a chiral flux phase arising from a charge density wave. However, most quantum oscillations studies of the electronic structure of kagome metals focus on compounds which electronically or magnetically order, obscuring the unperturbed vHs. Here we present quantum oscillation measurements of YV$_6$Sn$_6$ which contains a pristine kagome lattice free from long range order. We discovered quantum oscillations corresponding to a large orbit ($\approx$70% of the Brillouin Zone area) with the heaviest mass ever observed in vanadium based kagome metals ($\approx3.3 m_e$), consistent with a Fermi pocket whose Fermi level is near the vHs. Comparing with first principles calculations suggests that the effective mass of this pocket is highly sensitive to the position of Fermi level. Our study establishes the enhanced density of states associated with a vHs in a kagome metal, allowing further insight into a potential driving mechanism for the unconventional electronic orderings in this class of materials.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
Robust Quantum Gates against Correlated Noise in Integrated Quantum Chips
Authors:
Kangyuan Yi,
Yong-Ju Hai,
Kai Luo,
Ji Chu,
Libo Zhang,
Yuxuan Zhou,
Yao Song,
Song Liu,
Tongxing Yan,
Xiu-Hao Deng,
Yuanzhen Chen,
Dapeng Yu
Abstract:
As quantum circuits become more integrated and complex, additional error sources that were previously insignificant start to emerge. Consequently, the fidelity of quantum gates benchmarked under pristine conditions falls short of predicting their performance in realistic circuits. To overcome this problem, we must improve their robustness against pertinent error models besides isolated fidelity. H…
▽ More
As quantum circuits become more integrated and complex, additional error sources that were previously insignificant start to emerge. Consequently, the fidelity of quantum gates benchmarked under pristine conditions falls short of predicting their performance in realistic circuits. To overcome this problem, we must improve their robustness against pertinent error models besides isolated fidelity. Here we report the experimental realization of robust quantum gates in superconducting quantum circuits based on a geometric framework for diagnosing and correcting various gate errors. Using quantum process tomography and randomized benchmarking, we demonstrate robust single-qubit gates against quasi-static noise and spatially-correlated noise in a broad range of strengths, which are common sources of coherent errors in large-scale quantum circuit. We also apply our method to non-static noises and to realize robust two-qubit gates. Our work provides a versatile toolbox for achieving noise-resilient complex quantum circuits.
△ Less
Submitted 23 May, 2024; v1 submitted 3 January, 2024;
originally announced January 2024.
-
Free resolution of the logarithmic derivation modules of close to free arrangements
Authors:
Junyan Chu
Abstract:
This paper studies the algebraic structure of a new class of hyperplane arrangement $A$ obtained by deleting two hyperplanes from a free arrangement. We provide information on the minimal free resolutions of the logarithmic derivation module of $A$, which can be used to compute a lower bound for the graded Betti numbers of the resolution.
Specifically, for the three-dimensional case, we determin…
▽ More
This paper studies the algebraic structure of a new class of hyperplane arrangement $A$ obtained by deleting two hyperplanes from a free arrangement. We provide information on the minimal free resolutions of the logarithmic derivation module of $A$, which can be used to compute a lower bound for the graded Betti numbers of the resolution.
Specifically, for the three-dimensional case, we determine the minimal free resolution of the logarithmic derivation module of $A$. We present illustrative examples of our main theorems to provide insights into the relationship between algebraic and combinatorial properties for close-to-free arrangements.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
Absence of Weyl nodes in EuCd$_2$As$_2$ revealed by the carrier density dependence of the anomalous Hall effect
Authors:
Yue Shi,
Zhaoyu Liu,
Logan A. Burnett,
Seokhyeong Lee,
Chaowei Hu,
Qianni Jiang,
Jiaqi Cai,
Xiaodong Xu,
Mo Li,
Cheng-Chien Chen,
Jiun-Haw Chu
Abstract:
The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highl…
▽ More
The antiferromagnetic layered compound EuCd$_2$As$_2$ is widely considered as a leading candidate of ideal Weyl semimetal, featuring a single pair of Weyl nodes in its field-induced ferromagnetic (FM) state. Nevertheless, this view has recently been challenged by an optical spectroscopy study, which suggests that it is a magnetic semiconductor. In this study, we have successfully synthesized highly insulating EuCd$_2$As$_2$ crystals with carrier density reaching as low as $2\times 10^{15}$ $\text{cm}^{-3}$. The magneto-transport measurements revealed a progressive decrease of the anomalous Hall conductivity (AHC) by several orders of magnitude as the carrier density decreases. This behavior contradicts with what is expected from the intrinsic AHC generated by the Weyl points, which is independent of carrier density as the Fermi level approaches the charge neutrality point. In contrast, the scaling relationship between AHC and longitudinal conductivity aligns with the characteristics of variable range hopping insulators. Our results suggest that EuCd$_2$As$_2$ is a magnetic semiconductor rather than a topological Weyl semimetal.
△ Less
Submitted 27 February, 2024; v1 submitted 29 December, 2023;
originally announced January 2024.
-
Towards the Feynman rule for $n$-point gluon Mellin amplitudes in AdS/CFT
Authors:
Jinwei Chu,
Savan Kharel
Abstract:
We investigate the embedding formalism in conjunction with the Mellin transform to determine tree-level gluon amplitudes in AdS/CFT. Detailed computations of three to five-point correlators are conducted, ultimately distilling what were previously complex results for five-point correlators into a more succinct and comprehensible form. We then proceed to derive a recursion relation applicable to a…
▽ More
We investigate the embedding formalism in conjunction with the Mellin transform to determine tree-level gluon amplitudes in AdS/CFT. Detailed computations of three to five-point correlators are conducted, ultimately distilling what were previously complex results for five-point correlators into a more succinct and comprehensible form. We then proceed to derive a recursion relation applicable to a specific class of $n$-point gluon amplitudes. This relation is instrumental in systematically constructing amplitudes for a range of topologies. We illustrate its efficacy by specifically computing six to eight-point functions. Despite the complexity encountered in the intermediate steps of the recursion, the higher-point correlator is succinctly expressed as a polynomial in boundary coordinates, upon which a specific differential operator acts. Remarkably, we observe that these amplitudes strikingly mirror their counterparts in flat space, traditionally computed using standard Feynman rules. This intriguing similarity has led us to propose a novel dictionary: comprehensive rules that bridge AdS Mellin amplitudes with flat-space gluon amplitudes.
△ Less
Submitted 29 December, 2023;
originally announced January 2024.
-
Positive scalar curvature metrics and aspherical summands
Authors:
Shuli Chen,
Jianchun Chu,
Jintian Zhu
Abstract:
We prove for $n\in\{3,4,5\}$ that the connected sum of a closed aspherical $n$-manifold with an arbitrary non-compact manifold does not admit a complete metric with nonnegative scalar curvature. In particular, a special case of our result answers a question of Gromov.
More generally, we generalize the partial classification result of Chodosh, Li, and Liokumovich to the non-compact domination cas…
▽ More
We prove for $n\in\{3,4,5\}$ that the connected sum of a closed aspherical $n$-manifold with an arbitrary non-compact manifold does not admit a complete metric with nonnegative scalar curvature. In particular, a special case of our result answers a question of Gromov.
More generally, we generalize the partial classification result of Chodosh, Li, and Liokumovich to the non-compact domination case with our newly-developed technique.
Our result unifies all previous results of this type, and confirms the validity of Gromov's non-compact domination conjecture for closed aspherical manifolds of dimensions 3, 4, and 5.
△ Less
Submitted 31 March, 2024; v1 submitted 7 December, 2023;
originally announced December 2023.
-
On Time-Dependent Backgrounds In 1+1 Dimensional String Theory
Authors:
Bruno Balthazar,
Jinwei Chu,
David Kutasov
Abstract:
In perturbative string theory, one is generally interested in asymptotic observables, such as the S-matrix in flat spacetime, and boundary correlation functions in anti-de Sitter spacetime. However, there are backgrounds in which such observables do not exist. We study examples of such backgrounds in 1+1 dimensional string theory. In these examples, the Liouville wall accelerates and can become sp…
▽ More
In perturbative string theory, one is generally interested in asymptotic observables, such as the S-matrix in flat spacetime, and boundary correlation functions in anti-de Sitter spacetime. However, there are backgrounds in which such observables do not exist. We study examples of such backgrounds in 1+1 dimensional string theory. In these examples, the Liouville wall accelerates and can become spacelike in the past and/or future. When that happens, the corresponding null infinity, at which the standard scattering states are defined, is shielded by the Liouville wall. We compute scattering and particle production amplitudes in these backgrounds in the region in parameter space where the wall remains timelike, and discuss the continuation of this picture to the spacelike regime. We also discuss the physics from the point of view of the dynamics of free fermions in backgrounds with a time-dependent Fermi surface.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
The Graph Convolutional Network with Multi-representation Alignment for Drug Synergy Prediction
Authors:
Xinxing Yang,
Genke Yang,
Jian Chu
Abstract:
Drug combination refers to the use of two or more drugs to treat a specific disease at the same time. It is currently the mainstream way to treat complex diseases. Compared with single drugs, drug combinations have better efficacy and can better inhibit toxicity and drug resistance. The computational model based on deep learning concatenates the representation of multiple drugs and the correspondi…
▽ More
Drug combination refers to the use of two or more drugs to treat a specific disease at the same time. It is currently the mainstream way to treat complex diseases. Compared with single drugs, drug combinations have better efficacy and can better inhibit toxicity and drug resistance. The computational model based on deep learning concatenates the representation of multiple drugs and the corresponding cell line feature as input, and the output is whether the drug combination can have an inhibitory effect on the cell line. However, this strategy of concatenating multiple representations has the following defects: the alignment of drug representation and cell line representation is ignored, resulting in the synergistic relationship not being reflected positionally in the embedding space. Moreover, the alignment measurement function in deep learning cannot be suitable for drug synergy prediction tasks due to differences in input types. Therefore, in this work, we propose a graph convolutional network with multi-representation alignment (GCNMRA) for predicting drug synergy. In the GCNMRA model, we designed a multi-representation alignment function suitable for the drug synergy prediction task so that the positional relationship between drug representations and cell line representation is reflected in the embedding space. In addition, the vector modulus of drug representations and cell line representation is considered to improve the accuracy of calculation results and accelerate model convergence. Finally, many relevant experiments were run on multiple drug synergy datasets to verify the effectiveness of the above innovative elements and the excellence of the GCNMRA model.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Galaxy stellar and total mass estimation using machine learning
Authors:
Jiani Chu,
Hongming Tang,
Dandan Xu,
Shengdong Lu,
Richard Long
Abstract:
Conventional galaxy mass estimation methods suffer from model assumptions and degeneracies. Machine learning, which reduces the reliance on such assumptions, can be used to determine how well present-day observations can yield predictions for the distributions of stellar and dark matter. In this work, we use a general sample of galaxies from the TNG100 simulation to investigate the ability of mult…
▽ More
Conventional galaxy mass estimation methods suffer from model assumptions and degeneracies. Machine learning, which reduces the reliance on such assumptions, can be used to determine how well present-day observations can yield predictions for the distributions of stellar and dark matter. In this work, we use a general sample of galaxies from the TNG100 simulation to investigate the ability of multi-branch convolutional neural network (CNN) based machine learning methods to predict the central (i.e., within $1-2$ effective radii) stellar and total masses, and the stellar mass-to-light ratio $M_*/L$. These models take galaxy images and spatially-resolved mean velocity and velocity dispersion maps as inputs. Such CNN-based models can in general break the degeneracy between baryonic and dark matter in the sense that the model can make reliable predictions on the individual contributions of each component. For example, with $r$-band images and two galaxy kinematic maps as inputs, our model predicting $M_*/L$ has a prediction uncertainty of 0.04 dex. Moreover, to investigate which (global) features significantly contribute to the correct predictions of the properties above, we utilize a gradient boosting machine. We find that galaxy luminosity dominates the prediction of all masses in the central regions, with stellar velocity dispersion coming next. We also investigate the main contributing features when predicting stellar and dark matter mass fractions ($f_*$, $f_{\rm DM}$) and the dark matter mass $M_{DM}$, and discuss the underlying astrophysics.
△ Less
Submitted 26 February, 2024; v1 submitted 17 November, 2023;
originally announced November 2023.
-
Slow Passage through a Saddle-Node Bifurcation in Discrete Dynamical Systems
Authors:
Jay Chu,
Jun-Jie Lin,
Je-Chiang Tsai
Abstract:
We study a discrete non-autonomous system whose autonomous counterpart (with the frozen bifurcation parameter) admits a saddle-node bifurcation, and in which the bifurcation parameter slowly changes in time and is characterized by a sweep rate constant $ε$. The discrete system is more appropriate for modeling realistic systems since only time series data is available. We show that in contrast to i…
▽ More
We study a discrete non-autonomous system whose autonomous counterpart (with the frozen bifurcation parameter) admits a saddle-node bifurcation, and in which the bifurcation parameter slowly changes in time and is characterized by a sweep rate constant $ε$. The discrete system is more appropriate for modeling realistic systems since only time series data is available. We show that in contrast to its autonomous counterpart, when the time mesh size $Δt$ is less than the order $O(ε)$, there is a bifurcation delay as the bifurcation time-varying parameter is varied through the bifurcation point, and the delay is proportional to the two-thirds power of the sweep rate constant $ε$. This bifurcation delay is significant in various realistic systems since it allows one to take necessary action promptly before a sudden collapse or shift to different states. On the other hand, when the time mesh size $Δt$ is larger than the order $o(ε)$, the dynamical behavior of the solution is dramatically changed before the bifurcation point. This behavior is not observed in the autonomous counterpart. Therefore, the dynamical behavior of the system strongly depends on the time mesh size. Finally. due to the very discrete feature of the system, there are no efficient tools for the analytical study of the system. Our approach is elementary and analytical.
△ Less
Submitted 13 November, 2023;
originally announced November 2023.
-
Mellin Amplitude for $n$-Gluon Scattering in Anti-de Sitter
Authors:
Jinwei Chu,
Savan Kharel
Abstract:
In AdS/CFT, we introduce a robust method for computing $n$-point gluon Mellin amplitudes, applicable in various spacetime dimensions. Using the Mellin transform and a recursive algorithm, we efficiently calculate tree-level gluon amplitudes. Our approach simplifies the representation of higher-point amplitudes, eliminating the need for complicated integrations. Crucially, the resulting amplitudes…
▽ More
In AdS/CFT, we introduce a robust method for computing $n$-point gluon Mellin amplitudes, applicable in various spacetime dimensions. Using the Mellin transform and a recursive algorithm, we efficiently calculate tree-level gluon amplitudes. Our approach simplifies the representation of higher-point amplitudes, eliminating the need for complicated integrations. Crucially, the resulting amplitudes closely mirror those in flat space, allowing a straightforward dictionary between the two settings circumventing explicit calculations.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
Non-Fermi liquid behavior in a correlated flatband pyrochlore lattice
Authors:
Jianwei Huang,
Lei Chen,
Yuefei Huang,
Chandan Setty,
Bin Gao,
Yue Shi,
Zhaoyu Liu,
Yichen Zhang,
Turgut Yilmaz,
Elio Vescovo,
Makoto Hashimoto,
Donghui Lu,
Boris I. Yakobson,
Pengcheng Dai,
Jiun-Haw Chu,
Qimiao Si,
Ming Yi
Abstract:
Electronic correlation effects are manifested in quantum materials when either the onsite Coulomb repulsion is large or the electron kinetic energy is small. The former is the dominant effect in the cuprate superconductors or heavy fermion systems while the latter in twisted bilayer graphene or geometrically frustrated metals. However, the simultaneous cooperation of both effects in the same quant…
▽ More
Electronic correlation effects are manifested in quantum materials when either the onsite Coulomb repulsion is large or the electron kinetic energy is small. The former is the dominant effect in the cuprate superconductors or heavy fermion systems while the latter in twisted bilayer graphene or geometrically frustrated metals. However, the simultaneous cooperation of both effects in the same quantum material--the design principle to produce a correlated topological flat bands pinned at the Fermi level--remains rare. Here, using angle-resolved photoemission spectroscopy, we report the observation of a flat band at the Fermi level in a 3$d$ pyrochlore metal CuV$_2$S$_4$. From a combination of first-principles calculations and slave-spin calculations, we understand the origin of this band to be a destructive quantum-interference effect associated with the V pyrochlore sublattice and further renormalization to the Fermi level by electron interactions in the partially filled V $t_{2g}$ orbitals. As a result, we find transport behavior that indicates a deviation from Fermi-liquid behavior as well as a large Sommerfeld coefficient. Our work demonstrates the pathway into correlated topology by constructing and pinning correlated flat bands near the Fermi level out of a pure $d$-electron system by the combined cooperation of local Coulomb interactions and geometric frustration in a pyrochlore lattice system.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Advancing Bayesian Optimization via Learning Correlated Latent Space
Authors:
Seunghun Lee,
Jaewon Chu,
Sihyeon Kim,
Juyeon Ko,
Hyunwoo J. Kim
Abstract:
Bayesian optimization is a powerful method for optimizing black-box functions with limited function evaluations. Recent works have shown that optimization in a latent space through deep generative models such as variational autoencoders leads to effective and efficient Bayesian optimization for structured or discrete data. However, as the optimization does not take place in the input space, it lea…
▽ More
Bayesian optimization is a powerful method for optimizing black-box functions with limited function evaluations. Recent works have shown that optimization in a latent space through deep generative models such as variational autoencoders leads to effective and efficient Bayesian optimization for structured or discrete data. However, as the optimization does not take place in the input space, it leads to an inherent gap that results in potentially suboptimal solutions. To alleviate the discrepancy, we propose Correlated latent space Bayesian Optimization (CoBO), which focuses on learning correlated latent spaces characterized by a strong correlation between the distances in the latent space and the distances within the objective function. Specifically, our method introduces Lipschitz regularization, loss weighting, and trust region recoordination to minimize the inherent gap around the promising areas. We demonstrate the effectiveness of our approach on several optimization tasks in discrete data, such as molecule design and arithmetic expression fitting, and achieve high performance within a small budget.
△ Less
Submitted 19 November, 2023; v1 submitted 31 October, 2023;
originally announced October 2023.
-
NuTrea: Neural Tree Search for Context-guided Multi-hop KGQA
Authors:
Hyeong Kyu Choi,
Seunghun Lee,
Jaewon Chu,
Hyunwoo J. Kim
Abstract:
Multi-hop Knowledge Graph Question Answering (KGQA) is a task that involves retrieving nodes from a knowledge graph (KG) to answer natural language questions. Recent GNN-based approaches formulate this task as a KG path searching problem, where messages are sequentially propagated from the seed node towards the answer nodes. However, these messages are past-oriented, and they do not consider the f…
▽ More
Multi-hop Knowledge Graph Question Answering (KGQA) is a task that involves retrieving nodes from a knowledge graph (KG) to answer natural language questions. Recent GNN-based approaches formulate this task as a KG path searching problem, where messages are sequentially propagated from the seed node towards the answer nodes. However, these messages are past-oriented, and they do not consider the full KG context. To make matters worse, KG nodes often represent proper noun entities and are sometimes encrypted, being uninformative in selecting between paths. To address these problems, we propose Neural Tree Search (NuTrea), a tree search-based GNN model that incorporates the broader KG context. Our model adopts a message-passing scheme that probes the unreached subtree regions to boost the past-oriented embeddings. In addition, we introduce the Relation Frequency-Inverse Entity Frequency (RF-IEF) node embedding that considers the global KG context to better characterize ambiguous KG nodes. The general effectiveness of our approach is demonstrated through experiments on three major multi-hop KGQA benchmark datasets, and our extensive analyses further validate its expressiveness and robustness. Overall, NuTrea provides a powerful means to query the KG with complex natural language questions. Code is available at https://github.com/mlvlab/NuTrea.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
UniParser: Multi-Human Parsing with Unified Correlation Representation Learning
Authors:
Jiaming Chu,
Lei Jin,
Junliang Xing,
Jian Zhao
Abstract:
Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information. However, prior research has typically processed these two types of information through separate branches and distinct output formats, leading to inefficient and redundant frameworks. This paper introduces UniParser, which integrates instance-level and category-level repr…
▽ More
Multi-human parsing is an image segmentation task necessitating both instance-level and fine-grained category-level information. However, prior research has typically processed these two types of information through separate branches and distinct output formats, leading to inefficient and redundant frameworks. This paper introduces UniParser, which integrates instance-level and category-level representations in three key aspects: 1) we propose a unified correlation representation learning approach, allowing our network to learn instance and category features within the cosine space; 2) we unify the form of outputs of each modules as pixel-level segmentation results while supervising instance and category features using a homogeneous label accompanied by an auxiliary loss; and 3) we design a joint optimization procedure to fuse instance and category representations. By virtual of unifying instance-level and category-level output, UniParser circumvents manually designed post-processing techniques and surpasses state-of-the-art methods, achieving 49.3% AP on MHPv2.0 and 60.4% AP on CIHP. We will release our source code, pretrained models, and online demos to facilitate future studies.
△ Less
Submitted 19 May, 2024; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Characterization and Absolute Calibration of the Far Infrared Field Integral Line Spectrometer for SOFIA
Authors:
Dario Fadda,
Sebastian Colditz,
Christian Fischer,
William D. Vacca,
Jason Chu,
Melanie Clarke,
Randolf Klein,
Alfred Krabbe,
Robert Minchin,
Albrecht Poglitsch
Abstract:
We present the characterization and definitive flux calibration of the Far-Infrared Field Integral Line Spectrometer (FIFI-LS) instrument on-board SOFIA. The work is based on measurements made in the laboratory with an internal calibrator and on observations of planets, moons, and asteroids as absolute flux calibrators made during the entire lifetime of the instrument. We describe the techniques u…
▽ More
We present the characterization and definitive flux calibration of the Far-Infrared Field Integral Line Spectrometer (FIFI-LS) instrument on-board SOFIA. The work is based on measurements made in the laboratory with an internal calibrator and on observations of planets, moons, and asteroids as absolute flux calibrators made during the entire lifetime of the instrument. We describe the techniques used to derive flat-fields, water vapor column estimates, detector linearity, spectral and spatial resolutions, and absolute flux calibration. Two sets of responses are presented, before and after the entrance filter window was changed in 2018 to improve the sensitivity at 52um, a wavelength range previously not covered by PACS on Herschel. The relative spectral response of each detector and the illumination pattern of the arrays of the FIFI-LS arrays are derived using the internal calibrator before each observational series. The linearity of the array response is estimated by considering observations of bright sources. We find that the deviation from linearity of the FIFI-LS arrays affects the flux estimations less than 1%. The flux calibration accuracy is estimated to be 15% or better across the entire wavelength range of the instrument. The limited availability of sky calibrators during each observational series is the major limiting factor of the flux calibration accuracy.
△ Less
Submitted 2 October, 2023;
originally announced October 2023.
-
Absence of $E_{2g}$ nematic instability and dominant $A_{1g}$ response in the kagome metal CsV$_3$Sb$_5$
Authors:
Zhaoyu Liu,
Yue Shi,
Qianni Jiang,
Elliott W. Rosenberg,
Jonathan M. DeStefano,
Jinjin Liu,
Chaowei Hu,
Yuzhou Zhao,
Zhiwei Wang,
Yugui Yao,
David Graf,
Pengcheng Dai,
Jihui Yang,
Xiaodong Xu,
Jiun-Haw Chu
Abstract:
Ever since the discovery of the charge density wave (CDW) transition in the kagome metal CsV$_3$Sb$_5$, the nature of its symmetry breaking is under intense debate. While evidence suggests that the rotational symmetry is already broken at the CDW transition temperature ($T_{\rm CDW}$), an additional electronic nematic instability well below $T_{\rm CDW}$ has been reported based on the diverging el…
▽ More
Ever since the discovery of the charge density wave (CDW) transition in the kagome metal CsV$_3$Sb$_5$, the nature of its symmetry breaking is under intense debate. While evidence suggests that the rotational symmetry is already broken at the CDW transition temperature ($T_{\rm CDW}$), an additional electronic nematic instability well below $T_{\rm CDW}$ has been reported based on the diverging elastoresistivity coefficient in the anisotropic channel ($m_{E_{2g}}$). Verifying the existence of a nematic transition below $T_{\rm CDW}$ is not only critical for establishing the correct description of the CDW order parameter, but also important for understanding low-temperature superconductivity. Here, we report elastoresistivity measurements of CsV$_3$Sb$_5$ using three different techniques probing both isotropic and anisotropic symmetry channels. Contrary to previous reports, we find the anisotropic elastoresistivity coefficient $m_{E_{2g}}$ is temperature-independent, except for a step jump at $T_{\rm CDW}$. The absence of nematic fluctuations is further substantiated by measurements of the elastocaloric effect, which show no enhancement associated with nematic susceptibility. On the other hand, the symmetric elastoresistivity coefficient $m_{A_{1g}}$ increases below $T_{\rm CDW}$, reaching a peak value of 90 at $T^* = 20$ K. Our results strongly indicate that the phase transition at $T^*$ is not nematic in nature and the previously reported diverging elastoresistivity is due to the contamination from the $A_{1g}$ channel.
△ Less
Submitted 1 July, 2024; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Enhanced C-V2X Mode 4 to Optimize Age of Information and Reliability for IoV
Authors:
Jiahou Chu,
Qiong Wu,
Qiang Fan,
Zhengquan Li
Abstract:
Internet of vehicles (IoV) has emerged as a key technology to realize real-time vehicular application. For IoV, vehicles adopt cellular vehicle-to-everything (C-V2X) standard to support direct communication among them. C-V2X mode 4 controls resource allocation without the assistance of cellular network, hence it is widely used for IoV. However, C-V2X mode 4 has two drawbacks. First is that vehicle…
▽ More
Internet of vehicles (IoV) has emerged as a key technology to realize real-time vehicular application. For IoV, vehicles adopt cellular vehicle-to-everything (C-V2X) standard to support direct communication among them. C-V2X mode 4 controls resource allocation without the assistance of cellular network, hence it is widely used for IoV. However, C-V2X mode 4 has two drawbacks. First is that vehicles cannot communicate with each other for a period in some case which will cause an increase in age of information (AoI); second is that vehicles may select resource already occupied by others which will deteriorate the reliability. To address the two drawbacks, we propose an enhanced C-V2X mode 4 to optimize AoI and reliability. In addition, we consider the fact that for most vehicular applications, each vehicle periodically requires fresh information of vehicles within a certain distance and propose a new performance metric to evaluate the system AoI for IoV. Furthermore, we construct a platform through integrating SUMO and NS3. We demonstrate the superiority of the enhanced C-V2X mode 4 base on this simulation platform.
△ Less
Submitted 19 September, 2023;
originally announced September 2023.
-
Observation of flat and weakly dispersing bands in a van der Waals semiconductor Nb3Br8 with breathing kagome lattice
Authors:
Sabin Regmi,
Anup Pradhan Sakhya,
Tharindu Fernando,
Yuzhou Zhao,
Dylan Jeff,
Milo Sprague,
Favian Gonzalez,
Iftakhar Bin Elius,
Mazharul Islam Mondal,
Nathan Valadez,
Damani Jarrett,
Alexis Agosto,
Jihui Yang,
Jiun-Haw Chu,
Saiful I. Khondaker,
Xiaodong Xu,
Ting Cao,
Madhab Neupane
Abstract:
Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are…
▽ More
Niobium halides, Nb3X8 (X = Cl,Br,I), which are predicted two-dimensional magnets, have recently gotten attention due to their breathing kagome geometry. Here, we have studied the electronic structure of Nb3Br8 by using angle-resolved photoemission spectroscopy (ARPES) and first-principles calculations. ARPES results depict the presence of multiple flat and weakly dispersing bands. These bands are well explained by the theoretical calculations, which show they have Nb d character indicating their origination from the Nb atoms forming the breathing kagome plane. This van der Waals material can be easily thinned down via mechanical exfoliation to the ultrathin limit and such ultrathin samples are stable as depicted from the time-dependent Raman spectroscopy measurements at room temperature. These results demonstrate that Nb3Br8 is an excellent material not only for studying breathing kagome induced flat band physics and its connection with magnetism, but also for heterostructure fabrication for application purposes.
△ Less
Submitted 9 September, 2023;
originally announced September 2023.
-
Learning Spiking Neural Network from Easy to Hard task
Authors:
Lingling Tang,
Jiangtao Hu,
Hua Yu,
Surui Liu,
Jielei Chu
Abstract:
Starting with small and simple concepts, and gradually introducing complex and difficult concepts is the natural process of human learning. Spiking Neural Networks (SNNs) aim to mimic the way humans process information, but current SNNs models treat all samples equally, which does not align with the principles of human learning and overlooks the biological plausibility of SNNs. To address this, we…
▽ More
Starting with small and simple concepts, and gradually introducing complex and difficult concepts is the natural process of human learning. Spiking Neural Networks (SNNs) aim to mimic the way humans process information, but current SNNs models treat all samples equally, which does not align with the principles of human learning and overlooks the biological plausibility of SNNs. To address this, we propose a CL-SNN model that introduces Curriculum Learning(CL) into SNNs, making SNNs learn more like humans and providing higher biological interpretability. CL is a training strategy that advocates presenting easier data to models before gradually introducing more challenging data, mimicking the human learning process. We use a confidence-aware loss to measure and process the samples with different difficulty levels. By learning the confidence of different samples, the model reduces the contribution of difficult samples to parameter optimization automatically. We conducted experiments on static image datasets MNIST, Fashion-MNIST, CIFAR10, and neuromorphic datasets N-MNIST, CIFAR10-DVS, DVS-Gesture. The results are promising. To our best knowledge, this is the first proposal to enhance the biologically plausibility of SNNs by introducing CL.
△ Less
Submitted 25 September, 2023; v1 submitted 9 September, 2023;
originally announced September 2023.
-
Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Authors:
Dohwan Ko,
Ji Soo Lee,
Miso Choi,
Jaewon Chu,
Jihwan Park,
Hyunwoo J. Kim
Abstract:
Video Question Answering (VideoQA) is a challenging task that entails complex multi-modal reasoning. In contrast to multiple-choice VideoQA which aims to predict the answer given several options, the goal of open-ended VideoQA is to answer questions without restricting candidate answers. However, the majority of previous VideoQA models formulate open-ended VideoQA as a classification task to class…
▽ More
Video Question Answering (VideoQA) is a challenging task that entails complex multi-modal reasoning. In contrast to multiple-choice VideoQA which aims to predict the answer given several options, the goal of open-ended VideoQA is to answer questions without restricting candidate answers. However, the majority of previous VideoQA models formulate open-ended VideoQA as a classification task to classify the video-question pairs into a fixed answer set, i.e., closed-vocabulary, which contains only frequent answers (e.g., top-1000 answers). This leads the model to be biased toward only frequent answers and fail to generalize on out-of-vocabulary answers. We hence propose a new benchmark, Open-vocabulary Video Question Answering (OVQA), to measure the generalizability of VideoQA models by considering rare and unseen answers. In addition, in order to improve the model's generalization power, we introduce a novel GNN-based soft verbalizer that enhances the prediction on rare and unseen answers by aggregating the information from their similar words. For evaluation, we introduce new baselines by modifying the existing (closed-vocabulary) open-ended VideoQA models and improve their performances by further taking into account rare and unseen answers. Our ablation studies and qualitative analyses demonstrate that our GNN-based soft verbalizer further improves the model performance, especially on rare and unseen answers. We hope that our benchmark OVQA can serve as a guide for evaluating the generalizability of VideoQA models and inspire future research. Code is available at https://github.com/mlvlab/OVQA.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
Strain Tuning Three-state Potts Nematicity in a Correlated Antiferromagnet
Authors:
Kyle Hwangbo,
Elliott Rosenberg,
John Cenker,
Qianni Jiang,
Haidan Wen,
Di Xiao,
Jiun-Haw Chu,
Xiaodong Xu
Abstract:
Electronic nematicity, a state in which rotational symmetry is spontaneously broken, has become a familiar characteristic of many strongly correlated materials. One widely studied example is the discovered Ising-nematicity and its interplay with superconductivity in tetragonal iron pnictides. Since nematic directors in crystalline solids are restricted by the underlying crystal symmetry, recently…
▽ More
Electronic nematicity, a state in which rotational symmetry is spontaneously broken, has become a familiar characteristic of many strongly correlated materials. One widely studied example is the discovered Ising-nematicity and its interplay with superconductivity in tetragonal iron pnictides. Since nematic directors in crystalline solids are restricted by the underlying crystal symmetry, recently identified quantum material systems with three-fold rotational (C$_3$) symmetry offer a new platform to investigate nematic order with three-state Potts character. Here, we report reversible strain control of the three-state Potts nematicity in a zigzag antiferromagnetic insulator, FePSe$_3$. Probing the nematicity via optical linear dichroism, we demonstrate either $2π/3$ or $π/2$ rotation of nematic director by uniaxial strain. The nature of the nematic phase transition can also be controlled such that it undergoes a smooth crossover transition, a Potts nematic transition, or a Ising nematic flop transition. Further elastocaloric measurements demonstrate signatures of two coupled phase transitions, indicating that the nematic phase is a vestigial order arose from the antiferromagnetism. The ability to tune the nematic order with in-situ strain further enables the extraction of nematic susceptibility, which exhibits a divergent behavior near the magnetic ordering temperature that is corroborated with both linear dichroism and elastocaloric measurements. Our work points to an active control approach to manipulate and explore nematicity in three-state Potts correlated materials.
△ Less
Submitted 17 January, 2024; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Observation of Fractionally Quantized Anomalous Hall Effect
Authors:
Heonjoon Park,
Jiaqi Cai,
Eric Anderson,
Yinong Zhang,
Jiayi Zhu,
Xiaoyu Liu,
Chong Wang,
William Holtzmann,
Chaowei Hu,
Zhaoyu Liu,
Takashi Taniguchi,
Kenji Watanabe,
Jiun-haw Chu,
Ting Cao,
Liang Fu,
Wang Yao,
Cui-Zu Chang,
David Cobden,
Di Xiao,
Xiaodong Xu
Abstract:
The integer quantum anomalous Hall (QAH) effect is a lattice analog of the quantum Hall effect at zero magnetic field. This striking transport phenomenon occurs in electronic systems with topologically nontrivial bands and spontaneous time-reversal symmetry breaking. Discovery of its putative fractional counterpart in the presence of strong electron correlations, i.e., the fractional quantum anoma…
▽ More
The integer quantum anomalous Hall (QAH) effect is a lattice analog of the quantum Hall effect at zero magnetic field. This striking transport phenomenon occurs in electronic systems with topologically nontrivial bands and spontaneous time-reversal symmetry breaking. Discovery of its putative fractional counterpart in the presence of strong electron correlations, i.e., the fractional quantum anomalous Hall (FQAH) effect, would open a new chapter in condensed matter physics. Here, we report the direct observation of both integer and fractional QAH effects in electrical measurements on twisted bilayer MoTe$_2$. At zero magnetic field, near filling factor $ν= -1$ (one hole per moiré unit cell) we see an extended integer QAH plateau in the Hall resistance $R_\text{xy}$ that is quantized to $h/e^2 \pm 0.1 \%$ while the longitudinal resistance $R_\text{xx}$ vanishes. Remarkably, at $ν=-2/3$ and $-3/5$ we see plateau features in $R_\text{xy}$ at $3h/2e^2 \pm 1\%$ and $5h/3e^2 \pm 3\%$, respectively, while $R_\text{xx}$ remains small. All these features shift linearly in an applied magnetic field with slopes matching the corresponding Chern numbers $-1$, $-2/3$, and $-3/5$, precisely as expected for integer and fractional QAH states. In addition, at zero magnetic field, $R_\text{xy}$ is approximately $2h/e^2$ near half filling ($ν= -1/2$) and varies linearly as $ν$ is tuned. This behavior resembles that of the composite Fermi liquid in the half-filled lowest Landau level of a two-dimensional electron gas at high magnetic field. Direct observation of the FQAH and associated effects paves the way for researching charge fractionalization and anyonic statistics at zero magnetic field.
△ Less
Submitted 4 August, 2023;
originally announced August 2023.
-
GraphCL-DTA: a graph contrastive learning with molecular semantics for drug-target binding affinity prediction
Authors:
Xinxing Yang,
Genke Yang,
Jian Chu
Abstract:
Drug-target binding affinity prediction plays an important role in the early stages of drug discovery, which can infer the strength of interactions between new drugs and new targets. However, the performance of previous computational models is limited by the following drawbacks. The learning of drug representation relies only on supervised data, without taking into account the information containe…
▽ More
Drug-target binding affinity prediction plays an important role in the early stages of drug discovery, which can infer the strength of interactions between new drugs and new targets. However, the performance of previous computational models is limited by the following drawbacks. The learning of drug representation relies only on supervised data, without taking into account the information contained in the molecular graph itself. Moreover, most previous studies tended to design complicated representation learning module, while uniformity, which is used to measure representation quality, is ignored. In this study, we propose GraphCL-DTA, a graph contrastive learning with molecular semantics for drug-target binding affinity prediction. In GraphCL-DTA, we design a graph contrastive learning framework for molecular graphs to learn drug representations, so that the semantics of molecular graphs are preserved. Through this graph contrastive framework, a more essential and effective drug representation can be learned without additional supervised data. Next, we design a new loss function that can be directly used to smoothly adjust the uniformity of drug and target representations. By directly optimizing the uniformity of representations, the representation quality of drugs and targets can be improved. The effectiveness of the above innovative elements is verified on two real datasets, KIBA and Davis. The excellent performance of GraphCL-DTA on the above datasets suggests its superiority to the state-of-the-art model.
△ Less
Submitted 18 July, 2023;
originally announced July 2023.