Skip to main content

Showing 1–50 of 821 results for author: Wu, R

  1. arXiv:2407.10737  [pdf, other

    cs.CV cs.AI

    Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models

    Authors: Rining Wu, Feixiang Zhou, Ziwei Yin, Jian K. Liu

    Abstract: Our brains represent the ever-changing environment with neurons in a highly dynamic fashion. The temporal features of visual pixels in dynamic natural scenes are entrapped in the neuronal responses of the retina. It is crucial to establish the intrinsic temporal relationship between visual pixels and neuronal responses. Recent foundation vision models have paved an advanced way of understanding im… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: This article is accepted by ECCV 2024, which ID is 12149. Accepted papers' id can be found in: https://eccv2024.ecva.net/Conferences/2024/AcceptedPapers

  2. arXiv:2407.06172  [pdf, other

    cs.AI cs.CL

    On Speeding Up Language Model Evaluation

    Authors: Jin Peng Zhou, Christian K. Belardi, Ruihan Wu, Travis Zhang, Carla P. Gomes, Wen Sun, Kilian Q. Weinberger

    Abstract: Large language models (LLMs) currently dominate the field of natural language processing (NLP), representing the state-of-the-art across a diverse array of tasks. Developing a model of this nature, from training to inference, requires making numerous decisions which define a combinatorial search problem. For example, selecting the optimal pre-trained LLM, prompt, or hyperparameters to attain the b… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  3. arXiv:2407.05282  [pdf, other

    cs.CV

    UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

    Authors: Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang

    Abstract: This paper presents UltraEdit, a large-scale (approximately 4 million editing samples), automatically generated dataset for instruction-based image editing. Our key idea is to address the drawbacks in existing image editing datasets like InstructPix2Pix and MagicBrush, and provide a systematic approach to producing massive and high-quality image editing samples. UltraEdit offers several distinct a… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 32 pages, 14 figures

  4. arXiv:2407.05149  [pdf

    physics.bio-ph physics.app-ph physics.chem-ph physics.optics

    Quantized Acoustic Phonons Map the Dynamics of a Single Virus

    Authors: Yaqing Zhang, Rihan Wu, Md Shahjahan, Canchai Yang, Dohun Pyeon, Elad Harel

    Abstract: The natural vibrational frequencies of biological particles such as viruses and bacteria encode critical information about their mechanical and biological states as they interact with their local environment and undergo structural evolution. However, detecting and tracking these vibrations within a biological context at the single particle level has remained elusive. In this study, we track the vi… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Main Manuscript: 19 pages, 4 figures Supplementary Information: 29 pages, 17 figures

  5. arXiv:2407.04346  [pdf

    cs.CV

    MobileFlow: A Multimodal LLM For Mobile GUI Agent

    Authors: Songqin Nong, Jiali Zhu, Rui Wu, Jiongchao Jin, Shuo Shan, Xiutian Huang, Wenhao Xu

    Abstract: Currently, the integration of mobile Graphical User Interfaces (GUIs) is ubiquitous in most people's daily lives. And the ongoing evolution of multimodal large-scale models, such as GPT-4v, Qwen-VL-Max, has significantly bolstered the capabilities of GUI comprehension and user action analysis, showcasing the potentiality of intelligent GUI assistants. However, current GUI Agents often need to acce… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2407.03951  [pdf, other

    cs.LG

    Uncertainty-Guided Optimization on Large Language Model Search Trees

    Authors: Julia Grosse, Ruotian Wu, Ahmad Rashid, Philipp Hennig, Pascal Poupart, Agustinus Kristiadi

    Abstract: Beam search is a standard tree search algorithm when it comes to finding sequences of maximum likelihood, for example, in the decoding processes of large language models. However, it is myopic since it does not take the whole path from the root to a leaf into account. Moreover, it is agnostic to prior knowledge available about the process: For example, it does not consider that the objective being… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 10 pages

  7. arXiv:2407.01649  [pdf, other

    q-bio.QM cs.LG

    FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

    Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

    Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2406.17248  [pdf, other

    quant-ph

    MindSpore Quantum: A User-Friendly, High-Performance, and AI-Compatible Quantum Computing Framework

    Authors: Xusheng Xu, Jiangyu Cui, Zidong Cui, Runhong He, Qingyu Li, Xiaowei Li, Yanling Lin, Jiale Liu, Wuxin Liu, Jiale Lu, Maolin Luo, Chufan Lyu, Shijie Pan, Mosharev Pavel, Runqiu Shu, Jialiang Tang, Ruoqian Xu, Shu Xu, Kang Yang, Fan Yu, Qingguo Zeng, Haiying Zhao, Qiang Zheng, Junyuan Zhou, Xu Zhou , et al. (14 additional authors not shown)

    Abstract: We introduce MindSpore Quantum, a pioneering hybrid quantum-classical framework with a primary focus on the design and implementation of noisy intermediate-scale quantum (NISQ) algorithms. Leveraging the robust support of MindSpore, an advanced open-source deep learning training/inference framework, MindSpore Quantum exhibits exceptional efficiency in the design and training of variational quantum… ▽ More

    Submitted 10 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  9. arXiv:2406.15774  [pdf, other

    cs.RO

    Observation Time Difference: an Online Dynamic Objects Removal Method for Ground Vehicles

    Authors: Rongguang Wu, Chenglin Pang, Xuankang Wu, Zheng Fang

    Abstract: In the process of urban environment mapping, the sequential accumulations of dynamic objects will leave a large number of traces in the map. These traces will usually have bad influences on the localization accuracy and navigation performance of the robot. Therefore, dynamic objects removal plays an important role for creating clean map. However, conventional dynamic objects removal methods usuall… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  10. arXiv:2406.14288  [pdf, other

    cs.LG cs.AI

    Revisiting Modularity Maximization for Graph Clustering: A Contrastive Learning Perspective

    Authors: Yunfei Liu, Jintang Li, Yuehe Chen, Ruofan Wu, Ericbk Wang, Jing Zhou, Sheng Tian, Shuheng Shen, Xing Fu, Changhua Meng, Weiqiang Wang, Liang Chen

    Abstract: Graph clustering, a fundamental and challenging task in graph mining, aims to classify nodes in a graph into several disjoint clusters. In recent years, graph contrastive learning (GCL) has emerged as a dominant line of research in graph clustering and advances the new state-of-the-art. However, GCL-based methods heavily rely on graph augmentations and contrastive schemes, which may potentially in… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: KDD 2024 research track. Code available at https://github.com/EdisonLeeeee/MAGI

  11. arXiv:2406.12316  [pdf, other

    cs.CV cs.AI cs.MM

    Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning

    Authors: Ruiqi Wu, Bingliang Jiao, Wenxuan Wang, Meng Liu, Peng Wang

    Abstract: The Visible-Infrared Person Re-identification (VI ReID) aims to match visible and infrared images of the same pedestrians across non-overlapped camera views. These two input modalities contain both invariant information, such as shape, and modality-specific details, such as color. An ideal model should utilize valuable information from both modalities during training for enhanced representational… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepyed by ACM International Conference on Multimedia Retrieval (ICMR'24)

    Journal ref: ICMR'24: Proceedings of the 2024 International Conference on Multimedia Retrieval (2024) 579 - 588

  12. arXiv:2406.11810  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

    Authors: Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy, Wen Sun

    Abstract: We study computationally and statistically efficient Reinforcement Learning algorithms for the linear Bellman Complete setting, a setting that uses linear function approximation to capture value functions and unifies existing models like linear Markov Decision Processes (MDP) and Linear Quadratic Regulators (LQR). While it is known from the prior works that this setting is statistically tractable,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  13. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  14. arXiv:2406.08177  [pdf, other

    eess.IV cs.CV

    One-Step Effective Diffusion Network for Real-World Image Super-Resolution

    Authors: Rongyuan Wu, Lingchen Sun, Zhiyuan Ma, Lei Zhang

    Abstract: The pre-trained text-to-image diffusion models have been increasingly employed to tackle the real-world image super-resolution (Real-ISR) problem due to their powerful generative image priors. Most of the existing methods start from random noise to reconstruct the high-quality (HQ) image under the guidance of the given low-quality (LQ) image. While promising results have been achieved, such Real-… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  15. arXiv:2406.07928  [pdf, other

    cs.RO

    Undergraduate Robotics Education with General Instructors using a Student-Centered Personalized Learning Framework

    Authors: Rui Wu, David J Feil-Seifer, Ponkoj C Shill, Hossein Jamali, Sergiu Dascalu, Fred Harris, Laura Rosof, Bryan Hutchins, Marjorie Campo Ringler, Zhen Zhu

    Abstract: Recent advancements in robotics, including applications like self-driving cars, unmanned systems, and medical robots, have had a significant impact on the job market. On one hand, big robotics companies offer training programs based on the job requirements. However, these training programs may not be as beneficial as general robotics programs offered by universities or community colleges. On the o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures, 1 table, 2024 ASEE Conference

  16. arXiv:2406.07780  [pdf, other

    cs.LG cs.CL

    A Critical Look At Tokenwise Reward-Guided Text Generation

    Authors: Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart

    Abstract: Large language models (LLMs) can significantly be improved by aligning to human preferences -- the so-called reinforcement learning from human feedback (RLHF). However, the cost of fine-tuning an LLM is prohibitive for many users. Due to their ability to bypass LLM finetuning, tokenwise reward-guided text generation (RGTG) methods have recently been proposed. They use a reward model trained on ful… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  17. arXiv:2406.06612  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

    Authors: Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani

    Abstract: Generating combined visual and auditory sensory experiences is critical for the consumption of immersive content. Recent advances in neural generative models have enabled the creation of high-resolution content across multiple modalities such as images, text, speech, and videos. Despite these successes, there remains a significant gap in the generation of high-quality spatial audio that complement… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://see2sound.github.io/

  18. arXiv:2406.05917  [pdf, other

    econ.GN

    China's Rising Leadership in Global Science

    Authors: Renli Wu, Christopher Esposito, James Evans

    Abstract: Major shifts in the global system of science and technology are destabilizing the global status order and demonstrating the capacity for emerging countries like China and India to exert greater influence. In order to measure changes in the global scientific system, we develop a framework to assess the hierarchical position of countries in the international scientific collaboration network. Using a… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  19. arXiv:2406.04002  [pdf, other

    cs.CV

    3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

    Authors: Ruipu Wu, Jifei Che, Han Li, Chengjing Wu, Ting Liu, Luoqi Liu

    Abstract: Video panoptic segmentation is an advanced task that extends panoptic segmentation by applying its concept to video sequences. In the hope of addressing the challenge of video panoptic segmentation in diverse conditions, We utilize DVIS++ as our baseline model and enhance it by introducing a comprehensive approach centered on the query-wise ensemble, supplemented by additional techniques. Our prop… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 3nd Place Solution for CVPR 2024 PVUW VPS Track

  20. arXiv:2406.03835  [pdf, other

    cs.CV cs.RO

    Monocular Localization with Semantics Map for Autonomous Vehicles

    Authors: Jixiang Wan, Xudong Zhang, Shuzhou Dong, Yuwei Zhang, Yuchen Yang, Ruoxi Wu, Ye Jiang, Jijunnan Li, Jinquan Lin, Ming Yang

    Abstract: Accurate and robust localization remains a significant challenge for autonomous vehicles. The cost of sensors and limitations in local computational efficiency make it difficult to scale to large commercial applications. Traditional vision-based approaches focus on texture features that are susceptible to changes in lighting, season, perspective, and appearance. Additionally, the large storage siz… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  21. arXiv:2406.02976  [pdf, other

    cs.CV cs.AI

    DA-Flow: Dual Attention Normalizing Flow for Skeleton-based Video Anomaly Detection

    Authors: Ruituo Wu, Yang Chen, Jian Xiao, Bing Li, Jicong Fan, Frédéric Dufaux, Ce Zhu, Yipeng Liu

    Abstract: Cooperation between temporal convolutional networks (TCN) and graph convolutional networks (GCN) as a processing module has shown promising results in skeleton-based video anomaly detection (SVAD). However, to maintain a lightweight model with low computational and storage complexity, shallow GCN and TCN blocks are constrained by small receptive fields and a lack of cross-dimension interaction cap… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  22. arXiv:2406.02283  [pdf, other

    cs.RO

    Broadcasting Support Relations Recursively from Local Dynamics for Object Retrieval in Clutters

    Authors: Yitong Li, Ruihai Wu, Haoran Lu, Chuanruo Ning, Yan Shen, Guanqi Zhan, Hao Dong

    Abstract: In our daily life, cluttered objects are everywhere, from scattered stationery and books cluttering the table to bowls and plates filling the kitchen sink. Retrieving a target object from clutters is an essential while challenging skill for robots, for the difficulty of safely manipulating an object without disturbing others, which requires the robot to plan a manipulation sequence and first move… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: RSS 2024

  23. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  24. arXiv:2406.00943  [pdf, other

    cs.LG cs.AI

    State Space Models on Temporal Graphs: A First-Principles Study

    Authors: Jintang Li, Ruofan Wu, Xinzhou Jin, Boqun Ma, Liang Chen, Zibin Zheng

    Abstract: Over the past few years, research on deep graph learning has shifted from static graphs to temporal graphs in response to real-world complex systems that exhibit dynamic behaviors. In practice, temporal graphs are formalized as an ordered sequence of static graph snapshots observed at discrete time points. Sequence models such as RNNs or Transformers have long been the predominant backbone network… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Preprint; Code will be made available at https://github.com/EdisonLeeeee/GraphSSM

  25. arXiv:2405.19207  [pdf

    cs.IR cs.AI

    A Multi-Source Retrieval Question Answering Framework Based on RAG

    Authors: Ridong Wu, Shuhong Chen, Xiangbiao Su, Yuankai Zhu, Yifei Liao, Jianming Wu

    Abstract: With the rapid development of large-scale language models, Retrieval-Augmented Generation (RAG) has been widely adopted. However, existing RAG paradigms are inevitably influenced by erroneous retrieval information, thereby reducing the reliability and correctness of generated results. Therefore, to improve the relevance of retrieval information, this study proposes a method that replaces tradition… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 4 pages,3 figures

  26. arXiv:2405.19005  [pdf, other

    cs.CV

    Auto-selected Knowledge Adapters for Lifelong Person Re-identification

    Authors: Xuelin Qian, Ruiqi Wu, Gong Cheng, Junwei Han

    Abstract: Lifelong Person Re-Identification (LReID) extends traditional ReID by requiring systems to continually learn from non-overlapping datasets across different times and locations, adapting to new identities while preserving knowledge of previous ones. Existing approaches, either rehearsal-free or rehearsal-based, still suffer from the problem of catastrophic forgetting since they try to cram diverse… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  27. arXiv:2405.18334  [pdf, other

    cs.DB cs.CV cs.LG

    SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

    Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

    Abstract: In this paper, we will present SketchQL, a video database management system (VDBMS) for retrieving video moments with a sketch-based query interface. This novel interface allows users to specify object trajectory events with simple mouse drag-and-drop operations. Users can use trajectories of single objects as building blocks to compose complex events. Using a pre-trained model that encodes trajec… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Published on International Conference on Very Large Databases 2024

  28. arXiv:2405.17767  [pdf, other

    cs.LG cs.CL stat.ML

    Linguistic Collapse: Neural Collapse in (Large) Language Models

    Authors: Robert Wu, Vardan Papyan

    Abstract: Neural collapse ($\mathcal{NC}$) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers. These behaviors -- associated with generalization and robustness -- would manifest under specific conditions: models are trained towards zero loss, with noise-free labels belonging to… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 29 pages, 27 figures

    MSC Class: 68T07 (Primary) 68T50 (Secondary) ACM Class: I.2.6; I.2.7

  29. arXiv:2405.16989  [pdf, other

    stat.ME

    Uncertainty Learning for High-dimensional Mean-variance Portfolio

    Authors: Han Lin Shang, Ruike Wu, Yanrong Yang

    Abstract: Accounting for uncertainty in Data quality is important for accurate statistical inference. We aim to an optimal conservative allocation for a large universe of assets in mean-variance portfolio (MVP), which is the worst choice within uncertainty in data distribution. Unlike the low dimensional MVP studied in Blanchet et al. (2022, Management Science), the large number of assets raises a challengi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 2 figures, 4 tables

    MSC Class: 91G10; 62P05

  30. arXiv:2405.16886  [pdf, other

    cs.CV

    Hawk: Learning to Understand Open-World Video Anomalies

    Authors: Jiaqi Tang, Hao Lu, Ruizheng Wu, Xiaogang Xu, Ke Ma, Cheng Fang, Bin Guo, Jiangbo Lu, Qifeng Chen, Ying-Cong Chen

    Abstract: Video Anomaly Detection (VAD) systems can autonomously monitor and identify disturbances, reducing the need for manual labor and associated costs. However, current VAD systems are often limited by their superficial semantic understanding of scenes and minimal user interaction. Additionally, the prevalent data scarcity in existing datasets restricts their applicability in open-world scenarios. In t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  31. arXiv:2405.16720  [pdf, other

    cs.CL

    Large Scale Knowledge Washing

    Authors: Yu Wang, Ruihan Wu, Zexue He, Xiusi Chen, Julian McAuley

    Abstract: Large language models show impressive abilities in memorizing world knowledge, which leads to concerns regarding memorization of private information, toxic or sensitive knowledge, and copyrighted content. We introduce the problem of Large Scale Knowledge Washing, focusing on unlearning an extensive amount of factual knowledge. Previous unlearning methods usually define the reverse loss and update… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  32. arXiv:2405.16583  [pdf

    physics.optics

    An erbium-doped waveguide amplifier on thin film lithium niobate with an output power exceeding 100 mW

    Authors: Rui Bao, Zhiwei Fang, Jian Liu, Zhaoxiang Liu, Jinming Chen, Min Wang, Rongbo Wu, Haisu Zhang, Ya Cheng

    Abstract: We demonstrate high-power thin film lithium niobate (TFLN) erbium-doped waveguide amplifier (EDWA) with a maximum on-chip output power of 113 mW and a gain of 16 dB. The on-chip integrated EDWA is composed of large mode area (LMA) waveguide structures with a total length of 7 cm and a footprint of 1x1 cm2. Particularly, we connect segmented LMA waveguides with waveguide tapers to achieve on-chip m… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures

  33. Development of a Virtual Reality Application for Oculomotor Examination Education Based on Student-Centered Pedagogy

    Authors: Austin Finlayson, Rui Wu, Chia-Cheng Lin, Brian Sylcott

    Abstract: This work-in-progress paper discusses the use of student-centered pedagogy to teach clinical oculomotor examination via Virtual Reality (VR). Traditional methods, such as PowerPoint slides and lab activities, are often insufficient for providing hands-on experience due to the high cost of clinical equipment. To address this, a VR-based application was developed using Unity and the HTC Vive Pro hea… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  34. arXiv:2405.15140  [pdf, other

    cs.LG

    Better Membership Inference Privacy Measurement through Discrepancy

    Authors: Ruihan Wu, Pengrun Huang, Kamalika Chaudhuri

    Abstract: Membership Inference Attacks have emerged as a dominant method for empirically measuring privacy leakage from machine learning models. Here, privacy is measured by the {\em{advantage}} or gap between a score or a function computed on the training and the test data. A major barrier to the practical deployment of these attacks is that they do not scale to large well-generalized models -- either the… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 9 pages

  35. arXiv:2405.14868  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

    Authors: Basile Van Hoorick, Rundi Wu, Ege Ozguroglu, Kyle Sargent, Ruoshi Liu, Pavel Tokmakov, Achal Dave, Changxi Zheng, Carl Vondrick

    Abstract: Accurate reconstruction of complex dynamic scenes from just a single viewpoint continues to be a challenging task in computer vision. Current dynamic novel view synthesis methods typically require videos from many different camera viewpoints, necessitating careful recording setups, and significantly restricting their utility in the wild as well as in terms of embodied AI applications. In this pape… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted to ECCV 2024. Project webpage is available at: https://gcd.cs.columbia.edu/

  36. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  37. arXiv:2405.11793  [pdf, other

    cs.CV

    MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise

    Authors: Ruiqi Wu, Chenran Zhang, Jianle Zhang, Yi Zhou, Tao Zhou, Huazhu Fu

    Abstract: Current fundus image analysis models are predominantly built for specific tasks relying on individual datasets. The learning process is usually based on data-driven paradigm without prior knowledge, resulting in poor transferability and generalizability. To address this issue, we propose MM-Retinal, a multi-modal dataset that encompasses high-quality image-text pairs collected from professional fu… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Early Accepted by The International Conference on Medical Image Computing and Computer Assisted Intervention(MICCAI)2024

  38. arXiv:2405.11130  [pdf, other

    cs.RO cs.HC cs.SE

    WIP: A Unit Testing Framework for Self-Guided Personalized Online Robotics Learning

    Authors: Ponkoj Chandra Shill, David Feil-Seifer, Jiullian-Lee Vargas Ruiz, Rui Wu

    Abstract: Our ongoing development and deployment of an online robotics education platform highlighted a gap in providing an interactive, feedback-rich learning environment essential for mastering programming concepts in robotics, which they were not getting with the traditional code-simulate-turn in workflow. Since teaching resources are limited, students would benefit from feedback in real-time to find and… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures, IEEE FIE 2024

  39. arXiv:2405.09923  [pdf, other

    cs.CV eess.IV

    NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild, including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation, where gener… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  40. arXiv:2405.09842  [pdf

    cond-mat.mtrl-sci

    Why Superconducting Ta Qubits Have Fewer Tunneling Two-Level Systems at the Air-Oxide Interface Than Nb Qubits

    Authors: Zhe Wang, Clare C. Yu, Ruqian Wu

    Abstract: Superconducting qubits are a key contender for quantum computing elements, but they often face challenges like noise and decoherence from two-level systems (TLS). Tantalum (Ta) qubits are notable for their long T$_1$ coherence times nearing milliseconds, mainly due to fewer TLS, though the cause was unclear. Our research explored this by analyzing the air-oxide interface with density functional th… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  41. arXiv:2405.09135  [pdf, other

    quant-ph

    On the Role of Controllability in Pulse-based Quantum Machine Learning Models

    Authors: Han-Xiao Tao, Re-Bing Wu

    Abstract: Pulse-based quantum machine learning (QML) models possess full expressivity when they are ensemble controllable. However, it has also been shown that barren plateaus emerge in such models, rendering training intractable for systems with large dimension. In this paper, we show that the trade-off is closely related to the controllability of the underlying pulse-based models. We first apply the Flies… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  42. arXiv:2405.08310  [pdf, other

    cs.RO

    Cross-Category Functional Grasp Tansfer

    Authors: Rina Wu, Tianqiang Zhu, Xiangbo Lin, Yi Sun

    Abstract: Generating grasps for a dexterous hand often requires numerous grasping annotations. However, annotating high DoF dexterous hand poses is quite challenging. Especially for functional grasps, the grasp pose must be convenient for subsequent manipulation tasks. This prompt us to explore how people achieve manipulations on new objects based on past grasp experiences. We find that when grasping new it… ▽ More

    Submitted 20 May, 2024; v1 submitted 14 May, 2024; originally announced May 2024.

  43. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  44. arXiv:2405.07464  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Atomic-scale tunable phonon transport at tailored grain boundaries

    Authors: Xiaowang Wang, Chaitanya A. Gadre, Runqing Yang, Wanjuan Zou, Xing Bin, Christopher Addiego, Toshihiro Aoki, Yujie Quan, Wei-Tao Peng, Yifeng Huang, Chaojie Du, Mingjie Xu, Xingxu Yan, Ruqian Wu, Shyue Ping Ong, Bolin Liao, Penghui Cao, Xiaoqing Pan

    Abstract: Manipulating thermal properties in materials has been of fundamental importance for advancing innovative technologies. Heat carriers such as phonons are impeded by breaking crystal symmetry or periodicity. Notable methods of impeding the phonon propagation include varying the density of defects, interfaces, and nanostructures, as well as changing composition. However, a robust link between the ind… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  45. arXiv:2405.06903  [pdf, other

    cs.CV

    UniGarmentManip: A Unified Framework for Category-Level Garment Manipulation via Dense Visual Correspondence

    Authors: Ruihai Wu, Haoran Lu, Yiyan Wang, Yubo Wang, Hao Dong

    Abstract: Garment manipulation (e.g., unfolding, folding and hanging clothes) is essential for future robots to accomplish home-assistant tasks, while highly challenging due to the diversity of garment configurations, geometries and deformations. Although able to manipulate similar shaped garments in a certain task, previous works mostly have to design different policies for different tasks, could not gener… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  46. arXiv:2405.03203  [pdf, ps, other

    math.AP

    Sharp estimates, uniqueness and spikes condensation for superlinear free boundary problems arising in plasma physics

    Authors: Daniele Bartolucci, Aleks Jevnikar, Ruijun Wu

    Abstract: We are concerned with Grad-Shafranov type equations, describing in dimension $N=2$ the equilibrium configurations of a plasma in a Tokamak. We obtain a sharp superlinear generalization of the result of Temam (1977) about the linear case, implying the first general uniqueness result ever for superlinear free boundary problems arising in plasma physics. Previous general uniqueness results of Beresti… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 54 pages

    MSC Class: 35J61; 35B32; 35R35; 82D10

  47. arXiv:2405.03159  [pdf, other

    cs.CV

    DeepMpMRI: Tensor-decomposition Regularized Learning for Fast and High-Fidelity Multi-Parametric Microstructural MR Imaging

    Authors: Wenxin Fan, Jian Cheng, Cheng Li, Xinrui Ma, Jing Yang, Juan Zou, Ruoyou Wu, Zan Chen, Yuanjing Feng, Hairong Zheng, Shanshan Wang

    Abstract: Deep learning has emerged as a promising approach for learning the nonlinear mapping between diffusion-weighted MR images and tissue parameters, which enables automatic and deep understanding of the brain microstructures. However, the efficiency and accuracy in the multi-parametric estimations are still limited since previous studies tend to estimate multi-parametric maps with dense sampling and i… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  48. arXiv:2405.02928  [pdf, other

    math.PR math.ST

    Probabilistic cellular automata with local transition matrices: synchronization, ergodicity, and inference

    Authors: Erhan Bayraktar, Fei Lu, Mauro Maggioni, Ruoyu Wu, Sichen Yang

    Abstract: We introduce a new class of probabilistic cellular automata that are capable of exhibiting rich dynamics such as synchronization and ergodicity and can be easily inferred from data. The system is a finite-state locally interacting Markov chain on a circular graph. Each site's subsequent state is random, with a distribution determined by its neighborhood's empirical distribution multiplied by a loc… ▽ More

    Submitted 23 June, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 30 pages, 3 figures

    MSC Class: 60J10; 62F12

  49. arXiv:2405.02516  [pdf, other

    physics.optics cond-mat.mes-hall cond-mat.mtrl-sci quant-ph

    Site-Controlled Purcell-Induced Bright Single Photon Emitters in Hexagonal Boron Nitride

    Authors: Mashnoon Alam Sakib, Brandon Triplett, William Harris, Naveed Hussain, Alexander Senichev, Melika Momenzadeh, Joshua Bocanegra, Ruqian Wu, Alexandra Boltasseva, Vladimir M. Shalaev, Maxim R. Shcherbakov

    Abstract: Single photon emitters (SPEs) hosted in hexagonal boron nitride (hBN) are essential elementary building blocks for enabling future on-chip quantum photonic technologies that operate at room temperature. However, fundamental challenges, such as managing non-radiative decay, competing incoherent processes, as well as engineering difficulties in achieving deterministic placement and scaling of the em… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: 41 pages, 18 figures, supplementary information

  50. arXiv:2404.17879  [pdf, other

    quant-ph physics.atm-clus

    Trapping polar molecules by surface acoustic waves

    Authors: Haijin Ding, Re-Bing Wu, Yu-xi Liu

    Abstract: We propose a method to trap polar molecules with the electrical force induced by the surface acoustic wave (SAW) on piezoelectric materials. In this approach, the electrical force is perpendicular to the moving direction of the polar molecules, and is used to control the positions of trapped polar molecules in the direction orthogonal to the acoustic transmission. By virtue of an external electric… ▽ More

    Submitted 7 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: 18 pages, 10 figures