Skip to main content

Showing 1–50 of 441 results for author: Feng, B

  1. arXiv:2407.11382  [pdf, other

    cs.CV cs.AI cs.RO

    Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts

    Authors: Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo

    Abstract: This paper proposes an algorithm for automatically labeling 3D objects from 2D point or box prompts, especially focusing on applications in autonomous driving. Unlike previous arts, our auto-labeler predicts 3D shapes instead of bounding boxes and does not require training on a specific dataset. We propose a Segment, Lift, and Fit (SLF) paradigm to achieve this goal. Firstly, we segment high-quali… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  2. arXiv:2407.08353  [pdf

    cond-mat.mtrl-sci

    One-dimensional flat bands in phosphorene nanoribbons with pentagonal nature

    Authors: Shuo Sun, Jing-Yang You, Zhihao Cai, Jie Su, Tong Yang, Xinnan Peng, Yihe Wang, Daiyu Geng, Jian Gou, Yuli Huang, Sisheng Duan, Lan Chen, Kehui Wu, Andrew T. S. Wee, Yuan Ping Feng, Jia Lin Zhang, Jiong Lu, Baojie Feng, Wei Chen

    Abstract: Materials with topological flat bands can serve as a promising platform to investigate strongly interacting phenomena. However, experimental realization of ideal flat bands is mostly limited to artificial lattices or moiré systems. Here we report a general way to construct one-dimensional (1D) flat bands in phosphorene nanoribbons (PNRs) with pentagonal nature: penta-hexa-PNRs and penta-dodeca-PNR… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 13 pages, 4 figures

  3. arXiv:2407.01029  [pdf, other

    cs.CV

    EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting

    Authors: Chenxin Li, Brandon Y. Feng, Yifan Liu, Hengyu Liu, Cheng Wang, Weihao Yu, Yixuan Yuan

    Abstract: 3D reconstruction of biological tissues from a collection of endoscopic images is a key to unlock various important downstream surgical applications with 3D capabilities. Existing methods employ various advanced neural rendering techniques for photorealistic view synthesis, but they often struggle to recover accurate 3D representations when only sparse observations are available, which is usually… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accpeted by MICCAI2024

  4. arXiv:2406.14746  [pdf, other

    cs.LG cs.RO

    Relational Reasoning On Graphs Using Opinion Dynamics

    Authors: Yulong Yang, Bowen Feng, Keqin Wang, Naomi Leonard, Adji Bousso Dieng, Christine Allen-Blanchette

    Abstract: From pedestrians to Kuramoto oscillators, interactions between agents govern how a multitude of dynamical systems evolve in space and time. Discovering how these agents relate to each other can improve our understanding of the often complex dynamics that underlie these systems. Recent works learn to categorize relationships between agents based on observations of their physical behavior. These app… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 14 pages, 7 figures

  5. arXiv:2406.12816  [pdf, other

    cs.LG cs.CV eess.IV

    Neural Approximate Mirror Maps for Constrained Diffusion Models

    Authors: Berthy T. Feng, Ricardo Baptista, Katherine L. Bouman

    Abstract: Diffusion models excel at creating visually-convincing images, but they often struggle to meet subtle constraints inherent in the training data. Such constraints could be physics-based (e.g., satisfying a PDE), geometric (e.g., respecting symmetry), or semantic (e.g., including a particular number of objects). When the training data all satisfy a certain constraint, enforcing this constraint on a… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  6. arXiv:2406.12355  [pdf, other

    cs.CV

    LiCAF: LiDAR-Camera Asymmetric Fusion for Gait Recognition

    Authors: Yunze Deng, Haijun Xiong, Bin Feng

    Abstract: Gait recognition is a biometric technology that identifies individuals by using walking patterns. Due to the significant achievements of multimodal fusion in gait recognition, we consider employing LiDAR-camera fusion to obtain robust gait representations. However, existing methods often overlook intrinsic characteristics of modalities, and lack fine-grained fusion and temporal modeling. In this p… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by ICIP2024

  7. arXiv:2406.08814  [pdf, other

    cs.CV

    Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting

    Authors: Zhengqi Zhao, Xiaohu Huang, Hao Zhou, Kun Yao, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng

    Abstract: The key to action counting is accurately locating each video's repetitive actions. Instead of estimating the probability of each frame belonging to an action directly, we propose a dual-branch network, i.e., SkimFocusNet, working in a two-step manner. The model draws inspiration from empirical observations indicating that humans typically engage in coarse skimming of entire sequences to grasp the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 9 figures

  8. arXiv:2406.02785  [pdf, other

    astro-ph.IM cs.LG eess.IV

    Event-horizon-scale Imaging of M87* under Different Assumptions via Deep Generative Image Priors

    Authors: Berthy T. Feng, Katherine L. Bouman, William T. Freeman

    Abstract: Reconstructing images from the Event Horizon Telescope (EHT) observations of M87*, the supermassive black hole at the center of the galaxy M87, depends on a prior to impose desired image statistics. However, given the impossibility of directly observing black holes, there is no clear choice for a prior. We present a framework for flexibly designing a range of priors, each bringing different biases… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  9. arXiv:2406.00948  [pdf

    cond-mat.mtrl-sci

    Real-space tilting method for atomic resolution STEM imaging of nanocrystalline materials

    Authors: Jiake Wei, Zhangze Xu, Wenjie Shen, Bin Feng, Ryo Ishikawa, Naoya Shibata, Yuichi Ikuhara, Xuedong Bai

    Abstract: Atomic-resolution scanning transmission electron microscopy (STEM) characterization requires precise tilting of the specimen to high symmetric zone axis, which is usually processed in reciprocal space by following the diffraction patterns. However, for small-sized nanocrystalline materials, their diffraction patterns are too faint to guide the tilting process. Here, a simple and effective tilting… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  10. arXiv:2405.20334  [pdf, other

    cs.CV cs.GR

    VividDream: Generating 3D Scene with Ambient Dynamics

    Authors: Yao-Chih Lee, Yi-Ting Chen, Andrew Wang, Ting-Hsuan Liao, Brandon Y. Feng, Jia-Bin Huang

    Abstract: We introduce VividDream, a method for generating explorable 4D scenes with ambient dynamics from a single input image or text prompt. VividDream first expands an input image into a static 3D point cloud through iterative inpainting and geometry merging. An ensemble of animated videos is then generated using video diffusion models with quality refinement techniques and conditioned on renderings of… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Project page: https://vivid-dream-4d.github.io

  11. arXiv:2405.11485  [pdf

    cond-mat.mtrl-sci

    Evidence for Multiferroicity in Single-Layer CuCrSe$_2$

    Authors: Zhenyu Sun, Yueqi Su, Aomiao Zhi, Zhicheng Gao, Xu Han, Kang Wu, Lihong Bao, Yuan Huang, Youguo Shi, Xuedong Bai, Peng Cheng, Lan Chen, Kehui Wu, Xuezeng Tian, Changzheng Wu, Baojie Feng

    Abstract: Multiferroic materials, which simultaneously exhibit ferroelectricity and magnetism, have attracted substantial attention due to their fascinating physical properties and potential technological applications. With the trends towards device miniaturization, there is an increasing demand for the persistence of multiferroicity in single-layer materials at elevated temperatures. Here, we report high-t… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Journal ref: Nature Communications 15, 4252 (2024)

  12. arXiv:2405.10463  [pdf, other

    physics.optics eess.IV physics.bio-ph

    Single-shot volumetric fluorescence imaging with neural fields

    Authors: Oumeng Zhang, Haowen Zhou, Brandon Y. Feng, Elin M. Larsson, Reinaldo E. Alcalde, Siyuan Yin, Catherine Deng, Changhuei Yang

    Abstract: Single-shot volumetric fluorescence (SVF) imaging offers a significant advantage over traditional imaging methods that require scanning across multiple axial planes as it can capture biological processes with high temporal resolution across a large field of view. The key challenges in SVF imaging include requiring sparsity constraints to meet the multiplexing requirements of compressed sensing, el… ▽ More

    Submitted 4 June, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

  13. arXiv:2405.04531  [pdf, other

    stat.ME stat.CO

    Stochastic Gradient MCMC for Massive Geostatistical Data

    Authors: Mohamed A. Abba, Brian J. Reich, Reetam Majumder, Brandon Feng

    Abstract: Gaussian processes (GPs) are commonly used for prediction and inference for spatial data analyses. However, since estimation and prediction tasks have cubic time and quadratic memory complexity in number of locations, GPs are difficult to scale to large spatial datasets. The Vecchia approximation induces sparsity in the dependence structure and is one of several methods proposed to scale GP infere… ▽ More

    Submitted 3 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  14. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  15. arXiv:2405.04185  [pdf

    physics.app-ph

    Research on signalized intersection mixed traffic flow platoon control method considering Backward-looking effect

    Authors: Binghao Feng, Hui Guo, Minghui Ma, Yuepeng Wu, Shidong Liang, Yansong Wang

    Abstract: Connected and Autonomous Vehicles (CAVs) technology facilitates the advancement of intelligent transportation. However, intelligent control techniques for mixed traffic flow at signalized intersections involving both CAVs and Human-Driven Vehicles (HDVs) require further investigation into the impact of backward-looking effect. This paper proposes the concept of 1+n+1 mixed platoon considering the… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  16. arXiv:2404.19385  [pdf

    cond-mat.mtrl-sci

    Cerium Oxide-based Solid-State Thermal Transistors with Wide Switching Width of 9.5 W/mK

    Authors: Ahrong Jeong, Mitsuki Yoshimura, Zhiping Bian, Jason Tam, Bin Feng, Yuichi Ikuhara, Yusaku Magari, Takashi Endo, Yasutaka Matsuo, Hiromichi Ohta

    Abstract: Thermal transistors that electrically switch heat flow on and off have attracted attention as thermal management devices. Electrochemical reduction/oxidation switches the thermal conductivity (\k{appa}) of active metal oxide layers. The \k{appa}-switching width (difference between on-state and off-state \k{appa}) of the previously proposed electrochemical thermal transistors is narrow, less than 5… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 17 pages, 6 figures with supporting information (13 pages, 11 figures, 1 table)

  17. arXiv:2404.18372  [pdf, other

    nlin.SI math-ph

    Integrable semi-discretization for a modified Camassa-Holm equation with cubic nonlinearity

    Authors: Bao-Feng Feng, Heng-Chun Hu, Han-Han Sheng, Wei Yin, Guo-Fu Yu

    Abstract: In the present paper, an integrable semi-discretization of the modified Camassa-Holm (mCH) equation with cubic nonlinearity is presented. The key points of the construction are based on the discrete Kadomtsev-Petviashvili (KP) equation and appropriate definition of discrete reciprocal transformations. First, we demonstrate that these bilinear equations and their determinant solutions can be derive… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  18. arXiv:2404.15761  [pdf, other

    eess.SP

    Rechargeable UAV Trajectory Optimization for Real-Time Persistent Data Collection of Large-Scale Sensor Networks

    Authors: Rui Wang, Deshi Li, Qingqing Wu, Kaitao Meng, Boning Feng, Lele Cong

    Abstract: Unmanned aerial vehicles (UAVs) have received plenty of attention due to their high flexibility and enhanced communication ability, nonetheless, the limited onboard energy restricts UAVs' application on persistent data collection missions in large areas. In this paper, we propose a rechargeable UAV-assisted periodic data collection scheme, where a UAV is dispatched to periodically collect data fro… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 13 pages, 17 figures, submitted to IEEE for possible publication

  19. arXiv:2404.15014  [pdf, other

    cs.CV

    OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

    Authors: Guoqing Wang, Zhongdao Wang, Pin Tang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

    Abstract: Existing solutions for 3D semantic occupancy prediction typically treat the task as a one-shot 3D voxel-wise segmentation perception problem. These discriminative methods focus on learning the mapping between the inputs and occupancy map in a single step, lacking the ability to gradually refine the occupancy map and the reasonable scene imaginative capacity to complete the local regions somewhere.… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  20. arXiv:2404.13026  [pdf, other

    cs.CV cs.AI

    PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation

    Authors: Tianyuan Zhang, Hong-Xing Yu, Rundi Wu, Brandon Y. Feng, Changxi Zheng, Noah Snavely, Jiajun Wu, William T. Freeman

    Abstract: Realistic object interactions are crucial for creating immersive virtual experiences, yet synthesizing realistic 3D object dynamics in response to novel interactions remains a significant challenge. Unlike unconditional or text-conditioned dynamics generation, action-conditioned dynamics requires perceiving the physical material properties of objects and grounding the 3D motion prediction on these… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Project website at: https://physdreamer.github.io/

  21. arXiv:2404.09734  [pdf, other

    cs.IT eess.SP

    Weighted Sum-Rate Maximization for Movable Antenna-Enhanced Wireless Networks

    Authors: Biqian Feng, Yongpeng Wu, Xiang-Gen Xia, Chengshan Xiao

    Abstract: This letter investigates the weighted sum rate maximization problem in movable antenna (MA)-enhanced systems. To reduce the computational complexity, we transform it into a more tractable weighted minimum mean square error (WMMSE) problem well-suited for MA. We then adopt the WMMSE algorithm and majorization-minimization algorithm to optimize the beamforming and antenna positions, respectively. Mo… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Wireless Communications Letters

  22. arXiv:2404.09502  [pdf, other

    cs.CV

    SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction

    Authors: Pin Tang, Zhongdao Wang, Guoqing Wang, Jilai Zheng, Xiangxuan Ren, Bailan Feng, Chao Ma

    Abstract: Vision-based perception for autonomous driving requires an explicit modeling of a 3D space, where 2D latent representations are mapped and subsequent 3D operators are applied. However, operating on dense latent spaces introduces a cubic time and space complexity, which limits scalability in terms of perception range or spatial resolution. Existing approaches compress the dense representation using… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 10 pages, 4 figures, accepted by CVPR 2024

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition 2024 (CVPR 2024)

  23. arXiv:2404.07985  [pdf, other

    cs.CV eess.IV

    WaveMo: Learning Wavefront Modulations to See Through Scattering

    Authors: Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo Jin, Ashok Veeraraghavan, Christopher A. Metzler

    Abstract: Imaging through scattering media is a fundamental and pervasive challenge in fields ranging from medical diagnostics to astronomy. A promising strategy to overcome this challenge is wavefront modulation, which induces measurement diversity during image acquisition. Despite its importance, designing optimal wavefront modulations to image through scattering remains under-explored. This paper introdu… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  24. arXiv:2404.00471  [pdf, other

    physics.med-ph cs.CV cs.LG eess.IV

    Score-Based Diffusion Models for Photoacoustic Tomography Image Reconstruction

    Authors: Sreemanti Dey, Snigdha Saha, Berthy T. Feng, Manxiu Cui, Laure Delisle, Oscar Leong, Lihong V. Wang, Katherine L. Bouman

    Abstract: Photoacoustic tomography (PAT) is a rapidly-evolving medical imaging modality that combines optical absorption contrast with ultrasound imaging depth. One challenge in PAT is image reconstruction with inadequate acoustic signals due to limited sensor coverage or due to the density of the transducer array. Such cases call for solving an ill-posed inverse reconstruction problem. In this work, we use… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 5 pages

    Journal ref: ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, Republic of, 2024, pp. 2470-2474

  25. arXiv:2403.16095  [pdf, other

    cs.CV cs.RO

    CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field

    Authors: Jiarui Hu, Xianhao Chen, Boyin Feng, Guanglin Li, Liangjing Yang, Hujun Bao, Guofeng Zhang, Zhaopeng Cui

    Abstract: Recently neural radiance fields (NeRF) have been widely exploited as 3D representations for dense simultaneous localization and mapping (SLAM). Despite their notable successes in surface modeling and novel view synthesis, existing NeRF-based methods are hindered by their computationally intensive and time-consuming volume rendering pipeline. This paper presents an efficient dense RGB-D SLAM system… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: Project Page: https://zju3dv.github.io/cg-slam

  26. arXiv:2403.16040  [pdf, ps, other

    hep-ph

    General One-loop Generating Function by IBP relations

    Authors: Bo Feng, Chang Hu, Jiyuan Shen, Yaobo Zhang

    Abstract: In this paper we have studied the most general generating function of reduction for one loop integrals with arbitrary tensor structure in numerator and arbitrary power distribution of propagators in denominator. Using IBP relations, we have established the partial differential equations for these generating functions and solved them analytically. These results provide useful guidance for applying… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

    Comments: 50 pages

  27. arXiv:2403.13800  [pdf, other

    cs.CV

    TimeRewind: Rewinding Time with Image-and-Events Video Diffusion

    Authors: Jingxi Chen, Brandon Y. Feng, Haoming Cai, Mingyang Xie, Christopher Metzler, Cornelia Fermuller, Yiannis Aloimonos

    Abstract: This paper addresses the novel challenge of ``rewinding'' time from a single captured image to recover the fleeting moments missed just before the shutter button is pressed. This problem poses a significant challenge in computer vision and computational photography, as it requires predicting plausible pre-capture motion from a single static frame, an inherently ill-posed task due to the high degre… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  28. arXiv:2403.11050  [pdf, other

    cs.CV

    Endora: Video Generation Models as Endoscopy Simulators

    Authors: Chenxin Li, Hengyu Liu, Yifan Liu, Brandon Y. Feng, Wuyang Li, Xinyu Liu, Zhen Chen, Jing Shao, Yixuan Yuan

    Abstract: Generative models hold promise for revolutionizing medical education, robot-assisted surgery, and data augmentation for machine learning. Despite progress in generating 2D medical images, the complex domain of clinical video generation has largely remained untapped.This paper introduces \model, an innovative approach to generate medical videos that simulate clinical endoscopy scenes. We present a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Project page: https://endora-medvidgen.github.io/

  29. Layer-dependent Raman spectroscopy of ultrathin Ta$_2$Pd$_3$Te$_5$

    Authors: Zhenyu Sun, Zhaopeng Guo, Dayu Yan, Peng Cheng, Lan Chen, Youguo Shi, Yuan Huang, Zhijun Wang, Kehui Wu, Baojie Feng

    Abstract: Two-dimensional topological insulators (2DTIs) or quantum spin Hall insulators are attracting increasing attention due to their potential applications in next-generation spintronic devices. Despite their promising prospects, realizable 2DTIs are still limited. Recently, Ta2Pd3Te5, a semiconducting van der Waals material, has shown spectroscopic evidence of quantum spin Hall states. However, achiev… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Journal ref: Phys. Rev. Materials 7, 094004 (2023)

  30. arXiv:2401.17213  [pdf

    physics.optics physics.ins-det

    Ptycho-endoscopy on a lensless ultrathin fiber bundle tip

    Authors: Pengming Song, Ruihai Wang, Lars Loetgering, Jia Liu, Peter Vouras, Yujin Lee, Shaowei Jiang, Bin Feng, Andrew Maiden, Changhuei Yang, Guoan Zheng

    Abstract: Synthetic aperture radar (SAR) utilizes an aircraft-carried antenna to emit electromagnetic pulses and detect the returning echoes. As the aircraft travels across a designated area, it synthesizes a large virtual aperture to improve image resolution. Inspired by SAR, we introduce synthetic aperture ptycho-endoscopy (SAPE) for micro-endoscopic imaging beyond the diffraction limit. SAPE operates by… ▽ More

    Submitted 6 July, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

  31. arXiv:2401.06458  [pdf, other

    math.AP math-ph

    Asymptotic behavior for a new higher-order nonlinear Schrödinger equation

    Authors: Hongyi Zhang, Yufeng Zhang, Binlu Feng

    Abstract: We investigate the Cauchy problem of a new higher-order nonlinear Schrödinger equation (NHNSE) with weighted Sobolev initial data which is derived by ourselves. By applying $\bar{\partial}$-steepest descent method, we derive the long-time asymptotics of the NHNSE. Explicit steps are as follows: first of all, based on the spectral analysis of a Lax pair and scattering matrice, the solution of the N… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  32. arXiv:2401.00129  [pdf, other

    hep-th astro-ph.CO gr-qc hep-ph

    Towards Systematic Evaluation of de Sitter Correlators via Generalized Integration-By-Parts Relations

    Authors: Jiaqi Chen, Bo Feng

    Abstract: We generalize Integration-By-Parts (IBP) and differential equations methods to de Sitter correlators related to inflation. While massive correlators in de Sitter spacetime are usually regarded as highly intricate, we find they have remarkably hidden concise structures from the perspective of IBP. We find the factorization of the IBP relations of each vertex integral family corresponding to… ▽ More

    Submitted 29 June, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 26 pages, 2 figures. Important revision: corrected a mistake and fixed typos. Accepted version

    Journal ref: JHEP06(2024)199

  33. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  34. arXiv:2312.04679  [pdf, other

    eess.IV cs.CV

    ConVRT: Consistent Video Restoration Through Turbulence with Test-time Optimization of Neural Video Representations

    Authors: Haoming Cai, Jingxi Chen, Brandon Y. Feng, Weiyun Jiang, Mingyang Xie, Kevin Zhang, Ashok Veeraraghavan, Christopher Metzler

    Abstract: tmospheric turbulence presents a significant challenge in long-range imaging. Current restoration algorithms often struggle with temporal inconsistency, as well as limited generalization ability across varying turbulence levels and scene content different than the training data. To tackle these issues, we introduce a self-supervised method, Consistent Video Restoration through Turbulence (ConVRT)… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: https://convrt-2024.github.io/

  35. arXiv:2312.03788  [pdf, other

    cs.LG cs.CL

    SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

    Authors: Jiayi Pan, Chengcan Wang, Kaifu Zheng, Yangguang Li, Zhenyu Wang, Bin Feng

    Abstract: Large language models (LLMs) have shown remarkable capabilities in various tasks. However their huge model size and the consequent demand for computational and memory resources also pose challenges to model deployment. Currently, 4-bit post-training quantization (PTQ) has achieved some success in LLMs, reducing the memory footprint by approximately 75% compared to FP16 models, albeit with some acc… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  36. arXiv:2312.01195  [pdf, other

    cs.CR cs.SE

    AIM: Automatic Interrupt Modeling for Dynamic Firmware Analysis

    Authors: Bo Feng, Meng Luo, Changming Liu, Long Lu, Engin Kirda

    Abstract: The security of microcontrollers, which drive modern IoT and embedded devices, continues to raise major concerns. Within a microcontroller (MCU), the firmware is a monolithic piece of software that contains the whole software stack, whereas a variety of peripherals represent the hardware. As MCU firmware contains vulnerabilities, it is ideal to test firmware with off-the-shelf software testing tec… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: This paper was accepted to IEEE Transactions on Dependable and Secure Computing at Oct 12, 2023

  37. arXiv:2311.11203  [pdf, ps, other

    nlin.SI math-ph

    The general solutions for a non-isospectral integrable TD hierarchy via the inverse scattering transform

    Authors: Hongyi Zhang, Yufeng Zhang, Binlu Feng

    Abstract: A non-isospectral Lax pair is first introduced from which a kind of non-isospectral integrable TD hierarchy is derived, whose reduction is an integrable system called the non-isospectral integrable TD system. Then by using the inverse scattering transform (IST) method, new general soliton solutions for the non-isospectral integrable TD hierarchy are obtained. Because we investigate soliton solutio… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  38. arXiv:2310.18529  [pdf, other

    physics.optics eess.IV

    FPM-INR: Fourier ptychographic microscopy image stack reconstruction using implicit neural representations

    Authors: Haowen Zhou, Brandon Y. Feng, Haiyun Guo, Siyu Lin, Mingshu Liang, Christopher A. Metzler, Changhuei Yang

    Abstract: Image stacks provide invaluable 3D information in various biological and pathological imaging applications. Fourier ptychographic microscopy (FPM) enables reconstructing high-resolution, wide field-of-view image stacks without z-stack scanning, thus significantly accelerating image acquisition. However, existing FPM methods take tens of minutes to reconstruct and gigabytes of memory to store a hig… ▽ More

    Submitted 31 October, 2023; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Project Page: https://hwzhou2020.github.io/FPM-INR-Web/

  39. arXiv:2310.10835  [pdf, other

    eess.IV cs.CV cs.LG

    Provable Probabilistic Imaging using Score-Based Generative Priors

    Authors: Yu Sun, Zihui Wu, Yifan Chen, Berthy T. Feng, Katherine L. Bouman

    Abstract: Estimating high-quality images while also quantifying their uncertainty are two desired features in an image reconstruction algorithm for solving ill-posed inverse problems. In this paper, we propose plug-and-play Monte Carlo (PMC) as a principled framework for characterizing the space of possible solutions to a general inverse problem. PMC is able to incorporate expressive score-based generative… ▽ More

    Submitted 29 December, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

  40. arXiv:2310.06504  [pdf, other

    cs.CL cs.AI cs.LG

    Revisit Input Perturbation Problems for LLMs: A Unified Robustness Evaluation Framework for Noisy Slot Filling Task

    Authors: Guanting Dong, Jinxu Zhao, Tingfeng Hui, Daichi Guo, Wenlong Wan, Boqi Feng, Yueyan Qiu, Zhuoma Gongque, Keqing He, Zechen Wang, Weiran Xu

    Abstract: With the increasing capabilities of large language models (LLMs), these high-performance models have achieved state-of-the-art results on a wide range of natural language processing (NLP) tasks. However, the models' performance on commonly-used benchmark datasets often fails to accurately reflect their reliability and robustness when applied to real-world noisy data. To address these challenges, w… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at NLPCC 2023 (Oral Presentation)

  41. arXiv:2310.03125  [pdf, other

    cs.CV

    Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation

    Authors: Yihan Wu, Brandon Y. Feng, Heng Huang

    Abstract: In this paper, we introduce an innovative method of safeguarding user privacy against the generative capabilities of Neural Radiance Fields (NeRF) models. Our novel poisoning attack method induces changes to observed views that are imperceptible to the human eye, yet potent enough to disrupt NeRF's ability to accurately reconstruct a 3D scene. To achieve this, we devise a bi-level optimization alg… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  42. arXiv:2309.17293  [pdf, other

    quant-ph cs.CR cs.ET

    Quantum Privacy-preserving Two-party Circle Intersection Protocol Based on Phase-encoded Query

    Authors: Zi-Xian Li, Qi Yang, Bao Feng, Wen-Jie Liu

    Abstract: Privacy-preserving geometric intersection (PGI) is an important issue in Secure multiparty computation (SMC). The existing quantum PGI protocols are mainly based on grid coding, which requires a lot of computational complexity. The phase-encoded query method which has been used in some Quantum SMC protocols is suitable to solve the decision problem, but it needs to apply high dimensional Oracle op… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: 16 pages, 2 figures

    Journal ref: International Journal of Theoretical Physics,2023.62(7):p.138

  43. arXiv:2309.14349  [pdf, other

    cs.LG cs.AI

    Corporate Credit Rating: A Survey

    Authors: Bojing Feng, Xi Cheng, Dan Li, Zeyu Liu, Wenfang Xue

    Abstract: Corporate credit rating (CCR) plays a very important role in the process of contemporary economic and social development. How to use credit rating methods for enterprises has always been a problem worthy of discussion. Through reading and studying the relevant literature at home and abroad, this paper makes a systematic survey of CCR. This paper combs the context of the development of CCR methods… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 11 pages

  44. arXiv:2309.11591  [pdf, other

    cs.CV cs.GR

    Continuous Levels of Detail for Light Field Networks

    Authors: David Li, Brandon Y. Feng, Amitabh Varshney

    Abstract: Recently, several approaches have emerged for generating neural representations with multiple levels of detail (LODs). LODs can improve the rendering by using lower resolutions and smaller model sizes when appropriate. However, existing methods generally focus on a few discrete LODs which suffer from aliasing and flicker artifacts as details are changed and limit their granularity for adapting to… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Accepted to BMVC 2023. Webpage at https://augmentariumlab.github.io/continuous-lfn/

  45. arXiv:2309.01949  [pdf, other

    cs.CV

    Efficient Bayesian Computational Imaging with a Surrogate Score-Based Prior

    Authors: Berthy T. Feng, Katherine L. Bouman

    Abstract: We propose a surrogate function for efficient use of score-based priors for Bayesian inverse imaging. Recent work turned score-based diffusion models into probabilistic priors for solving ill-posed imaging problems by appealing to an ODE-based log-probability function. However, evaluating this function is computationally inefficient and inhibits posterior estimation of high-dimensional images. Our… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  46. arXiv:2308.16861  [pdf, ps, other

    cs.CR

    Facing Unknown: Open-World Encrypted Traffic Classification Based on Contrastive Pre-Training

    Authors: Xiang Li, Beibei Feng, Tianning Zang, Shuyuan Zhao, Jingrun Ma

    Abstract: Traditional Encrypted Traffic Classification (ETC) methods face a significant challenge in classifying large volumes of encrypted traffic in the open-world assumption, i.e., simultaneously classifying the known applications and detecting unknown applications. We propose a novel Open-World Contrastive Pre-training (OWCP) framework for this. OWCP performs contrastive pre-training to obtain a robust… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by 2023 IEEE ISCC, 6 pages, 5 figures

  47. arXiv:2308.06720  [pdf, other

    cs.IT eess.SP

    Joint Beamforming and Antenna Movement Design for Moveable Antenna Systems Based on Statistical CSI

    Authors: Xintai Chen, Biqian Feng, Yongpeng Wu, Derrick Wing Kwan Ng, Robert Schober

    Abstract: This paper studies a novel movable antenna (MA)-enhanced multiple-input multiple-output (MIMO) system to leverage the corresponding spatial degrees of freedom (DoFs) for improving the performance of wireless communications. We aim to maximize the achievable rate by jointly optimizing the MA positions and the transmit covariance matrix based on statistical channel state information (CSI). To solve… ▽ More

    Submitted 18 August, 2023; v1 submitted 13 August, 2023; originally announced August 2023.

    Comments: Accepted by GLOBECOM 2023

  48. arXiv:2308.06707  [pdf, other

    cs.CV

    Condition-Adaptive Graph Convolution Learning for Skeleton-Based Gait Recognition

    Authors: Xiaohu Huang, Xinggang Wang, Zhidianqiu Jin, Bo Yang, Botao He, Bin Feng, Wenyu Liu

    Abstract: Graph convolutional networks have been widely applied in skeleton-based gait recognition. A key challenge in this task is to distinguish the individual walking styles of different subjects across various views. Existing state-of-the-art methods employ uniform convolutions to extract features from diverse sequences and ignore the effects of viewpoint changes. To overcome these limitations, we propo… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: Accepted by TIP journal

  49. arXiv:2308.03757  [pdf, other

    cs.CV

    3D Motion Magnification: Visualizing Subtle Motions with Time Varying Radiance Fields

    Authors: Brandon Y. Feng, Hadi Alzayer, Michael Rubinstein, William T. Freeman, Jia-Bin Huang

    Abstract: Motion magnification helps us visualize subtle, imperceptible motion. However, prior methods only work for 2D videos captured with a fixed camera. We present a 3D motion magnification method that can magnify subtle motions from scenes captured by a moving camera, while supporting novel view rendering. We represent the scene with time-varying radiance fields and leverage the Eulerian principle for… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: ICCV 2023. See the project page at https://3d-motion-magnification.github.io

  50. arXiv:2308.02855  [pdf, other

    cond-mat.mtrl-sci

    Emergent electronic landscapes in a novel valence-ordered nickelate with tri-component nickel coordination

    Authors: Aravind Raji, Zhengang Dong, Victor Porée, Alaska Subedi, Xiaoyan Li, Bernat Mundet, Lucia Varbaro, Claribel Domínguez, Marios Hadjimichael, Bohan Feng, Alessandro Nicolaou, Jean-Pascal Rueff, Danfeng Li, Alexandre Gloter

    Abstract: The metal-hydride-based topochemical reduction process has produced novel thermodynamically unstable phases across various transition metal oxide series with unusual crystal structures and non-trivial ground states. Here, by such an oxygen (de-) intercalation method we synthesis a novel samarium nickelate with ordered nickel valences associated with tri-component coordination configurations. This… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.