Skip to main content

Showing 1–14 of 14 results for author: Bi, N

  1. arXiv:2404.09918  [pdf, other

    cs.CV

    EdgeRelight360: Text-Conditioned 360-Degree HDR Image Generation for Real-Time On-Device Video Portrait Relighting

    Authors: Min-Hui Lin, Mahesh Reddy, Guillaume Berger, Michel Sarkis, Fatih Porikli, Ning Bi

    Abstract: In this paper, we present EdgeRelight360, an approach for real-time video portrait relighting on mobile devices, utilizing text-conditioned generation of 360-degree high dynamic range image (HDRI) maps. Our method proposes a diffusion-based text-to-360-degree image generation in the HDR domain, taking advantage of the HDR10 standard. This technique facilitates the generation of high-quality, reali… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Camera-ready version (CVPR workshop - EDGE'24)

  2. arXiv:2404.05063  [pdf, other

    cs.CV

    AUEditNet: Dual-Branch Facial Action Unit Intensity Manipulation with Implicit Disentanglement

    Authors: Shiwei Jin, Zhen Wang, Lei Wang, Peng Liu, Ning Bi, Truong Nguyen

    Abstract: Facial action unit (AU) intensity plays a pivotal role in quantifying fine-grained expression behaviors, which is an effective condition for facial expression manipulation. However, publicly available datasets containing intensity annotations for multiple AUs remain severely limited, often featuring a restricted number of subjects. This limitation places challenges to the AU intensity manipulation… ▽ More

    Submitted 10 April, 2024; v1 submitted 7 April, 2024; originally announced April 2024.

  3. arXiv:2312.16197  [pdf, other

    cs.CV cs.LG

    INFAMOUS-NeRF: ImproviNg FAce MOdeling Using Semantically-Aligned Hypernetworks with Neural Radiance Fields

    Authors: Andrew Hou, Feng Liu, Zhiyuan Ren, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu

    Abstract: We propose INFAMOUS-NeRF, an implicit morphable face model that introduces hypernetworks to NeRF to improve the representation power in the presence of many training subjects. At the same time, INFAMOUS-NeRF resolves the classic hypernetwork tradeoff of representation power and editability by learning semantically-aligned latent spaces despite the subject-specific models, all without requiring a l… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  4. arXiv:2306.17123  [pdf, other

    cs.CV cs.GR

    PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN

    Authors: Kai-En Lin, Alex Trevithick, Keli Cheng, Michel Sarkis, Mohsen Ghafoorian, Ning Bi, Gerhard Reitmayr, Ravi Ramamoorthi

    Abstract: Portrait synthesis creates realistic digital avatars which enable users to interact with others in a compelling way. Recent advances in StyleGAN and its extensions have shown promising results in synthesizing photorealistic and accurate reconstruction of human faces. However, previous methods often focus on frontal face synthesis and most methods are not able to handle large head rotations due to… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: Project website: https://cseweb.ucsd.edu//~viscomp/projects/EGSR23PVP/

  5. arXiv:2306.14687  [pdf, other

    eess.IV cs.CV

    GSMorph: Gradient Surgery for cine-MRI Cardiac Deformable Registration

    Authors: Haoran Dou, Ning Bi, Luyi Han, Yuhao Huang, Ritse Mann, Xin Yang, Dong Ni, Nishant Ravikumar, Alejandro F. Frangi, Yunzhi Huang

    Abstract: Deep learning-based deformable registration methods have been widely investigated in diverse medical applications. Learning-based deformable registration relies on weighted objective functions trading off registration accuracy and smoothness of the deformation field. Therefore, they inevitably require tuning the hyperparameter for optimal registration performance. Tuning the hyperparameters is hig… ▽ More

    Submitted 20 July, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at MICCAI 2023

  6. arXiv:2305.11452  [pdf, other

    cs.CV

    ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection

    Authors: Shiwei Jin, Zhen Wang, Lei Wang, Ning Bi, Truong Nguyen

    Abstract: Learning-based gaze estimation methods require large amounts of training data with accurate gaze annotations. Facing such demanding requirements of gaze data collection and annotation, several image synthesis methods were proposed, which successfully redirected gaze directions precisely given the assigned conditions. However, these methods focused on changing gaze directions of the images that onl… ▽ More

    Submitted 19 May, 2023; originally announced May 2023.

  7. arXiv:2203.16681  [pdf, other

    cs.CV

    Face Relighting with Geometrically Consistent Shadows

    Authors: Andrew Hou, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu

    Abstract: Most face relighting methods are able to handle diffuse shadows, but struggle to handle hard shadows, such as those cast by the nose. Methods that propose techniques for handling hard shadows often do not produce geometrically consistent shadows since they do not directly leverage the estimated face geometry while synthesizing them. We propose a novel differentiable algorithm for synthesizing hard… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to CVPR 2022

  8. arXiv:2110.12385  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Perceptual Consistency in Video Segmentation

    Authors: Yizhe Zhang, Shubhankar Borse, Hong Cai, Ying Wang, Ning Bi, Xiaoyun Jiang, Fatih Porikli

    Abstract: In this paper, we present a novel perceptual consistency perspective on video semantic segmentation, which can capture both temporal consistency and pixel-wise correctness. Given two nearby video frames, perceptual consistency measures how much the segmentation decisions agree with the pixel correspondences obtained via matching general perceptual features. More specifically, for each pixel in one… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

    Comments: To appear in WACV 2022. Comments and questions are welcome

  9. arXiv:2104.00825  [pdf, other

    cs.CV

    Towards High Fidelity Face Relighting with Realistic Shadows

    Authors: Andrew Hou, Ze Zhang, Michel Sarkis, Ning Bi, Yiying Tong, Xiaoming Liu

    Abstract: Existing face relighting methods often struggle with two problems: maintaining the local facial details of the subject and accurately removing and synthesizing shadows in the relit image, especially hard shadows. We propose a novel deep face relighting method that addresses both problems. Our method learns to predict the ratio (quotient) image between a source image and the target image with the d… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR 2021

  10. arXiv:2009.02007  [pdf, other

    cs.CV

    Real-Time Selfie Video Stabilization

    Authors: Jiyang Yu, Ravi Ramamoorthi, Keli Cheng, Michel Sarkis, Ning Bi

    Abstract: We propose a novel real-time selfie video stabilization method. Our method is completely automatic and runs at 26 fps. We use a 1D linear convolutional network to directly infer the rigid moving least squares warping which implicitly balances between the global rigidity and local flexibility. Our network structure is specifically designed to stabilize the background and foreground at the same time… ▽ More

    Submitted 16 June, 2021; v1 submitted 4 September, 2020; originally announced September 2020.

  11. arXiv:1910.10845  [pdf, other

    cs.CV cs.LG

    Weakly-Supervised Degree of Eye-Closeness Estimation

    Authors: Eyasu Mequanint, Shuai Zhang, Bijan Forutanpour, Yingyong Qi, Ning Bi

    Abstract: Following recent technological advances there is a growing interest in building non-intrusive methods that help us communicate with computing devices. In this regard, accurate information from eye is a promising input medium between a user and computing devices. In this paper we propose a method that captures the degree of eye closeness. Although many methods exist for detection of eyelid openness… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

  12. arXiv:1905.03415  [pdf, other

    cs.CV

    PPGNet: Learning Point-Pair Graph for Line Segment Detection

    Authors: Ziheng Zhang, Zhengxin Li, Ning Bi, Jia Zheng, Jinlei Wang, Kun Huang, Weixin Luo, Yanyu Xu, Shenghua Gao

    Abstract: In this paper, we present a novel framework to detect line segments in man-made environments. Specifically, we propose to describe junctions, line segments and relationships between them with a simple graph, which is more structured and informative than end-point representation used in existing line segment detection methods. In order to extract a line segment graph from an image, we further intro… ▽ More

    Submitted 16 May, 2019; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: To appear in CVPR 2019

  13. arXiv:1904.02553  [pdf, other

    cs.CV

    Generic Multiview Visual Tracking

    Authors: Minye Wu, Haibin Ling, Ning Bi, Shenghua Gao, Hao Sheng, Jingyi Yu

    Abstract: Recent progresses in visual tracking have greatly improved the tracking performance. However, challenges such as occlusion and view change remain obstacles in real world deployment. A natural solution to these challenges is to use multiple cameras with multiview inputs, though existing systems are mostly limited to specific targets (e.g. human), static cameras, and/or camera calibration. To break… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  14. arXiv:1812.08374  [pdf, other

    cs.CV cs.LG

    DAC: Data-free Automatic Acceleration of Convolutional Networks

    Authors: Xin Li, Shuai Zhang, Bolan Jiang, Yingyong Qi, Mooi Choo Chuah, Ning Bi

    Abstract: Deploying a deep learning model on mobile/IoT devices is a challenging task. The difficulty lies in the trade-off between computation speed and accuracy. A complex deep learning model with high accuracy runs slowly on resource-limited devices, while a light-weight model that runs much faster loses accuracy. In this paper, we propose a novel decomposition method, namely DAC, that is capable of fact… ▽ More

    Submitted 27 December, 2018; v1 submitted 20 December, 2018; originally announced December 2018.

    Comments: Accepted by IEEE Winter Conference on Applications of Computer Vision (WACV 2019)