Skip to main content

Showing 1–50 of 74 results for author: Zou, C

  1. arXiv:2407.06305  [pdf, other

    cs.CV cs.GR

    SweepNet: Unsupervised Learning Shape Abstraction via Neural Sweepers

    Authors: Mingrui Zhao, Yizhi Wang, Fenggen Yu, Changqing Zou, Ali Mahdavi-Amiri

    Abstract: Shape abstraction is an important task for simplifying complex geometric structures while retaining essential features. Sweep surfaces, commonly found in human-made objects, aid in this process by effectively capturing and representing object geometry, thereby facilitating abstraction. In this paper, we introduce \papername, a novel approach to shape abstraction through sweep surfaces. We propose… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages,20 figures, ECCV 2024

  2. arXiv:2405.15305  [pdf, other

    cs.CV

    Diff3DS: Generating View-Consistent 3D Sketch via Differentiable Curve Rendering

    Authors: Yibo Zhang, Lihong Wang, Changqing Zou, Tieru Wu, Rui Ma

    Abstract: 3D sketches are widely used for visually representing the 3D shape and structure of objects or scenes. However, the creation of 3D sketch often requires users to possess professional artistic skills. Existing research efforts primarily focus on enhancing the ability of interactive sketch generation in 3D virtual systems. In this work, we propose Diff3DS, a novel differentiable rendering framework… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Project: https://yiboz2001.github.io/Diff3DS/

  3. arXiv:2405.00700  [pdf

    cs.NE cond-mat.str-el

    Oxygen vacancies modulated VO2 for neurons and Spiking Neural Network construction

    Authors: Liang Li, Ting Zhou, Tong Liu, Zhiwei Liu, Yaping Li, Shuo Wu, Shanguang Zhao, Jinglin Zhu, Meiling Liu, Zhihan Lin, Bowen Sun, Jianjun Li, Fangwen Sun, Chongwen Zou

    Abstract: Artificial neuronal devices are the basic building blocks for neuromorphic computing systems, which have been motivated by realistic brain emulation. Aiming for these applications, various device concepts have been proposed to mimic the neuronal dynamics and functions. While till now, the artificial neuron devices with high efficiency, high stability and low power consumption are still far from pr… ▽ More

    Submitted 16 April, 2024; originally announced May 2024.

    Comments: 18 pages,4 figures

  4. arXiv:2404.16452  [pdf, other

    cs.CV

    PAD: Patch-Agnostic Defense against Adversarial Patch Attacks

    Authors: Lihua Jing, Rui Wang, Wenqi Ren, Xin Dong, Cong Zou

    Abstract: Adversarial patch attacks present a significant threat to real-world object detectors due to their practical feasibility. Existing defense methods, which rely on attack data or prior knowledge, struggle to effectively address a wide range of adversarial patches. In this paper, we show two inherent characteristics of adversarial patches, semantic independence and spatial heterogeneity, independent… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  5. The Victim and The Beneficiary: Exploiting a Poisoned Model to Train a Clean Model on Poisoned Data

    Authors: Zixuan Zhu, Rui Wang, Cong Zou, Lihua Jing

    Abstract: Recently, backdoor attacks have posed a serious security threat to the training process of deep neural networks (DNNs). The attacked model behaves normally on benign samples but outputs a specific result when the trigger is present. However, compared with the rocketing progress of backdoor attacks, existing defenses are difficult to deal with these threats effectively or require benign samples to… ▽ More

    Submitted 31 May, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: 13 pages, 6 figures, published to ICCV

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). 2023: 155-164

  6. arXiv:2404.09499  [pdf, other

    cs.CV cs.GR

    Learning Human Motion from Monocular Videos via Cross-Modal Manifold Alignment

    Authors: Shuaiying Hou, Hongyu Tao, Junheng Fang, Changqing Zou, Hujun Bao, Weiwei Xu

    Abstract: Learning 3D human motion from 2D inputs is a fundamental task in the realms of computer vision and computer graphics. Many previous methods grapple with this inherently ambiguous task by introducing motion priors into the learning process. However, these approaches face difficulties in defining the complete configurations of such priors or training a robust model. In this paper, we present the Vid… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2404.08252  [pdf, other

    cs.CV

    MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance

    Authors: Yuqun Wu, Jae Yong Lee, Chuhang Zou, Shenlong Wang, Derek Hoiem

    Abstract: The latest regularized Neural Radiance Field (NeRF) approaches produce poor geometry and view extrapolation for multiview stereo (MVS) benchmarks such as ETH3D. In this paper, we aim to create 3D models that provide accurate geometry and view synthesis, partially closing the large geometric performance gap between NeRF and traditional MVS methods. We propose a patch-based approach that effectively… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 26 pages, 15 figures

  8. arXiv:2403.11077  [pdf, other

    cs.CV

    Zippo: Zipping Color and Transparency Distributions into a Single Diffusion Model

    Authors: Kangyang Xie, Binbin Yang, Hao Chen, Meng Wang, Cheng Zou, Hui Xue, Ming Yang, Chunhua Shen

    Abstract: Beyond the superiority of the text-to-image diffusion model in generating high-quality images, recent studies have attempted to uncover its potential for adapting the learned semantic knowledge to visual perception tasks. In this work, instead of translating a generative diffusion model into a visual perception model, we explore to retain the generative ability with the perceptive adaptation. To a… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  9. arXiv:2403.09439  [pdf, other

    cs.CV cs.AI

    3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation

    Authors: Frank Zhang, Yibo Zhang, Quan Zheng, Rui Ma, Wei Hua, Hujun Bao, Weiwei Xu, Changqing Zou

    Abstract: Text-driven 3D scene generation techniques have made rapid progress in recent years. Their success is mainly attributed to using existing generative models to iteratively perform image warping and inpainting to generate 3D scenes. However, these methods heavily rely on the outputs of existing models, leading to error accumulation in geometry and appearance that prevent the models from being used i… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 7 figures

  10. arXiv:2403.07728  [pdf, other

    stat.ML cs.LG stat.ME

    CAP: A General Algorithm for Online Selective Conformal Prediction with FCR Control

    Authors: Yajie Bao, Yuyang Huo, Haojie Ren, Changliang Zou

    Abstract: We study the problem of post-selection predictive inference in an online fashion. To avoid devoting resources to unimportant units, a preliminary selection of the current individual before reporting its prediction interval is common and meaningful in online predictive tasks. Since the online selection causes a temporal multiplicity in the selected prediction intervals, it is important to control t… ▽ More

    Submitted 28 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  11. arXiv:2311.12818  [pdf, other

    cs.CV cs.GR

    Manifold Path Guiding for Importance Sampling Specular Chains

    Authors: Zhimin Fan, Pengpei Hong, Jie Guo, Changqing Zou, Yanwen Guo, Ling-Qi Yan

    Abstract: Complex visual effects such as caustics are often produced by light paths containing multiple consecutive specular vertices (dubbed specular chains), which pose a challenge to unbiased estimation in Monte Carlo rendering. In this work, we study the light transport behavior within a sub-path that is comprised of a specular chain and two non-specular separators. We show that the specular manifolds f… ▽ More

    Submitted 24 September, 2023; originally announced November 2023.

    Comments: 14 pages, 19 figures

    ACM Class: I.3.6

  12. arXiv:2309.05941  [pdf

    cs.CR

    Random Segmentation: New Traffic Obfuscation against Packet-Size-Based Side-Channel Attacks

    Authors: Mnassar Alyami, Abdulmajeed Alghamdi, Mohammed Alkhowaiter, Cliff Zou, Yan Solihin

    Abstract: Despite encryption, the packet size is still visible, enabling observers to infer private information in the Internet of Things (IoT) environment (e.g., IoT device identification). Packet padding obfuscates packet-length characteristics with a high data overhead because it relies on adding noise to the data. This paper proposes a more data-efficient approach that randomizes packet sizes without ad… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 15 pages, 4 figures, to appear in Sensors 2023

  13. arXiv:2308.15902  [pdf

    physics.optics cs.ET

    Photonic time-delayed reservoir computing based on series coupled microring resonators with high memory capacity

    Authors: Yijia Li, Ming Li, MingYi Gao, Chang-Ling Zou, Chun-Hua Dong, Jin Lu, Yali Qin, XiaoNiu Yang, Qi Xuan, Hongliang Ren

    Abstract: On-chip microring resonators (MRRs) have been proposed to construct the time-delayed reservoir computing (RC), which offers promising configurations available for computation with high scalability, high-density computing, and easy fabrication. A single MRR, however, is inadequate to supply enough memory for the computational task with diverse memory requirements. Large memory needs are met by the… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  14. arXiv:2308.13176  [pdf, other

    cs.SI cs.LG

    Using Adamic-Adar Index Algorithm to Predict Volunteer Collaboration: Less is More

    Authors: Chao Wu, Peng Chen, Baiqiao Yin, Zijuan Lin, Chen Jiang, Di Yu, Changhong Zou, Chunwang Lui

    Abstract: Social networks exhibit a complex graph-like structure due to the uncertainty surrounding potential collaborations among participants. Machine learning algorithms possess generic outstanding performance in multiple real-world prediction tasks. However, whether machine learning algorithms outperform specific algorithms designed for graph link prediction remains unknown to us. To address this issue,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  15. arXiv:2308.04669  [pdf, other

    cs.CV cs.GR cs.LG

    A General Implicit Framework for Fast NeRF Composition and Rendering

    Authors: Xinyu Gao, Ziyi Yang, Yunlu Zhao, Yuxiang Sun, Xiaogang Jin, Changqing Zou

    Abstract: A variety of Neural Radiance Fields (NeRF) methods have recently achieved remarkable success in high render speed. However, current accelerating methods are specialized and incompatible with various implicit methods, preventing real-time composition over various types of NeRF works. Because NeRF relies on sampling along rays, it is possible to provide general guidance for acceleration. To that end… ▽ More

    Submitted 4 January, 2024; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: AAAI 2024

  16. arXiv:2306.00314  [pdf

    cs.CR cs.AI

    Adversarial-Aware Deep Learning System based on a Secondary Classical Machine Learning Verification Approach

    Authors: Mohammed Alkhowaiter, Hisham Kholidy, Mnassar Alyami, Abdulmajeed Alghamdi, Cliff Zou

    Abstract: Deep learning models have been used in creating various effective image classification applications. However, they are vulnerable to adversarial attacks that seek to misguide the models into predicting incorrect classes. Our study of major adversarial attack models shows that they all specifically target and exploit the neural networking structures in their designs. This understanding makes us dev… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 17 pages, 3 figures

  17. Side-Channel VoIP Profiling Attack against Customer Service Automated Phone System

    Authors: Roy Laurens, Edo Christianto, Bruce Caulkins, Cliff C. Zou

    Abstract: In many VoIP systems, Voice Activity Detection (VAD) is often used on VoIP traffic to suppress packets of silence in order to reduce the bandwidth consumption of phone calls. Unfortunately, although VoIP traffic is fully encrypted and secured, traffic analysis of this suppression can reveal identifying information about calls made to customer service automated phone systems. Because different cust… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: 6 pages, 12 figures. Published in IEEE Global Communications Conference (GLOBECOM), 2022

    Journal ref: 2022 IEEE Global Communications Conference, Rio de Janeiro, Brazil, 2022, pp. 6091-6096

  18. arXiv:2305.04685  [pdf, other

    cs.RO

    ARDIE: AR, Dialogue, and Eye Gaze Policies for Human-Robot Collaboration

    Authors: Chelsea Zou, Kishan Chandan, Yan Ding, Shiqi Zhang

    Abstract: Human-robot collaboration (HRC) has become increasingly relevant in industrial, household, and commercial settings. However, the effectiveness of such collaborations is highly dependent on the human and robots' situational awareness of the environment. Improving this awareness includes not only aligning perceptions in a shared workspace, but also bidirectionally communicating intent and visualizin… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  19. arXiv:2304.14422  [pdf, other

    cs.LG

    MINN: Learning the dynamics of differential-algebraic equations and application to battery modeling

    Authors: Yicun Huang, Changfu Zou, Yang Li, Torsten Wik

    Abstract: The concept of integrating physics-based and data-driven approaches has become popular for modeling sustainable energy systems. However, the existing literature mainly focuses on the data-driven surrogates generated to replace physics-based models. These models often trade accuracy for speed but lack the generalisability, adaptability, and interpretability inherent in physics-based models, which a… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  20. arXiv:2303.17867  [pdf, other

    cs.CV cs.LG eess.IV

    CAP-VSTNet: Content Affinity Preserved Versatile Style Transfer

    Authors: Linfeng Wen, Chengying Gao, Changqing Zou

    Abstract: Content affinity loss including feature and pixel affinity is a main problem which leads to artifacts in photorealistic and video style transfer. This paper proposes a new framework named CAP-VSTNet, which consists of a new reversible residual network and an unbiased linear transform module, for versatile style transfer. This reversible residual network can not only preserve content affinity but n… ▽ More

    Submitted 31 March, 2023; originally announced March 2023.

    Comments: CVPR 2023

  21. arXiv:2303.10839  [pdf, other

    cs.CV

    MXM-CLR: A Unified Framework for Contrastive Learning of Multifold Cross-Modal Representations

    Authors: Ye Wang, Bowei Jiang, Changqing Zou, Rui Ma

    Abstract: Multifold observations are common for different data modalities, e.g., a 3D shape can be represented by multi-view images and an image can be described with different captions. Existing cross-modal contrastive representation learning (XM-CLR) methods such as CLIP are not fully suitable for multifold data as they only consider one positive pair and treat other pairs as negative when computing the c… ▽ More

    Submitted 20 March, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 16 pages, 14 figures

  22. arXiv:2302.14335  [pdf, other

    cs.CV

    DC-Former: Diverse and Compact Transformer for Person Re-Identification

    Authors: Wen Li, Cheng Zou, Meng Wang, Furong Xu, Jianan Zhao, Ruobing Zheng, Yuan Cheng, Wei Chu

    Abstract: In person re-identification (re-ID) task, it is still challenging to learn discriminative representation by deep learning, due to limited data. Generally speaking, the model will get better performance when increasing the amount of data. The addition of similar classes strengthens the ability of the classifier to identify similar identities, thereby improving the discrimination of representation.… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by AAAI23

  23. arXiv:2212.14670  [pdf, other

    q-fin.TR cs.AI cs.LG q-fin.ST

    Hierarchical Deep Reinforcement Learning for VWAP Strategy Optimization

    Authors: Xiaodong Li, Pangjing Wu, Chenxin Zou, Qing Li

    Abstract: Designing an intelligent volume-weighted average price (VWAP) strategy is a critical concern for brokers, since traditional rule-based strategies are relatively static that cannot achieve a lower transaction cost in a dynamic market. Many studies have tried to minimize the cost via reinforcement learning, but there are bottlenecks in improvement, especially for long-duration strategies such as the… ▽ More

    Submitted 11 December, 2022; originally announced December 2022.

  24. arXiv:2212.00994  [pdf, ps, other

    cs.AI

    Knowledge Graph Quality Evaluation under Incomplete Information

    Authors: Xiaodong Li, Chenxin Zou, Yi Cai, Yuelong Zhu

    Abstract: Knowledge graphs (KGs) have attracted more and more attentions because of their fundamental roles in many tasks. Quality evaluation for KGs is thus crucial and indispensable. Existing methods in this field evaluate KGs by either proposing new quality metrics from different dimensions or measuring performances at KG construction stages. However, there are two major issues with those methods. First,… ▽ More

    Submitted 12 April, 2023; v1 submitted 2 December, 2022; originally announced December 2022.

  25. arXiv:2212.00914  [pdf, other

    cs.CV

    QFF: Quantized Fourier Features for Neural Field Representations

    Authors: Jae Yong Lee, Yuqun Wu, Chuhang Zou, Shenlong Wang, Derek Hoiem

    Abstract: Multilayer perceptrons (MLPs) learn high frequencies slowly. Recent approaches encode features in spatial bins to improve speed of learning details, but at the cost of larger model size and loss of continuity. Instead, we propose to encode features in bins of Fourier features that are commonly used for positional encoding. We call these Quantized Fourier Features (QFF). As a naturally multiresolut… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  26. arXiv:2210.07582  [pdf, other

    cs.CV

    Deep PatchMatch MVS with Learned Patch Coplanarity, Geometric Consistency and Adaptive Pixel Sampling

    Authors: Jae Yong Lee, Chuhang Zou, Derek Hoiem

    Abstract: Recent work in multi-view stereo (MVS) combines learnable photometric scores and regularization with PatchMatch-based optimization to achieve robust pixelwise estimates of depth, normals, and visibility. However, non-learning based methods still outperform for large scenes with sparse views, in part due to use of geometric consistency constraints and ability to optimize over many views at high res… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  27. arXiv:2207.01216  [pdf, other

    cs.CV

    Solutions for Fine-grained and Long-tailed Snake Species Recognition in SnakeCLEF 2022

    Authors: Cheng Zou, Furong Xu, Meng Wang, Wen Li, Yuan Cheng

    Abstract: Automatic snake species recognition is important because it has vast potential to help lower deaths and disabilities caused by snakebites. We introduce our solution in SnakeCLEF 2022 for fine-grained snake species recognition on a heavy long-tailed class distribution. First, a network architecture is designed to extract and fuse features from multiple modalities, i.e. photograph from visual modali… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: Top solutions for FGVC9, accepted to CLEF2022

  28. arXiv:2206.06741  [pdf, other

    cs.CV

    Recurrent Transformer Variational Autoencoders for Multi-Action Motion Synthesis

    Authors: Rania Briq, Chuhang Zou, Leonid Pishchulin, Chris Broaddus, Juergen Gall

    Abstract: We consider the problem of synthesizing multi-action human motion sequences of arbitrary lengths. Existing approaches have mastered motion sequence generation in single action scenarios, but fail to generalize to multi-action and arbitrary-length sequences. We fill this gap by proposing a novel efficient approach that leverages expressiveness of Recurrent Transformers and generative richness of co… ▽ More

    Submitted 27 June, 2022; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: accepted at Transformers for Vision workshop at CVPR 2022

  29. arXiv:2205.09335  [pdf, other

    cs.LG cs.SI

    A Simple Yet Effective SVD-GCN for Directed Graphs

    Authors: Chunya Zou, Andi Han, Lequan Lin, Junbin Gao

    Abstract: In this paper, we propose a simple yet effective graph neural network for directed graphs (digraph) based on the classic Singular Value Decomposition (SVD), named SVD-GCN. The new graph neural network is built upon the graph SVD-framelet to better decompose graph signals on the SVD ``frequency'' bands. Further the new framelet SVD-GCN is also scaled up for larger scale graphs via using Chebyshev p… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 14 pages

  30. arXiv:2202.06738  [pdf, other

    eess.SP cs.LG

    Attention-based Deep Neural Networks for Battery Discharge Capacity Forecasting

    Authors: Yadong Zhang, Chenye Zou, Xin Chen

    Abstract: Battery discharge capacity forecasting is critically essential for the applications of lithium-ion batteries. The capacity degeneration can be treated as the memory of the initial battery state of charge from the data point of view. The streaming sensor data collected by battery management systems (BMS) reflect the usable battery capacity degradation rates under various operational working conditi… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  31. arXiv:2112.07383  [pdf, other

    cs.CV

    Improving Human-Object Interaction Detection via Phrase Learning and Label Composition

    Authors: Zhimin Li, Cheng Zou, Yu Zhao, Boxun Li, Sheng Zhong

    Abstract: Human-Object Interaction (HOI) detection is a fundamental task in high-level human-centric scene understanding. We propose PhraseHOI, containing a HOI branch and a novel phrase branch, to leverage language prior and improve relation expression. Specifically, the phrase branch is supervised by semantic embeddings, whose ground truths are automatically converted from the original HOI annotations wit… ▽ More

    Submitted 15 January, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: Accepted to AAAI2022

  32. arXiv:2112.04761  [pdf, other

    cs.CV

    HBReID: Harder Batch for Re-identification

    Authors: Wen Li, Furong Xu, Jianan Zhao, Ruobing Zheng, Cheng Zou, Meng Wang, Yuan Cheng

    Abstract: Triplet loss is a widely adopted loss function in ReID task which pulls the hardest positive pairs close and pushes the hardest negative pairs far away. However, the selected samples are not the hardest globally, but the hardest only in a mini-batch, which will affect the performance. In this report, a hard batch mining method is proposed to mine the hardest samples globally to make triplet harder… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

  33. arXiv:2112.02889  [pdf, other

    cs.CV cs.CL cs.LG eess.IV

    Joint Learning of Localized Representations from Medical Images and Reports

    Authors: Philip Müller, Georgios Kaissis, Congyu Zou, Daniel Rueckert

    Abstract: Contrastive learning has proven effective for pre-training image models on unlabeled data with promising results for tasks such as medical image classification. Using paired text (like radiological reports) during pre-training improves the results even further. Still, most existing methods target image classification downstream tasks and may not be optimal for localized tasks like semantic segment… ▽ More

    Submitted 31 August, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted at ECCV 2022

    Journal ref: Computer Vision - ECCV 2022, pp. 685-701

  34. arXiv:2109.11913  [pdf

    cs.MM

    Spatial Information Refinement for Chroma Intra Prediction in Video Coding

    Authors: Chengyi Zou, Shuai Wan, Tiannan Ji, Marta Mrak, Marc Gorriz Blanch, Luis Herranz

    Abstract: Video compression benefits from advanced chroma intra prediction methods, such as the Cross-Component Linear Model (CCLM) which uses linear models to approximate the relationship between the luma and chroma components. Recently it has been proven that advanced cross-component prediction methods based on Neural Networks (NN) can bring additional coding gains. In this paper, spatial information refi… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  35. arXiv:2109.01971  [pdf, other

    cs.NI eess.SP

    Horizontal and Vertical Collaboration for VR Delivery in MEC-Enabled Small-Cell Networks

    Authors: Zhuojia Gu, Hancheng Lu, Chenkai Zou

    Abstract: Due to the large bandwidth, low latency and computationally intensive features of virtual reality (VR) video applications, the current resource-constrained wireless and edge networks cannot meet the requirements of on-demand VR delivery. In this letter, we propose a joint horizontal and vertical collaboration architecture in mobile edge computing (MEC)-enabled small-cell networks for VR delivery.… ▽ More

    Submitted 4 September, 2021; originally announced September 2021.

    Comments: 5 pages, 5 figures

  36. arXiv:2108.08943  [pdf, other

    cs.CV

    PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

    Authors: Jae Yong Lee, Joseph DeGol, Chuhang Zou, Derek Hoiem

    Abstract: Recent learning-based multi-view stereo (MVS) methods show excellent performance with dense cameras and small depth ranges. However, non-learning based approaches still outperform for scenes with large depth ranges and sparser wide-baseline views, in part due to their PatchMatch optimization over pixelwise estimates of depth, normals, and visibility. In this paper, we propose an end-to-end trainab… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021 for oral presentation

  37. SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

    Authors: Zeyu Ruan, Changqing Zou, Longhai Wu, Gangshan Wu, Limin Wang

    Abstract: Three-dimensional face dense alignment and reconstruction in the wild is a challenging problem as partial facial information is commonly missing in occluded and large pose face images. Large head pose variations also increase the solution space and make the modeling more difficult. Our key idea is to model occlusion and pose to decompose this challenging task into several relatively more manageabl… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

    Comments: To appear in IEEE Transactions on Image Processing. Code and model is available at https://github.com/MCG-NJU/SADRNet

  38. arXiv:2104.05666  [pdf, other

    cs.CV

    View-Guided Point Cloud Completion

    Authors: Xuancheng Zhang, Yutong Feng, Siqi Li, Changqing Zou, Hai Wan, Xibin Zhao, Yandong Guo, Yue Gao

    Abstract: This paper presents a view-guided solution for the task of point cloud completion. Unlike most existing methods directly inferring the missing points using shape priors, we address this task by introducing ViPC (view-guided point cloud completion) that takes the missing crucial global structure information from an extra single-view image. By leveraging a framework that sequentially performs effect… ▽ More

    Submitted 13 April, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: 10 pages, 8 figures, CVPR2021

  39. A Web Infrastructure for Certifying Multimedia News Content for Fake News Defense

    Authors: Edward L. Amoruso, Raghu Avula, Stephen P. Johnson, Cliff C. Zou

    Abstract: In dealing with altered multimedia news content, also referred to as fake news, we present a ready-to-deploy scheme based on existing public key infrastructure as a new fake news defense paradigm. This scheme enables news organizations to certify/endorse a newsworthy multimedia news content and securely and conveniently pass this trust information to end users. A news organization can use our prog… ▽ More

    Submitted 23 May, 2023; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: 7 pages, 6 figures

  40. arXiv:2103.04503  [pdf, other

    cs.CV

    End-to-End Human Object Interaction Detection with HOI Transformer

    Authors: Cheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun

    Abstract: We propose HOI Transformer to tackle human object interaction (HOI) detection in an end-to-end manner. Current approaches either decouple HOI task into separated stages of object detection and interaction classification or introduce surrogate interaction problem. In contrast, our method, named HOI Transformer, streamlines the HOI pipeline by eliminating the need for many hand-designed components.… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: Accepted to CVPR2021

  41. GaNDLF: A Generally Nuanced Deep Learning Framework for Scalable End-to-End Clinical Workflows in Medical Imaging

    Authors: Sarthak Pati, Siddhesh P. Thakur, İbrahim Ethem Hamamcı, Ujjwal Baid, Bhakti Baheti, Megh Bhalerao, Orhun Güley, Sofia Mouchtaris, David Lang, Spyridon Thermos, Karol Gotkowski, Camila González, Caleb Grenko, Alexander Getka, Brandon Edwards, Micah Sheller, Junwen Wu, Deepthi Karkada, Ravi Panchumarthy, Vinayak Ahluwalia, Chunrui Zou, Vishnu Bashyam, Yuemeng Li, Babak Haghighi, Rhea Chitalia , et al. (17 additional authors not shown)

    Abstract: Deep Learning (DL) has the potential to optimize machine learning in both the scientific and clinical communities. However, greater expertise is required to develop DL algorithms, and the variability of implementations hinders their reproducibility, translation, and deployment. Here we present the community-driven Generally Nuanced Deep Learning Framework (GaNDLF), with the goal of lowering these… ▽ More

    Submitted 16 May, 2023; v1 submitted 25 February, 2021; originally announced March 2021.

    Comments: Deep Learning, Framework, Segmentation, Regression, Classification, Cross-validation, Data augmentation, Deployment, Clinical, Workflows

    Journal ref: Commun Eng 2, 23 (2023)

  42. arXiv:2007.05876  [pdf, other

    cs.CR

    On Runtime Software Security of TrustZone-M based IoT Devices

    Authors: Lan Luo, Yue Zhang, Cliff C. Zou, Xinhui Shao, Zhen Ling, Xinwen Fu

    Abstract: Internet of Things (IoT) devices have been increasingly integrated into our daily life. However, such smart devices suffer a broad attack surface. Particularly, attacks targeting the device software at runtime are challenging to defend against if IoT devices use resource-constrained microcontrollers (MCUs). TrustZone-M, a TrustZone extension for MCUs, is an emerging security technique fortifying M… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

    Comments: 6 pages, 3 figures

  43. arXiv:2003.13910  [pdf, other

    cs.CV

    Attention-based Multi-modal Fusion Network for Semantic Scene Completion

    Authors: Siqi Li, Changqing Zou, Yipeng Li, Xibin Zhao, Yue Gao

    Abstract: This paper presents an end-to-end 3D convolutional network named attention-based multi-modal fusion network (AMFNet) for the semantic scene completion (SSC) task of inferring the occupancy and semantic labels of a volumetric 3D scene from single-view RGB-D images. Compared with previous methods which use only the semantic features extracted from RGB-D images, the proposed AMFNet learns to perform… ▽ More

    Submitted 15 April, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted by AAAI 2020

  44. arXiv:2003.02683  [pdf, other

    cs.CV

    SketchyCOCO: Image Generation from Freehand Scene Sketches

    Authors: Chengying Gao, Qi Liu, Qi Xu, Limin Wang, Jianzhuang Liu, Changqing Zou

    Abstract: We introduce the first method for automatic image generation from scene-level freehand sketches. Our model allows for controllable image generation by specifying the synthesis goal via freehand sketches. The key contribution is an attribute vector bridged Generative Adversarial Network called EdgeGAN, which supports high visual-quality object-level image content generation without using freehand s… ▽ More

    Submitted 7 April, 2020; v1 submitted 5 March, 2020; originally announced March 2020.

  45. arXiv:2002.07362  [pdf, other

    cs.CV

    MILA: Multi-Task Learning from Videos via Efficient Inter-Frame Attention

    Authors: Donghyun Kim, Tian Lan, Chuhang Zou, Ning Xu, Bryan A. Plummer, Stan Sclaroff, Jayan Eledath, Gerard Medioni

    Abstract: Prior work in multi-task learning has mainly focused on predictions on a single image. In this work, we present a new approach for multi-task learning from videos via efficient inter-frame local attention (MILA). Our approach contains a novel inter-frame attention module which allows learning of task-specific attention across frames. We embed the attention module in a ``slow-fast'' architecture, w… ▽ More

    Submitted 10 October, 2021; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Accepted in ICCV 2021 MTL Workshop

  46. arXiv:1910.09447  [pdf, other

    cs.CV

    Improving Style Transfer with Calibrated Metrics

    Authors: Mao-Chuang Yeh, Shuai Tang, Anand Bhattad, Chuhang Zou, David Forsyth

    Abstract: Style transfer methods produce a transferred image which is a rendering of a content image in the manner of a style image. We seek to understand how to improve style transfer. To do so requires quantitative evaluation procedures, but the current evaluation is qualitative, mostly involving user studies. We describe a novel quantitative evaluation procedure. Our procedure relies on two statistics:… ▽ More

    Submitted 13 February, 2020; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: updated conference camera ready version. arXiv admin note: text overlap with arXiv:1804.00118

  47. arXiv:1910.05786  [pdf, other

    cs.CL

    Progress Notes Classification and Keyword Extraction using Attention-based Deep Learning Models with BERT

    Authors: Matthew Tang, Priyanka Gandhi, Md Ahsanul Kabir, Christopher Zou, Jordyn Blakey, Xiao Luo

    Abstract: Various deep learning algorithms have been developed to analyze different types of clinical data including clinical text classification and extracting information from 'free text' and so on. However, automate the keyword extraction from the clinical notes is still challenging. The challenges include dealing with noisy clinical notes which contain various abbreviations, possible typos, and unstruct… ▽ More

    Submitted 24 October, 2019; v1 submitted 13 October, 2019; originally announced October 2019.

  48. arXiv:1910.04099  [pdf, other

    cs.CV cs.LG eess.IV

    Manhattan Room Layout Reconstruction from a Single 360 image: A Comparative Study of State-of-the-art Methods

    Authors: Chuhang Zou, Jheng-Wei Su, Chi-Han Peng, Alex Colburn, Qi Shan, Peter Wonka, Hung-Kuo Chu, Derek Hoiem

    Abstract: Recent approaches for predicting layouts from 360 panoramas produce excellent results. These approaches build on a common framework consisting of three steps: a pre-processing step based on edge-based alignment, prediction of layout elements, and a post-processing step by fitting a 3D layout to the layout elements. Until now, it has been difficult to compare the methods due to multiple different d… ▽ More

    Submitted 25 December, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted by International Journal of Computer Vision (IJCV), 2021

  49. arXiv:1909.04326  [pdf, other

    cs.CV

    Universal Physical Camouflage Attacks on Object Detectors

    Authors: Lifeng Huang, Chengying Gao, Yuyin Zhou, Cihang Xie, Alan Yuille, Changqing Zou, Ning Liu

    Abstract: In this paper, we study physical adversarial attacks on object detectors in the wild. Previous works mostly craft instance-dependent perturbations only for rigid or planar objects. To this end, we propose to learn an adversarial pattern to effectively attack all instances belonging to the same object category, referred to as Universal Physical Camouflage Attack (UPC). Concretely, UPC crafts camouf… ▽ More

    Submitted 21 April, 2020; v1 submitted 10 September, 2019; originally announced September 2019.

    Comments: CVPR 2020; codes, models, and demos are available at https://mesunhlf.github.io/index_physical.html

  50. arXiv:1909.00915  [pdf, other

    cs.CV

    Counterfactual Depth from a Single RGB Image

    Authors: Theerasit Issaranon, Chuhang Zou, David Forsyth

    Abstract: We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations. Our method works for the same reason that scene completion works: the spatial structure of objects is simple. But we offer a much higher resolution representation of… ▽ More

    Submitted 2 September, 2019; originally announced September 2019.