Skip to main content

Showing 1–24 of 24 results for author: Pang, H

  1. arXiv:2402.12770  [pdf, other

    cs.CL

    Acknowledgment of Emotional States: Generating Validating Responses for Empathetic Dialogue

    Authors: Zi Haur Pang, Yahui Fu, Divesh Lala, Keiko Ochi, Koji Inoue, Tatsuya Kawahara

    Abstract: In the realm of human-AI dialogue, the facilitation of empathetic responses is important. Validation is one of the key communication techniques in psychology, which entails recognizing, understanding, and acknowledging others' emotional states, thoughts, and actions. This study introduces the first framework designed to engender empathetic dialogue with validating responses. Our approach incorpora… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper has been accepted for presentation at International Workshop on Spoken Dialogue Systems Technology 2024 (IWSDS 2024)

  2. arXiv:2402.01509  [pdf, other

    eess.IV cs.CV cs.LG

    Advancing Brain Tumor Inpainting with Generative Models

    Authors: Ruizhi Zhu, Xinru Zhang, Haowen Pang, Chundan Xu, Chuyang Ye

    Abstract: Synthesizing healthy brain scans from diseased brain scans offers a potential solution to address the limitations of general-purpose algorithms, such as tissue segmentation and brain extraction algorithms, which may not effectively handle diseased images. We consider this a 3D inpainting task and investigate the adaptation of 2D inpainting methods to meet the requirements of 3D magnetic resonance… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2312.08704  [pdf, other

    cs.CV cs.GR

    PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments

    Authors: Rixin Zhou, Ding Xia, Yi Zhang, Honglin Pang, Xi Yang, Chuntao Li

    Abstract: In this paper, we propose a learning-based image fragment pair-searching and -matching approach to solve the challenging restoration problem. Existing works use rule-based methods to match similar contour shapes or textures, which are always difficult to tune hyperparameters for extensive data and computationally time-consuming. Therefore, we propose a neural network that can effectively utilize n… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 14 pages, 16 figures, 4 tables

  4. arXiv:2312.05941  [pdf, other

    cs.CV

    ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

    Authors: Haokai Pang, Heming Zhu, Adam Kortylewski, Christian Theobalt, Marc Habermann

    Abstract: Real-time rendering of photorealistic and controllable human avatars stands as a cornerstone in Computer Vision and Graphics. While recent advances in neural implicit rendering have unlocked unprecedented photorealism for digital avatars, real-time performance has mostly been demonstrated for static scenes only. To address this, we propose ASH, an animatable Gaussian splatting approach for photore… ▽ More

    Submitted 15 April, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: For project page, see https://vcai.mpi-inf.mpg.de/projects/ash/

  5. arXiv:2310.17901  [pdf, other

    cs.LG stat.ML

    Improving the Knowledge Gradient Algorithm

    Authors: Yang Le, Gao Siyang, Ho Chin Pang

    Abstract: The knowledge gradient (KG) algorithm is a popular policy for the best arm identification (BAI) problem. It is built on the simple idea of always choosing the measurement that yields the greatest expected one-step improvement in the estimate of the best mean of the arms. In this research, we show that this policy has limitations, causing the algorithm not asymptotically optimal. We next provide a… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 32 pages, 42 figures

  6. arXiv:2309.17448  [pdf, other

    cs.CV

    SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation

    Authors: Zhongang Cai, Wanqi Yin, Ailing Zeng, Chen Wei, Qingping Sun, Yanjun Wang, Hui En Pang, Haiyi Mei, Mingyuan Zhang, Lei Zhang, Chen Change Loy, Lei Yang, Ziwei Liu

    Abstract: Expressive human pose and shape estimation (EHPS) unifies body, hands, and face motion capture with numerous applications. Despite encouraging progress, current state-of-the-art methods still depend largely on a confined set of training datasets. In this work, we investigate scaling up EHPS towards the first generalist foundation model (dubbed SMPLer-X), with up to ViT-Huge as the backbone and tra… ▽ More

    Submitted 30 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Homepage: https://caizhongang.github.io/projects/SMPLer-X/

  7. arXiv:2309.10684  [pdf, other

    cs.CV cs.GR

    Locally Stylized Neural Radiance Fields

    Authors: Hong-Wing Pang, Binh-Son Hua, Sai-Kit Yeung

    Abstract: In recent years, there has been increasing interest in applying stylization on 3D scenes from a reference style image, in particular onto neural radiance fields (NeRF). While performing stylization directly on NeRF guarantees appearance consistency over arbitrary novel views, it is a challenging problem to guide the transfer of patterns from the style image onto different parts of the NeRF scene.… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  8. arXiv:2308.04322  [pdf, other

    cs.CV

    Domain Adaptive Person Search via GAN-based Scene Synthesis for Cross-scene Videos

    Authors: Huibing Wang, Tianxiang Cui, Mingze Yao, Huijuan Pang, Yushan Du

    Abstract: Person search has recently been a challenging task in the computer vision domain, which aims to search specific pedestrians from real cameras.Nevertheless, most surveillance videos comprise only a handful of images of each pedestrian, which often feature identical backgrounds and clothing. Hence, it is difficult to learn more discriminative features for person search in real scenes. To tackle this… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  9. arXiv:2307.09621  [pdf, other

    cs.CV

    Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration

    Authors: Ka Chun Shum, Hong-Wing Pang, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: In this paper, we address the problem of conditional scene decoration for 360-degree images. Our method takes a 360-degree background photograph of an indoor scene and generates decorated images of the same scene in the panorama view. To do this, we develop a 360-aware object layout generator that learns latent object vectors in the 360-degree view to enable a variety of furniture arrangements for… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: ICCV2023

  10. arXiv:2212.07651  [pdf, other

    eess.IV cs.CV cs.LG

    Two-stage Contextual Transformer-based Convolutional Neural Network for Airway Extraction from CT Images

    Authors: Yanan Wu, Shuiqing Zhao, Shouliang Qi, Jie Feng, Haowen Pang, Runsheng Chang, Long Bai, Mengqi Li, Shuyue Xia, Wei Qian, Hongliang Ren

    Abstract: Accurate airway extraction from computed tomography (CT) images is a critical step for planning navigation bronchoscopy and quantitative assessment of airway-related chronic obstructive pulmonary disease (COPD). The existing methods are challenging to sufficiently segment the airway, especially the high-generation airway, with the constraint of the limited label and cannot meet the clinical use in… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

  11. arXiv:2211.16544  [pdf, other

    cs.RO cs.HC

    Towards Transcervical Ultrasound Image Guidance for Transoral Robotic Surgery

    Authors: Wanwen Chen, Megha Kalia, Qi Zeng, Emily H. T. Pang, Razeyeh Bagherinasab, Thomas D. Milner, Farahna Sabiq, Eitan Prisman, Septimiu E. Salcudean

    Abstract: Purpose: Trans-oral robotic surgery (TORS) using the da Vinci surgical robot is a new minimally-invasive surgery method to treat oropharyngeal tumors, but it is a challenging operation. Augmented reality (AR) based on intra-operative ultrasound (US) has the potential to enhance the visualization of the anatomy and cancerous tumors to provide additional tools for decision-making in surgery. Methods… ▽ More

    Submitted 31 March, 2023; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 12 pages, 8 figures. Accepted by Information Processing for Computer Assisted Interventions (IPCAI 2023)

  12. arXiv:2209.10529  [pdf, other

    cs.CV

    Benchmarking and Analyzing 3D Human Pose and Shape Estimation Beyond Algorithms

    Authors: Hui En Pang, Zhongang Cai, Lei Yang, Tianwei Zhang, Ziwei Liu

    Abstract: 3D human pose and shape estimation (a.k.a. "human mesh recovery") has achieved substantial progress. Researchers mainly focus on the development of novel algorithms, while less attention has been paid to other critical factors involved. This could lead to less optimal baselines, hindering the fair and faithful evaluations of newly designed methodologies. To address this problem, this work presents… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Submission to 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks

  13. arXiv:2204.05445  [pdf, other

    cs.SD eess.AS

    Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness

    Authors: Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng

    Abstract: It is critical for a keyword spotting model to have a small footprint as it typically runs on-device with low computational resources. However, maintaining the previous SOTA performance with reduced model size is challenging. In addition, a far-field and noisy environment with multiple signals interference aggravates the problem causing the accuracy to degrade significantly. In this paper, we pres… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: submitted to INTERSPEECH 2022

  14. arXiv:2108.01806  [pdf, other

    cs.CV cs.GR

    Neural Scene Decoration from a Single Photograph

    Authors: Hong-Wing Pang, Yingshu Chen, Phuoc-Hieu Le, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung

    Abstract: Furnishing and rendering indoor scenes has been a long-standing task for interior design, where artists create a conceptual design for the space, build a 3D model of the space, decorate, and then perform rendering. Although the task is important, it is tedious and requires tremendous effort. In this paper, we introduce a new problem of domain-specific indoor scene image synthesis, namely neural sc… ▽ More

    Submitted 25 July, 2022; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: ECCV 2022 paper. 14 pages of main content, 4 pages of references, and 11 pages of appendix

  15. arXiv:2105.02409  [pdf, other

    cs.MM

    Multimedia Edge Computing

    Authors: Zhi Wang, Wenwu Zhu, Lifeng Sun, Han Hu, Ge Ma, Ming Ma, Haitian Pang, Jiahui Ye, Hongshan Li

    Abstract: In this paper, we investigate the recent studies on multimedia edge computing, from sensing not only traditional visual/audio data but also individuals' geographical preference and mobility behaviors, to performing distributed machine learning over such data using the joint edge and cloud infrastructure and using evolutional strategies like reinforcement learning and online learning at edge device… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 20 pages, 9 figures. arXiv admin note: text overlap with arXiv:1702.07627

  16. arXiv:1909.07541  [pdf, other

    cs.CV cs.RO

    A*3D Dataset: Towards Autonomous Driving in Challenging Environments

    Authors: Quang-Hieu Pham, Pierre Sevestre, Ramanpreet Singh Pahwa, Huijing Zhan, Chun Ho Pang, Yuda Chen, Armin Mustafa, Vijay Chandrasekhar, Jie Lin

    Abstract: With the increasing global popularity of self-driving cars, there is an immediate need for challenging real-world datasets for benchmarking and training various computer vision tasks such as 3D object detection. Existing datasets either represent simple scenarios or provide only day-time data. In this paper, we introduce a new challenging A*3D dataset which consists of RGB images and LiDAR data wi… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: A new 3D dataset by I2R, A*STAR for autonomous driving

  17. arXiv:1805.09249  [pdf, other

    cs.NI

    Multi-User Cooperative Mobile Video Streaming: Performance Analysis and Online Mechanism Design

    Authors: Lin Gao, Ming Tang, Haitian Pang, Jianwei Huang, Lifeng Sun

    Abstract: Adaptive bitrate streaming enables video users to adapt their playing bitrates to the real-time network conditions, hence achieving the desirable quality-of-experience (QoE). In a multi-user wireless scenario, however, existing single-user based bitrate adaptation methods may fail to provide the desirable QoE, due to lack of consideration of multi-user interactions (such as the multi-user interfer… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: This manuscript serves as the online technical report for the paper published in IEEE Transactions on Mobile Computing

  18. arXiv:1805.08008  [pdf, other

    cs.MM cs.GT cs.NI

    Performance Bound Analysis for Crowdsourced Mobile Video Streaming

    Authors: Lin Gao, Ming Tang, Haitian Pang, Jianwei Huang, Lifeng Sun

    Abstract: Adaptive bitrate (ABR) streaming enables video users to adapt the playing bitrate to the real-time network conditions to achieve the desirable quality of experience (QoE). In this work, we propose a novel crowdsourced streaming framework for multi-user ABR video streaming over wireless networks. This framework enables the nearby mobile video users to crowdsource their radio links and resources for… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.

    Comments: This manuscript serves as the online technical report for the paper published in the IEEE Conference on Information Sciences and Systems (CISS 2016)

  19. arXiv:1709.00273  [pdf, ps, other

    cs.NI

    When Data Sponsoring Meets Edge Caching: A Game-Theoretic Analysis

    Authors: Haitian Pang, Lin Gao, Qinghua Ding, Lifeng Sun

    Abstract: Data sponsoring is a widely-used incentive method in today's cellular networks, where video content providers (CPs) cover part or all of the cellular data cost for mobile users so as to attract more video users and increase data traffic. In the forthcoming 5G cellular networks, edge caching is emerging as a promising technique to deliver videos with lower cost and higher quality. The key idea is t… ▽ More

    Submitted 1 September, 2017; originally announced September 2017.

    Comments: 6 pages, accepted by GLOBECOM 2017

  20. arXiv:1704.01079  [pdf, other

    cs.LG math.OC stat.ML

    Homotopy Parametric Simplex Method for Sparse Learning

    Authors: Haotian Pang, Robert Vanderbei, Han Liu, Tuo Zhao

    Abstract: High dimensional sparse learning has imposed a great computational challenge to large scale data analysis. In this paper, we are interested in a broad class of sparse learning approaches formulated as linear programs parametrized by a {\em regularization factor}, and solve them by the parametric simplex method (PSM). Our parametric simplex method offers significant advantages over other competing… ▽ More

    Submitted 27 November, 2017; v1 submitted 4 April, 2017; originally announced April 2017.

    Comments: Accepted by NIPS 2017

  21. arXiv:1703.06648  [pdf, other

    cs.NI

    Multi-Dimensional Auction Mechanisms for Crowdsourced Mobile Video Streaming

    Authors: Ming Tang, Haitian Pang, Shou Wang, Lin Gao, Jianwei Huang, Lifeng Sun

    Abstract: Crowdsourced mobile video streaming enables nearby mobile video users to aggregate network resources to improve their video streaming performances. However, users are often selfish and may not be willing to cooperate without proper incentives. Designing an incentive mechanism for such a scenario is challenging due to the users' asynchronous downloading behaviors and their private valuations for mu… ▽ More

    Submitted 7 July, 2018; v1 submitted 20 March, 2017; originally announced March 2017.

  22. arXiv:1611.00211  [pdf, ps, other

    cs.NI

    Joint Optimization of Data Sponsoring and Edge Caching for Mobile Video Delivery

    Authors: Haitian Pang, Lin Gao, Lifeng Sun

    Abstract: In this work, we study the joint optimization of edge caching and data sponsoring for a video content provider (CP), aiming at reducing the content delivery cost and increasing the CP's revenue. Specifically, we formulate the joint optimization problem as a two-stage decision problem for the CP. In Stage I, the CP determines the edge caching policy (for a relatively long time period). In Stage II,… ▽ More

    Submitted 1 November, 2016; originally announced November 2016.

    Comments: accepted by GLOBECOM 2016

  23. arXiv:1606.04195  [pdf, other

    cs.MM cs.NI cs.SI

    Social- and Mobility-Aware Device-to-Device Content Delivery

    Authors: Zhi Wang, Lifeng Sun, Miao Zhang, Haitian Pang, Erfang Tian, Wenwu Zhu

    Abstract: Mobile online social network services have seen a rapid increase, in which the huge amount of user-generated social media contents propagating between users via social connections has significantly challenged the traditional content delivery paradigm: First, replicating all of the contents generated by users to edge servers that well "fit" the receivers becomes difficult due to the limited bandwid… ▽ More

    Submitted 13 June, 2016; originally announced June 2016.

  24. arXiv:1207.4129  [pdf

    cs.CV

    Recovering Articulated Object Models from 3D Range Data

    Authors: Dragomir Anguelov, Daphne Koller, Hoi-Cheung Pang, Praveen Srinivasan, Sebastian Thrun

    Abstract: We address the problem of unsupervised learning of complex articulated object models from 3D range data. We describe an algorithm whose input is a set of meshes corresponding to different configurations of an articulated object. The algorithm automatically recovers a decomposition of the object into approximately rigid parts, the location of the parts in the different object instances, and the art… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-18-26