Skip to main content

Showing 1–50 of 57 results for author: Gao, N

  1. arXiv:2406.10513  [pdf, other

    cs.LG q-bio.BM

    Lift Your Molecules: Molecular Graph Generation in Latent Euclidean Space

    Authors: Mohamed Amine Ketata, Nicholas Gao, Johanna Sommer, Tom Wollschläger, Stephan Günnemann

    Abstract: We introduce a new framework for molecular graph generation with 3D molecular generative models. Our Synthetic Coordinate Embedding (SyCo) framework maps molecular graphs to Euclidean point clouds via synthetic conformer coordinates and learns the inverse map using an E(n)-Equivariant Graph Neural Network (EGNN). The induced point cloud-structured latent space is well-suited to apply existing 3D m… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2405.14762  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph quant-ph

    Neural Pfaffians: Solving Many Many-Electron Schrödinger Equations

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Neural wave functions accomplished unprecedented accuracies in approximating the ground state of many-electron systems, though at a high computational cost. Recent works proposed amortizing the cost by learning generalized wave functions across different structures and compounds instead of solving each problem independently. Enforcing the permutation antisymmetry of electrons in such generalized n… ▽ More

    Submitted 6 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2404.14665  [pdf, other

    cs.HC

    Illuminating the Unseen: Investigating the Context-induced Harms in Behavioral Sensing

    Authors: Han Zhang, Vedant Das Swain, Leijie Wang, Nan Gao, Yilun Sheng, Xuhai Xu, Flora D. Salim, Koustuv Saha, Anind K. Dey, Jennifer Mankoff

    Abstract: Behavioral sensing technologies are rapidly evolving across a range of well-being applications. Despite its potential, concerns about the responsible use of such technology are escalating. In response, recent research within the sensing technology has started to address these issues. While promising, they primarily focus on broad demographic categories and overlook more nuanced, context-specific i… ▽ More

    Submitted 5 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 26 pages, 8 tables, and 1 figure (excluding appendix)

    MSC Class: 68U35 ACM Class: H.5.0; I.2.m

  4. arXiv:2404.07200  [pdf, other

    cs.LG

    Toward a Better Understanding of Fourier Neural Operators: Analysis and Improvement from a Spectral Perspective

    Authors: Shaoxiang Qin, Fuyuan Lyu, Wenhui Peng, Dingyang Geng, Ju Wang, Naiping Gao, Xue Liu, Liangzhu Leon Wang

    Abstract: In solving partial differential equations (PDEs), Fourier Neural Operators (FNOs) have exhibited notable effectiveness compared to Convolutional Neural Networks (CNNs). This paper presents clear empirical evidence through spectral analysis to elucidate the superiority of FNO over CNNs: FNO is significantly more capable of learning low-frequencies. This empirical evidence also unveils FNO's distinc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  5. arXiv:2404.02411  [pdf, other

    cs.HC

    A Unified Editing Method for Co-Speech Gesture Generation via Diffusion Inversion

    Authors: Zeyu Zhao, Nan Gao, Zhi Zeng, Guixuan Zhang, Jie Liu, Shuwu Zhang

    Abstract: Diffusion models have shown great success in generating high-quality co-speech gestures for interactive humanoid robots or digital avatars from noisy input with the speech audio or text as conditions. However, they rarely focus on providing rich editing capabilities for content creators other than high-level specialized measures like style conditioning. To resolve this, we propose a unified framew… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  6. arXiv:2403.10098  [pdf, other

    cs.CV

    DiffMAC: Diffusion Manifold Hallucination Correction for High Generalization Blind Face Restoration

    Authors: Nan Gao, Jia Li, Huaibo Huang, Zhi Zeng, Ke Shang, Shuwu Zhang, Ran He

    Abstract: Blind face restoration (BFR) is a highly challenging problem due to the uncertainty of degradation patterns. Current methods have low generalization across photorealistic and heterogeneous domains. In this paper, we propose a Diffusion-Information-Diffusion (DID) framework to tackle diffusion manifold hallucination correction (DiffMAC), which achieves high-generalization face restoration in divers… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 15 pages, 12 figures

  7. arXiv:2403.05249  [pdf, other

    quant-ph cs.LG physics.chem-ph physics.comp-ph

    On Representing Electronic Wave Functions with Sign Equivariant Neural Networks

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Recent neural networks demonstrated impressively accurate approximations of electronic ground-state wave functions. Such neural networks typically consist of a permutation-equivariant neural network followed by a permutation-antisymmetric operation to enforce the electronic exchange symmetry. While accurate, such neural networks are computationally expensive. In this work, we explore the flipped a… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Published at Workshop on AI4DifferentialEquations in Science at ICLR 2024

  8. arXiv:2401.13221  [pdf, other

    cs.CV

    Unified-Width Adaptive Dynamic Network for All-In-One Image Restoration

    Authors: Yimin Xu, Nanxi Gao, Zhongyun Shan, Fei Chao, Rongrong Ji

    Abstract: In contrast to traditional image restoration methods, all-in-one image restoration techniques are gaining increased attention for their ability to restore images affected by diverse and unknown corruption types and levels. However, contemporary all-in-one image restoration methods omit task-wise difficulties and employ the same networks to reconstruct images afflicted by diverse degradations. This… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  9. arXiv:2311.15496  [pdf, ps, other

    cs.HC

    Critiquing Self-report Practices for Human Mental and Wellbeing Computing at Ubicomp

    Authors: Nan Gao, Soundariya Ananthan, Chun Yu, Yuntao Wang, Flora D. Salim

    Abstract: Computing human mental and wellbeing is crucial to various domains, including health, education, and entertainment. However, the reliance on self-reporting in traditional research to establish ground truth often leads to methodological inconsistencies and susceptibility to response biases, thus hindering the effectiveness of modelling. This paper presents the first systematic methodological review… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  10. arXiv:2311.05457  [pdf, other

    cs.HC

    Automated Mobile Sensing Strategies Generation for Human Behaviour Understanding

    Authors: Nan Gao, Zhuolei Yu, Chun Yu, Yuntao Wang, Flora D. Salim, Yuanchun Shi

    Abstract: Mobile sensing plays a crucial role in generating digital traces to understand human daily lives. However, studying behaviours like mood or sleep quality in smartphone users requires carefully designed mobile sensing strategies such as sensor selection and feature construction. This process is time-consuming, burdensome, and requires expertise in multiple domains. Furthermore, the resulting sensin… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  11. arXiv:2310.13304  [pdf, other

    cs.HC

    "Living Within Four Walls": Exploring Emotional and Social Dynamics in Mobile Usage During Home Confinement

    Authors: Nan Gao, Sam Nolan, Kaixin Ji, Shakila Khan Rumi, Judith Simone Heinisch, Christoph Anderson, Klaus David, Flora D. Salim

    Abstract: Home confinement, a situation experienced by individuals for reasons ranging from medical quarantines, rehabilitation needs, disability accommodations, and remote working, is a common yet impactful aspect of modern life. While essential in various scenarios, confinement within the home environment can profoundly influence psychological well-being and digital device usage. In this study, we delve i… ▽ More

    Submitted 8 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  12. arXiv:2309.07736  [pdf, other

    cs.CR eess.SP

    RIS-Assisted Wireless Link Signatures for Specific Emitter Identification

    Authors: Ning Gao, Shuchen Meng, Cen Li, Shengguo Meng, Wankai Tang, Shi Jin, Michail Matthaiou

    Abstract: The physical layer authentication (PLA) is a promising technology which can enhance the access security of a massive number of devices in the near future. In this paper, we propose a reconfigurable intelligent surface (RIS)-assisted PLA system, in which the legitimate transmitter can customize the channel fingerprints during PLA by controlling the ON-OFF state of the RIS. Without loss of generalit… ▽ More

    Submitted 7 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

  13. arXiv:2308.16528  [pdf, other

    cs.CV cs.LG cs.RO

    SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects

    Authors: Ning Gao, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

    Abstract: To enable meaningful robotic manipulation of objects in the real-world, 6D pose estimation is one of the critical aspects. Most existing approaches have difficulties to extend predictions to scenarios where novel object instances are continuously introduced, especially with heavy occlusions. In this work, we propose a few-shot pose estimation (FSPE) approach called SA6D, which uses a self-adaptive… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Journal ref: Conference on Robot Learning (CoRL), 2023

  14. arXiv:2308.11369  [pdf, other

    cs.CV

    Enhancing Interpretable Object Abstraction via Clustering-based Slot Initialization

    Authors: Ning Gao, Bernard Hohmann, Gerhard Neumann

    Abstract: Object-centric representations using slots have shown the advances towards efficient, flexible and interpretable abstraction from low-level perceptual features in a compositional scene. Current approaches randomize the initial state of slots followed by an iterative refinement. As we show in this paper, the random slot initialization significantly affects the accuracy of the final slot prediction.… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Journal ref: The 34th British Machine Vision Conference (BMVC), 2023

  15. arXiv:2307.08946  [pdf, other

    cs.CR eess.SP

    EsaNet: Environment Semantics Enabled Physical Layer Authentication

    Authors: Ning Gao, Qiying Huang, Cen Li, Shi Jin, Michail Matthaiou

    Abstract: Wireless networks are vulnerable to physical layer spoofing attacks due to the wireless broadcast nature, thus, integrating communications and security (ICAS) is urgently needed for 6G endogenous security. In this letter, we propose an environment semantics enabled physical layer authentication network based on deep learning, namely EsaNet, to authenticate the spoofing from the underlying wireless… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  16. arXiv:2307.08423  [pdf, other

    cs.LG physics.comp-ph

    Artificial Intelligence for Science in Quantum, Atomistic, and Continuum Systems

    Authors: Xuan Zhang, Limei Wang, Jacob Helwig, Youzhi Luo, Cong Fu, Yaochen Xie, Meng Liu, Yuchao Lin, Zhao Xu, Keqiang Yan, Keir Adams, Maurice Weiler, Xiner Li, Tianfan Fu, Yucheng Wang, Haiyang Yu, YuQing Xie, Xiang Fu, Alex Strasser, Shenglong Xu, Yi Liu, Yuanqi Du, Alexandra Saxton, Hongyi Ling, Hannah Lawrence , et al. (38 additional authors not shown)

    Abstract: Advances in artificial intelligence (AI) are fueling a new paradigm of discoveries in natural sciences. Today, AI has started to advance natural sciences by improving, accelerating, and enabling our understanding of natural phenomena at a wide range of spatial and temporal scales, giving rise to a new area of research known as AI for science (AI4Science). Being an emerging research paradigm, AI4Sc… ▽ More

    Submitted 15 November, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

  17. arXiv:2306.15670  [pdf, other

    cs.CV cs.RO

    Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

    Authors: Haoyi Jiang, Tianheng Cheng, Naiyu Gao, Haoyang Zhang, Tianwei Lin, Wenyu Liu, Xinggang Wang

    Abstract: `3D Semantic Scene Completion (SSC) has emerged as a nascent and pivotal undertaking in autonomous driving, aiming to predict voxel occupancy within volumetric scenes. However, prevailing methodologies primarily focus on voxel-wise feature aggregation, while neglecting instance semantics and scene context. In this paper, we present a novel paradigm termed Symphonies (Scene-from-Insts), that delves… ▽ More

    Submitted 22 November, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: Technical report. Code and models at: https://github.com/hustvl/Symphonies

  18. arXiv:2306.14916  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph stat.ML

    Uncertainty Estimation for Molecules: Desiderata and Methods

    Authors: Tom Wollschläger, Nicholas Gao, Bertrand Charpentier, Mohamed Amine Ketata, Stephan Günnemann

    Abstract: Graph Neural Networks (GNNs) are promising surrogates for quantum mechanical calculations as they establish unprecedented low errors on collections of molecular dynamics (MD) trajectories. Thanks to their fast inference times they promise to accelerate computational chemistry applications. Unfortunately, despite low in-distribution (ID) errors, such GNNs might be horribly wrong for out-of-distribu… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Published as conference paper at ICML 2023

  19. arXiv:2305.15817  [pdf, other

    cs.LG

    Sharpness-Aware Minimization Revisited: Weighted Sharpness as a Regularization Term

    Authors: Yun Yue, Jiadi Jiang, Zhiling Ye, Ning Gao, Yongchao Liu, Ke Zhang

    Abstract: Deep Neural Networks (DNNs) generalization is known to be closely related to the flatness of minima, leading to the development of Sharpness-Aware Minimization (SAM) for seeking flatter minima and better generalization. In this paper, we revisit the loss of SAM and propose a more general method, called WSAM, by incorporating sharpness as a regularization term. We prove its generalization bound thr… ▽ More

    Submitted 9 June, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: 10 pages. Accepted as a conference paper at KDD '23

  20. arXiv:2305.08604  [pdf, other

    cs.IT eess.SP

    A Survey of Blockchain and Artificial Intelligence for 6G Wireless Communications

    Authors: Yiping Zuo, Jiajia Guo, Ning Gao, Yongxu Zhu, Shi Jin, Xiao Li

    Abstract: The research on the sixth-generation (6G) wireless communications for the development of future mobile communication networks has been officially launched around the world. 6G networks face multifarious challenges, such as resource-constrained mobile devices, difficult wireless resource management, high complexity of heterogeneous network architectures, explosive computing and storage requirements… ▽ More

    Submitted 7 September, 2023; v1 submitted 15 May, 2023; originally announced May 2023.

  21. arXiv:2304.03708  [pdf, other

    eess.IV cs.CV

    Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

    Authors: Gongning Luo, Kuanquan Wang, Jun Liu, Shuo Li, Xinjie Liang, Xiangyu Li, Shaowei Gan, Wei Wang, Suyu Dong, Wenyi Wang, Pengxin Yu, Enyou Liu, Hongrong Wei, Na Wang, Jia Guo, Huiqi Li, Zhao Zhang, Ziwei Zhao, Na Gao, Nan An, Ashkan Pakzad, Bojidar Rangelov, Jiaqi Dou, Song Tian, Zeyu Liu , et al. (5 additional authors not shown)

    Abstract: Efficient automatic segmentation of multi-level (i.e. main and branch) pulmonary arteries (PA) in CTPA images plays a significant role in clinical applications. However, most existing methods concentrate only on main PA or branch PA segmentation separately and ignore segmentation efficiency. Besides, there is no public large-scale dataset focused on PA segmentation, which makes it highly challengi… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  22. arXiv:2303.13013  [pdf, other

    cs.CL cs.CV cs.HC

    GesGPT: Speech Gesture Synthesis With Text Parsing from ChatGPT

    Authors: Nan Gao, Zeyu Zhao, Zhi Zeng, Shuwu Zhang, Dongdong Weng, Yihua Bao

    Abstract: Gesture synthesis has gained significant attention as a critical research field, aiming to produce contextually appropriate and natural gestures corresponding to speech or textual input. Although deep learning-based approaches have achieved remarkable progress, they often overlook the rich semantic information present in the text, leading to less expressive and meaningful gestures. In this letter,… ▽ More

    Submitted 27 May, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

    Journal ref: IEEE Robotics and Automation Letters 9 (2024) 3

  23. arXiv:2303.04791  [pdf, other

    cs.LG cond-mat.mtrl-sci physics.chem-ph physics.comp-ph

    Ewald-based Long-Range Message Passing for Molecular Graphs

    Authors: Arthur Kosmala, Johannes Gasteiger, Nicholas Gao, Stephan Günnemann

    Abstract: Neural architectures that learn potential energy surfaces from molecular data have undergone fast improvement in recent years. A key driver of this success is the Message Passing Neural Network (MPNN) paradigm. Its favorable scaling with system size partly relies upon a spatial distance limit on messages. While this focus on locality is a useful inductive bias, it also impedes the learning of long… ▽ More

    Submitted 6 June, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

    Comments: Published at the 40th International Conference on Machine Learning (ICML 2023)

  24. arXiv:2302.04168  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph quant-ph

    Generalizing Neural Wave Functions

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Recent neural network-based wave functions have achieved state-of-the-art accuracies in modeling ab-initio ground-state potential energy surface. However, these networks can only solve different spatial arrangements of the same set of atoms. To overcome this limitation, we present Graph-learned orbital embeddings (Globe), a neural network-based reparametrization method that can adapt neural wave f… ▽ More

    Submitted 31 May, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: Published at the 40th International Conference on Machine Learning (ICML 2023)

  25. arXiv:2301.01882  [pdf, other

    cs.CV

    InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation

    Authors: Fei He, Haoyang Zhang, Naiyu Gao, Jian Jia, Yanhu Shan, Xin Zhao, Kaiqi Huang

    Abstract: Video instance segmentation (VIS) aims at segmenting and tracking objects in videos. Prior methods typically generate frame-level or clip-level object instances first and then associate them by either additional tracking heads or complex instance matching algorithms. This explicit instance association approach increases system complexity and fails to fully exploit temporal cues in videos. In this… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022

  26. arXiv:2212.03396  [pdf, other

    cs.LG cs.AI

    Learning to Select Prototypical Parts for Interpretable Sequential Data Modeling

    Authors: Yifei Zhang, Neng Gao, Cunqing Ma

    Abstract: Prototype-based interpretability methods provide intuitive explanations of model prediction by comparing samples to a reference set of memorized exemplars or typical representatives in terms of similarity. In the field of sequential data modeling, similarity calculations of prototypes are usually based on encoded representation vectors. However, due to highly recursive functions, there is usually… ▽ More

    Submitted 16 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: To be appeared in AAAI 2023

  27. arXiv:2212.01461  [pdf, other

    cs.CV

    Learning Disentangled Label Representations for Multi-label Classification

    Authors: Jian Jia, Fei He, Naiyu Gao, Xiaotang Chen, Kaiqi Huang

    Abstract: Although various methods have been proposed for multi-label classification, most approaches still follow the feature learning mechanism of the single-label (multi-class) classification, namely, learning a shared image feature to classify multiple labels. However, we find this One-shared-Feature-for-Multiple-Labels (OFML) mechanism is not conducive to learning discriminative label features and make… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: 17 pages, 9 figures

  28. arXiv:2210.04747  [pdf, other

    cs.IT eess.SP

    An NLoS-based Enhanced Sensing Method for MmWave Communication System

    Authors: Shiwen He, Kangli Cai, Shiyue Huang, Zhenyu Anz, Wei Huang, Ning Gao

    Abstract: The millimeter-wave (mmWave)-based Wi-Fi sensing technology has recently attracted extensive attention since it provides a possibility to realize higher sensing accuracy. However, current works mainly concentrate on sensing scenarios where the line-of-sight (LoS) path exists, which significantly limits their applications. To address the problem, we propose an enhanced mmWave sensing algorithm in t… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

  29. arXiv:2210.02337  [pdf, other

    cs.CR

    When Physical Layer Key Generation Meets RIS: Opportunities, Challenges, and Road Ahead

    Authors: Ning Gao, Yu Han, Nannan Li, Shi Jin, Michail Matthaiou

    Abstract: Physical layer key generation (PLKG) is a promising technology to obtain symmetric keys between a pair of wireless communication users in a plug-and-play manner. The shared entropy source almost entirely comes from the intrinsic randomness of the radio channel, which is highly dependent on the wireless environments. However, in some static/block fading wireless environments, the intrinsic randomne… ▽ More

    Submitted 3 July, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

  30. Measuring incompatibility and clustering quantum observables with a quantum switch

    Authors: Ning Gao, Dantong Li, Anchit Mishra, Junchen Yan, Kyrylo Simonov, Giulio Chiribella

    Abstract: The existence of incompatible observables is a cornerstone of quantum mechanics and a valuable resource in quantum technologies. Here we introduce a measure of incompatibility, called the mutual eigenspace disturbance (MED), which quantifies the amount of disturbance induced by the measurement of a sharp observable on the eigenspaces of another. The MED provides a metric on the space of von Neuman… ▽ More

    Submitted 9 May, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: 13 pages, 2 figures

    Journal ref: Phys. Rev. Lett. 130, 170201 (2023)

  31. QueryProp: Object Query Propagation for High-Performance Video Object Detection

    Authors: Fei He, Naiyu Gao, Jian Jia, Xin Zhao, Kaiqi Huang

    Abstract: Video object detection has been an important yet challenging topic in computer vision. Traditional methods mainly focus on designing the image-level or box-level feature propagation strategies to exploit temporal information. This paper argues that with a more effective and efficient feature propagation framework, video object detectors can gain improvement in terms of both accuracy and speed. For… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: This paper is accepted to AAAI2022

  32. arXiv:2207.03405  [pdf, other

    cs.HC

    Investigating the Effects of Mood & Usage Behaviour on Notification Response Time

    Authors: Judith S. Heinisch, Nan Gao, Christoph Anderson, Shohreh Deldari, Klaus David, Flora Salim

    Abstract: Notifications are one of the most prevailing mechanisms on smartphones and personal computers to convey timely and important information. Despite these benefits, smartphone notifications demand individuals' attention and can cause stress and frustration when delivered at inopportune timings. This paper investigates the effect of individuals' smartphone usage behavior and mood on notification respo… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  33. arXiv:2206.07162  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Category-Agnostic 6D Pose Estimation with Conditional Neural Processes

    Authors: Yumeng Li, Ning Gao, Hanna Ziesche, Gerhard Neumann

    Abstract: We present a novel meta-learning approach for 6D pose estimation on unknown objects. In contrast to ``instance-level" and ``category-level" pose estimation methods, our algorithm learns object representation in a category-agnostic way, which endows it with strong generalization capabilities across object categories. Specifically, we employ a neural process-based meta-learning approach to train an… ▽ More

    Submitted 19 October, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    Comments: Accepted at CVPR2022 workshop: Women in Computer Vision (WiCV)

    Journal ref: CVPR2022 workshop: Women in Computer Vision (WiCV)

  34. arXiv:2206.00468  [pdf, other

    cs.CV

    PanopticDepth: A Unified Framework for Depth-aware Panoptic Segmentation

    Authors: Naiyu Gao, Fei He, Jian Jia, Yanhu Shan, Haoyang Zhang, Xin Zhao, Kaiqi Huang

    Abstract: This paper presents a unified framework for depth-aware panoptic segmentation (DPS), which aims to reconstruct 3D scene with instance-level semantics from one single image. Prior works address this problem by simply adding a dense depth regression head to panoptic segmentation (PS) networks, resulting in two independent task branches. This neglects the mutually-beneficial relations between these t… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

    Comments: CVPR2022

  35. arXiv:2205.14962  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Sampling-free Inference for Ab-Initio Potential Energy Surface Networks

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Recently, it has been shown that neural networks not only approximate the ground-state wave functions of a single molecular system well but can also generalize to multiple geometries. While such generalization significantly speeds up training, each energy evaluation still requires Monte Carlo integration which limits the evaluation to a few geometries. In this work, we address the inference shortc… ▽ More

    Submitted 6 March, 2023; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: Published as a conference paper at ICLR 2023

  36. arXiv:2205.11110  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Meta-Learning Regrasping Strategies for Physical-Agnostic Objects

    Authors: Ning Gao, Jingyu Zhang, Ruijie Chen, Ngo Anh Vien, Hanna Ziesche, Gerhard Neumann

    Abstract: Grasping inhomogeneous objects in real-world applications remains a challenging task due to the unknown physical properties such as mass distribution and coefficient of friction. In this study, we propose a meta-learning algorithm called ConDex, which incorporates Conditional Neural Processes (CNP) with DexNet-2.0 to autonomously discern the underlying physical properties of objects using depth im… ▽ More

    Submitted 14 September, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: Accepted as spotlight in ICRA 2022 Workshop: Scaling Robot Learning

  37. arXiv:2205.07646  [pdf, other

    cs.CL cs.SD eess.AS

    A Fast Attention Network for Joint Intent Detection and Slot Filling on Edge Devices

    Authors: Liang Huang, Senjie Liang, Feiyang Ye, Nan Gao

    Abstract: Intent detection and slot filling are two main tasks in natural language understanding and play an essential role in task-oriented dialogue systems. The joint learning of both tasks can improve inference accuracy and is popular in recent works. However, most joint models ignore the inference latency and cannot meet the need to deploy dialogue systems at the edge. In this paper, we propose a Fast A… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: 9 pages, 4 figures

  38. arXiv:2203.04905  [pdf, other

    cs.CV cs.AI cs.LG

    What Matters For Meta-Learning Vision Regression Tasks?

    Authors: Ning Gao, Hanna Ziesche, Ngo Anh Vien, Michael Volpp, Gerhard Neumann

    Abstract: Meta-learning is widely used in few-shot classification and function regression due to its ability to quickly adapt to unseen tasks. However, it has not yet been well explored on regression tasks with high dimensional inputs such as images. This paper makes two main contributions that help understand this barely explored area. \emph{First}, we design two new types of cross-category level vision re… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted at CVPR 2022

  39. Individual and Group-wise Classroom Seating Experience: Effects on Student Engagement in Different Courses

    Authors: Nan Gao, Mohammad Saiedur Rahaman, Wei Shao, Kaixin Ji, Flora D. Salim

    Abstract: Seating location in the classroom can affect student engagement, attention and academic performance by providing better visibility, improved movement, and participation in discussions. Existing studies typically explore how traditional seating arrangements (e.g. grouped tables or traditional rows) influence students' perceived engagement, without considering group seating behaviours under more fle… ▽ More

    Submitted 23 July, 2022; v1 submitted 22 December, 2021; originally announced December 2021.

    Comments: The manuscript has been accepted by IMWUT

    Journal ref: IMWUT. 6(3), 1-23 (2022)

  40. arXiv:2110.05064  [pdf, other

    cs.LG physics.chem-ph physics.comp-ph

    Ab-Initio Potential Energy Surfaces by Pairing GNNs with Neural Wave Functions

    Authors: Nicholas Gao, Stephan Günnemann

    Abstract: Solving the Schrödinger equation is key to many quantum mechanical properties. However, an analytical solution is only tractable for single-electron systems. Recently, neural networks succeeded at modeling wave functions of many-electron systems. Together with the variational Monte-Carlo (VMC) framework, this led to solutions on par with the best known classical methods. Still, these neural method… ▽ More

    Submitted 29 March, 2022; v1 submitted 11 October, 2021; originally announced October 2021.

    Comments: Published as a conference paper at ICLR 2022

  41. arXiv:2107.00389  [pdf, other

    cs.HC

    Investigating the Reliability of Self-report Data in the Wild: The Quest for Ground Truth

    Authors: Nan Gao, Mohammad Saiedur Rahaman, Wei Shao, Flora D. Salim

    Abstract: Inferring human mental state (e.g., emotion, depression, engagement) with sensing technology is one of the most valuable challenges in the affective computing area, which has a profound impact in all industries interacting with humans. The self-report survey is the most common way to quantify how people think, but prone to subjectivity and various responses bias. It is usually used as the ground t… ▽ More

    Submitted 29 November, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

  42. arXiv:2105.06637  [pdf, other

    cs.HC cs.CY cs.LG

    Understanding occupants' behaviour, engagement, emotion, and comfort indoors with heterogeneous sensors and wearables

    Authors: Nan Gao, Max Marschall, Jane Burry, Simon Watkins, Flora D. Salim

    Abstract: We conducted a field study at a K-12 private school in the suburbs of Melbourne, Australia. The data capture contained two elements: First, a 5-month longitudinal field study In-Gauge using two outdoor weather stations, as well as indoor weather stations in 17 classrooms and temperature sensors on the vents of occupant-controlled room air-conditioners; these were collated into individual datasets… ▽ More

    Submitted 22 April, 2022; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: This paper has been accepted by Nature Scientific Data. The link for the datasets: https://rmit.figshare.com/articles/dataset/In-Gauge_and_En-Gage_Datasets/14578908

  43. arXiv:2105.00950  [pdf, other

    cs.IT eess.SP

    3-D Deployment of UAV Swarm for Massive MIMO Communications

    Authors: Ning Gao, Xiao Li, Shi Jin, Michail Matthaiou

    Abstract: We consider the uplink transmission between a multi-antenna ground station and an unmanned aerial vehicle (UAV) swarm. The UAVs are assumed as intelligent agents, which can explore their optimal three dimensional (3-D) deployment to maximize the channel capacity of the multiple input multiple output (MIMO) system. Specifically, considering the limitations of each UAV in accessing the global inform… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  44. Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation

    Authors: Naiyu Gao, Yanhu Shan, Xin Zhao, Kaiqi Huang

    Abstract: Panoptic segmentation (PS) is a complex scene understanding task that requires providing high-quality segmentation for both thing objects and stuff regions. Previous methods handle these two classes with semantic and instance segmentation modules separately, following with heuristic fusion or additional modules to resolve the conflicts between the two outputs. This work simplifies this pipeline of… ▽ More

    Submitted 15 June, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

  45. arXiv:2008.08903  [pdf, other

    cs.LG cs.IR eess.IV

    Generative Adversarial Networks for Spatio-temporal Data: A Survey

    Authors: Nan Gao, Hao Xue, Wei Shao, Sichen Zhao, Kyle Kai Qin, Arian Prabowo, Mohammad Saiedur Rahaman, Flora D. Salim

    Abstract: Generative Adversarial Networks (GANs) have shown remarkable success in producing realistic-looking images in the computer vision area. Recently, GAN-based techniques are shown to be promising for spatio-temporal-based applications such as trajectory prediction, events generation and time-series data imputation. While several reviews for GANs in computer vision have been presented, no one has cons… ▽ More

    Submitted 29 July, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

    Comments: This paper has been accepted by ACM Transactions on Intelligent Systems and Technology (TIST)

  46. arXiv:2007.04831  [pdf, other

    cs.HC

    n-Gage: Predicting in-class Emotional, Behavioural and Cognitive Engagement in the Wild

    Authors: Nan Gao, Wei Shao, Mohammad Saiedur Rahaman, Flora D. Salim

    Abstract: The study of student engagement has attracted growing interests to address problems such as low academic performance, disaffection, and high dropout rates. Existing approaches to measuring student engagement typically rely on survey-based instruments. While effective, those approaches are time-consuming and labour-intensive. Meanwhile, both the response rate and quality of the survey are usually p… ▽ More

    Submitted 22 July, 2020; v1 submitted 9 July, 2020; originally announced July 2020.

    Comments: This paper has been accepted by the Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) volume 4 issue 3, 2020

  47. arXiv:2006.12631  [pdf, other

    cs.LG stat.ML

    Fast and Flexible Temporal Point Processes with Triangular Maps

    Authors: Oleksandr Shchur, Nicholas Gao, Marin Biloš, Stephan Günnemann

    Abstract: Temporal point process (TPP) models combined with recurrent neural networks provide a powerful framework for modeling continuous-time event data. While such models are flexible, they are inherently sequential and therefore cannot benefit from the parallelism of modern hardware. By exploiting the recent developments in the field of normalizing flows, we design TriTPP -- a new class of non-recurrent… ▽ More

    Submitted 10 November, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

  48. arXiv:2006.07680  [pdf, other

    cs.LG quant-ph stat.ML

    High-Dimensional Similarity Search with Quantum-Assisted Variational Autoencoder

    Authors: Nicholas Gao, Max Wilson, Thomas Vandal, Walter Vinci, Ramakrishna Nemani, Eleanor Rieffel

    Abstract: Recent progress in quantum algorithms and hardware indicates the potential importance of quantum computing in the near future. However, finding suitable application areas remains an active area of research. Quantum machine learning is touted as a potential approach to demonstrate quantum advantage within both the gate-model and the adiabatic schemes. For instance, the Quantum-assisted Variational… ▽ More

    Submitted 13 June, 2020; originally announced June 2020.

  49. arXiv:2005.14260  [pdf

    cs.CV cond-mat.mtrl-sci

    Overview: Computer vision and machine learning for microstructural characterization and analysis

    Authors: Elizabeth A. Holm, Ryan Cohn, Nan Gao, Andrew R. Kitahara, Thomas P. Matson, Bo Lei, Srujana Rao Yarasi

    Abstract: The characterization and analysis of microstructure is the foundation of microstructural science, connecting the materials structure to its composition, process history, and properties. Microstructural quantification traditionally involves a human deciding a priori what to measure and then devising a purpose-built method for doing so. However, recent advances in data science, including computer vi… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: submitted to Materials and Metallurgical Transactions A

  50. arXiv:2004.14382  [pdf, other

    cs.LG

    Transfer Learning for Thermal Comfort Prediction in Multiple Cities

    Authors: Nan Gao, Wei Shao, Mohammad Saiedur Rahaman, Jun Zhai, Klaus David, Flora D. Salim

    Abstract: HVAC (Heating, Ventilation and Air Conditioning) system is an important part of a building, which constitutes up to 40% of building energy usage. The main purpose of HVAC, maintaining appropriate thermal comfort, is crucial for the best utilisation of energy usage. Besides, thermal comfort is also crucial for well-being, health, and work productivity. Recently, data-driven thermal comfort models h… ▽ More

    Submitted 20 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.