Skip to main content

Showing 1–50 of 50 results for author: Zang, Z

  1. arXiv:2406.05766  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Gentle-CLIP: Exploring Aligned Semantic In Low-Quality Multimodal Data With Soft Alignment

    Authors: Zijia Song, Zelin Zang, Yelin Wang, Guozheng Yang, Jiangbin Zheng, Kaicheng yu, Wanyu Chen, Stan Z. Li

    Abstract: Multimodal fusion breaks through the barriers between diverse modalities and has already yielded numerous impressive performances. However, in various specialized fields, it is struggling to obtain sufficient alignment data for the training process, which seriously limits the use of previously elegant models. Thus, semi-supervised learning attempts to achieve multimodal alignment with fewer matche… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2406.01627  [pdf, other

    q-bio.GN cs.LG

    GenBench: A Benchmarking Suite for Systematic Evaluation of Genomic Foundation Models

    Authors: Zicheng Liu, Jiahui Li, Siyuan Li, Zelin Zang, Cheng Tan, Yufei Huang, Yajing Bai, Stan Z. Li

    Abstract: The Genomic Foundation Model (GFM) paradigm is expected to facilitate the extraction of generalizable representations from massive genomic data, thereby enabling their application across a spectrum of downstream applications. Despite advancements, a lack of evaluation framework makes it difficult to ensure equitable assessment due to experimental settings, model intricacy, benchmark datasets, and… ▽ More

    Submitted 5 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  3. arXiv:2405.16258  [pdf, other

    cs.LG cs.AI eess.SY

    USD: Unsupervised Soft Contrastive Learning for Fault Detection in Multivariate Time Series

    Authors: Hong Liu, Xiuxiu Qiu, Yiming Shi, Zelin Zang

    Abstract: Unsupervised fault detection in multivariate time series is critical for maintaining the integrity and efficiency of complex systems, with current methodologies largely focusing on statistical and machine learning techniques. However, these approaches often rest on the assumption that data distributions conform to Gaussian models, overlooking the diversity of patterns that can manifest in both nor… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 19 pages, 7 figures, under review

  4. arXiv:2404.13288  [pdf, other

    cs.RO cs.CV

    PoseINN: Realtime Visual-based Pose Regression and Localization with Invertible Neural Networks

    Authors: Zirui Zang, Ahmad Amine, Rahul Mangharam

    Abstract: Estimating ego-pose from cameras is an important problem in robotics with applications ranging from mobile robotics to augmented reality. While SOTA models are becoming increasingly accurate, they can still be unwieldy due to high computational costs. In this paper, we propose to solve the problem by using invertible neural networks (INN) to find the mapping between the latent space of images and… ▽ More

    Submitted 7 May, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  5. arXiv:2404.10667  [pdf, other

    cs.CV

    VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time

    Authors: Sicheng Xu, Guojun Chen, Yu-Xiao Guo, Jiaolong Yang, Chong Li, Zhenyu Zang, Yizhong Zhang, Xin Tong, Baining Guo

    Abstract: We introduce VASA, a framework for generating lifelike talking faces with appealing visual affective skills (VAS) given a single static image and a speech audio clip. Our premiere model, VASA-1, is capable of not only producing lip movements that are exquisitely synchronized with the audio, but also capturing a large spectrum of facial nuances and natural head motions that contribute to the percep… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Tech Report. Project webpage: https://www.microsoft.com/en-us/research/project/vasa-1/

  6. Pattern-Based Peephole Optimizations with Java JIT Tests

    Authors: Zhiqiang Zang, Aditya Thimmaiah, Milos Gligoric

    Abstract: We present JOG, a framework that facilitates developing Java JIT peephole optimizations alongside JIT tests. JOG enables developers to write a pattern, in Java itself, that specifies desired code transformations by writing code before and after the optimization, as well as any necessary preconditions. Such patterns can be written in the same way that tests of the optimization are already written i… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 12 pages, 9 figures, 3 tables, published in ISSTA 2023 (Research Papers track)

  7. Java JIT Testing with Template Extraction

    Authors: Zhiqiang Zang, Fu-Yao Yu, Aditya Thimmaiah, August Shi, Milos Gligoric

    Abstract: We present LeJit, a template-based framework for testing Java just-in-time (JIT) compilers. Like recent template-based frameworks, LeJit executes a template -- a program with holes to be filled -- to generate concrete programs given as inputs to Java JIT compilers. LeJit automatically generates template programs from existing Java code by converting expressions to holes, as well as generating nece… ▽ More

    Submitted 7 July, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: 23 pages, 6 figures, 8 tables, accepted in FSE 2024 (Research Papers track)

  8. arXiv:2402.18233  [pdf, other

    cs.CV

    Zero-Shot Aerial Object Detection with Visual Description Regularization

    Authors: Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv

    Abstract: Existing object detection models are mainly trained on large-scale labeled datasets. However, annotating data for novel aerial object classes is expensive since it is time-consuming and may require expert knowledge. Thus, it is desirable to study label-efficient object detection methods on aerial images. In this work, we propose a zero-shot method for aerial object detection named visual Descripti… ▽ More

    Submitted 1 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 13 pages, 3 figures

  9. arXiv:2402.16901  [pdf, other

    q-bio.GN cs.AI cs.LG

    FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics

    Authors: ChenRui Duan, Zelin Zang, Yongjie Xu, Hang He, Zihan Liu, Zijia Song, Ju-Sheng Zheng, Stan Z. Li

    Abstract: Metagenomic data, comprising mixed multi-species genomes, are prevalent in diverse environments like oceans and soils, significantly impacting human health and ecological functions. However, current research relies on K-mer representations, limiting the capture of structurally relevant gene contexts. To address these limitations and further our understanding of complex relationships between metage… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  10. arXiv:2402.13144  [pdf, other

    cs.LG cs.CV

    Neural Network Parameter Diffusion

    Authors: Kai Wang, Zhaopan Xu, Yukun Zhou, Zelin Zang, Trevor Darrell, Zhuang Liu, Yang You

    Abstract: Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also \textit{generate high-performing neural network parameters}. Our approach is simple, utilizing an autoencoder and a standard latent diffusion model. The autoencoder extracts latent representations of a subset of the trained network parameters. A diffusion mod… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: We introduce a novel approach for parameter generation, named neural network parameter diffusion (\textbf{p-diff}), which employs a standard latent diffusion model to synthesize a new set of parameters

  11. arXiv:2402.09416  [pdf, other

    q-bio.BM cs.LG

    Deep Manifold Transformation for Protein Representation Learning

    Authors: Bozhen Hu, Zelin Zang, Cheng Tan, Stan Z. Li

    Abstract: Protein representation learning is critical in various tasks in biology, such as drug design and protein structure or function prediction, which has primarily benefited from protein language models and graph neural networks. These models can capture intrinsic patterns from protein sequences and structures through masking and task-related losses. However, the learned protein representations are usu… ▽ More

    Submitted 12 January, 2024; originally announced February 2024.

    Comments: This work has been accepted by ICASSP 2024

  12. arXiv:2402.09325  [pdf, other

    cs.CV cs.RO

    PC-NeRF: Parent-Child Neural Radiance Fields Using Sparse LiDAR Frames in Autonomous Driving Environments

    Authors: Xiuzhong Hu, Guangming Xiong, Zheng Zang, Peng Jia, Yuxuan Han, Junyi Ma

    Abstract: Large-scale 3D scene reconstruction and novel view synthesis are vital for autonomous vehicles, especially utilizing temporally sparse LiDAR frames. However, conventional explicit representations remain a significant bottleneck towards representing the reconstructed and synthetic scenes at unlimited resolution. Although the recently developed neural radiance fields (NeRF) have shown compelling res… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.00874

  13. arXiv:2402.05441  [pdf

    cs.CV eess.IV

    Spiking Neural Network Enhanced Hand Gesture Recognition Using Low-Cost Single-photon Avalanche Diode Array

    Authors: Zhenya Zang, Xingda Li, David Day Uei Li

    Abstract: We present a compact spiking convolutional neural network (SCNN) and spiking multilayer perceptron (SMLP) to recognize ten different gestures in dark and bright light environments, using a $9.6 single-photon avalanche diode (SPAD) array. In our hand gesture recognition (HGR) system, photon intensity data was leveraged to train and test the network. A vanilla convolutional neural network (CNN) was… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures

  14. arXiv:2401.10973  [pdf, other

    cs.MA cs.LG

    T2MAC: Targeted and Trusted Multi-Agent Communication through Selective Engagement and Evidence-Driven Integration

    Authors: Chuxiong Sun, Zehua Zang, Jiabao Li, Jiangmeng Li, Xiao Xu, Rui Wang, Changwen Zheng

    Abstract: Communication stands as a potent mechanism to harmonize the behaviors of multiple agents. However, existing works primarily concentrate on broadcast communication, which not only lacks practicality, but also leads to information redundancy. This surplus, one-fits-all information could adversely impact the communication efficiency. Furthermore, existing works often resort to basic mechanisms to int… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: AAAI24

  15. Building a Reusable and Extensible Automatic Compiler Infrastructure for Reconfigurable Devices

    Authors: Zhenya Zang, Uwe Dolinsky, Pietro Ghiglio, Stefano Cherubin, Mehdi Goli, Shufan Yang

    Abstract: Multi-Level Intermediate Representation (MLIR) is gaining increasing attention in reconfigurable hardware communities due to its capability to represent various abstract levels for software compilers. This project aims to be the first to provide an end-to-end framework that leverages open-source, cross-platform compilation technology to generate MLIR from SYCL. Additionally, it aims to explore a l… ▽ More

    Submitted 14 December, 2023; originally announced January 2024.

    Comments: 2023 33rd International Conference on Field-Programmable Logic and Applications (FPL)

  16. arXiv:2401.07543  [pdf, other

    cs.CE cs.AI

    Must: Maximizing Latent Capacity of Spatial Transcriptomics Data

    Authors: Zelin Zang, Liangyu Li, Yongjie Xu, Chenrui Duan, Kai Wang, Yang You, Yi Sun, Stan Z. Li

    Abstract: Spatial transcriptomics (ST) technologies have revolutionized the study of gene expression patterns in tissues by providing multimodality data in transcriptomic, spatial, and morphological, offering opportunities for understanding tissue biology beyond transcriptomics. However, we identify the modality bias phenomenon in ST data species, i.e., the inconsistent contribution of different modalities… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 30 pages and 6 figures, plus 27 pages and 14 figures in appendices

  17. arXiv:2401.06727  [pdf, other

    cs.LG

    Deep Manifold Graph Auto-Encoder for Attributed Graph Embedding

    Authors: Bozhen Hu, Zelin Zang, Jun Xia, Lirong Wu, Cheng Tan, Stan Z. Li

    Abstract: Representing graph data in a low-dimensional space for subsequent tasks is the purpose of attributed graph embedding. Most existing neural network approaches learn latent representations by minimizing reconstruction errors. Rare work considers the data distribution and the topological structure of latent codes simultaneously, which often results in inferior embeddings in real-world graph data. Thi… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: This work has been accepted by ICASSP2023, due to download limitations, we upload this work here

    Journal ref: In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) (pp. 1-5). IEEE

  18. arXiv:2401.05578   

    cs.LG

    Fast Cerebral Blood Flow Analysis via Extreme Learning Machine

    Authors: Xi Chen, Zhenya Zang, Xingda Li

    Abstract: We introduce a rapid and precise analytical approach for analyzing cerebral blood flow (CBF) using Diffuse Correlation Spectroscopy (DCS) with the application of the Extreme Learning Machine (ELM). Our evaluation of ELM and existing algorithms involves a comprehensive set of metrics. We assess these algorithms using synthetic datasets for both semi-infinite and multi-layer models. The results demo… ▽ More

    Submitted 1 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Not ready to submission. Need further correction

  19. arXiv:2401.02713  [pdf, other

    cs.LG cs.AI q-bio.BM

    Graph-level Protein Representation Learning by Structure Knowledge Refinement

    Authors: Ge Wang, Zelin Zang, Jiangbin Zheng, Jun Xia, Stan Z. Li

    Abstract: This paper focuses on learning representation on the whole graph level in an unsupervised manner. Learning graph-level representation plays an important role in a variety of real-world issues such as molecule property prediction, protein structure feature extraction, and social network analysis. The mainstream method is utilizing contrastive learning to facilitate graph feature extraction, known a… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  20. Unsupervised Social Event Detection via Hybrid Graph Contrastive Learning and Reinforced Incremental Clustering

    Authors: Yuanyuan Guo, Zehua Zang, Hang Gao, Xiao Xu, Rui Wang, Lixiang Liu, Jiangmeng Li

    Abstract: Detecting events from social media data streams is gradually attracting researchers. The innate challenge for detecting events is to extract discriminative information from social media data thereby assigning the data into different events. Due to the excessive diversity and high updating frequency of social data, using supervised approaches to detect events from social messages is hardly achieved… ▽ More

    Submitted 15 December, 2023; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted by Knowledge-Based Systems

  21. arXiv:2310.08986  [pdf, other

    cs.CV

    VCL Challenges 2023 at ICCV 2023 Technical Report: Bi-level Adaptation Method for Test-time Adaptive Object Detection

    Authors: Chenyu Lin, Yusheng He, Zhengqing Zang, Chenwei Tang, Tao Wang, Jiancheng Lv

    Abstract: This report outlines our team's participation in VCL Challenges B Continual Test_time Adaptation, focusing on the technical details of our approach. Our primary focus is Testtime Adaptation using bi_level adaptations, encompassing image_level and detector_level adaptations. At the image level, we employ adjustable parameterbased image filters, while at the detector level, we leverage adjustable pa… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  22. arXiv:2310.00874  [pdf, other

    cs.CV cs.RO

    PC-NeRF: Parent-Child Neural Radiance Fields under Partial Sensor Data Loss in Autonomous Driving Environments

    Authors: Xiuzhong Hu, Guangming Xiong, Zheng Zang, Peng Jia, Yuxuan Han, Junyi Ma

    Abstract: Reconstructing large-scale 3D scenes is essential for autonomous vehicles, especially when partial sensor data is lost. Although the recently developed neural radiance fields (NeRF) have shown compelling results in implicit representations, the large-scale 3D scene reconstruction using partially lost LiDAR point cloud data still needs to be explored. To bridge this gap, we propose a novel 3D scene… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  23. arXiv:2309.07909  [pdf, other

    cs.LG cs.CE cs.CV

    DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

    Authors: Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan. Z Li, Yang You

    Abstract: Unsupervised Contrastive learning has gained prominence in fields such as vision, and biology, leveraging predefined positive/negative samples for representation learning. Data augmentation, categorized into hand-designed and model-based methods, has been identified as a crucial component for enhancing contrastive learning. However, hand-designed methods require human expertise in domain-specific… ▽ More

    Submitted 25 May, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

    Comments: accepted by ICML24

  24. arXiv:2303.17108  [pdf, other

    physics.optics cs.GR

    B-spline freeform surface tailoring for prescribed irradiance based on differentiable ray-tracing

    Authors: Haoqiang Wang, Zihan Zang, Yunpeng Xu, Yanjun Han, Hongtao Li, Yi Luo

    Abstract: A universal and flexible design method for freeform surface that can modulate the distribution of an zero-étendue source to an arbitrary irradiance distribution is a significant challenge in the field of non-imaging optics. Current design methods typically formulate the problem as a partial differential equation and solve it through sophisticated numerical methods, especially for off-axis situatio… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  25. arXiv:2303.13694  [pdf, other

    cs.RO eess.SY

    Ensemble Gaussian Processes for Adaptive Autonomous Driving on Multi-friction Surfaces

    Authors: Tomáš Nagy, Ahmad Amine, Truong X. Nghiem, Ugo Rosolia, Zirui Zang, Rahul Mangharam

    Abstract: Driving under varying road conditions is challenging, especially for autonomous vehicles that must adapt in real-time to changes in the environment, e.g., rain, snow, etc. It is difficult to apply offline learning-based methods in these time-varying settings, as the controller should be trained on datasets representing all conditions it might encounter in the future. While online learning may adap… ▽ More

    Submitted 26 May, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 8 pages, 12 figures, accepted for publication in IFAC World Congress 2023

  26. arXiv:2212.00544  [pdf, other

    cs.RO

    Towards Explainability in Modular Autonomous Vehicle Software

    Authors: Hongrui Zheng, Zirui Zang, Shuo Yang, Rahul Mangharam

    Abstract: Safety-critical Autonomous Systems require trustworthy and transparent decision-making process to be deployable in the real world. The advancement of Machine Learning introduces high performance but largely through black-box algorithms. We focus the discussion of explainability specifically with Autonomous Vehicles (AVs). As a safety-critical system, AVs provide the unique opportunity to utilize c… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

  27. arXiv:2211.15478  [pdf, other

    cs.LG cs.AI

    EVNet: An Explainable Deep Network for Dimension Reduction

    Authors: Zelin Zang, Shenghui Cheng, Linyan Lu, Hanchen Xia, Liangyu Li, Yaoting Sun, Yongjie Xu, Lei Shang, Baigui Sun, Stan Z. Li

    Abstract: Dimension reduction (DR) is commonly utilized to capture the intrinsic structure and transform high-dimensional data into low-dimensional space while retaining meaningful properties of the original data. It is used in various applications, such as image recognition, single-cell sequencing analysis, and biomarker discovery. However, contemporary parametric-free and parametric DR techniques suffer f… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: 18 pages, 15 figures, accepted by TVCG

  28. arXiv:2211.11262  [pdf, other

    cs.CV

    Boosting Novel Category Discovery Over Domains with Soft Contrastive Learning and All-in-One Classifier

    Authors: Zelin Zang, Lei Shang, Senqiao Yang, Fei Wang, Baigui Sun, Xuansong Xie, Stan Z. Li

    Abstract: Unsupervised domain adaptation (UDA) has proven to be highly effective in transferring knowledge from a label-rich source domain to a label-scarce target domain. However, the presence of additional novel categories in the target domain has led to the development of open-set domain adaptation (ODA) and universal domain adaptation (UNDA). Existing ODA and UNDA methods treat all novel categories as a… ▽ More

    Submitted 23 July, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted by ICCV

  29. arXiv:2211.03576  [pdf, ps, other

    cs.CV eess.IV physics.optics

    DAD vision: opto-electronic co-designed computer vision with division adjoint method

    Authors: Zihan Zang, Haoqiang Wang, Yunpeng Xu

    Abstract: The miniaturization and mobility of computer vision systems are limited by the heavy computational burden and the size of optical lenses. Here, we propose to use a ultra-thin diffractive optical element to implement passive optical convolution. A division adjoint opto-electronic co-design method is also proposed. In our simulation experiments, the first few convolutional layers of the neural netwo… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  30. arXiv:2209.11925  [pdf, other

    cs.RO

    Local_INN: Implicit Map Representation and Localization with Invertible Neural Networks

    Authors: Zirui Zang, Hongrui Zheng, Johannes Betz, Rahul Mangharam

    Abstract: Robot localization is an inverse problem of finding a robot's pose using a map and sensor measurements. In recent years, Invertible Neural Networks (INNs) have successfully solved ambiguous inverse problems in various fields. This paper proposes a framework that solves the localization problem with INN. We design an INN that provides implicit map representation in the forward path and localization… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  31. arXiv:2209.11181  [pdf, other

    cs.RO eess.SY

    Teaching Autonomous Systems Hands-On: Leveraging Modular Small-Scale Hardware in the Robotics Classroom

    Authors: Johannes Betz, Hongrui Zheng, Zirui Zang, Florian Sauerbeck, Krzysztof Walas, Velin Dimitrov, Madhur Behl, Rosa Zheng, Joydeep Biswas, Venkat Krovi, Rahul Mangharam

    Abstract: Although robotics courses are well established in higher education, the courses often focus on theory and sometimes lack the systematic coverage of the techniques involved in developing, deploying, and applying software to real hardware. Additionally, most hardware platforms for robotics teaching are low-level toys aimed at younger students at middle-school levels. To address this gap, an autonomo… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: 15 pages, 12 figures, 3 tables

  32. Compiler Testing using Template Java Programs

    Authors: Zhiqiang Zang, Nathan Wiatrek, Milos Gligoric, August Shi

    Abstract: We present JAttack, a framework that enables template-based testing for compilers. Using JAttack, a developer writes a template program that describes a set of programs to be generated and given as test inputs to a compiler. Such a framework enables developers to incorporate their domain knowledge on testing compilers, giving a basic program structure that allows for exploring complex programs tha… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

    Comments: 13 pages, 6 figures, 2 tables, accepted in ASE 2022 (Research Papers track)

  33. arXiv:2207.03809  [pdf, other

    cs.LG

    UDRN: Unified Dimensional Reduction Neural Network for Feature Selection and Feature Projection

    Authors: Zelin Zang, Yongjie Xu, Linyan Lu, Yulan Geng, Senqiao Yang, Stan Z. Li

    Abstract: Dimensional reduction~(DR) maps high-dimensional data into a lower dimensions latent space with minimized defined optimization objectives. The DR method usually falls into feature selection~(FS) and feature projection~(FP). FS focuses on selecting a critical subset of dimensions but risks destroying the data distribution (structure). On the other hand, FP combines all the input features into lower… ▽ More

    Submitted 22 November, 2022; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: 14 pages, 7 figures

  34. arXiv:2207.03160  [pdf, other

    cs.LG

    DLME: Deep Local-flatness Manifold Embedding

    Authors: Zelin Zang, Siyuan Li, Di Wu, Ge Wang, Lei Shang, Baigui Sun, Hao Li, Stan Z. Li

    Abstract: Manifold learning (ML) aims to seek low-dimensional embedding from high-dimensional data. The problem is challenging on real-world datasets, especially with under-sampling data, and we find that previous methods perform poorly in this case. Generally, ML methods first transform input data into a low-dimensional embedding space to maintain the data's geometric structure and subsequently perform dow… ▽ More

    Submitted 25 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 16 pages, 7 figures

  35. Winning the 3rd Japan Automotive AI Challenge -- Autonomous Racing with the Autoware.Auto Open Source Software Stack

    Authors: Zirui Zang, Renukanandan Tumu, Johannes Betz, Hongrui Zheng, Rahul Mangharam

    Abstract: The 3rd Japan Automotive AI Challenge was an international online autonomous racing challenge where 164 teams competed in December 2021. This paper outlines the winning strategy to this competition, and the advantages and challenges of using the Autoware.Auto open source autonomous driving platform for multi-agent racing. Our winning approach includes a lane-switching opponent overtaking strategy,… ▽ More

    Submitted 4 June, 2022; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Accepted at Autoware Workshop at IV 2022

  36. arXiv:2205.13943  [pdf, other

    cs.CV cs.AI

    Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

    Authors: Siyuan Li, Di Wu, Fang Wu, Zelin Zang, Stan. Z. Li

    Abstract: Masked image modeling, an emerging self-supervised pre-training method, has shown impressive success across numerous downstream vision tasks with Vision transformers. Its underlying idea is simple: a portion of the input image is masked out and then reconstructed via a pre-text task. However, the working principle behind MIM is not well explained, and previous studies insist that MIM primarily wor… ▽ More

    Submitted 2 June, 2023; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: ICML 2023 (poster). The source code will be released in https://github.com/Westlake-AI/A2MIM

  37. arXiv:2203.13754  [pdf

    physics.bio-ph cs.LG

    Fast fluorescence lifetime imaging analysis via extreme learning machine

    Authors: Zhenya Zang, Dong Xiao, Quan Wang, Zinuo Li, Wujun Xie, Yu Chen, David Day Uei Li

    Abstract: We present a fast and accurate analytical method for fluorescence lifetime imaging microscopy (FLIM) using the extreme learning machine (ELM). We used extensive metrics to evaluate ELM and existing algorithms. First, we compared these algorithms using synthetic datasets. Results indicate that ELM can obtain higher fidelity, even in low-photon conditions. Afterwards, we used ELM to retrieve lifetim… ▽ More

    Submitted 25 March, 2022; originally announced March 2022.

    Comments: 14 pages, 9 figures

  38. arXiv:2110.14553  [pdf, other

    cs.LG cs.AI cs.CV

    GenURL: A General Framework for Unsupervised Representation Learning

    Authors: Siyuan Li, Zicheng Liu, Zelin Zang, Di Wu, Zhiyuan Chen, Stan Z. Li

    Abstract: Unsupervised representation learning (URL), which learns compact embeddings of high-dimensional data without supervision, has made remarkable progress recently. However, the development of URLs for different requirements is independent, which limits the generalization of the algorithms, especially prohibitive as the number of tasks grows. For example, dimension reduction methods, t-SNE, and UMAP o… ▽ More

    Submitted 16 April, 2024; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: TNNLS 2024 version with 13 pages and 14 figures

  39. Surrogate Representation Learning with Isometric Mapping for Gray-box Graph Adversarial Attacks

    Authors: Zihan Liu, Yun Luo, Zelin Zang, Stan Z. Li

    Abstract: Gray-box graph attacks aim at disrupting the performance of the victim model by using inconspicuous attacks with limited knowledge of the victim model. The parameters of the victim model and the labels of the test nodes are invisible to the attacker. To obtain the gradient on the node attributes or graph structure, the attacker constructs an imaginary surrogate model trained under supervision. How… ▽ More

    Submitted 22 February, 2022; v1 submitted 20 October, 2021; originally announced October 2021.

    Journal ref: WSDM22: Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining February 2022

  40. arXiv:2106.15788  [pdf, other

    cs.CV

    Exploring Localization for Self-supervised Fine-grained Contrastive Learning

    Authors: Di Wu, Siyuan Li, Zelin Zang, Stan Z. Li

    Abstract: Self-supervised contrastive learning has demonstrated great potential in learning visual representations. Despite their success in various downstream tasks such as image classification and object detection, self-supervised pre-training for fine-grained scenarios is not fully explored. We point out that current contrastive methods are prone to memorizing background/foreground texture and therefore… ▽ More

    Submitted 11 October, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: BMVC 2022 camera-ready. 15 pages (main) with 5 pages appendix

  41. arXiv:2104.13048  [pdf, other

    cs.LG cs.AI

    Unsupervised Deep Manifold Attributed Graph Embedding

    Authors: Zelin Zang, Siyuan Li, Di Wu, Jianzhu Guo, Yongjie Xu, Stan Z. Li

    Abstract: Unsupervised attributed graph representation learning is challenging since both structural and feature information are required to be represented in the latent space. Existing methods concentrate on learning latent representation via reconstruction tasks, but cannot directly optimize representation and are prone to oversmoothing, thus limiting the applications on downstream tasks. To alleviate the… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2007.01594 by other authors

  42. arXiv:2012.00481  [pdf, other

    cs.LG cs.AI stat.ML

    Consistent Representation Learning for High Dimensional Data Analysis

    Authors: Stan Z. Li, Lirong Wu, Zelin Zang

    Abstract: High dimensional data analysis for exploration and discovery includes three fundamental tasks: dimensionality reduction, clustering, and visualization. When the three associated tasks are done separately, as is often the case thus far, inconsistencies can occur among the tasks in terms of data geometry and others. This can lead to confusing or misleading data interpretation. In this paper, we prop… ▽ More

    Submitted 1 December, 2020; originally announced December 2020.

  43. arXiv:2010.14831  [pdf, other

    cs.LG cs.CV cs.HC stat.ML

    Deep Manifold Transformation for Nonlinear Dimensionality Reduction

    Authors: Stan Z. Li, Zelin Zang, Lirong Wu

    Abstract: Manifold learning-based encoders have been playing important roles in nonlinear dimensionality reduction (NLDR) for data exploration. However, existing methods can often fail to preserve geometric, topological and/or distributional structures of data. In this paper, we propose a deep manifold learning framework, called deep manifold transformation (DMT) for unsupervised NLDR and embedding learning… ▽ More

    Submitted 3 May, 2021; v1 submitted 28 October, 2020; originally announced October 2020.

  44. arXiv:2010.04012  [pdf, other

    cs.LG cs.AI

    Invertible Manifold Learning for Dimension Reduction

    Authors: Siyuan Li, Haitao Lin, Zelin Zang, Lirong Wu, Jun Xia, Stan Z. Li

    Abstract: Dimension reduction (DR) aims to learn low-dimensional representations of high-dimensional data with the preservation of essential information. In the context of manifold learning, we define that the representation after information-lossless DR preserves the topological and geometric properties of data manifolds formally, and propose a novel two-stage DR method, called invertible manifold learning… ▽ More

    Submitted 30 June, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: ECML-PKDD 2021 camera-ready. 15 pages (main) with 10 pages appendix

  45. arXiv:2009.09590  [pdf, other

    cs.LG cs.AI stat.ML

    Generalized Clustering and Multi-Manifold Learning with Geometric Structure Preservation

    Authors: Lirong Wu, Zicheng Liu, Zelin Zang, Jun Xia, Siyuan Li, Stan. Z Li

    Abstract: Though manifold-based clustering has become a popular research topic, we observe that one important factor has been omitted by these works, namely that the defined clustering loss may corrupt the local and global structure of the latent space. In this paper, we propose a novel Generalized Clustering and Multi-manifold Learning (GCML) framework with geometric structure preservation for generalized… ▽ More

    Submitted 8 October, 2021; v1 submitted 20 September, 2020; originally announced September 2020.

  46. arXiv:2006.08256  [pdf, other

    cs.LG stat.ML

    Markov-Lipschitz Deep Learning

    Authors: Stan Z. Li, Zelin Zang, Lirong Wu

    Abstract: We propose a novel framework, called Markov-Lipschitz deep learning (MLDL), to tackle geometric deterioration caused by collapse, twisting, or crossing in vector-based neural network transformations for manifold-based representation learning and manifold data generation. A prior constraint, called locally isometric smoothness (LIS), is imposed across-layers and encoded into a Markov random field (… ▽ More

    Submitted 30 September, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

  47. arXiv:2003.02631  [pdf, other

    eess.SP cs.LG stat.ML

    Machine Learning for Predictive Deployment of UAVs with Multiple Access

    Authors: Linyan Lu, Zhaohui Yang, Mingzhe Chen, Zelin Zang, Mohammad Shikh-Bahaei

    Abstract: In this paper, a machine learning based deployment framework of unmanned aerial vehicles (UAVs) is studied. In the considered model, UAVs are deployed as flying base stations (BS) to offload heavy traffic from ground BSs. Due to time-varying traffic distribution, a long short-term memory (LSTM) based prediction algorithm is introduced to predict the future cellular traffic. To predict the user ser… ▽ More

    Submitted 30 July, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

  48. arXiv:1605.09458  [pdf, other

    cs.LG

    Training Auto-encoders Effectively via Eliminating Task-irrelevant Input Variables

    Authors: Hui Shen, Dehua Li, Hong Wu, Zhaoxiang Zang

    Abstract: Auto-encoders are often used as building blocks of deep network classifier to learn feature extractors, but task-irrelevant information in the input data may lead to bad extractors and result in poor generalization performance of the network. In this paper,via dropping the task-irrelevant input variables the performance of auto-encoders can be obviously improved .Specifically, an importance-based… ▽ More

    Submitted 30 May, 2016; originally announced May 2016.

  49. arXiv:1604.07704  [pdf

    cs.AI cs.NE

    Tournament selection in zeroth-level classifier systems based on average reward reinforcement learning

    Authors: Zhaoxiang Zang, Zhao Li, Junying Wang, Zhiping Dan

    Abstract: As a genetics-based machine learning technique, zeroth-level classifier system (ZCS) is based on a discounted reward reinforcement learning algorithm, bucket-brigade algorithm, which optimizes the discounted total reward received by an agent but is not suitable for all multi-step problems, especially large-size ones. There are some undiscounted reinforcement learning methods available, such as R-l… ▽ More

    Submitted 26 April, 2016; originally announced April 2016.

    Comments: 14 pages, 3 figures

    ACM Class: I.2

  50. Learning classifier systems with memory condition to solve non-Markov problems

    Authors: Zhaoxiang Zang, Dehua Li, Junying Wang

    Abstract: In the family of Learning Classifier Systems, the classifier system XCS has been successfully used for many applications. However, the standard XCS has no memory mechanism and can only learn optimal policy in Markov environments, where the optimal action is determined solely by the state of current sensory input. In practice, most environments are partially observable environments on agent's sensa… ▽ More

    Submitted 2 November, 2012; originally announced November 2012.

    Comments: 34 pages, 15 figures, 1 table