Skip to main content

Showing 1–50 of 74 results for author: Ke, Z

  1. arXiv:2407.07364  [pdf, other

    cs.LG cs.AI eess.SY

    Real-time system optimal traffic routing under uncertainties -- Can physics models boost reinforcement learning?

    Authors: Zemian Ke, Qiling Zou, Jiachao Liu, Sean Qian

    Abstract: System optimal traffic routing can mitigate congestion by assigning routes for a portion of vehicles so that the total travel time of all vehicles in the transportation system can be reduced. However, achieving real-time optimal routing poses challenges due to uncertain demands and unknown system dynamics, particularly in expansive transportation networks. While physics model-based methods are sen… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2406.05391  [pdf, other

    cs.LG

    DUPLEX: Dual GAT for Complex Embedding of Directed Graphs

    Authors: Zhaoru Ke, Hang Yu, Jianguo Li, Haipeng Zhang

    Abstract: Current directed graph embedding methods build upon undirected techniques but often inadequately capture directed edge information, leading to challenges such as: (1) Suboptimal representations for nodes with low in/out-degrees, due to the insufficient neighbor interactions; (2) Limited inductive ability for representing new nodes post-training; (3) Narrow generalizability, as training is overly c… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  3. arXiv:2406.01598  [pdf

    cs.CV cs.DB cs.RO

    D2E-An Autonomous Decision-making Dataset involving Driver States and Human Evaluation

    Authors: Zehong Ke, Yanbo Jiang, Yuning Wang, Hao Cheng, Jinhao Li, Jianqiang Wang

    Abstract: With the advancement of deep learning technology, data-driven methods are increasingly used in the decision-making of autonomous driving, and the quality of datasets greatly influenced the model performance. Although current datasets have made significant progress in the collection of vehicle and environment data, emphasis on human-end data including the driver states and human evaluation is not s… ▽ More

    Submitted 12 April, 2024; originally announced June 2024.

    Comments: Submit for ITSC 2024

  4. arXiv:2405.04900  [pdf, other

    cs.CV

    Self-supervised Gait-based Emotion Representation Learning from Selective Strongly Augmented Skeleton Sequences

    Authors: Cheng Song, Lu Lu, Zhen Ke, Long Gao, Shuai Ding

    Abstract: Emotion recognition is an important part of affective computing. Extracting emotional cues from human gaits yields benefits such as natural interaction, a nonintrusive nature, and remote detection. Recently, the introduction of self-supervised learning techniques offers a practical solution to the issues arising from the scarcity of labeled data in the field of gait-based emotion recognition. Howe… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  5. arXiv:2405.04017  [pdf, other

    cs.LG cs.AI math.OC

    An Improved Finite-time Analysis of Temporal Difference Learning with Deep Neural Networks

    Authors: Zhifa Ke, Zaiwen Wen, Junyu Zhang

    Abstract: Temporal difference (TD) learning algorithms with neural network function parameterization have well-established empirical success in many practical large-scale reinforcement learning tasks. However, theoretical understanding of these algorithms remains challenging due to the nonlinearity of the action-value approximation. In this paper, we develop an improved non-asymptotic analysis of the neural… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  6. arXiv:2403.15679  [pdf, other

    cs.CV cs.MM

    DS-NeRV: Implicit Neural Video Representation with Decomposed Static and Dynamic Codes

    Authors: Hao Yan, Zhihui Ke, Xiaobo Zhou, Tie Qiu, Xidong Shi, Dadong Jiang

    Abstract: Implicit neural representations for video (NeRV) have recently become a novel way for high-quality video representation. However, existing works employ a single network to represent the entire video, which implicitly confuse static and dynamic information. This leads to an inability to effectively compress the redundant static information and lack the explicitly modeling of global temporal-coheren… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: CVPR 2024. Project page at https://haoyan14.github.io/DS-NeRV

  7. arXiv:2403.11013  [pdf, other

    cs.LG math.ST

    Improved Algorithm and Bounds for Successive Projection

    Authors: Jiashun Jin, Zheng Tracy Ke, Gabriel Moryoussef, Jiajun Tang, Jingming Wang

    Abstract: Given a $K$-vertex simplex in a $d$-dimensional space, suppose we measure $n$ points on the simplex with noise (hence, some of the observed points fall outside the simplex). Vertex hunting is the problem of estimating the $K$ vertices of the simplex. A popular vertex hunting algorithm is successive projection algorithm (SPA). However, SPA is observed to perform unsatisfactorily under strong noise… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 32 pages, 5 figures

  8. arXiv:2403.00644  [pdf, other

    cs.CV

    Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

    Authors: Yuhao Liu, Zhanghan Ke, Fang Liu, Nanxuan Zhao, Rynson W. H. Lau

    Abstract: Diffusion models trained on large-scale datasets have achieved remarkable progress in image synthesis. However, due to the randomness in the diffusion process, they often struggle with handling diverse low-level tasks that require details preservation. To overcome this limitation, we present a new Diff-Plugin framework to enable a single pre-trained diffusion model to generate high-fidelity result… ▽ More

    Submitted 28 May, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR2024. Replaced some celebrity images to avoid copyright disputes

  9. arXiv:2402.00341  [pdf, other

    cs.CV

    Recasting Regional Lighting for Shadow Removal

    Authors: Yuhao Liu, Zhanghan Ke, Ke Xu, Fang Liu, Zhenwei Wang, Rynson W. H. Lau

    Abstract: Removing shadows requires an understanding of both lighting conditions and object textures in a scene. Existing methods typically learn pixel-level color mappings between shadow and non-shadow images, in which the joint modeling of lighting and object textures is implicit and inadequate. We observe that in a shadow region, the degradation degree of object textures depends on the local illumination… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: AAAI 2024 (Oral)

  10. arXiv:2401.06954  [pdf, other

    cs.CL

    Bridging the Preference Gap between Retrievers and LLMs

    Authors: Zixuan Ke, Weize Kong, Cheng Li, Mingyang Zhang, Qiaozhu Mei, Michael Bendersky

    Abstract: Large Language Models (LLMs) have demonstrated superior results across a wide range of tasks, and Retrieval-augmented Generation (RAG) is an effective way to enhance the performance by locating relevant information and placing it into the context window of the LLM. However, the relationship between retrievers and LLMs in a RAG is still under-investigated. Most existing work treats the retriever an… ▽ More

    Submitted 20 February, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  11. Recent Advances in Text Analysis

    Authors: Zheng Tracy Ke, Pengsheng Ji, Jiashun Jin, Wanshan Li

    Abstract: Text analysis is an interesting research area in data science and has various applications, such as in artificial intelligence, biomedical research, and engineering. We review popular methods for text analysis, ranging from topic modeling to the recent neural language models. In particular, we review Topic-SCORE, a statistical approach to topic modeling, and discuss how to use it to analyze MADSta… ▽ More

    Submitted 7 February, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Journal ref: Annual Review of Statistics and Its Application 2024 11:1

  12. arXiv:2310.16858  [pdf, other

    cs.CV

    4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields via Semantic Distillation

    Authors: Dadong Jiang, Zhihui Ke, Xiaobo Zhou, Xidong Shi

    Abstract: This paper targets interactive object-level editing (e.g., deletion, recoloring, transformation, composition) in dynamic scenes. Recently, some methods aiming for flexible editing static scenes represented by neural radiance field (NeRF) have shown impressive synthesis quality, while similar capabilities in time-variant dynamic scenes remain limited. To solve this problem, we propose 4D-Editor, an… ▽ More

    Submitted 5 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Project page: https://patrickddj.github.io/4D-Editor

  13. arXiv:2310.09436  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks

    Authors: Zixuan Ke, Bing Liu, Wenhan Xiong, Asli Celikyilmaz, Haoran Li

    Abstract: Continual learning (CL) has two main objectives: preventing catastrophic forgetting (CF) and encouraging knowledge transfer (KT). The existing literature mainly focused on overcoming CF. Some work has also been done on KT when the tasks are similar. To our knowledge, only one method has been proposed to learn a sequence of mixed tasks. However, these techniques still suffer from CF and/or limited… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: https://github.com/ZixuanKe/PyContinual

    Journal ref: EMNLP 2023 (findings)

  14. arXiv:2309.09774  [pdf, other

    cs.LG cs.CV

    Towards Self-Adaptive Pseudo-Label Filtering for Semi-Supervised Learning

    Authors: Lei Zhu, Zhanghan Ke, Rynson Lau

    Abstract: Recent semi-supervised learning (SSL) methods typically include a filtering strategy to improve the quality of pseudo labels. However, these filtering strategies are usually hand-crafted and do not change as the model is updated, resulting in a lot of correct pseudo labels being discarded and incorrect pseudo labels being selected during the training process. In this work, we observe that the dist… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: This paper was first submitted to NeurIPS 2021

  15. Where Did the President Visit Last Week? Detecting Celebrity Trips from News Articles

    Authors: Kai Peng, Ying Zhang, Shuai Ling, Zhaoru Ke, Haipeng Zhang

    Abstract: Celebrities' whereabouts are of pervasive importance. For instance, where politicians go, how often they visit, and who they meet, come with profound geopolitical and economic implications. Although news articles contain travel information of celebrities, it is not possible to perform large-scale and network-wise analysis due to the lack of automatic itinerary detection tools. To design such tools… ▽ More

    Submitted 9 October, 2023; v1 submitted 17 July, 2023; originally announced July 2023.

    Comments: Accepted to ICWSM 2024, 12 pages

  16. arXiv:2306.16643  [pdf

    cs.DL cs.SI physics.soc-ph

    Cautious explorers generate more future academic impact

    Authors: Xingsheng Yang, Zhaoru Ke, Qing Ke, Haipeng Zhang, Fengnan Gao

    Abstract: Some scientists are more likely to explore unfamiliar research topics while others tend to exploit existing ones. In previous work, correlations have been found between scientists' topic choices and their career performances. However, literature has yet to untangle the intricate interplay between scientific impact and research topic choices, where scientific exploration and exploitation intertwine… ▽ More

    Submitted 29 June, 2023; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: 16 pages of main text and 94 pages of supplementary information. v2: Added page number and fixed typo in author list

  17. arXiv:2306.14775  [pdf, other

    cs.LG cs.CV

    Parameter-Level Soft-Masking for Continual Learning

    Authors: Tatsuya Konishi, Mori Kurokawa, Chihiro Ono, Zixuan Ke, Gyuhak Kim, Bing Liu

    Abstract: Existing research on task incremental learning in continual learning has primarily focused on preventing catastrophic forgetting (CF). Although several techniques have achieved learning with no CF, they attain it by letting each task monopolize a sub-network in a shared network, which seriously limits knowledge transfer (KT) and causes over-consumption of the network capacity, i.e., as more tasks… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: ICML2023

  18. arXiv:2306.05363  [pdf, other

    stat.ME cs.LG math.ST stat.AP

    Subject clustering by IF-PCA and several recent methods

    Authors: Dieyi Chen, Jiashun Jin, Zheng Tracy Ke

    Abstract: Subject clustering (i.e., the use of measured features to cluster subjects, such as patients or cells, into multiple groups) is a problem of great interest. In recent years, many approaches were proposed, among which unsupervised deep learning (UDL) has received a great deal of attention. Two interesting questions are (a) how to combine the strengths of UDL and other approaches, and (b) how these… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  19. arXiv:2304.10038  [pdf, other

    cs.LG cs.AI cs.CV

    Open-World Continual Learning: Unifying Novelty Detection and Continual Learning

    Authors: Gyuhak Kim, Changnan Xiao, Tatsuya Konishi, Zixuan Ke, Bing Liu

    Abstract: As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (i) they have learned and (ii) detect items that they have not seen or learned before, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2211.02633, arXiv:2208.09734

  20. arXiv:2303.13511  [pdf, other

    cs.CV cs.AI cs.LG

    Neural Preset for Color Style Transfer

    Authors: Zhanghan Ke, Yuhao Liu, Lei Zhu, Nanxuan Zhao, Rynson W. H. Lau

    Abstract: In this paper, we present a Neural Preset technique to address the limitations of existing color style transfer methods, including visual artifacts, vast memory requirement, and slow style switching speed. Our method is based on two core designs. First, we propose Deterministic Neural Color Mapping (DNCM) to consistently operate on each pixel via an image-adaptive color mapping matrix, avoiding ar… ▽ More

    Submitted 24 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: Project page with demos: https://zhkkke.github.io/NeuralPreset . Artifact-free real-time 4K color style transfer via AI-generated presets. CVPR 2023

  21. arXiv:2303.08810  [pdf, other

    cs.CV

    BiFormer: Vision Transformer with Bi-Level Routing Attention

    Authors: Lei Zhu, Xinjiang Wang, Zhanghan Ke, Wayne Zhang, Rynson Lau

    Abstract: As the core building block of vision transformers, attention is a powerful tool to capture long-range dependency. However, such power comes at a cost: it incurs a huge computation burden and heavy memory footprint as pairwise token interaction across all spatial locations is computed. A series of works attempt to alleviate this problem by introducing handcrafted and content-agnostic sparsity into… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: CVPR 2023 camera-ready

  22. arXiv:2303.05024  [pdf, other

    math.ST cs.LG cs.SI stat.ML

    Phase transition for detecting a small community in a large network

    Authors: Jiashun Jin, Zheng Tracy Ke, Paxton Turner, Anru R. Zhang

    Abstract: How to detect a small community in a large network is an interesting problem, including clique detection as a special case, where a naive degree-based $χ^2$-test was shown to be powerful in the presence of an Erdős-Renyi background. Using Sinkhorn's theorem, we show that the signal captured by the $χ^2$-test may be a modeling artifact, and it may disappear once we replace the Erdős-Renyi model by… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  23. arXiv:2302.13087  [pdf, other

    math.OC cs.LG

    Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation

    Authors: Zhifa Ke, Junyu Zhang, Zaiwen Wen

    Abstract: In this paper, a Gauss-Newton Temporal Difference (GNTD) learning method is proposed to solve the Q-learning problem with nonlinear function approximation. In each iteration, our method takes one Gauss-Newton (GN) step to optimize a variant of Mean-Squared Bellman Error (MSBE), where target networks are adopted to avoid double sampling. Inexact GN steps are analyzed so that one can safely and effi… ▽ More

    Submitted 31 March, 2024; v1 submitted 25 February, 2023; originally announced February 2023.

  24. arXiv:2302.03241  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Pre-training of Language Models

    Authors: Zixuan Ke, Yijia Shao, Haowei Lin, Tatsuya Konishi, Gyuhak Kim, Bing Liu

    Abstract: Language models (LMs) have been instrumental for the rapid advance of natural language processing. This paper studies continual pre-training of LMs, in particular, continual domain-adaptive pre-training (or continual DAP-training). Existing research has shown that further pre-training an LM using a domain corpus to adapt the LM to the domain can improve the end-task performance in the domain. This… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: https://github.com/UIC-Liu-Lab/ContinualLM

    Journal ref: ICLR 2023

  25. arXiv:2301.08986  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Adapting a Language Model While Preserving its General Knowledge

    Authors: Zixuan Ke, Yijia Shao, Haowei Lin, Hu Xu, Lei Shu, Bing Liu

    Abstract: Domain-adaptive pre-training (or DA-training for short), also known as post-training, aims to train a pre-trained general-purpose language model (LM) using an unlabeled corpus of a particular domain to adapt the LM so that end-tasks in the domain can give improved performances. However, existing DA-training methods are in some sense blind as they do not explicitly identify what knowledge in the LM… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: EMNLP 2022

  26. arXiv:2301.05586  [pdf, other

    cs.CV

    YOLOv6 v3.0: A Full-Scale Reloading

    Authors: Chuyi Li, Lulu Li, Yifei Geng, Hongliang Jiang, Meng Cheng, Bo Zhang, Zaidan Ke, Xiaoming Xu, Xiangxiang Chu

    Abstract: The YOLO community has been in high spirits since our first two releases! By the advent of Chinese New Year 2023, which sees the Year of the Rabbit, we refurnish YOLOv6 with numerous novel enhancements on the network architecture and the training scheme. This release is identified as YOLOv6 v3.0. For a glimpse of performance, our YOLOv6-N hits 37.5% AP on the COCO dataset at a throughput of 1187 F… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: Tech Report. arXiv admin note: text overlap with arXiv:2209.02976

  27. arXiv:2301.03182  [pdf, other

    cs.CV

    Structure-Informed Shadow Removal Networks

    Authors: Yuhao Liu, Qing Guo, Lan Fu, Zhanghan Ke, Ke Xu, Wei Feng, Ivor W. Tsang, Rynson W. H. Lau

    Abstract: Existing deep learning-based shadow removal methods still produce images with shadow remnants. These shadow remnants typically exist in homogeneous regions with low-intensity values, making them untraceable in the existing image-to-image mapping paradigm. We observe that shadows mainly degrade images at the image-structure level (in which humans perceive object shapes and continuous colors). Hence… ▽ More

    Submitted 1 February, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: IEEE TIP

  28. arXiv:2212.07599  [pdf

    eess.IV cs.CV

    Universal Generative Modeling in Dual-domain for Dynamic MR Imaging

    Authors: Chuanming Yu, Yu Guan, Ziwen Ke, Dong Liang, Qiegen Liu

    Abstract: Dynamic magnetic resonance image reconstruction from incomplete k-space data has generated great research interest due to its capability to reduce scan time. Never-theless, the reconstruction problem is still challenging due to its ill-posed nature. Recently, diffusion models espe-cially score-based generative models have exhibited great potential in algorithm robustness and usage flexi-bility. Mo… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 12 pages, 11 figures

  29. arXiv:2212.00942  [pdf, other

    cs.CV

    A Geometric-Relational Deep Learning Framework for BIM Object Classification

    Authors: Hairong Luo, Ge Gao, Han Huang, Ziyi Ke, Cheng Peng, Ming Gu

    Abstract: Interoperability issue is a significant problem in Building Information Modeling (BIM). Object type, as a kind of critical semantic information needed in multiple BIM applications like scan-to-BIM and code compliance checking, also suffers when exchanging BIM data or creating models using software of other domains. It can be supplemented using deep learning. Current deep learning methods mainly le… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: Computer Vision for Civil and Infrastructure Engineering Workshop (CVCIE @ ECCV2022)

  30. arXiv:2211.12701  [pdf, ps, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Learning of Natural Language Processing Tasks: A Survey

    Authors: Zixuan Ke, Bing Liu

    Abstract: Continual learning (CL) is a learning paradigm that emulates the human capability of learning and accumulating knowledge continually without forgetting the previously learned knowledge and also transferring the learned knowledge to help learn new tasks better. This survey presents a comprehensive review and analysis of the recent progress of CL in NLP, which has significant differences from CL in… ▽ More

    Submitted 11 May, 2023; v1 submitted 22 November, 2022; originally announced November 2022.

    Comments: Preprint. Work in Progress

  31. arXiv:2211.02633  [pdf, other

    cs.LG cs.AI cs.CV

    A Theoretical Study on Solving Continual Learning

    Authors: Gyuhak Kim, Changnan Xiao, Tatsuya Konishi, Zixuan Ke, Bing Liu

    Abstract: Continual learning (CL) learns a sequence of tasks incrementally. There are two popular CL settings, class incremental learning (CIL) and task incremental learning (TIL). A major challenge of CL is catastrophic forgetting (CF). While a number of techniques are already available to effectively overcome CF for TIL, CIL remains to be highly challenging. So far, little theoretical study has been done… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  32. arXiv:2210.05549  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Training of Language Models for Few-Shot Learning

    Authors: Zixuan Ke, Haowei Lin, Yijia Shao, Hu Xu, Lei Shu, Bing Liu

    Abstract: Recent work on applying large language models (LMs) achieves impressive performance in many NLP applications. Adapting or posttraining an LM using an unlabeled domain corpus can produce even better performance for end-tasks in the domain. This paper proposes the problem of continually extending an LM by incrementally post-train the LM with a sequence of unlabeled domain corpora to expand its knowl… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

    Journal ref: EMNLP 2022

  33. arXiv:2209.02976  [pdf, other

    cs.CV

    YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

    Authors: Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng, Liang Li, Zaidan Ke, Qingyuan Li, Meng Cheng, Weiqiang Nie, Yiduo Li, Bo Zhang, Yufei Liang, Linyuan Zhou, Xiaoming Xu, Xiangxiang Chu, Xiaoming Wei, Xiaolin Wei

    Abstract: For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this technical report, we strive to push its limits to the next level, stepping forward with an unwavering mindset for industry application. Considering the divers… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: technical report

  34. arXiv:2208.09734  [pdf, other

    cs.LG cs.CV

    A Multi-Head Model for Continual Learning via Out-of-Distribution Replay

    Authors: Gyuhak Kim, Zixuan Ke, Bing Liu

    Abstract: This paper studies class incremental learning (CIL) of continual learning (CL). Many approaches have been proposed to deal with catastrophic forgetting (CF) in CIL. Most methods incrementally construct a single classifier for all classes of all tasks in a single head network. To prevent CF, a popular approach is to memorize a small number of samples from previous tasks and replay them during train… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

  35. arXiv:2207.01322  [pdf, other

    cs.CV

    Harmonizer: Learning to Perform White-Box Image and Video Harmonization

    Authors: Zhanghan Ke, Chunyi Sun, Lei Zhu, Ke Xu, Rynson W. H. Lau

    Abstract: Recent works on image harmonization solve the problem as a pixel-wise image translation task via large autoencoders. They have unsatisfactory performances and slow inference speeds when dealing with high-resolution images. In this work, we observe that adjusting the input arguments of basic image filters, e.g., brightness and contrast, is sufficient for humans to produce realistic images from the… ▽ More

    Submitted 20 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

  36. arXiv:2206.07465  [pdf

    math.OC cs.IT physics.optics

    High-fidelity quantitative differential phase contrast deconvolution using dark-field sparse prior

    Authors: Shuhe Zhang, Tao Peng, Zeyu Ke, Meng Shao, Tos T. J. M. Berendschot, Jinhua Zhou

    Abstract: Differential phase contrast (DPC) imaging plays an important role in the family of quantitative phase measurement. However, the reconstruction algorithm for quantitative DPC (qDPC) imaging is not yet optimized, as it does not incorporate the inborn properties of qDPC imaging. In this research, we propose a simple but effective image prior, the dark-field sparse prior (DSP), to facilitate the phase… ▽ More

    Submitted 15 June, 2022; originally announced June 2022.

  37. arXiv:2204.11194  [pdf, other

    cs.DL

    Co-citation and Co-authorship Networks of Statisticians

    Authors: Pengsheng Ji, Jiashun Jin, Zheng Tracy Ke, Wanshan Li

    Abstract: We collected and cleaned a large data set on publications in statistics. The data set consists of the coauthor relationships and citation relationships of 83, 331 papers published in 36 representative journals in statistics, probability, and machine learning, spanning 41 years. The data set allows us to construct many different networks, and motivates a number of research problems about the resear… ▽ More

    Submitted 24 April, 2022; originally announced April 2022.

    Comments: 61 pages, 16 figures

  38. arXiv:2204.11097  [pdf, other

    cs.SI stat.ME

    The SCORE normalization, especially for highly heterogeneous network and text data

    Authors: Zheng Tracy Ke, Jiashun Jin

    Abstract: SCORE was introduced as a spectral approach to network community detection. Since many networks have severe degree heterogeneity, the ordinary spectral clustering (OSC) approach to community detection may perform unsatisfactorily. SCORE alleviates the effect of degree heterogeneity by introducing a new normalization idea in the spectral domain and makes OSC more effective. SCORE is easy to use and… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: 34 pages, 5 figures, 7 tables

  39. arXiv:2204.01916  [pdf, other

    cs.LG cs.AI cs.NE

    Domain-Aware Contrastive Knowledge Transfer for Multi-domain Imbalanced Data

    Authors: Zixuan Ke, Mohammad Kachuee, Sungjin Lee

    Abstract: In many real-world machine learning applications, samples belong to a set of domains e.g., for product reviews each review belongs to a product category. In this paper, we study multi-domain imbalanced learning (MIL), the scenario that there is imbalance not only in classes but also in domains. In the MIL setting, different domains exhibit different patterns and there is a varying degree of simila… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

    Comments: ACL WASSA 2022

  40. arXiv:2112.10021  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Continual Learning with Knowledge Transfer for Sentiment Classification

    Authors: Zixuan Ke, Bing Liu, Hao Wang, Lei Shu

    Abstract: This paper studies continual learning (CL) for sentiment classification (SC). In this setting, the CL system learns a sequence of SC tasks incrementally in a neural network, where each task builds a classifier to classify the sentiment of reviews of a particular product category or domain. Two natural questions are: Can the system transfer the knowledge learned in the past from the previous tasks… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Journal ref: ECML-PKDD 2020

  41. arXiv:2112.10017  [pdf, other

    cs.LG cs.AI cs.CV cs.NE

    Continual Learning of a Mixed Sequence of Similar and Dissimilar Tasks

    Authors: Zixuan Ke, Bing Liu, Xingchang Huang

    Abstract: Existing research on continual learning of a sequence of tasks focused on dealing with catastrophic forgetting, where the tasks are assumed to be dissimilar and have little shared knowledge. Some work has also been done to transfer previously learned knowledge to the new task when the tasks are similar and have shared knowledge. To the best of our knowledge, no technique has been proposed to learn… ▽ More

    Submitted 18 December, 2021; originally announced December 2021.

    Journal ref: NeurIPS 2020

  42. arXiv:2112.09891  [pdf, other

    cs.LG eess.IV

    Equilibrated Zeroth-Order Unrolled Deep Networks for Accelerated MRI

    Authors: Zhuo-Xu Cui, Jing Cheng, Qingyong Zhu, Yuanyuan Liu, Sen Jia, Kankan Zhao, Ziwen Ke, Wenqi Huang, Haifeng Wang, Yanjie Zhu, Dong Liang

    Abstract: Recently, model-driven deep learning unrolls a certain iterative algorithm of a regularization model into a cascade network by replacing the first-order information (i.e., (sub)gradient or proximal operator) of the regularizer with a network module, which appears more explainable and predictable compared to common data-driven networks. Conversely, in theory, there is not necessarily such a functio… ▽ More

    Submitted 22 December, 2021; v1 submitted 18 December, 2021; originally announced December 2021.

    Comments: 11 figures

  43. arXiv:2112.03271  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Adapting BERT for Continual Learning of a Sequence of Aspect Sentiment Classification Tasks

    Authors: Zixuan Ke, Hu Xu, Bing Liu

    Abstract: This paper studies continual learning (CL) of a sequence of aspect sentiment classification (ASC) tasks. Although some CL techniques have been proposed for document sentiment classification, we are not aware of any CL work on ASC. A CL system that incrementally learns a sequence of ASC tasks should address the following two issues: (1) transfer knowledge learned from previous tasks to the new task… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Comments: arXiv admin note: text overlap with arXiv:2112.02714, arXiv:2112.02706

    Journal ref: NAACL 2021

  44. arXiv:2112.02714  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks

    Authors: Zixuan Ke, Bing Liu, Hu Xu, Lei Shu

    Abstract: This paper studies continual learning (CL) of a sequence of aspect sentiment classification(ASC) tasks in a particular CL setting called domain incremental learning (DIL). Each task is from a different domain or product. The DIL setting is particularly suited to ASC because in testing the system needs not know the task/domain to which the test data belongs. To our knowledge, this setting has not b… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Journal ref: EMNLP 2021

  45. arXiv:2112.02706  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning

    Authors: Zixuan Ke, Bing Liu, Nianzu Ma, Hu Xu, Lei Shu

    Abstract: Continual learning (CL) learns a sequence of tasks incrementally with the goal of achieving two main objectives: overcoming catastrophic forgetting (CF) and encouraging knowledge transfer (KT) across tasks. However, most existing techniques focus only on overcoming CF and have no mechanism to encourage KT, and thus do not do well in KT. Although several papers have tried to deal with both CF and K… ▽ More

    Submitted 5 December, 2021; originally announced December 2021.

    Journal ref: NeurIPS 2021

  46. arXiv:2109.11818  [pdf, other

    cs.CV

    MODNet-V: Improving Portrait Video Matting via Background Restoration

    Authors: Jiayu Sun, Zhanghan Ke, Lihe Zhang, Huchuan Lu, Rynson W. H. Lau

    Abstract: To address the challenging portrait video matting problem more precisely, existing works typically apply some matting priors that require additional user efforts to obtain, such as annotated trimaps or background images. In this work, we observe that instead of asking the user to explicitly provide a background image, we may recover it from the input video itself. To this end, we first propose a n… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

  47. arXiv:2104.05901  [pdf, other

    cs.CV eess.IV

    SRR-Net: A Super-Resolution-Involved Reconstruction Method for High Resolution MR Imaging

    Authors: Wenqi Huang, Sen Jia, Ziwen Ke, Zhuo-Xu Cui, Jing Cheng, Yanjie Zhu, Dong Liang

    Abstract: Improving the image resolution and acquisition speed of magnetic resonance imaging (MRI) is a challenging problem. There are mainly two strategies dealing with the speed-resolution trade-off: (1) $k$-space undersampling with high-resolution acquisition, and (2) a pipeline of lower resolution image reconstruction and image super-resolution. However, these approaches either have limited performance… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

  48. arXiv:2104.01102  [pdf, other

    eess.IV cs.CV

    Deep Manifold Learning for Dynamic MR Imaging

    Authors: Ziwen Ke, Zhuo-Xu Cui, Wenqi Huang, Jing Cheng, Sen Jia, Haifeng Wang, Xin Liu, Hairong Zheng, Leslie Ying, Yanjie Zhu, Dong Liang

    Abstract: Purpose: To develop a deep learning method on a nonlinear manifold to explore the temporal redundancy of dynamic signals to reconstruct cardiac MRI data from highly undersampled measurements. Methods: Cardiac MR image reconstruction is modeled as general compressed sensing (CS) based optimization on a low-rank tensor manifold. The nonlinear manifold is designed to characterize the temporal corre… ▽ More

    Submitted 8 March, 2021; originally announced April 2021.

    Comments: 17 pages, 7 figures

  49. arXiv:2011.11961  [pdf, other

    cs.CV

    MODNet: Real-Time Trimap-Free Portrait Matting via Objective Decomposition

    Authors: Zhanghan Ke, Jiayu Sun, Kaican Li, Qiong Yan, Rynson W. H. Lau

    Abstract: Existing portrait matting methods either require auxiliary inputs that are costly to obtain or involve multiple stages that are computationally expensive, making them less suitable for real-time applications. In this work, we present a light-weight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. The key idea behind our efficient design… ▽ More

    Submitted 18 March, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

  50. arXiv:2011.08295  [pdf, other

    eess.SP cs.LG

    Real-Time Radio Technology and Modulation Classification via an LSTM Auto-Encoder

    Authors: Ziqi Ke, Haris Vikalo

    Abstract: Identification of the type of communication technology and/or modulation scheme based on detected radio signal are challenging problems encountered in a variety of applications including spectrum allocation and radio interference mitigation. They are rendered difficult due to a growing number of emitter types and varied effects of real-world channels upon the radio signal. Existing spectrum monito… ▽ More

    Submitted 16 November, 2020; originally announced November 2020.