Skip to main content

Showing 1–9 of 9 results for author: Zhai, K

  1. arXiv:2405.11811  [pdf, other

    cs.LG cs.DC

    FedCAda: Adaptive Client-Side Optimization for Accelerated and Stable Federated Learning

    Authors: Liuzhi Zhou, Yu He, Kun Zhai, Xiang Liu, Sen Liu, Xingjun Ma, Guangnan Ye, Yu-Gang Jiang, Hongfeng Chai

    Abstract: Federated learning (FL) has emerged as a prominent approach for collaborative training of machine learning models across distributed clients while preserving data privacy. However, the quest to balance acceleration and stability becomes a significant challenge in FL, especially on the client-side. In this paper, we introduce FedCAda, an innovative federated client adaptive algorithm designed to ta… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  2. arXiv:2404.11888  [pdf, other

    cs.LG cs.AI

    The Dog Walking Theory: Rethinking Convergence in Federated Learning

    Authors: Kun Zhai, Yifeng Gao, Xingjun Ma, Difan Zou, Guangnan Ye, Yu-Gang Jiang

    Abstract: Federated learning (FL) is a collaborative learning paradigm that allows different clients to train one powerful global model without sharing their private data. Although FL has demonstrated promising results in various applications, it is known to suffer from convergence issues caused by the data distribution shift across different clients, especially on non-independent and identically distribute… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2302.14581  [pdf, other

    cs.CV

    HopFIR: Hop-wise GraphFormer with Intragroup Joint Refinement for 3D Human Pose Estimation

    Authors: Kai Zhai, Qiang Nie, Bo Ouyang, Xiang Li, Shanlin Yang

    Abstract: 2D-to-3D human pose lifting is fundamental for 3D human pose estimation (HPE), for which graph convolutional networks (GCNs) have proven inherently suitable for modeling the human skeletal topology. However, the current GCN-based 3D HPE methods update the node features by aggregating their neighbors' information without considering the interaction of joints in different joint synergies. Although s… ▽ More

    Submitted 19 August, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by ICCV 2023

  4. arXiv:2109.02396  [pdf, other

    cs.LG cs.DC

    Byzantine-Robust Federated Learning via Credibility Assessment on Non-IID Data

    Authors: Kun Zhai, Qiang Ren, Junli Wang, Chungang Yan

    Abstract: Federated learning is a novel framework that enables resource-constrained edge devices to jointly learn a model, which solves the problem of data protection and data islands. However, standard federated learning is vulnerable to Byzantine attacks, which will cause the global model to be manipulated by the attacker or fail to converge. On non-iid data, the current methods are not effective in defen… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

  5. arXiv:2101.09961  [pdf, ps, other

    cs.RO cs.LG

    Scaffolded Learning of In-place Trotting Gait for a Quadruped Robot with Bayesian Optimization

    Authors: Keyan Zhai, Chu'an Li, Andre Rosendo

    Abstract: During learning trials, systems are exposed to different failure conditions which may break robotic parts before a safe behavior is discovered. Humans contour this problem by grounding their learning to a safer structure/control first and gradually increasing its difficulty. This paper presents the impact of a similar supports in the learning of a stable gait on a quadruped robot. Based on the psy… ▽ More

    Submitted 3 April, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: 9 pages, 6 figures, 16-th International Conference on Intelligent Autonomous System (IAS-16)

  6. arXiv:2012.13846  [pdf, other

    cs.CV cs.DC

    SparsePipe: Parallel Deep Learning for 3D Point Clouds

    Authors: Keke Zhai, Pan He, Tania Banerjee, Anand Rangarajan, Sanjay Ranka

    Abstract: We propose SparsePipe, an efficient and asynchronous parallelism approach for handling 3D point clouds with multi-GPU training. SparsePipe is built to support 3D sparse data such as point clouds. It achieves this by adopting generalized convolutions with sparse tensor representation to build expressive high-dimensional convolutional neural networks. Compared to dense solutions, the new models can… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

    Comments: Accepted in 2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC)

  7. Dynamic Load Balancing for Compressible Multiphase Turbulence

    Authors: Keke Zhai, Tania Banerjee, David Zwick, Jason Hackl, Sanjay Ranka

    Abstract: CMT-nek is a new scientific application for performing high fidelity predictive simulations of particle laden explosively dispersed turbulent flows. CMT-nek involves detailed simulations, is compute intensive and is targeted to be deployed on exascale platforms. The moving particles are the main source of load imbalance as the application is executed on parallel processors. In a demonstration prob… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: This paper has been accepted by ACM International Conference on Supercomputing (ICS) 2018

  8. arXiv:1206.6482  [pdf

    cs.CV cs.LG stat.ML

    Modeling Images using Transformed Indian Buffet Processes

    Authors: Ke Zhai, Yuening Hu, Sinead Williamson, Jordan Boyd-Graber

    Abstract: Latent feature models are attractive for image modeling, since images generally contain multiple objects. However, many latent feature models ignore that objects can appear at different locations or require pre-segmentation of images. While the transformed Indian buffet process (tIBP) provides a method for modeling transformation-invariant features in unsegmented binary images, its current form is… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  9. arXiv:1107.3765  [pdf, other

    cs.AI cs.DC

    Using Variational Inference and MapReduce to Scale Topic Modeling

    Authors: Ke Zhai, Jordan Boyd-Graber, Nima Asadi

    Abstract: Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for exploring document collections. Because of the increasing prevalence of large datasets, there is a need to improve the scalability of inference of LDA. In this paper, we propose a technique called ~\emph{MapReduce LDA} (Mr. LDA) to accommodate very large corpus collections in the MapReduce framework. In contrast to other t… ▽ More

    Submitted 19 July, 2011; originally announced July 2011.