Skip to main content

Showing 1–50 of 260 results for author: Sugiyama, M

  1. arXiv:2406.09179  [pdf, other

    cs.LG

    Unlearning with Control: Assessing Real-world Utility for Large Language Model Unlearning

    Authors: Qizhou Wang, Bo Han, Puning Yang, Jianing Zhu, Tongliang Liu, Masashi Sugiyama

    Abstract: The compelling goal of eradicating undesirable data behaviors, while preserving usual model functioning, underscores the significance of machine unlearning within the domain of large language models (LLMs). Recent research has begun to approach LLM unlearning via gradient ascent (GA) -- increasing the prediction risk for those training strings targeted to be unlearned, thereby erasing their parame… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  2. arXiv:2406.08288  [pdf, other

    cs.LG

    Decoupling the Class Label and the Target Concept in Machine Unlearning

    Authors: Jianing Zhu, Bo Han, Jiangchao Yao, Jianliang Xu, Gang Niu, Masashi Sugiyama

    Abstract: Machine unlearning as an emerging research topic for data regulations, aims to adjust a trained model to approximate a retrained one that excludes a portion of training data. Previous studies showed that class-wise unlearning is successful in forgetting the knowledge of a target class, through gradient ascent on the forgetting data or fine-tuning with the remaining data. However, while these metho… ▽ More

    Submitted 16 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2405.20494  [pdf, other

    cs.CV cs.AI cs.LG

    Slight Corruption in Pre-training Data Makes Better Diffusion Models

    Authors: Hao Chen, Yujin Han, Diganta Misra, Xiang Li, Kai Hu, Difan Zou, Masashi Sugiyama, Jindong Wang, Bhiksha Raj

    Abstract: Diffusion models (DMs) have shown remarkable capabilities in generating realistic high-quality images, audios, and videos. They benefit significantly from extensive pre-training on large-scale datasets, including web-crawled data with paired data and conditions, such as image-text and image-class pairs. Despite rigorous filtering, these pre-training datasets often inevitably contain corrupted pair… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 50 pages, 33 figures, 4 tables

  4. arXiv:2405.18890  [pdf, other

    cs.LG cs.DC

    Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

    Authors: Ziqing Fan, Shengchao Hu, Jiangchao Yao, Gang Niu, Ya Zhang, Masashi Sugiyama, Yanfeng Wang

    Abstract: In federated learning (FL), the multi-step update and data heterogeneity among clients often lead to a loss landscape with sharper minima, degenerating the performance of the resulted global model. Prevalent federated approaches incorporate sharpness-aware minimization (SAM) into local training to mitigate this problem. However, the local loss landscapes may not accurately reflect the flatness of… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  5. arXiv:2405.16168  [pdf, other

    cs.LG stat.ML

    Multi-Player Approaches for Dueling Bandits

    Authors: Or Raveh, Junya Honda, Masashi Sugiyama

    Abstract: Various approaches have emerged for multi-armed bandits in distributed systems. The multiplayer dueling bandit problem, common in scenarios with only preference-based information like human feedback, introduces challenges related to controlling collaborative exploration of non-informative arm pairs, but has received little attention. To fill this gap, we demonstrate that the direct use of a Follow… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  6. arXiv:2405.14596  [pdf, other

    cs.LG

    Linear Mode Connectivity in Differentiable Tree Ensembles

    Authors: Ryuichi Kanoh, Mahito Sugiyama

    Abstract: Linear Mode Connectivity (LMC) refers to the phenomenon that performance remains consistent for linearly interpolated models in the parameter space. For independently optimized model pairs from different random initializations, achieving LMC is considered crucial for validating the stable success of the non-convex optimization in modern machine learning models and for facilitating practical parame… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  7. arXiv:2405.14114  [pdf, other

    cs.LG cs.AI

    Offline Reinforcement Learning from Datasets with Structured Non-Stationarity

    Authors: Johannes Ackermann, Takayuki Osa, Masashi Sugiyama

    Abstract: Current Reinforcement Learning (RL) is often limited by the large amount of data needed to learn a successful policy. Offline RL aims to solve this issue by using transitions collected by a different behavior policy. We address a novel Offline RL problem setting in which, while collecting the dataset, the transition and reward functions gradually change between episodes but stay constant within ea… ▽ More

    Submitted 27 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted for Reinforcement Learning Conference (RLC) 2024

  8. arXiv:2405.09892  [pdf, other

    cs.LG cs.DC

    Balancing Similarity and Complementarity for Federated Learning

    Authors: Kunda Yan, Sen Cui, Abudukelimu Wuerkaixi, Jingfeng Zhang, Bo Han, Gang Niu, Masashi Sugiyama, Changshui Zhang

    Abstract: In mobile and IoT systems, Federated Learning (FL) is increasingly important for effectively using data while maintaining user privacy. One key challenge in FL is managing statistical heterogeneity, such as non-i.i.d. data, arising from numerous clients and diverse data sources. This requires strategic cooperation, often with clients having similar characteristics. However, we are interested in a… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  9. arXiv:2404.07465  [pdf, other

    cs.LG

    Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains

    Authors: Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama

    Abstract: In this paper, we investigate an offline reinforcement learning (RL) problem where datasets are collected from two domains. In this scenario, having datasets with domain labels facilitates efficient policy training. However, in practice, the task of assigning domain labels can be resource-intensive or infeasible at a large scale, leading to a prevalence of domain-unlabeled data. To formalize this… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  10. arXiv:2404.06287  [pdf, other

    cs.CV cs.LG

    Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

    Authors: Ming-Kun Xie, Jia-Hao Xiao, Pei Peng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: The key to multi-label image classification (MLC) is to improve model performance by leveraging label correlations. Unfortunately, it has been shown that overemphasizing co-occurrence relationships can cause the overfitting issue of the model, ultimately leading to performance degradation. In this paper, we provide a causal inference framework to show that the correlative features caused by the ta… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  11. arXiv:2403.10855  [pdf, other

    cs.LG cs.RO

    Reinforcement Learning with Options and State Representation

    Authors: Ayoub Ghriss, Masashi Sugiyama, Alessandro Lazaric

    Abstract: The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments. It addresses such goals by decomposing learning tasks in a hierarchical fashion known as Hierarchical Reinforcement Learning. We start in the first chapter by getting familiar with the Markov Dec… ▽ More

    Submitted 25 March, 2024; v1 submitted 16 March, 2024; originally announced March 2024.

    Comments: Master Thesis 2018, MVA ENS Paris-Saclay, Tokyo RIKEN AIP

  12. arXiv:2403.06869  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Learning with Noisy Foundation Models

    Authors: Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

    Abstract: Foundation models are usually pre-trained on large-scale datasets and then adapted to downstream tasks through tuning. However, the large-scale pre-training datasets, often inaccessible or too expensive to handle, can contain label noise that may adversely affect the generalization of the model and pose unexpected risks. This paper stands out as the first work to comprehensively understand and ana… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 18 pages, 10 figures, 6 tables, preprint. arXiv admin note: substantial text overlap with arXiv:2309.17002

  13. arXiv:2402.19287  [pdf, other

    cs.LG

    StiefelGen: A Simple, Model Agnostic Approach for Time Series Data Augmentation over Riemannian Manifolds

    Authors: Prasad Cheema, Mahito Sugiyama

    Abstract: Data augmentation is an area of research which has seen active development in many machine learning fields, such as in image-based learning models, reinforcement learning for self driving vehicles, and general noise injection for point cloud data. However, convincing methods for general time series data augmentation still leaves much to be desired, especially since the methods developed for these… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 61 pages, 41 figures

  14. arXiv:2402.18805  [pdf, other

    cs.SI stat.ML

    VEC-SBM: Optimal Community Detection with Vectorial Edges Covariates

    Authors: Guillaume Braun, Masashi Sugiyama

    Abstract: Social networks are often associated with rich side information, such as texts and images. While numerous methods have been developed to identify communities from pairwise interactions, they usually ignore such side information. In this work, we study an extension of the Stochastic Block Model (SBM), a widely used statistical framework for community detection, that integrates vectorial edges covar… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  15. arXiv:2402.06918  [pdf, other

    cs.LG cs.AI cs.CL

    Generating Chain-of-Thoughts with a Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought

    Authors: Zhen-Yu Zhang, Siwei Han, Huaxiu Yao, Gang Niu, Masashi Sugiyama

    Abstract: To improve the ability of the large language model (LLMs) to tackle complex reasoning problems, chain-of-thoughts (CoT) methods were proposed to guide LLMs to reason step-by-step, enabling problem solving from simple to complex. State-of-the-art methods for generating such a chain involve interactive collaboration, where the learner generates candidate intermediate thoughts, evaluated by the LLM,… ▽ More

    Submitted 26 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  16. arXiv:2402.03771  [pdf, other

    cs.LG

    Reinforcement Learning from Bagged Reward

    Authors: Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

    Abstract: In Reinforcement Learning (RL), it is commonly assumed that an immediate reward signal is generated for each action taken by the agent, helping the agent maximize cumulative rewards to obtain the optimal policy. However, in many real-world scenarios, immediate reward signals are not obtainable; instead, agents receive a single reward that is contingent upon a partial sequence or a complete traject… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  17. arXiv:2402.01922  [pdf, other

    cs.LG cs.AI

    A General Framework for Learning from Weak Supervision

    Authors: Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

    Abstract: Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment. This paper introduces a general framework for learning from weak supervision (GLWS) with a novel algorithm. Central to GLWS is an Expectation-Maximization (EM) formulati… ▽ More

    Submitted 5 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 24 pages, 20 tables, 9 figures

  18. arXiv:2401.06826  [pdf, other

    cs.LG cs.AI cs.CV

    Direct Distillation between Different Domains

    Authors: Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

    Abstract: Knowledge Distillation (KD) aims to learn a compact student network using knowledge from a large pre-trained teacher network, where both networks are trained on data from the same distribution. However, in practical applications, the student network may be required to perform in a new scenario (i.e., the target domain), which usually exhibits significant differences from the known scenario of the… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  19. arXiv:2311.15502  [pdf, other

    cs.LG

    Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical

    Authors: Wei Wang, Takashi Ishida, Yu-Jie Zhang, Gang Niu, Masashi Sugiyama

    Abstract: Complementary-label learning is a weakly supervised learning problem in which each training example is associated with one or multiple complementary labels indicating the classes to which it does not belong. Existing consistent approaches have relied on the uniform distribution assumption to model the generation of complementary labels, or on an ordinary-label training set to estimate the transiti… ▽ More

    Submitted 12 May, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: ICML 2024

  20. arXiv:2310.15681  [pdf, other

    cs.LG

    Fixed-Budget Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

    Authors: Shintaro Nakamura, Masashi Sugiyama

    Abstract: We study the real-valued combinatorial pure exploration of the multi-armed bandit in the fixed-budget setting. We first introduce the Combinatorial Successive Asign (CSA) algorithm, which is the first algorithm that can identify the best action even when the size of the action class is exponentially large with respect to the number of arms. We show that the upper bound of the probability of error… ▽ More

    Submitted 15 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  21. arXiv:2310.13923  [pdf, other

    cs.LG

    Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation

    Authors: Jianing Zhu, Geng Yu, Jiangchao Yao, Tongliang Liu, Gang Niu, Masashi Sugiyama, Bo Han

    Abstract: Out-of-distribution (OOD) detection is important for deploying reliable machine learning models on real-world applications. Recent advances in outlier exposure have shown promising results on OOD detection via fine-tuning model with informatively sampled auxiliary outliers. However, previous methods assume that the collected outliers can be sufficiently large and representative to cover the bounda… ▽ More

    Submitted 26 October, 2023; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: accepted by NeurIPS 2023

  22. arXiv:2310.07351  [pdf, other

    cs.LG

    Atom-Motif Contrastive Transformer for Molecular Property Prediction

    Authors: Wentao Yu, Shuo Chen, Chen Gong, Gang Niu, Masashi Sugiyama

    Abstract: Recently, Graph Transformer (GT) models have been widely used in the task of Molecular Property Prediction (MPP) due to their high reliability in characterizing the latent relationship among graph nodes (i.e., the atoms in a molecule). However, most existing GT-based methods usually explore the basic interactions between pairwise atoms, and thus they fail to consider the important interactions amo… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: submit to AAAI-24

  23. arXiv:2310.05632  [pdf, other

    cs.LG

    Binary Classification with Confidence Difference

    Authors: Wei Wang, Lei Feng, Yuchen Jiang, Gang Niu, Min-Ling Zhang, Masashi Sugiyama

    Abstract: Recently, learning with soft labels has been shown to achieve better performance than learning with hard labels in terms of model generalization, calibration, and robustness. However, collecting pointwise labeling confidence for all training examples can be challenging and time-consuming in real-world scenarios. This paper delves into a novel weakly supervised binary classification problem called… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023

  24. arXiv:2310.00539  [pdf, other

    stat.ML cs.LG

    Thompson Exploration with Best Challenger Rule in Best Arm Identification

    Authors: Jongyeong Lee, Junya Honda, Masashi Sugiyama

    Abstract: This paper studies the fixed-confidence best arm identification (BAI) problem in the bandit framework in the canonical single-parameter exponential models. For this problem, many policies have been proposed, but most of them require solving an optimization problem at every round and/or are forced to explore an arm at least a certain number of times except those restricted to the Gaussian model. To… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: TBA ACML2023, 49pages

  25. arXiv:2309.17002  [pdf, other

    cs.LG cs.AI cs.CV

    Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

    Authors: Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

    Abstract: Pre-training on large-scale datasets and then fine-tuning on downstream tasks have become a standard practice in deep learning. However, pre-training data often contain label noise that may adversely affect the generalization of the model. This paper aims to understand the nature of noise in pre-training datasets and to mitigate its impact on downstream tasks. More specifically, through extensive… ▽ More

    Submitted 11 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: ICLR 2024 Spotlight

  26. arXiv:2309.08216  [pdf, other

    cs.LG

    Unified Risk Analysis for Weakly Supervised Learning

    Authors: Chao-Kai Chiang, Masashi Sugiyama

    Abstract: Among the flourishing research of weakly supervised learning (WSL), we recognize the lack of a unified interpretation of the mechanism behind the weakly supervised scenarios, let alone a systematic treatment of the risk rewrite problem, a crucial step in the empirical risk minimization approach. In this paper, we introduce a framework providing a comprehensive understanding and a unified methodolo… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  27. arXiv:2308.10238  [pdf, other

    cs.LG stat.ML

    Thompson Sampling for Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

    Authors: Shintaro Nakamura, Masashi Sugiyama

    Abstract: We study the real-valued combinatorial pure exploration of the multi-armed bandit (R-CPE-MAB) problem. In R-CPE-MAB, a player is given $d$ stochastic arms, and the reward of each arm $s\in\{1, \ldots, d\}$ follows an unknown distribution with mean $μ_s$. In each time step, a player pulls a single arm and observes its reward. The player's goal is to identify the optimal \emph{action}… ▽ More

    Submitted 15 November, 2023; v1 submitted 20 August, 2023; originally announced August 2023.

  28. arXiv:2308.06453  [pdf, other

    cs.LG cs.AI cs.CV

    Multi-Label Knowledge Distillation

    Authors: Penghui Yang, Ming-Kun Xie, Chen-Chen Zong, Lei Feng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Existing knowledge distillation methods typically work by imparting the knowledge of output logits or intermediate feature maps from the teacher network to the student network, which is very successful in multi-class single-label learning. However, these methods can hardly be extended to the multi-label learning scenario, where each instance is associated with multiple semantic labels, because the… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV 2023. The first two authors contributed equally to this work

  29. arXiv:2307.11469  [pdf, other

    cs.CV cs.AI

    Distribution Shift Matters for Knowledge Distillation with Webly Collected Images

    Authors: Jialiang Tang, Shuo Chen, Gang Niu, Masashi Sugiyama, Chen Gong

    Abstract: Knowledge distillation aims to learn a lightweight student network from a pre-trained teacher network. In practice, existing knowledge distillation methods are usually infeasible when the original training data is unavailable due to some privacy issues and data management considerations. Therefore, data-free knowledge distillation approaches proposed to collect training instances from the Internet… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  30. arXiv:2307.06721  [pdf, other

    cs.CL cs.LG

    Why Guided Dialog Policy Learning performs well? Understanding the role of adversarial learning and its alternative

    Authors: Sho Shimoyama, Tetsuro Morimura, Kenshi Abe, Toda Takamichi, Yuta Tomomatsu, Masakazu Sugiyama, Asahi Hentona, Yuuki Azuma, Hirotaka Ninomiya

    Abstract: Dialog policies, which determine a system's action based on the current state at each dialog turn, are crucial to the success of the dialog. In recent years, reinforcement learning (RL) has emerged as a promising option for dialog policy learning (DPL). In RL-based DPL, dialog policies are updated according to rewards. The manual construction of fine-grained rewards, such as state-action-based one… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

  31. arXiv:2307.05948  [pdf, other

    cs.LG

    Diversity-enhancing Generative Network for Few-shot Hypothesis Adaptation

    Authors: Ruijiang Dong, Feng Liu, Haoang Chi, Tongliang Liu, Mingming Gong, Gang Niu, Masashi Sugiyama, Bo Han

    Abstract: Generating unlabeled data has been recently shown to help address the few-shot hypothesis adaptation (FHA) problem, where we aim to train a classifier for the target domain with a few labeled target-domain data and a well-trained source-domain classifier (i.e., a source hypothesis), for the additional information of the highly-compatible unlabeled data. However, the generated data of the existing… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  32. arXiv:2306.09202  [pdf, other

    cs.LG

    An Optimal Algorithm for the Real-Valued Combinatorial Pure Exploration of Multi-Armed Bandit

    Authors: Shintaro Nakamura, Masashi Sugiyama

    Abstract: We study the real-valued combinatorial pure exploration problem in the stochastic multi-armed bandit (R-CPE-MAB). We study the case where the size of the action set is polynomial with respect to the number of arms. In such a case, the R-CPE-MAB can be seen as a special case of the so-called transductive linear bandits. Existing methods in the R-CPE-MAB and transductive linear bandits have a gap of… ▽ More

    Submitted 14 December, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

  33. arXiv:2306.07036  [pdf, other

    cs.LG

    Making Binary Classification from Multiple Unlabeled Datasets Almost Free of Supervision

    Authors: Yuhao Wu, Xiaobo Xia, Jun Yu, Bo Han, Gang Niu, Masashi Sugiyama, Tongliang Liu

    Abstract: Training a classifier exploiting a huge amount of supervised data is expensive or even prohibited in a situation, where the labeling cost is high. The remarkable progress in working with weaker forms of supervision is binary classification from multiple unlabeled datasets which requires the knowledge of exact class priors for all unlabeled datasets. However, the availability of class priors is res… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 38 pages, 5 figures, 10 tables

  34. BadLabel: A Robust Perspective on Evaluating and Enhancing Label-noise Learning

    Authors: Jingfeng Zhang, Bo Song, Haohan Wang, Bo Han, Tongliang Liu, Lei Liu, Masashi Sugiyama

    Abstract: Label-noise learning (LNL) aims to increase the model's generalization given training data with noisy labels. To facilitate practical LNL algorithms, researchers have proposed different label noise types, ranging from class-conditional to instance-dependent noises. In this paper, we introduce a novel label noise type called BadLabel, which can significantly degrade the performance of existing LNL… ▽ More

    Submitted 12 February, 2024; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: IEEE T-PAMI 2024 Accept

  35. arXiv:2305.14690  [pdf, other

    cs.LG

    Generalizing Importance Weighting to A Universal Solver for Distribution Shift Problems

    Authors: Tongtong Fang, Nan Lu, Gang Niu, Masashi Sugiyama

    Abstract: Distribution shift (DS) may have two levels: the distribution itself changes, and the support (i.e., the set where the probability density is non-zero) also changes. When considering the support change between the training and test distributions, there can be four cases: (i) they exactly match; (ii) the training support is wider (and thus covers the test support); (iii) the test support is wider;… ▽ More

    Submitted 1 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 camera-ready version (this paper was selected for spotlight presentation)

  36. arXiv:2305.12715  [pdf, other

    cs.LG cs.AI cs.CV

    Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations

    Authors: Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

    Abstract: Learning with reduced labeling standards, such as noisy label, partial label, and multiple label candidates, which we generically refer to as \textit{imprecise} labels, is a commonplace challenge in machine learning tasks. Previous methods tend to propose specific designs for every emerging imprecise label configuration, which is usually unsustainable when multiple configurations of imprecision co… ▽ More

    Submitted 29 September, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 29 pages, 3 figures, 16 tables, preprint

  37. arXiv:2305.11512  [pdf, other

    cs.LG math.CT math.LO

    Enriching Disentanglement: From Logical Definitions to Quantitative Metrics

    Authors: Yivan Zhang, Masashi Sugiyama

    Abstract: Disentangling the explanatory factors in complex data is a promising approach for generalizable and data-efficient representation learning. While a variety of quantitative metrics for learning and evaluating disentangled representations have been proposed, it remains unclear what properties these metrics truly quantify. In this work, we establish a theoretical connection between logical definition… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  38. arXiv:2305.09412  [pdf, other

    cs.HC

    Analysis of Pleasantness Evoked by Various Airborne Ultrasound Tactile Stimuli Using Pairwise Comparisons and the Bradley-Terry Model

    Authors: Sora Satake, Yoshihiro Nagano, Masashi Sugiyama, Masahiro Fujiwara, Yasutoshi Makino, Hiroyuki Shinoda

    Abstract: The presentation of a moving tactile stimulus to a person's forearm evokes a pleasant sensation. The speed, intensity, and contact area of the strokes should be systematically changed to evaluate the relationship between pleasantness and tactile stimuli in more detail. Studies have examined the relationship between stroking stimulation and pleasant sensations using airborne ultrasound tactile disp… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 6 pages, 6 figures

  39. arXiv:2305.08344  [pdf, other

    cs.LG

    Enhancing Label Sharing Efficiency in Complementary-Label Learning with Label Augmentation

    Authors: Wei-I Lin, Gang Niu, Hsuan-Tien Lin, Masashi Sugiyama

    Abstract: Complementary-label Learning (CLL) is a form of weakly supervised learning that trains an ordinary classifier using only complementary labels, which are the classes that certain instances do not belong to. While existing CLL studies typically use novel loss functions or training techniques to solve this problem, few studies focus on how complementary labels collectively provide information to trai… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  40. arXiv:2305.06886  [pdf, other

    cs.LG cs.CV math.CT

    A Category-theoretical Meta-analysis of Definitions of Disentanglement

    Authors: Yivan Zhang, Masashi Sugiyama

    Abstract: Disentangling the factors of variation in data is a fundamental concept in machine learning and has been studied in various ways by different researchers, leading to a multitude of definitions. Despite the numerous empirical studies, more theoretical research is needed to fully understand the defining properties of disentanglement and how different definitions relate to each other. This paper pres… ▽ More

    Submitted 29 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: International Conference on Machine Learning 2023

  41. arXiv:2305.02795  [pdf, other

    cs.LG

    Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

    Authors: Ming-Kun Xie, Jia-Hao Xiao, Hao-Zhe Liu, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

    Abstract: Pseudo-labeling has emerged as a popular and effective approach for utilizing unlabeled data. However, in the context of semi-supervised multi-label learning (SSMLL), conventional pseudo-labeling methods encounter difficulties when dealing with instances associated with multiple labels and an unknown label count. These limitations often result in the introduction of false positive labels or the ne… ▽ More

    Submitted 20 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  42. arXiv:2305.00399  [pdf, other

    cs.CR

    Assessing Vulnerabilities of Adversarial Learning Algorithm through Poisoning Attacks

    Authors: Jingfeng Zhang, Bo Song, Bo Han, Lei Liu, Gang Niu, Masashi Sugiyama

    Abstract: Adversarial training (AT) is a robust learning algorithm that can defend against adversarial attacks in the inference phase and mitigate the side effects of corrupted data in the training phase. As such, it has become an indispensable component of many artificial intelligence (AI) systems. However, in high-stake AI applications, it is crucial to understand AT's vulnerabilities to ensure reliable d… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  43. arXiv:2305.00374  [pdf, other

    cs.LG cs.CR

    Enhancing Adversarial Contrastive Learning via Adversarial Invariant Regularization

    Authors: Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

    Abstract: Adversarial contrastive learning (ACL) is a technique that enhances standard contrastive learning (SCL) by incorporating adversarial data to learn a robust representation that can withstand adversarial attacks and common corruptions without requiring costly annotations. To improve transferability, the existing work introduced the standard invariant regularization (SIR) to impose style-independence… ▽ More

    Submitted 23 October, 2023; v1 submitted 29 April, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  44. arXiv:2303.12291  [pdf, other

    cs.LG

    Fairness Improves Learning from Noisily Labeled Long-Tailed Data

    Authors: Jiaheng Wei, Zhaowei Zhu, Gang Niu, Tongliang Liu, Sijia Liu, Masashi Sugiyama, Yang Liu

    Abstract: Both long-tailed and noisily labeled data frequently appear in real-world applications and impose significant challenges for learning. Most prior works treat either problem in an isolated way and do not explicitly consider the coupling effects of the two. Our empirical observation reveals that such solutions fail to consistently improve the learning when the dataset is long-tailed with label noise… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Paper under review

  45. arXiv:2302.14407  [pdf, other

    cs.LG math.ST stat.ML

    The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models

    Authors: Jongyeong Lee, Chao-Kai Chiang, Masashi Sugiyama

    Abstract: Thompson sampling (TS) has been known for its outstanding empirical performance supported by theoretical guarantees across various reward models in the classical stochastic multi-armed bandit problems. Nonetheless, its optimality is often restricted to specific priors due to the common observation that TS is fairly insensitive to the choice of the prior when it comes to asymptotic regret bounds. H… ▽ More

    Submitted 12 December, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: 55 pages, TBA AAAI2024

  46. arXiv:2302.03857  [pdf, other

    cs.LG cs.CR

    Efficient Adversarial Contrastive Learning via Robustness-Aware Coreset Selection

    Authors: Xilie Xu, Jingfeng Zhang, Feng Liu, Masashi Sugiyama, Mohan Kankanhalli

    Abstract: Adversarial contrastive learning (ACL) does not require expensive data annotations but outputs a robust representation that withstands adversarial attacks and also generalizes to a wide range of downstream tasks. However, ACL needs tremendous running time to generate the adversarial variants of all training data, which limits its scalability to large datasets. To speed up ACL, this paper proposes… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023 Spotlight

  47. arXiv:2302.02907  [pdf, other

    cs.CV cs.CR cs.LG

    GAT: Guided Adversarial Training with Pareto-optimal Auxiliary Tasks

    Authors: Salah Ghamizi, Jingfeng Zhang, Maxime Cordy, Mike Papadakis, Masashi Sugiyama, Yves Le Traon

    Abstract: While leveraging additional training data is well established to improve adversarial robustness, it incurs the unavoidable cost of data collection and the heavy computation to train models. To mitigate the costs, we propose Guided Adversarial Training (GAT), a novel adversarial training technique that exploits auxiliary tasks under a limited set of training data. Our approach extends single-task m… ▽ More

    Submitted 25 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  48. arXiv:2302.02552  [pdf, other

    cs.LG stat.ML

    Adapting to Continuous Covariate Shift via Online Density Ratio Estimation

    Authors: Yu-Jie Zhang, Zhen-Yu Zhang, Peng Zhao, Masashi Sugiyama

    Abstract: Dealing with distribution shifts is one of the central challenges for modern machine learning. One fundamental situation is the covariate shift, where the input distributions of data change from training to testing stages while the input-conditional output distribution remains unchanged. In this paper, we initiate the study of a more challenging scenario -- continuous covariate shift -- in which t… ▽ More

    Submitted 27 May, 2024; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: NeurIPS 2023

  49. arXiv:2302.01544  [pdf, other

    cs.LG math.ST stat.ML

    Optimality of Thompson Sampling with Noninformative Priors for Pareto Bandits

    Authors: Jongyeong Lee, Junya Honda, Chao-Kai Chiang, Masashi Sugiyama

    Abstract: In the stochastic multi-armed bandit problem, a randomized probability matching policy called Thompson sampling (TS) has shown excellent performance in various reward models. In addition to the empirical performance, TS has been shown to achieve asymptotic problem-dependent lower bounds in several models. However, its optimality has been mainly addressed under light-tailed or one-parameter models… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 49 pages, a preprint

  50. arXiv:2302.00587  [pdf, other

    q-bio.BM cs.AI cs.LG

    Molecular Graph Generation by Decomposition and Reassembling

    Authors: Masatsugu Yamada, Mahito Sugiyama

    Abstract: Designing molecular structures with desired chemical properties is an essential task in drug discovery and material design. However, finding molecules with the optimized desired properties is still a challenging task due to combinatorial explosion of candidate space of molecules. Here we propose a novel \emph{decomposition-and-reassembling} based approach, which does not include any optimization i… ▽ More

    Submitted 11 December, 2022; originally announced February 2023.