Skip to main content

Showing 1–17 of 17 results for author: Kimura, M

  1. arXiv:2406.18806  [pdf, other

    stat.ML cs.LG

    Density Ratio Estimation via Sampling along Generalized Geodesics on Statistical Manifolds

    Authors: Masanari Kimura, Howard Bondell

    Abstract: The density ratio of two probability distributions is one of the fundamental tools in mathematical and computational statistics and machine learning, and it has a variety of known applications. Therefore, density ratio estimation from finite samples is a very important task, but it is known to be unstable when the distributions are distant from each other. One approach to address this problem is d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2406.16773  [pdf

    stat.AP

    Valuation methods for professional sports clubs: A historical review, a model development, and the application to Japanese football clubs

    Authors: Masaaki Kimura, Zen Walsh, Takuo Inoue, Toshiya Takahashi, Hideki Koizumi

    Abstract: In the trend towards the globalization of football and the increasing commercialization of professional football clubs, a methodology for calculating the firm value of clubs in non-western countries has yet to be established. This study reviews the valuation methods for the club firm values in Europe and North America and how values are calculated at the time of changing ownership of Japanese club… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2405.14522  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property

    Authors: Yuya Yoshikawa, Masanari Kimura, Ryotaro Shimizu, Yuki Saito

    Abstract: Techniques that explain the predictions of black-box machine learning models are crucial to make the models transparent, thereby increasing trust in AI systems. The input features to the models often have a nested structure that consists of high- and low-level features, and each high-level feature is decomposed into multiple low-level features. For such inputs, both high-level feature attributions… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  4. arXiv:2405.00442  [pdf, other

    stat.ML cs.AI cs.LG

    Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration

    Authors: Masanari Kimura, Hiroki Naganuma

    Abstract: The key factor in implementing machine learning algorithms in decision-making situations is not only the accuracy of the model but also its confidence level. The confidence level of a model in a classification problem is often given by the output vector of a softmax function for convenience. However, these values are known to deviate significantly from the actual expected model confidence. This pr… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is under consideration at Pattern Recognition Letters

  5. arXiv:2403.17410  [pdf, other

    cs.LG cs.AI stat.ML

    On permutation-invariant neural networks

    Authors: Masanari Kimura, Ryotaro Shimizu, Yuki Hirakawa, Ryosuke Goto, Yuki Saito

    Abstract: Conventional machine learning algorithms have traditionally been designed under the assumption that input data follows a vector-based format, with an emphasis on vector-centric paradigms. However, as the demand for tasks involving set-based inputs has grown, there has been a paradigm shift in the research community towards addressing these challenges. In recent years, the emergence of neural netwo… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  6. arXiv:2403.10175  [pdf, other

    cs.LG cs.AI stat.ML

    A Short Survey on Importance Weighting for Machine Learning

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: Importance weighting is a fundamental procedure in statistics and machine learning that weights the objective function or probability distribution based on the importance of the instance in some sense. The simplicity and usefulness of the idea has led to many applications of importance weighting. For example, it is known that supervised learning under an assumption about the difference between the… ▽ More

    Submitted 14 May, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  7. arXiv:2302.12991  [pdf, other

    stat.ML cs.LG

    Generalization Bounds for Set-to-Set Matching with Negative Sampling

    Authors: Masanari Kimura

    Abstract: The problem of matching two sets of multiple elements, namely set-to-set matching, has received a great deal of attention in recent years. In particular, it has been reported that good experimental results can be obtained by preparing a neural network as a matching function, especially in complex cases where, for example, each element of the set is an image. However, theoretical analysis of set-to… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: This paper is accepted at the International Conference on Neural Information Processing (ICONIP2022)

  8. arXiv:2206.10936  [pdf, other

    stat.ML cs.IT cs.LG

    Information Geometry of Dropout Training

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: Dropout is one of the most popular regularization techniques in neural network training. Because of its power and simplicity of idea, dropout has been analyzed extensively and many variants have been proposed. In this paper, several properties of dropout are discussed in a unified manner from the viewpoint of information geometry. We showed that dropout flattens the model manifold and that their r… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  9. arXiv:2103.17060  [pdf, other

    cs.IT math.ST stat.CO stat.ML

    $α$-Geodesical Skew Divergence

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: The asymmetric skew divergence smooths one of the distributions by mixing it, to a degree determined by the parameter $λ$, with the other distribution. Such divergence is an approximation of the KL divergence that does not require the target distribution to be absolutely continuous with respect to the source distribution. In this paper, an information geometric generalization of the skew divergenc… ▽ More

    Submitted 25 April, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Journal ref: Entropy. 2021; 23(5):528

  10. arXiv:2101.10229  [pdf, other

    cs.LG cs.AI math.CA math.NA stat.ML

    Universal Approximation Properties for an ODENet and a ResNet: Mathematical Analysis and Numerical Experiments

    Authors: Yuto Aizawa, Masato Kimura, Kazunori Matsui

    Abstract: We prove a universal approximation property (UAP) for a class of ODENet and a class of ResNet, which are simplified mathematical models for deep learning systems with skip connections. The UAP can be stated as follows. Let $n$ and $m$ be the dimension of input and output data, and assume $m\leq n$. Then we show that ODENet of width $n+m$ with any non-polynomial continuous activation function can a… ▽ More

    Submitted 17 May, 2023; v1 submitted 22 December, 2020; originally announced January 2021.

  11. arXiv:2007.03899  [pdf, other

    cs.LG stat.ML

    Density Fixing: Simple yet Effective Regularization Method based on the Class Prior

    Authors: Masanari Kimura, Ryohei Izawa

    Abstract: Machine learning models suffer from overfitting, which is caused by a lack of labeled data. To tackle this problem, we proposed a framework of regularization methods, called density-fixing, that can be used commonly for supervised and semi-supervised learning. Our proposed regularization method improves the generalization performance by forcing the model to approximate the class's prior distributi… ▽ More

    Submitted 6 September, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

  12. arXiv:2006.06231  [pdf, other

    stat.ML cs.LG

    Why Mixup Improves the Model Performance

    Authors: Masanari Kimura

    Abstract: Machine learning techniques are used in a wide range of domains. However, machine learning models often suffer from the problem of over-fitting. Many data augmentation methods have been proposed to tackle such a problem, and one of them is called mixup. Mixup is a recently proposed regularization procedure, which linearly interpolates a random pair of training examples. This regularization method… ▽ More

    Submitted 17 June, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  13. arXiv:1912.02945  [pdf

    cs.LG cs.MA cs.RO stat.ML

    A pedestrian path-planning model in accordance with obstacle's danger with reinforcement learning

    Authors: Thanh-Trung Trinh, Dinh-Minh Vu, Masaomi Kimura

    Abstract: Most microscopic pedestrian navigation models use the concept of "forces" applied to the pedestrian agents to replicate the navigation environment. While the approach could provide believable results in regular situations, it does not always resemble natural pedestrian navigation behaviour in many typical settings. In our research, we proposed a novel approach using reinforcement learning for simu… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  14. arXiv:1909.07156  [pdf, other

    cs.LG cs.AI stat.ML

    New Perspective of Interpretability of Deep Neural Networks

    Authors: Masanari Kimura, Masayuki Tanaka

    Abstract: Deep neural networks (DNNs) are known as black-box models. In other words, it is difficult to interpret the internal state of the model. Improving the interpretability of DNNs is one of the hot research topics. However, at present, the definition of interpretability for DNNs is vague, and the question of what is a highly explanatory model is still controversial. To address this issue, we provide t… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

  15. arXiv:1906.10822  [pdf, other

    cs.LG stat.ML

    Gradient Noise Convolution (GNC): Smoothing Loss Function for Distributed Large-Batch SGD

    Authors: Kosuke Haruki, Taiji Suzuki, Yohei Hamakawa, Takeshi Toda, Ryuji Sakai, Masahiro Ozawa, Mitsuhiro Kimura

    Abstract: Large-batch stochastic gradient descent (SGD) is widely used for training in distributed deep learning because of its training-time efficiency, however, extremely large-batch SGD leads to poor generalization and easily converges to sharp minima, which prevents naive large-scale data-parallel SGD (DP-SGD) from converging to good minima. To overcome this difficulty, we propose gradient noise convolu… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: 19 pages, 11 figures, 7 tables

  16. arXiv:1802.06368  [pdf, other

    cs.LG cs.SI stat.ML

    Node Centralities and Classification Performance for Characterizing Node Embedding Algorithms

    Authors: Kento Nozawa, Masanari Kimura, Atsunori Kanemura

    Abstract: Embedding graph nodes into a vector space can allow the use of machine learning to e.g. predict node classes, but the study of node embedding algorithms is immature compared to the natural language processing field because of a diverse nature of graphs. We examine the performance of node embedding algorithms with respect to graph centrality measures that characterize diverse graphs, through system… ▽ More

    Submitted 18 February, 2018; originally announced February 2018.

    Comments: Under review at ICLR 2018 workshop track

  17. arXiv:1112.0611  [pdf, ps, other

    stat.ML

    Information-Maximization Clustering based on Squared-Loss Mutual Information

    Authors: Masashi Sugiyama, Makoto Yamada, Manabu Kimura, Hirotaka Hachiya

    Abstract: Information-maximization clustering learns a probabilistic classifier in an unsupervised manner so that mutual information between feature vectors and cluster assignments is maximized. A notable advantage of this approach is that it only involves continuous optimization of model parameters, which is substantially easier to solve than discrete optimization of cluster assignments. However, existing… ▽ More

    Submitted 2 December, 2011; originally announced December 2011.