Skip to main content

Showing 1–41 of 41 results for author: Hosseini, R

  1. arXiv:2405.16266  [pdf, other

    cs.RO cs.LG eess.SY

    Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation

    Authors: Hamid Taheri, Seyed Rasoul Hosseini

    Abstract: Collision-free motion is essential for mobile robots. Most approaches to collision-free and efficient navigation with wheeled robots require parameter tuning by experts to obtain good navigation behavior. This study investigates the application of deep reinforcement learning to train a mobile robot for autonomous navigation in a complex environment. The robot utilizes LiDAR sensor data and a deep… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  2. arXiv:2404.02348  [pdf, other

    eess.IV cs.CV

    COVID-19 Detection Based on Blood Test Parameters using Various Artificial Intelligence Methods

    Authors: Kavian Khanjani, Seyed Rasoul Hosseini, Hamid Taheri, Shahrzad Shashaani, Mohammad Teshnehlab

    Abstract: In 2019, the world faced a new challenge: a COVID-19 disease caused by the novel coronavirus, SARS-CoV-2. The virus rapidly spread across the globe, leading to a high rate of mortality, which prompted health organizations to take measures to control its transmission. Early disease detection is crucial in the treatment process, and computer-based automatic detection systems have been developed to a… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  3. arXiv:2403.19782  [pdf, other

    cs.CV

    ENet-21: An Optimized light CNN Structure for Lane Detection

    Authors: Seyed Rasoul Hosseini, Mohammad Teshnehlab

    Abstract: Lane detection for autonomous vehicles is an important concept, yet it is a challenging issue of driver assistance systems in modern vehicles. The emergence of deep learning leads to significant progress in self-driving cars. Conventional deep learning-based methods handle lane detection problems as a binary segmentation task and determine whether a pixel belongs to a line. These methods rely on t… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: The paper is under review by Soft Computing journal

  4. arXiv:2402.18128  [pdf, other

    cs.CV cs.LG

    Downstream Task Guided Masking Learning in Masked Autoencoders Using Multi-Level Optimization

    Authors: Han Guo, Ramtin Hosseini, Ruiyi Zhang, Sai Ashish Somayajula, Ranak Roy Chowdhury, Rajesh K. Gupta, Pengtao Xie

    Abstract: Masked Autoencoder (MAE) is a notable method for self-supervised pretraining in visual representation learning. It operates by randomly masking image patches and reconstructing these masked patches using the unmasked ones. A key limitation of MAE lies in its disregard for the varying informativeness of different patches, as it uniformly selects patches to mask. To overcome this, some approaches pr… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  5. arXiv:2308.13792  [pdf, other

    cs.LG cs.CV

    Out-of-distribution detection using normalizing flows on the data manifold

    Authors: Seyedeh Fatemeh Razavi, Mohammad Mahdi Mehmanchi, Reshad Hosseini, Mostafa Tavassolipour

    Abstract: A common approach for out-of-distribution detection involves estimating an underlying data distribution, which assigns a lower likelihood value to out-of-distribution data. Normalizing flows are likelihood-based generative models providing a tractable density estimation via dimension-preserving invertible transformations. Conventional normalizing flows are prone to fail in out-of-distribution dete… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  6. arXiv:2304.00486  [pdf, other

    cs.LG cs.CY

    Learning by Grouping: A Multilevel Optimization Framework for Improving Fairness in Classification without Losing Accuracy

    Authors: Ramtin Hosseini, Li Zhang, Bhanu Garg, Pengtao Xie

    Abstract: The integration of machine learning models in various real-world applications is becoming more prevalent to assist humans in their daily decision-making tasks as a result of recent advancements in this field. However, it has been discovered that there is a tradeoff between the accuracy and fairness of these decision-making tasks. In some cases, these AI systems can be unfair by exhibiting bias or… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

  7. arXiv:2303.03212  [pdf, other

    cs.CV eess.IV

    Combination of Single and Multi-frame Image Super-resolution: An Analytical Perspective

    Authors: Mohammad Mahdi Afrasiabi, Reshad Hosseini, Aliazam Abbasfar

    Abstract: Super-resolution is the process of obtaining a high-resolution image from one or more low-resolution images. Single image super-resolution (SISR) and multi-frame super-resolution (MFSR) methods have been evolved almost independently for years. A neglected study in this field is the theoretical analysis of finding the optimum combination of SISR and MFSR. To fill this gap, we propose a novel theore… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  8. arXiv:2212.05581  [pdf, ps, other

    cs.LG

    Efficient Relation-aware Neighborhood Aggregation in Graph Neural Networks via Tensor Decomposition

    Authors: Peyman Baghershahi, Reshad Hosseini, Hadi Moradi

    Abstract: Many Graph Neural Networks (GNNs) are proposed for Knowledge Graph Embedding (KGE). However, lots of these methods neglect the importance of the information of relations and combine it with the information of entities inefficiently, leading to low expressiveness. To address this issue, we introduce a general knowledge graph encoder incorporating tensor decomposition in the aggregation function of… ▽ More

    Submitted 29 May, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

    Comments: 13 pages, 5 Tables, 2 Figures

  9. arXiv:2212.05402  [pdf, ps, other

    cs.LG

    Stochastic First-Order Learning for Large-Scale Flexibly Tied Gaussian Mixture Model

    Authors: Mohammad Pasande, Reshad Hosseini, Babak Nadjar Araabi

    Abstract: Gaussian Mixture Models (GMMs) are one of the most potent parametric density models used extensively in many applications. Flexibly-tied factorization of the covariance matrices in GMMs is a powerful approach for coping with the challenges of common GMMs when faced with high-dimensional data and complex densities which often demand a large number of Gaussian components. However, the expectation-ma… ▽ More

    Submitted 11 November, 2023; v1 submitted 10 December, 2022; originally announced December 2022.

  10. arXiv:2211.02720  [pdf, other

    cs.LG

    Deep Surrogate Docking: Accelerating Automated Drug Discovery with Graph Neural Networks

    Authors: Ryien Hosseini, Filippo Simini, Austin Clyde, Arvind Ramanathan

    Abstract: The process of screening molecules for desirable properties is a key step in several applications, ranging from drug discovery to material design. During the process of drug discovery specifically, protein-ligand docking, or chemical docking, is a standard in-silico scoring technique that estimates the binding affinity of molecules with a specific protein target. Recently, however, as the number o… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Comments: Published as workshop paper at NeurIPS 2022 (AI for Science)

  11. arXiv:2208.03659  [pdf, other

    cs.CV

    Fast Online and Relational Tracking

    Authors: Mohammad Hossein Nasseri, Mohammadreza Babaee, Hadi Moradi, Reshad Hosseini

    Abstract: To overcome challenges in multiple object tracking task, recent algorithms use interaction cues alongside motion and appearance features. These algorithms use graph neural networks or transformers to extract interaction features that lead to high computation costs. In this paper, a novel interaction cue based on geometric features is presented aiming to detect occlusion and re-identify lost target… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

  12. arXiv:2207.09955  [pdf, other

    cs.LG cs.AI cs.AR cs.PF

    Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications

    Authors: Ryien Hosseini, Filippo Simini, Venkatram Vishwanath

    Abstract: As Graph Neural Networks (GNNs) increase in popularity for scientific machine learning, their training and inference efficiency is becoming increasingly critical. Additionally, the deep learning field as a whole is trending towards wider and deeper networks, and ever increasing data sizes, to the point where hard hardware bottlenecks are often encountered. Emerging specialty hardware platforms pro… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Published as workshop paper at MLSys 2022 (MLBench)

  13. Greykite: Deploying Flexible Forecasting at Scale at LinkedIn

    Authors: Reza Hosseini, Albert Chen, Kaixu Yang, Sayan Patra, Yi Su, Saad Eddin Al Orjany, Sishi Tang, Parvez Ahammad

    Abstract: Forecasts help businesses allocate resources and achieve objectives. At LinkedIn, product owners use forecasts to set business targets, track outlook, and monitor health. Engineers use forecasts to efficiently provision hardware. Developing a forecasting solution to meet these needs requires accurate and interpretable forecasts on diverse time series with sub-hourly to quarterly frequencies. We pr… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '22), August 14-18, 2022, Washington, DC, USA. ACM, New York, NY, USA, 11 pages

    ACM Class: G.3

  14. arXiv:2206.03293  [pdf, other

    cs.LG stat.ML

    Joint Manifold Learning and Density Estimation Using Normalizing Flows

    Authors: Seyedeh Fatemeh Razavi, Mohammad Mahdi Mehmanchi, Reshad Hosseini, Mostafa Tavassolipour

    Abstract: Based on the manifold hypothesis, real-world data often lie on a low-dimensional manifold, while normalizing flows as a likelihood-based generative model are incapable of finding this manifold due to their structural constraints. So, one interesting question arises: $\textit{"Can we find sub-manifold(s) of data in normalizing flows and estimate the density of the data on the sub-manifold(s)?"}$. I… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

  15. arXiv:2206.01002  [pdf, other

    cs.LG cs.CV

    Introducing One Sided Margin Loss for Solving Classification Problems in Deep Networks

    Authors: Ali Karimi, Zahra Mousavi Kouzehkanan, Reshad Hosseini, Hadi Asheri

    Abstract: This paper introduces a new loss function, OSM (One-Sided Margin), to solve maximum-margin classification problems effectively. Unlike the hinge loss, in OSM the margin is explicitly determined with corresponding hyperparameters and then the classification problem is solved. In experiments, we observe that using OSM loss leads to faster training speeds and better accuracies than binary and categor… ▽ More

    Submitted 2 June, 2022; originally announced June 2022.

  16. arXiv:2204.04823  [pdf, other

    cs.RO

    ACuTE: Automatic Curriculum Transfer from Simple to Complex Environments

    Authors: Yash Shukla, Christopher Thierauf, Ramtin Hosseini, Gyan Tatiya, Jivko Sinapov

    Abstract: Despite recent advances in Reinforcement Learning (RL), many problems, especially real-world tasks, remain prohibitively expensive to learn. To address this issue, several lines of research have explored how tasks, or data samples themselves, can be sequenced into a curriculum to learn a problem that may otherwise be too difficult to learn from scratch. However, generating and optimizing a curricu… ▽ More

    Submitted 10 April, 2022; originally announced April 2022.

  17. arXiv:2112.10644  [pdf, other

    cs.LG cs.AI

    Self-attention Presents Low-dimensional Knowledge Graph Embeddings for Link Prediction

    Authors: Peyman Baghershahi, Reshad Hosseini, Hadi Moradi

    Abstract: A few models have tried to tackle the link prediction problem, also known as knowledge graph completion, by embedding knowledge graphs in comparably lower dimensions. However, the state-of-the-art results are attained at the cost of considerably increasing the dimensionality of embeddings which causes scalability issues in the case of huge knowledge bases. Transformers have been successfully used… ▽ More

    Submitted 26 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: 14 pages, 3 figure, 6 tables

  18. arXiv:2111.06353  [pdf, other

    cs.LG cs.AI

    Learning from Mistakes -- A Framework for Neural Architecture Search

    Authors: Bhanu Garg, Li Zhang, Pradyumna Sridhara, Ramtin Hosseini, Eric Xing, Pengtao Xie

    Abstract: Learning from one's mistakes is an effective human learning technique where the learners focus more on the topics where mistakes were made, so as to deepen their understanding. In this paper, we investigate if this human learning strategy can be applied in machine learning. We propose a novel machine learning method called Learning From Mistakes (LFM), wherein the learner improves its ability to l… ▽ More

    Submitted 13 January, 2022; v1 submitted 11 November, 2021; originally announced November 2021.

  19. arXiv:2111.03470  [pdf, other

    eess.AS cs.CL cs.LG

    ParsiNorm: A Persian Toolkit for Speech Processing Normalization

    Authors: Romina Oji, Seyedeh Fatemeh Razavi, Sajjad Abdi Dehsorkh, Alireza Hariri, Hadi Asheri, Reshad Hosseini

    Abstract: In general, speech processing models consist of a language model along with an acoustic model. Regardless of the language model's complexity and variants, three critical pre-processing steps are needed in language models: cleaning, normalization, and tokenization. Among mentioned steps, the normalization step is so essential to format unification in pure textual applications. However, for embedded… ▽ More

    Submitted 15 December, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

  20. arXiv:2109.07561  [pdf, other

    cs.RO cs.CV

    A Framework for Multisensory Foresight for Embodied Agents

    Authors: Xiaohui Chen, Ramtin Hosseini, Karen Panetta, Jivko Sinapov

    Abstract: Predicting future sensory states is crucial for learning agents such as robots, drones, and autonomous vehicles. In this paper, we couple multiple sensory modalities with exploratory actions and propose a predictive neural network architecture to address this problem. Most existing approaches rely on large, manually annotated datasets, or only use visual data as a single modality. In contrast, the… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: ICRA 2021

  21. arXiv:2108.12876  [pdf, other

    cs.CV

    Solving Viewing Graph Optimization for Simultaneous Position and Rotation Registration

    Authors: Seyed-Mahdi Nasiri, Reshad Hosseini, Hadi Moradi

    Abstract: A viewing graph is a set of unknown camera poses, as the vertices, and the observed relative motions, as the edges. Solving the viewing graph is an essential step in a Structure-from-Motion procedure, where a set of relative motions is obtained from a collection of 2D images. Almost all methods in the literature solve for the rotations separately, through rotation averaging process, and use them f… ▽ More

    Submitted 29 August, 2021; originally announced August 2021.

  22. arXiv:2108.11019  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Vector Transport Free Riemannian LBFGS for Optimization on Symmetric Positive Definite Matrix Manifolds

    Authors: Reza Godaz, Benyamin Ghojogh, Reshad Hosseini, Reza Monsefi, Fakhri Karray, Mark Crowley

    Abstract: This work concentrates on optimization on Riemannian manifolds. The Limited-memory Broyden-Fletcher-Goldfarb-Shanno (LBFGS) algorithm is a commonly used quasi-Newton method for numerical optimization in Euclidean spaces. Riemannian LBFGS (RLBFGS) is an extension of this method to Riemannian manifolds. RLBFGS involves computationally expensive vector transports as well as unfolding recursions using… ▽ More

    Submitted 3 October, 2021; v1 submitted 24 August, 2021; originally announced August 2021.

    Comments: Published in the 13th Asian Conference on Machine Learning (ACML) 2021. The first two authors contributed equally to this work

    Report number: https://proceedings.mlr.press/v157/godaz21a.html

    Journal ref: Proceedings of The 13th Asian Conference on Machine Learning, PMLR, vol. 157, pp. 1-16, 2021

  23. arXiv:2107.04618  [pdf, other

    cs.CV

    Optimal Triangulation Method is Not Really Optimal

    Authors: Seyed-Mahdi Nasiri, Reshad Hosseini, Hadi Moradi

    Abstract: Triangulation refers to the problem of finding a 3D point from its 2D projections on multiple camera images. For solving this problem, it is the common practice to use so-called optimal triangulation method, which we call the L2 method in this paper. But, the method can be optimal only if we assume no uncertainty in the camera parameters. Through extensive comparison on synthetic and real data, we… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 9 pages, 13 figures

  24. arXiv:2104.10785  [pdf, other

    stat.ML cs.CV cs.LG

    Accurate and fast matrix factorization for low-rank learning

    Authors: Reza Godaz, Reza Monsefi, Faezeh Toutounian, Reshad Hosseini

    Abstract: In this paper, we tackle two important problems in low-rank learning, which are partial singular value decomposition and numerical rank estimation of huge matrices. By using the concepts of Krylov subspaces such as Golub-Kahan bidiagonalization (GK-bidiagonalization) as well as Ritz vectors, we propose two methods for solving these problems in a fast and accurate way. Our experiments show the adva… ▽ More

    Submitted 4 September, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

  25. arXiv:2103.06869  [pdf

    cs.LG

    Learning with partially separable data

    Authors: Aida Khozaei, Hadi Moradi, Reshad Hosseini

    Abstract: There are partially separable data types that make classification tasks very hard. In other words, only parts of the data are informative meaning that looking at the rest of the data would not give any distinguishable hint for classification. In this situation, the typical assumption of having the whole labeled data as an informative unit set for classification does not work. Consequently, typical… ▽ More

    Submitted 11 March, 2021; originally announced March 2021.

  26. arXiv:2103.04147  [pdf, other

    cs.CV

    Simple online and real-time tracking with occlusion handling

    Authors: Mohammad Hossein Nasseri, Hadi Moradi, Reshad Hosseini, Mohammadreza Babaee

    Abstract: Multiple object tracking is a challenging problem in computer vision due to difficulty in dealing with motion prediction, occlusion handling, and object re-identification. Many recent algorithms use motion and appearance cues to overcome these challenges. But using appearance cues increases the computation cost notably and therefore the speed of the algorithm decreases significantly which makes th… ▽ More

    Submitted 6 March, 2021; originally announced March 2021.

  27. arXiv:2012.12899  [pdf, other

    cs.LG cs.AI cs.CV

    Learning by Self-Explanation, with Application to Neural Architecture Search

    Authors: Ramtin Hosseini, Pengtao Xie

    Abstract: Learning by self-explanation is an effective learning technique in human learning, where students explain a learned topic to themselves for deepening their understanding of this topic. It is interesting to investigate whether this explanation-driven learning methodology broadly used by humans is helpful for improving machine learning as well. Based on this inspiration, we propose a novel machine l… ▽ More

    Submitted 10 March, 2021; v1 submitted 23 December, 2020; originally announced December 2020.

  28. arXiv:2012.07091  [pdf, other

    cs.LG cs.IT

    Reinforcement Learning with Subspaces using Free Energy Paradigm

    Authors: Milad Ghorbani, Reshad Hosseini, Seyed Pooya Shariatpanahi, Majid Nili Ahmadabadi

    Abstract: In large-scale problems, standard reinforcement learning algorithms suffer from slow learning speed. In this paper, we follow the framework of using subspaces to tackle this problem. We propose a free-energy minimization framework for selecting the subspaces and integrate the policy of the state-space into the subspaces. Our proposed free-energy minimization framework rests upon Thompson sampling… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

    Comments: 12 pages, preprint

    ACM Class: I.2.11; H.1.1

  29. arXiv:2012.06122  [pdf, other

    cs.LG cs.CR cs.CV

    DSRNA: Differentiable Search of Robust Neural Architectures

    Authors: Ramtin Hosseini, Xingyi Yang, Pengtao Xie

    Abstract: In deep learning applications, the architectures of deep neural networks are crucial in achieving high accuracy. Many methods have been proposed to search for high-performance neural architectures automatically. However, these searched architectures are prone to adversarial attacks. A small perturbation of the input data can render the architecture to change prediction outcomes significantly. To a… ▽ More

    Submitted 10 December, 2020; originally announced December 2020.

    Comments: 10 pages

  30. arXiv:2010.14561  [pdf, other

    cs.CV

    Contour Integration using Graph-Cut and Non-Classical Receptive Field

    Authors: Zahra Mousavi Kouzehkanan, Reshad Hosseini, Babak Nadjar Araabi

    Abstract: Many edge and contour detection algorithms give a soft-value as an output and the final binary map is commonly obtained by applying an optimal threshold. In this paper, we propose a novel method to detect image contours from the extracted edge segments of other algorithms. Our method is based on an undirected graphical model with the edge segments set as the vertices. The proposed energy functions… ▽ More

    Submitted 10 May, 2021; v1 submitted 27 October, 2020; originally announced October 2020.

  31. arXiv:2008.02144  [pdf, other

    cs.LG stat.ML

    FRMDN: Flow-based Recurrent Mixture Density Network

    Authors: Seyedeh Fatemeh Razavi, Reshad Hosseini, Tina Behzad

    Abstract: The class of recurrent mixture density networks is an important class of probabilistic models used extensively in sequence modeling and sequence-to-sequence mapping applications. In this class of models, the density of a target sequence in each time-step is modeled by a Gaussian mixture model with the parameters given by a recurrent neural network. In this paper, we generalize recurrent mixture de… ▽ More

    Submitted 20 April, 2023; v1 submitted 5 August, 2020; originally announced August 2020.

  32. arXiv:1912.05753  [pdf

    q-bio.QM cs.LG stat.ML

    Pathway-Activity Likelihood Analysis and Metabolite Annotation for Untargeted Metabolomics using Probabilistic Modeling

    Authors: Ramtin Hosseini, Neda Hassanpour, Li-Ping Liu, Soha Hassoun

    Abstract: Motivation: Untargeted metabolomics comprehensively characterizes small molecules and elucidates activities of biochemical pathways within a biological sample. Despite computational advances, interpreting collected measurements and determining their biological role remains a challenge. Results: To interpret measurements, we present an inference-based approach, termed Probabilistic modeling for Unt… ▽ More

    Submitted 9 March, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

    Comments: For more details, please visit my homepage at: https://www.eecs.tufts.edu/~ramtin/

  33. arXiv:1903.06255  [pdf

    cs.CV cs.LG stat.ML

    Active Transfer Learning for Persian Offline Signature Verification

    Authors: Taraneh Younesian, Saeed Masoudnia, Reshad Hosseini, Babak N. Araabi

    Abstract: Offline Signature Verification (OSV) remains a challenging pattern recognition task, especially in the presence of skilled forgeries that are not available during the training. This challenge is aggravated when there are small labeled training data available but with large intra-personal variations. In this study, we address this issue by employing an active learning approach, which selects the mo… ▽ More

    Submitted 28 February, 2019; originally announced March 2019.

    Journal ref: 2019 4th International Conference on Pattern Recognition and Image Analysis (IPRIA)

  34. arXiv:1812.03190  [pdf, other

    cs.LG stat.ML

    Deep-RBF Networks Revisited: Robust Classification with Rejection

    Authors: Pourya Habib Zadeh, Reshad Hosseini, Suvrit Sra

    Abstract: One of the main drawbacks of deep neural networks, like many other classifiers, is their vulnerability to adversarial attacks. An important reason for their vulnerability is assigning high confidence to regions with few or even no feature points. By feature points, we mean a nonlinear transformation of the input space extracting a meaningful representation of the input data. On the other hand, dee… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

  35. arXiv:1806.00281  [pdf, other

    cs.RO

    A Recursive Least Square Method for 3D Pose Graph Optimization Problem

    Authors: S. M. Nasiri, Reshad Hosseini, Hadi Moradi

    Abstract: Pose Graph Optimization (PGO) is an important non-convex optimization problem and is the state-of-the-art formulation for SLAM in robotics. It also has applications like camera motion estimation, structure from motion and 3D reconstruction in machine vision. Recent researches have shown the importance of good initialization to bootstrap well-known iterative PGO solvers to converge to good solution… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

  36. arXiv:1710.08012  [pdf, other

    stat.ML cs.AI cs.LG

    Exploiting generalization in the subspaces for faster model-based learning

    Authors: Maryam Hashemzadeh, Reshad Hosseini, Majid Nili Ahmadabadi

    Abstract: Due to the lack of enough generalization in the state-space, common methods in Reinforcement Learning (RL) suffer from slow learning speed especially in the early learning trials. This paper introduces a model-based method in discrete state-spaces for increasing learning speed in terms of required experience (but not required computational time) by exploiting generalization in the experiences of t… ▽ More

    Submitted 25 October, 2017; v1 submitted 22 October, 2017; originally announced October 2017.

  37. arXiv:1706.03267  [pdf, other

    stat.ML cs.LG

    An Alternative to EM for Gaussian Mixture Models: Batch and Stochastic Riemannian Optimization

    Authors: Reshad Hosseini, Suvrit Sra

    Abstract: We consider maximum likelihood estimation for Gaussian Mixture Models (Gmms). This task is almost invariably solved (in theory and practice) via the Expectation Maximization (EM) algorithm. EM owes its success to various factors, of which is its ability to fulfill positive definiteness constraints in closed form is of key importance. We propose an alternative to EM by appealing to the rich Riemann… ▽ More

    Submitted 10 June, 2017; originally announced June 2017.

    Comments: 21 pages, 6 figures

  38. arXiv:1607.05002  [pdf, ps, other

    stat.ML cs.LG

    Geometric Mean Metric Learning

    Authors: Pourya Habib Zadeh, Reshad Hosseini, Suvrit Sra

    Abstract: We revisit the task of learning a Euclidean metric from data. We approach this problem from first principles and formulate it as a surprisingly simple optimization problem. Indeed, our formulation even admits a closed form solution. This solution possesses several very attractive properties: (i) an innate geometric appeal through the Riemannian geometry of positive definite matrices; (ii) ease of… ▽ More

    Submitted 18 July, 2016; originally announced July 2016.

    Comments: 7 pages, 4 figures

  39. arXiv:1507.06065  [pdf, ps, other

    stat.ML cs.LG

    MixEst: An Estimation Toolbox for Mixture Models

    Authors: Reshad Hosseini, Mohamadreza Mash'al

    Abstract: Mixture models are powerful statistical models used in many applications ranging from density estimation to clustering and classification. When dealing with mixture models, there are many issues that the experimenter should be aware of and needs to solve. The MixEst toolbox is a powerful and user-friendly package for MATLAB that implements several state-of-the-art approaches to address these probl… ▽ More

    Submitted 22 July, 2015; originally announced July 2015.

    Comments: 5 pages

  40. arXiv:1506.07677  [pdf, other

    stat.ML cs.LG math.OC

    Manifold Optimization for Gaussian Mixture Models

    Authors: Reshad Hosseini, Suvrit Sra

    Abstract: We take a new look at parameter estimation for Gaussian Mixture Models (GMMs). In particular, we propose using \emph{Riemannian manifold optimization} as a powerful counterpart to Expectation Maximization (EM). An out-of-the-box invocation of manifold optimization, however, fails spectacularly: it converges to the same solution but vastly slower. Driven by intuition from manifold convexity, we the… ▽ More

    Submitted 25 June, 2015; originally announced June 2015.

    Comments: 19 pages

  41. arXiv:1101.0255  [pdf, ps, other

    math.ST cs.LG

    Conditional information and definition of neighbor in categorical random fields

    Authors: Reza Hosseini

    Abstract: We show that the definition of neighbor in Markov random fields as defined by Besag (1974) when the joint distribution of the sites is not positive is not well-defined. In a random field with finite number of sites we study the conditions under which giving the value at extra sites will change the belief of an agent about one site. Also the conditions under which the information from some sites is… ▽ More

    Submitted 31 December, 2010; originally announced January 2011.