Skip to main content

Showing 1–16 of 16 results for author: Ruozzi, N

  1. arXiv:2405.17859  [pdf, other

    cs.CV cs.RO

    Adapting Pre-Trained Vision Models for Novel Instance Detection and Segmentation

    Authors: Yangxiao Lu, Jishnu Jaykumar P, Yunhui Guo, Nicholas Ruozzi, Yu Xiang

    Abstract: Novel Instance Detection and Segmentation (NIDS) aims at detecting and segmenting novel object instances given a few examples of each instance. We propose a unified framework (NIDS-Net) comprising object proposal generation, embedding creation for both instance templates and proposal regions, and embedding matching for instance label assignment. Leveraging recent advancements in large vision metho… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 22 pages, 9 figures, Code is available at: https://github.com/YoungSean/NIDS-Net

  2. arXiv:2312.14556  [pdf, other

    cs.CV

    CaptainCook4D: A dataset for understanding errors in procedural activities

    Authors: Rohith Peddi, Shivvrat Arya, Bharath Challa, Likhitha Pallapothula, Akshay Vyas, Jikai Wang, Qifan Zhang, Vasundhara Komaragiri, Eric Ragan, Nicholas Ruozzi, Yu Xiang, Vibhav Gogate

    Abstract: Following step-by-step procedures is an essential component of various activities carried out by individuals in their daily lives. These procedures serve as a guiding framework that helps to achieve goals efficiently, whether it is assembling furniture or preparing a recipe. However, the complexity and duration of procedural activities inherently increase the likelihood of making errors. Understan… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: Accepted to the 2023 International Conference on Machine Learning(ICML) workshop on Data-centric Machine Learning Research(DMLR), Project Page: https://captaincook4d.github.io/captain-cook/

  3. arXiv:2302.03793  [pdf, other

    cs.RO cs.CV cs.LG

    Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction

    Authors: Yangxiao Lu, Ninad Khargonkar, Zesheng Xu, Charles Averill, Kamalesh Palanisamy, Kaiyu Hang, Yunhui Guo, Nicholas Ruozzi, Yu Xiang

    Abstract: We introduce a novel robotic system for improving unseen object instance segmentation in the real world by leveraging long-term robot interaction with objects. Previous approaches either grasp or push an object and then obtain the segmentation mask of the grasped or pushed object after one action. Instead, our system defers the decision on segmenting objects after a sequence of robot pushing actio… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 11 pages, 7 figures, 5 tables

  4. arXiv:2211.11679  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Mean Shift Mask Transformer for Unseen Object Instance Segmentation

    Authors: Yangxiao Lu, Yuqiao Chen, Nicholas Ruozzi, Yu Xiang

    Abstract: Segmenting unseen objects from images is a critical perception skill that a robot needs to acquire. In robot manipulation, it can facilitate a robot to grasp and manipulate unseen objects. Mean shift clustering is a widely used method for image segmentation tasks. However, the traditional mean shift clustering algorithm is not differentiable, making it difficult to integrate it into an end-to-end… ▽ More

    Submitted 21 September, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Add pixel confidence maps

  5. arXiv:2110.09647  [pdf, other

    cs.LG cs.AI

    Relational Neural Markov Random Fields

    Authors: Yuqiao Chen, Sriraam Natarajan, Nicholas Ruozzi

    Abstract: Statistical Relational Learning (SRL) models have attracted significant attention due to their ability to model complex data while handling uncertainty. However, most of these models have been limited to discrete domains due to their limited potential functions. We introduce Relational Neural Markov Random Fields (RN-MRFs) which allow for handling of complex relational hybrid domains. The key adva… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

    Comments: StarAI 2021 workshop on IJCLR 2021

  6. arXiv:2005.02335  [pdf, other

    cs.HC cs.AI cs.LG

    Don't Explain without Verifying Veracity: An Evaluation of Explainable AI with Video Activity Recognition

    Authors: Mahsan Nourani, Chiradeep Roy, Tahrima Rahman, Eric D. Ragan, Nicholas Ruozzi, Vibhav Gogate

    Abstract: Explainable machine learning and artificial intelligence models have been used to justify a model's decision-making process. This added transparency aims to help improve user performance and understanding of the underlying model. However, in practice, explainable systems face many open questions and challenges. Specifically, designers might reduce the complexity of deep learning models in order to… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    ACM Class: H.1.2

  7. arXiv:2001.02773  [pdf, other

    cs.LG stat.ML

    Lifted Hybrid Variational Inference

    Authors: Yuqiao Chen, Yibo Yang, Sriraam Natarajan, Nicholas Ruozzi

    Abstract: A variety of lifted inference algorithms, which exploit model symmetry to reduce computational cost, have been proposed to render inference tractable in probabilistic relational models. Most existing lifted inference algorithms operate only over discrete domains or continuous domains with restricted potential functions, e.g., Gaussian. We investigate two approximate lifted variational approaches t… ▽ More

    Submitted 7 February, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: AAAI 2020 Workshop on Statistical Relational AI (StarAI 2020)

  8. arXiv:1906.06419  [pdf, other

    cs.LG stat.ML

    Learning Correlated Latent Representations with Adaptive Priors

    Authors: Da Tang, Dawen Liang, Nicholas Ruozzi, Tony Jebara

    Abstract: Variational Auto-Encoders (VAEs) have been widely applied for learning compact, low-dimensional latent representations of high-dimensional data. When the correlation structure among data points is available, previous work proposed Correlated Variational Auto-Encoders (CVAEs), which employ a structured mixture model as prior and a structured variational posterior for each mixture component to enfor… ▽ More

    Submitted 18 December, 2019; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: 16 pages, 1 figure, 5 tables

  9. arXiv:1905.05335  [pdf, other

    cs.LG stat.ML

    Correlated Variational Auto-Encoders

    Authors: Da Tang, Dawen Liang, Tony Jebara, Nicholas Ruozzi

    Abstract: Variational Auto-Encoders (VAEs) are capable of learning latent representations for high dimensional data. However, due to the i.i.d. assumption, VAEs only optimize the singleton variational distributions and fail to account for the correlations between data points, which might be crucial for learning latent representations from dataset where a priori we know correlations exist. We propose Correla… ▽ More

    Submitted 17 April, 2020; v1 submitted 13 May, 2019; originally announced May 2019.

    Comments: International Conference on Machine Learning (ICML), 2019

  10. arXiv:1806.05355  [pdf, other

    stat.ML cs.LG

    Scalable Neural Network Compression and Pruning Using Hard Clustering and L1 Regularization

    Authors: Yibo Yang, Nicholas Ruozzi, Vibhav Gogate

    Abstract: We propose a simple and easy to implement neural network compression algorithm that achieves results competitive with more complicated state-of-the-art methods. The key idea is to modify the original optimization problem by adding K independent Gaussian priors (corresponding to the k-means objective) over the network parameters to achieve parameter quantization, as well as an L1 penalty to achieve… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

  11. arXiv:1503.01228  [pdf, other

    cs.LG cs.CV stat.ML

    Bethe Learning of Conditional Random Fields via MAP Decoding

    Authors: Kui Tang, Nicholas Ruozzi, David Belanger, Tony Jebara

    Abstract: Many machine learning tasks can be formulated in terms of predicting structured outputs. In frameworks such as the structured support vector machine (SVM-Struct) and the structured perceptron, discriminative functions are learned by iteratively applying efficient maximum a posteriori (MAP) decoding. However, maximum likelihood estimation (MLE) of probabilistic models over these same structured spa… ▽ More

    Submitted 4 March, 2015; originally announced March 2015.

    Comments: 19 pages (9 supplementary), 10 figures (3 supplementary)

  12. arXiv:1309.6859  [pdf

    cs.DM math.CO

    Beyond Log-Supermodularity: Lower Bounds and the Bethe Partition Function

    Authors: Nicholas Ruozzi

    Abstract: A recent result has demonstrated that the Bethe partition function always lower bounds the true partition function of binary, log-supermodular graphical models. We demonstrate that these results can be extended to other interesting classes of graphical models that are not necessarily binary or log-supermodular: the ferromagnetic Potts model with a uniform external field and its generalizations and… ▽ More

    Submitted 26 September, 2013; originally announced September 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-546-555

  13. arXiv:1212.0171  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Message-Passing Algorithms for Quadratic Minimization

    Authors: Nicholas Ruozzi, Sekhar Tatikonda

    Abstract: Gaussian belief propagation (GaBP) is an iterative algorithm for computing the mean of a multivariate Gaussian distribution, or equivalently, the minimum of a multivariate positive definite quadratic function. Sufficient conditions, such as walk-summability, that guarantee the convergence and correctness of GaBP are known, but GaBP may fail to converge to the correct solution given an arbitrary po… ▽ More

    Submitted 1 December, 2012; originally announced December 2012.

    Journal ref: Journal of Machine Learning Research. 14 (Aug) :2287-2314, 2013

  14. arXiv:1202.6035  [pdf, ps, other

    cs.DM math-ph math.CO

    The Bethe Partition Function of Log-supermodular Graphical Models

    Authors: Nicholas Ruozzi

    Abstract: Sudderth, Wainwright, and Willsky have conjectured that the Bethe approximation corresponding to any fixed point of the belief propagation algorithm over an attractive, pairwise binary graphical model provides a lower bound on the true partition function. In this work, we resolve this conjecture in the affirmative by demonstrating that, for any graphical model with binary variables whose potential… ▽ More

    Submitted 16 April, 2012; v1 submitted 27 February, 2012; originally announced February 2012.

    Comments: Typo, bug fixes, and improved exposition

  15. Message-Passing Algorithms: Reparameterizations and Splittings

    Authors: Nicholas Ruozzi, Sekhar Tatikonda

    Abstract: The max-product algorithm, a local message-passing scheme that attempts to compute the most probable assignment (MAP) of a given probability distribution, has been successfully employed as a method of approximate inference for applications arising in coding theory, computer vision, and machine learning. However, the max-product algorithm is not guaranteed to converge to the MAP assignment, and if… ▽ More

    Submitted 1 December, 2012; v1 submitted 17 February, 2010; originally announced February 2010.

    Comments: A complete rework and expansion of the previous versions

    Journal ref: Information Theory, IEEE Transactions on , vol.59, no.9, pp.5860,5881, Sept. 2013

  16. Applications of Metric Coinduction

    Authors: Dexter Kozen, Nicholas Ruozzi

    Abstract: Metric coinduction is a form of coinduction that can be used to establish properties of objects constructed as a limit of finite approximations. One can prove a coinduction step showing that some property is preserved by one step of the approximation process, then automatically infer by the coinduction principle that the property holds of the limit object. This can often be used to avoid complic… ▽ More

    Submitted 16 September, 2009; v1 submitted 19 August, 2009; originally announced August 2009.

    ACM Class: F.4.1; F.3.1; I.1.3; I.2.3

    Journal ref: Logical Methods in Computer Science, Volume 5, Issue 3 (September 16, 2009) lmcs:1168