Skip to main content

Showing 1–12 of 12 results for author: Vaculin, R

  1. arXiv:2311.12290  [pdf, other

    cs.LG

    A Supervised Contrastive Learning Pretrain-Finetune Approach for Time Series

    Authors: Trang H. Tran, Lam M. Nguyen, Kyongmin Yeo, Nam Nguyen, Roman Vaculin

    Abstract: Foundation models have recently gained attention within the field of machine learning thanks to its efficiency in broad data processing. While researchers had attempted to extend this success to time series models, the main challenge is effectively extracting representations and transferring knowledge from pretraining datasets to the target finetuning dataset. To tackle this issue, we introduce a… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  2. arXiv:2306.00778  [pdf, other

    cs.LG stat.ML

    An End-to-End Time Series Model for Simultaneous Imputation and Forecast

    Authors: Trang H. Tran, Lam M. Nguyen, Kyongmin Yeo, Nam Nguyen, Dzung Phan, Roman Vaculin, Jayant Kalagnanam

    Abstract: Time series forecasting using historical data has been an interesting and challenging topic, especially when the data is corrupted by missing values. In many industrial problem, it is important to learn the inference function between the auxiliary observations and target variables as it provides additional knowledge when the data is not fully observed. We develop an end-to-end time series model th… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  3. arXiv:2303.12316  [pdf, other

    cs.LG

    TsSHAP: Robust model agnostic feature-based explainability for time series forecasting

    Authors: Vikas C. Raykar, Arindam Jati, Sumanta Mukherjee, Nupur Aggarwal, Kanthi Sarpatwar, Giridhar Ganapavarapu, Roman Vaculin

    Abstract: A trustworthy machine learning model should be accurate as well as explainable. Understanding why a model makes a certain decision defines the notion of explainability. While various flavors of explainability have been well-studied in supervised learning paradigms like classification and regression, literature on explainability for time series forecasting is relatively scarce. In this paper, we… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: 11 pages, 8 figures

  4. arXiv:2207.03384  [pdf, other

    cs.CR cs.LG

    HE-PEx: Efficient Machine Learning under Homomorphic Encryption using Pruning, Permutation and Expansion

    Authors: Ehud Aharoni, Moran Baruch, Pradip Bose, Alper Buyuktosunoglu, Nir Drucker, Subhankar Pal, Tomer Pelleg, Kanthi Sarpatwar, Hayim Shaul, Omri Soceanu, Roman Vaculin

    Abstract: Privacy-preserving neural network (NN) inference solutions have recently gained significant traction with several solutions that provide different latency-bandwidth trade-offs. Of these, many rely on homomorphic encryption (HE), a method of performing computations over encrypted data. However, HE operations even with state-of-the-art schemes are still considerably slow compared to their plaintext… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  5. arXiv:2103.03411  [pdf, other

    cs.CR cs.AI cs.LG

    Efficient Encrypted Inference on Ensembles of Decision Trees

    Authors: Kanthi Sarpatwar, Karthik Nandakumar, Nalini Ratha, James Rayfield, Karthikeyan Shanmugam, Sharath Pankanti, Roman Vaculin

    Abstract: Data privacy concerns often prevent the use of cloud-based machine learning services for sensitive personal data. While homomorphic encryption (HE) offers a potential solution by enabling computations on encrypted data, the challenge is to obtain accurate machine learning models that work within the multiplicative depth constraints of a leveled HE scheme. Existing approaches for encrypted inferenc… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: 9 pages, 6 figures

  6. arXiv:2102.12347  [pdf, other

    cs.LG cs.AI

    AutoAI-TS: AutoAI for Time Series Forecasting

    Authors: Syed Yousaf Shah, Dhaval Patel, Long Vu, Xuan-Hong Dang, Bei Chen, Peter Kirchner, Horst Samulowitz, David Wood, Gregory Bramble, Wesley M. Gifford, Giridhar Ganapavarapu, Roman Vaculin, Petros Zerfos

    Abstract: A large number of time series forecasting models including traditional statistical models, machine learning models and more recently deep learning have been proposed in the literature. However, choosing the right model along with good parameter values that performs well on a given data is still challenging. Automatically providing a good set of models to users for a given dataset saves both time a… ▽ More

    Submitted 8 March, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Accepted for publication at ACM SIGMOD 2021 Industry Track

  7. arXiv:1910.12832  [pdf, other

    cs.LG cs.CR cs.IT stat.ML

    Differentially Private Distributed Data Summarization under Covariate Shift

    Authors: Kanthi Sarpatwar, Karthikeyan Shanmugam, Venkata Sitaramagiridharganesh Ganapavarapu, Ashish Jagmohan, Roman Vaculin

    Abstract: We envision AI marketplaces to be platforms where consumers, with very less data for a target task, can obtain a relevant model by accessing many private data sources with vast number of data samples. One of the key challenges is to construct a training dataset that matches a target task without compromising on privacy of the data sources. To this end, we consider the following distributed data su… ▽ More

    Submitted 9 January, 2020; v1 submitted 28 October, 2019; originally announced October 2019.

    Comments: To appear in the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  8. arXiv:1810.11126  [pdf, other

    cs.DC

    Promoting Distributed Trust in Machine Learning and Computational Simulation via a Blockchain Network

    Authors: Nelson Kibichii Bore, Ravi Kiran Raman, Isaac M. Markus, Sekou L. Remy, Oliver Bent, Michael Hind, Eleftheria K. Pissadaki, Biplav Srivastava, Roman Vaculin, Kush R. Varshney, Komminist Weldemariam

    Abstract: Policy decisions are increasingly dependent on the outcomes of simulations and/or machine learning models. The ability to share and interact with these outcomes is relevant across multiple fields and is especially critical in the disease modeling community where models are often only accessible and workable to the researchers that generate them. This work presents a blockchain-enabled system that… ▽ More

    Submitted 25 October, 2018; originally announced October 2018.

  9. arXiv:1809.08529  [pdf

    cs.DC

    Permissioned Blockchain Technologies for Academic Publishing

    Authors: Petr Novotny, Qi Zhang, Richard Hull, Salman Baset, Jim Laredo, Roman Vaculin, Daniel L. Ford, Donna N. Dillenberger

    Abstract: Academic publishing is continuously evolving with the gradual adoption of new technologies. Blockchain is a new technology that promises to change how individuals and organizations interact across various boundaries. The adoption of blockchains is beginning to transform diverse industries such as finance, supply chain, international trade, as well as energy and resource management and many others.… ▽ More

    Submitted 23 September, 2018; originally announced September 2018.

  10. arXiv:1809.08438  [pdf, other

    cs.DC cs.IT eess.SY stat.ML

    Trusted Multi-Party Computation and Verifiable Simulations: A Scalable Blockchain Approach

    Authors: Ravi Kiran Raman, Roman Vaculin, Michael Hind, Sekou L. Remy, Eleftheria K. Pissadaki, Nelson Kibichii Bore, Roozbeh Daneshvar, Biplav Srivastava, Kush R. Varshney

    Abstract: Large-scale computational experiments, often running over weeks and over large datasets, are used extensively in fields such as epidemiology, meteorology, computational biology, and healthcare to understand phenomena, and design high-stakes policies affecting everyday health and economy. For instance, the OpenMalaria framework is a computationally-intensive simulation used by various non-governmen… ▽ More

    Submitted 22 September, 2018; originally announced September 2018.

    Comments: 16 pages, 8 figures

  11. arXiv:1702.03584  [pdf, other

    cs.AI cs.LG

    Similarity Preserving Representation Learning for Time Series Clustering

    Authors: Qi Lei, Jinfeng Yi, Roman Vaculin, Lingfei Wu, Inderjit S. Dhillon

    Abstract: A considerable amount of clustering algorithms take instance-feature matrices as their inputs. As such, they cannot directly analyze time series data due to its temporal nature, usually unequal lengths, and complex properties. This is a great pity since many of these algorithms are effective, robust, efficient, and easy to use. In this paper, we bridge this gap by proposing an efficient representa… ▽ More

    Submitted 2 June, 2019; v1 submitted 12 February, 2017; originally announced February 2017.

  12. arXiv:1507.06667  [pdf, ps, other

    cs.IR cs.CY cs.HC cs.SI

    Alexandria: Extensible Framework for Rapid Exploration of Social Media

    Authors: Fenno F. Heath III, Richard Hull, Elham Khabiri, Matthew Riemer, Noi Sukaviriya, Roman Vaculin

    Abstract: The Alexandria system under development at IBM Research provides an extensible framework and platform for supporting a variety of big-data analytics and visualizations. The system is currently focused on enabling rapid exploration of text-based social media data. The system provides tools to help with constructing "domain models" (i.e., families of keywords and extractors to enable focus on tweets… ▽ More

    Submitted 23 July, 2015; originally announced July 2015.

    Comments: 8 pages