Skip to main content

Showing 1–45 of 45 results for author: Bae, D

  1. arXiv:2406.06947  [pdf, other

    cs.AI cs.HC

    CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only

    Authors: Junhee Cho, Jihoon Kim, Daseul Bae, Jinho Choo, Youngjune Gwon, Yeong-Dae Kwon

    Abstract: Software robots have long been deployed in Robotic Process Automation (RPA) to automate mundane and repetitive computer tasks. The advent of Large Language Models (LLMs) with advanced reasoning capabilities has set the stage for these agents to now undertake more complex and even previously unseen tasks. However, the LLM-based automation techniques in recent literature frequently rely on HTML sour… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 10 pages, 5 figures; (19 pages and 6 figures more in appendix)

  2. arXiv:2403.03230  [pdf, other

    q-bio.NC cs.AI

    Large language models surpass human experts in predicting neuroscience results

    Authors: Xiaoliang Luo, Akilles Rechardt, Guangzhi Sun, Kevin K. Nejad, Felipe Yáñez, Bati Yilmaz, Kangjoo Lee, Alexandra O. Cohen, Valentina Borghesani, Anton Pashkov, Daniele Marinazzo, Jonathan Nicholas, Alessandro Salatiello, Ilia Sucholutsky, Pasquale Minervini, Sepehr Razavi, Roberta Rocca, Elkhan Yusifov, Tereza Okalova, Nianlong Gu, Martin Ferianc, Mikail Khona, Kaustubh R. Patil, Pui-Shee Lee, Rui Mata , et al. (14 additional authors not shown)

    Abstract: Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created Brain… ▽ More

    Submitted 21 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2402.12984  [pdf, other

    cs.CL cs.AI

    Can GNN be Good Adapter for LLMs?

    Authors: Xuanwen Huang, Kaiqiao Han, Yang Yang, Dezheng Bao, Quanjin Tao, Ziwei Chai, Qi Zhu

    Abstract: Recently, large language models (LLMs) have demonstrated superior capabilities in understanding and zero-shot learning on textual data, promising significant advances for many text-related domains. In the graph domain, various real-world scenarios also involve textual data, where tasks and node features can be described by text. These text-attributed graphs (TAGs) have broad applications in social… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: Accepted by WWW'24

  4. arXiv:2402.10213  [pdf, other

    q-bio.NC cs.AI cs.LG

    Clustering Inductive Biases with Unrolled Networks

    Authors: Jonathan Huml, Abiy Tasissa, Demba Ba

    Abstract: The classical sparse coding (SC) model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field profiles observed empirically. While neurons fire sparsely, neuronal populations are also org… ▽ More

    Submitted 29 November, 2023; originally announced February 2024.

  5. arXiv:2311.00447  [pdf, other

    cs.AI

    On the Opportunities of Green Computing: A Survey

    Authors: You Zhou, Xiujing Lin, Xiang Zhang, Maolin Wang, Gangwei Jiang, Huakang Lu, Yupeng Wu, Kai Zhang, Zhe Yang, Kehang Wang, Yongduo Sui, Fengwei Jia, Zuoli Tang, Yao Zhao, Hongxuan Zhang, Tiannuo Yang, Weibo Chen, Yunong Mao, Yi Li, De Bao, Yu Li, Hongrui Liao, Ting Liu, Jingwen Liu, Jinchi Guo , et al. (16 additional authors not shown)

    Abstract: Artificial Intelligence (AI) has achieved significant advancements in technology and research with the development over several decades, and is widely used in many areas including computing vision, natural language processing, time-series analysis, speech synthesis, etc. During the age of deep learning, especially with the arise of Large Language Models, a large majority of researchers' attention… ▽ More

    Submitted 8 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 113 pages, 18 figures

  6. arXiv:2310.00420  [pdf, other

    eess.SP cs.LG stat.ML

    An Efficient Algorithm for Clustered Multi-Task Compressive Sensing

    Authors: Alexander Lin, Demba Ba

    Abstract: This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matri… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  7. arXiv:2309.02848  [pdf, other

    cs.SI

    Prompt-based Node Feature Extractor for Few-shot Learning on Text-Attributed Graphs

    Authors: Xuanwen Huang, Kaiqiao Han, Dezheng Bao, Quanjin Tao, Zhisheng Zhang, Yang Yang, Qi Zhu

    Abstract: Text-attributed Graphs (TAGs) are commonly found in the real world, such as social networks and citation networks, and consist of nodes represented by textual descriptions. Currently, mainstream machine learning methods on TAGs involve a two-stage modeling approach: (1) unsupervised node feature extraction with pre-trained language models (PLMs); and (2) supervised learning using Graph Neural Netw… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: Under review

  8. arXiv:2306.03249  [pdf, other

    cs.LG eess.SP stat.CO

    Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

    Authors: Alexander Lin, Bahareh Tolooshams, Yves Atchadé, Demba Ba

    Abstract: Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models is the expectation-maximization (EM) algorithm. For problems with high-dimensional latent variables and large datasets, EM scales poorly because it needs to inv… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 29 pages, 4 figures

    Journal ref: International Conference on Machine Learning, 2023

  9. arXiv:2305.18552  [pdf, other

    cs.LG cs.NE

    Learning Linear Groups in Neural Networks

    Authors: Emmanouil Theodosis, Karim Helwani, Demba Ba

    Abstract: Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a priori specification of the desired symmetries. We present a neural network architecture, Linear Group Networks (LGNs), for learning linear groups acting on the… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  10. arXiv:2302.11162  [pdf, other

    cs.AI cs.LG

    Sparse, Geometric Autoencoder Models of V1

    Authors: Jonathan Huml, Abiy Tasissa, Demba Ba

    Abstract: The classical sparse coding model represents visual stimuli as a linear combination of a handful of learned basis functions that are Gabor-like when trained on natural image data. However, the Gabor-like filters learned by classical sparse coding far overpredict well-tuned simple cell receptive field (SCRF) profiles. A number of subsequent models have either discarded the sparse dictionary learnin… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Symmetry and Geometry in Neural Representations (NeurIPS) 2022

  11. arXiv:2211.09238  [pdf, other

    cs.LG

    Learning unfolded networks with a cyclic group structure

    Authors: Emmanouil Theodosis, Demba Ba

    Abstract: Deep neural networks lack straightforward ways to incorporate domain knowledge and are notoriously considered black boxes. Prior works attempted to inject domain knowledge into architectures implicitly through data augmentation. Building on recent advances on equivariant neural networks, we propose networks that explicitly encode domain knowledge, specifically equivariance with respect to rotation… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: Accepted as an extended abstract in NeurIPS Workshop on Symmetry and Geometry in Neural Representations

  12. Unrolled Compressed Blind-Deconvolution

    Authors: Bahareh Tolooshams, Satish Mulleti, Demba Ba, Yonina C. Eldar

    Abstract: The problem of sparse multichannel blind deconvolution (S-MBD) arises frequently in many engineering applications such as radar/sonar/ultrasound imaging. To reduce its computational and implementation cost, we propose a compression method that enables blind recovery from much fewer measurements with respect to the full received signal in time. The proposed compression measures the signal through a… ▽ More

    Submitted 18 May, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: Accepted to IEEE TSP

  13. arXiv:2205.08290  [pdf, other

    cs.SE

    Literature Review to Collect Conceptual Variables of Scenario Methods for Establishing a Conceptual Scenario Framework

    Authors: Young-Min Baek, Esther Cho, Donghwan Shin, Doo-Hwan Bae

    Abstract: Over recent decades, scenarios and scenario-based software/system engineering have been actively employed as essential tools to handle intricate problems, validate requirements, and support stakeholders' communication. However, despite the widespread use of scenarios, there have been several challenges for engineers to more willingly utilize scenario-based engineering approaches (i.e., scenario me… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: 22 pages, 7 figures

    MSC Class: 68M99 ACM Class: D.2.1

  14. arXiv:2204.06799  [pdf, other

    cs.SE

    Environment Imitation: Data-Driven Environment Model Generation Using Imitation Learning for Efficient CPS Goal Verification

    Authors: Yong-Jun Shin, Donghwan Shin, Doo-Hwan Bae

    Abstract: Cyber-Physical Systems (CPS) continuously interact with their physical environments through software controllers that observe the environments and determine actions. Engineers can verify to what extent the CPS under analysis can achieve given goals by analyzing its Field Operational Test (FOT) logs. However, it is challenging to repeat many FOTs to obtain statistically significant results due to i… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  15. BABD: A Bitcoin Address Behavior Dataset for Pattern Analysis

    Authors: Yuexin Xiang, Yuchen Lei, Ding Bao, Wei Ren, Tiantian Li, Qingqing Yang, Wenmao Liu, Tianqing Zhu, Kim-Kwang Raymond Choo

    Abstract: Cryptocurrencies are no longer just the preferred option for cybercriminal activities on darknets, due to the increasing adoption in mainstream applications. This is partly due to the transparency associated with the underpinning ledgers, where any individual can access the record of a transaction record on the public ledger. In this paper, we build a dataset comprising Bitcoin transactions betwee… ▽ More

    Submitted 5 May, 2022; v1 submitted 10 April, 2022; originally announced April 2022.

    Comments: 14 pages, 4 figures

    MSC Class: 68-11 ACM Class: H.2.8

    Journal ref: in IEEE Transactions on Information Forensics and Security, vol. 19, pp. 2171-2185, 2024

  16. arXiv:2202.12808  [pdf, other

    eess.SP cs.LG stat.CO stat.ML

    High-Dimensional Sparse Bayesian Learning without Covariance Matrices

    Authors: Alexander Lin, Andrew H. Song, Berkin Bilgic, Demba Ba

    Abstract: Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem. However, the most popular inference algorithms for SBL become too expensive for high-dimensional settings, due to the need to store and compute a large covariance matrix. We introduce a new inference scheme that avoids explicit construction of the covariance matrix by solving multiple linear systems in p… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

    Comments: 5 pages

    Journal ref: IEEE ICASSP 2022

  17. arXiv:2110.04683  [pdf, other

    cs.LG eess.SP

    Mixture Model Auto-Encoders: Deep Clustering through Dictionary Learning

    Authors: Alexander Lin, Andrew H. Song, Demba Ba

    Abstract: State-of-the-art approaches for clustering high-dimensional data utilize deep auto-encoder architectures. Many of these networks require a large number of parameters and suffer from a lack of interpretability, due to the black-box nature of the auto-encoders. We introduce Mixture Model Auto-Encoders (MixMate), a novel architecture that clusters data by performing inference on a generative model. D… ▽ More

    Submitted 25 February, 2022; v1 submitted 9 October, 2021; originally announced October 2021.

    Comments: 5 pages, 3 figures

    Journal ref: IEEE ICASSP 2022

  18. arXiv:2109.11066  [pdf, other

    cs.CV cs.LG

    A two-step machine learning approach for crop disease detection: an application of GAN and UAV technology

    Authors: Aaditya Prasad, Nikhil Mehta, Matthew Horak, Wan D. Bae

    Abstract: Automated plant diagnosis is a technology that promises large increases in cost-efficiency for agriculture. However, multiple problems reduce the effectiveness of drones, including the inverse relationship between resolution and speed and the lack of adequate labeled training data. This paper presents a two-step machine learning approach that analyzes low-fidelity and high-fidelity images in seque… ▽ More

    Submitted 18 September, 2021; originally announced September 2021.

    Comments: 13 pages, 5 figures Preprint of an article submitted for consideration in the International Journal on Artificial Intelligence Tools, 2021, World Scientific Publishing Company, https://www.worldscientific.com/worldscinet/ijait

    ACM Class: I.2.6; I.2.10

  19. arXiv:2106.00058  [pdf, other

    cs.LG eess.SP stat.ML

    Stable and Interpretable Unrolled Dictionary Learning

    Authors: Bahareh Tolooshams, Demba Ba

    Abstract: The dictionary learning problem, representing data as a combination of a few atoms, has long stood as a popular method for learning representations in statistics and signal processing. The most popular dictionary learning algorithm alternates between sparse coding and dictionary update steps, and a rich literature has studied its theoretical convergence. The success of dictionary learning relies o… ▽ More

    Submitted 2 August, 2022; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Published in Transactions on Machine Learning Research (TMLR) (08/2022)

  20. arXiv:2105.10439  [pdf, other

    eess.SP cs.LG stat.ML

    Covariance-Free Sparse Bayesian Learning

    Authors: Alexander Lin, Andrew H. Song, Berkin Bilgic, Demba Ba

    Abstract: Sparse Bayesian learning (SBL) is a powerful framework for tackling the sparse coding problem while also providing uncertainty quantification. The most popular inference algorithms for SBL exhibit prohibitively large computational costs for high-dimensional problems due to the need to maintain a large covariance matrix. To resolve this issue, we introduce a new method for accelerating SBL inferenc… ▽ More

    Submitted 8 April, 2022; v1 submitted 21 May, 2021; originally announced May 2021.

    Comments: 13 pages

  21. arXiv:2104.13894  [pdf, other

    eess.SP cs.IT cs.LG math.OC

    Weighed $\ell_1$ on the simplex: Compressive sensing meets locality

    Authors: Abiy Tasissa, Pranay Tankala, Demba Ba

    Abstract: Sparse manifold learning algorithms combine techniques in manifold learning and sparse optimization to learn features that could be utilized for downstream tasks. The standard setting of compressive sensing can not be immediately applied to this setup. Due to the intrinsic geometric structure of data, dictionary atoms might be redundant and do not satisfy the restricted isometry property or cohere… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: arXiv admin note: text overlap with arXiv:2012.02134

  22. arXiv:2104.00530  [pdf, other

    cs.LG stat.AP stat.ML

    Gaussian Process Convolutional Dictionary Learning

    Authors: Andrew H. Song, Bahareh Tolooshams, Demba Ba

    Abstract: Convolutional dictionary learning (CDL), the problem of estimating shift-invariant templates from data, is typically conducted in the absence of a prior/structure on the templates. In data-scarce or low signal-to-noise ratio (SNR) regimes, learned templates overfit the data and lack smoothness, which can affect the predictive performance of downstream tasks. To address this limitation, we propose… ▽ More

    Submitted 24 November, 2021; v1 submitted 28 March, 2021; originally announced April 2021.

    Comments: IEEE Signal Processing Letters (2021)

  23. arXiv:2102.07003  [pdf, other

    cs.LG

    On the convergence of group-sparse autoencoders

    Authors: Emmanouil Theodosis, Bahareh Tolooshams, Pranay Tankala, Abiy Tasissa, Demba Ba

    Abstract: Recent approaches in the theoretical analysis of model-based deep learning architectures have studied the convergence of gradient descent in shallow ReLU networks that arise from generative models whose hidden layers are sparse. Motivated by the success of architectures that impose structured forms of sparsity, we introduce and study a group-sparse autoencoder that accounts for a variety of genera… ▽ More

    Submitted 21 January, 2022; v1 submitted 13 February, 2021; originally announced February 2021.

  24. arXiv:2012.02134  [pdf, other

    cs.LG cs.IT eess.SP math.OC

    K-Deep Simplex: Deep Manifold Learning via Local Dictionaries

    Authors: Pranay Tankala, Abiy Tasissa, James M. Murphy, Demba Ba

    Abstract: We propose K-Deep Simplex (KDS) which, given a set of data points, learns a dictionary comprising synthetic landmarks, along with representation coefficients supported on a simplex. KDS integrates manifold learning and sparse coding/dictionary learning: reconstruction term, as in classical dictionary learning, and a novel local weighted $\ell_1$ penalty that encourages each data point to represent… ▽ More

    Submitted 14 January, 2023; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 14 pages, 8 figures

  25. arXiv:2010.11391  [pdf, ps, other

    eess.SP cs.LG

    Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution

    Authors: Bahareh Tolooshams, Satish Mulleti, Demba Ba, Yonina C. Eldar

    Abstract: We propose a learned-structured unfolding neural network for the problem of compressive sparse multichannel blind-deconvolution. In this problem, each channel's measurements are given as convolution of a common source signal and sparse filter. Unlike prior works where the compression is achieved either through random projections or by applying a fixed structured compression matrix, this paper prop… ▽ More

    Submitted 11 February, 2021; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Accepted to 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)

  26. arXiv:2006.09534  [pdf, other

    cs.IT cs.LG eess.SP

    Towards improving discriminative reconstruction via simultaneous dense and sparse coding

    Authors: Abiy Tasissa, Emmanouil Theodosis, Bahareh Tolooshams, Demba Ba

    Abstract: Discriminative features extracted from the sparse coding model have been shown to perform well for classification. Recent deep learning architectures have further improved reconstruction in inverse problems by considering new dense priors learned from data. We propose a novel dense and sparse coding model that integrates both representation capability and discriminative features. The model studies… ▽ More

    Submitted 13 December, 2022; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 24 pages

  27. arXiv:2005.02325  [pdf

    cs.CL

    Digraph of Senegal s local languages: issues, challenges and prospects of their transliteration

    Authors: Elhadji Mamadou Nguer, Diop Sokhna Bao, Yacoub Ahmed Fall, Mouhamadou Khoule

    Abstract: The local languages in Senegal, like those of West African countries in general, are written based on two alphabets: supplemented Arabic alphabet (called Ajami) and Latin alphabet. Each writing has its own applications. Ajami writing is generally used by people educated in Koranic schools for communication, business, literature (religious texts, poetry, etc.), traditional religious medicine, etc.… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Journal ref: LTC 2015

  28. arXiv:1910.14627  [pdf, other

    cs.NE

    An Automatic Design Framework of Swarm Pattern Formation based on Multi-objective Genetic Programming

    Authors: Zhun Fan, Zhaojun Wang, Xiaomin Zhu, Bingliang Hu, Anmin Zou, Dongwei Bao

    Abstract: Most existing swarm pattern formation methods depend on a predefined gene regulatory network (GRN) structure that requires designers' priori knowledge, which is difficult to adapt to complex and changeable environments. To dynamically adapt to the complex and changeable environments, we propose an automatic design framework of swarm pattern formation based on multi-objective genetic programming. T… ▽ More

    Submitted 1 November, 2019; v1 submitted 31 October, 2019; originally announced October 2019.

  29. arXiv:1910.12727  [pdf

    cs.LG cs.CV

    Layer Pruning for Accelerating Very Deep Neural Networks

    Authors: Weiwei Zhang, Changsheng chen, Xuechun Wu, Jialin Gao, Di Bao, Jiwei Li, Xi Zhou

    Abstract: In this paper, we propose an adaptive pruning method. This method can cut off the channel and layer adaptively. The proportion of the layer and the channel to be cut is learned adaptively. The pruning method proposed in this paper can reduce half of the parameters, and the accuracy will not decrease or even be higher than baseline.

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: v2

  30. arXiv:1908.09258  [pdf, other

    cs.LG stat.ML

    RandNet: deep learning with compressed measurements of images

    Authors: Thomas Chang, Bahareh Tolooshams, Demba Ba

    Abstract: Principal component analysis, dictionary learning, and auto-encoders are all unsupervised methods for learning representations from a large amount of training data. In all these methods, the higher the dimensions of the input data, the longer it takes to learn. We introduce a class of neural networks, termed RandNet, for learning representations using compressed random measurements of data of inte… ▽ More

    Submitted 25 August, 2019; originally announced August 2019.

    Comments: The first two authors contributed equally to this work

  31. arXiv:1907.09881  [pdf, other

    cs.LG stat.ML

    Convolutional Dictionary Learning in Hierarchical Networks

    Authors: Javier Zazo, Bahareh Tolooshams, Demba Ba

    Abstract: Filter banks are a popular tool for the analysis of piecewise smooth signals such as natural images. Motivated by the empirically observed properties of scale and detail coefficients of images in the wavelet domain, we propose a hierarchical deep generative model of piecewise smooth signals that is a recursion across scales: the low pass scale coefficients at one layer are obtained by filtering th… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

  32. Fast Convolutional Dictionary Learning off the Grid

    Authors: Andrew H. Song, Francisco J. Flores, Demba Ba

    Abstract: Given a continuous-time signal that can be modeled as the superposition of localized, time-shifted events from multiple sources, the goal of Convolutional Dictionary Learning (CDL) is to identify the location of the events--by Convolutional Sparse Coding (CSC)--and learn the template for each source--by Convolutional Dictionary Update (CDU). In practice, because we observe samples of the continuou… ▽ More

    Submitted 21 July, 2019; originally announced July 2019.

    Journal ref: IEEE Transactions on Signal Processing 2020

  33. arXiv:1907.03211  [pdf, other

    cs.LG stat.AP stat.ML

    Convolutional dictionary learning based auto-encoders for natural exponential-family distributions

    Authors: Bahareh Tolooshams, Andrew H. Song, Simona Temereanca, Demba Ba

    Abstract: We introduce a class of auto-encoder neural networks tailored to data from the natural exponential family (e.g., count data). The architectures are inspired by the problem of learning the filters in a convolutional generative model with sparsity constraints, often referred to as convolutional dictionary learning (CDL). Our work is the first to combine ideas from convolutional generative models and… ▽ More

    Submitted 28 June, 2020; v1 submitted 6 July, 2019; originally announced July 2019.

    Journal ref: International Conference on Machine Learning (ICML) 2020

  34. Deep Residual Autoencoders for Expectation Maximization-inspired Dictionary Learning

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: We introduce a neural-network architecture, termed the constrained recurrent sparse autoencoder (CRsAE), that solves convolutional dictionary learning problems, thus establishing a link between dictionary learning and neural networks. Specifically, we leverage the interpretation of the alternating-minimization algorithm for dictionary learning as an approximate Expectation-Maximization algorithm t… ▽ More

    Submitted 18 October, 2020; v1 submitted 18 April, 2019; originally announced April 2019.

    Journal ref: in IEEE Transactions on Neural Networks and Learning Systems, pp. 1-15, 2020

  35. arXiv:1810.09920  [pdf, other

    stat.ML cs.LG q-bio.NC stat.CO

    Clustering Time Series with Nonlinear Dynamics: A Bayesian Non-Parametric and Particle-Based Approach

    Authors: Alexander Lin, Yingzhuo Zhang, Jeremy Heng, Stephen A. Allsop, Kay M. Tye, Pierre E. Jacob, Demba Ba

    Abstract: We propose a general statistical framework for clustering multiple time series that exhibit nonlinear dynamics into an a-priori-unknown number of sub-groups. Our motivation comes from neuroscience, where an important problem is to identify, within a large assembly of neurons, subsets that respond similarly to a stimulus or contingency. Upon modeling the multiple time series as the output of a Diri… ▽ More

    Submitted 4 March, 2019; v1 submitted 23 October, 2018; originally announced October 2018.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  36. arXiv:1810.02906  [pdf, other

    stat.ML cs.LG

    Network Distance Based on Laplacian Flows on Graphs

    Authors: Dianbin Bao, Kisung You, Lizhen Lin

    Abstract: Distance plays a fundamental role in measuring similarity between objects. Various visualization techniques and learning tasks in statistics and machine learning such as shape matching, classification, dimension reduction and clustering often rely on some distance or similarity measure. It is of tremendous importance to have a distance that can incorporate the underlying structure of the object. I… ▽ More

    Submitted 5 October, 2018; originally announced October 2018.

  37. arXiv:1807.04734  [pdf, other

    cs.LG stat.ML

    Scalable Convolutional Dictionary Learning with Constrained Recurrent Sparse Auto-encoders

    Authors: Bahareh Tolooshams, Sourav Dey, Demba Ba

    Abstract: Given a convolutional dictionary underlying a set of observed signals, can a carefully designed auto-encoder recover the dictionary in the presence of noise? We introduce an auto-encoder architecture, termed constrained recurrent sparse auto-encoder (CRsAE), that answers this question in the affirmative. Given an input signal and an approximate dictionary, the encoder finds a sparse approximation… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

  38. arXiv:1807.01958  [pdf, other

    eess.SP cs.LG

    Deeply-Sparse Signal rePresentations ($\text{D}\text{S}^2\text{P}$)

    Authors: Demba Ba

    Abstract: A recent line of work shows that a deep neural network with ReLU nonlinearities arises from a finite sequence of cascaded sparse coding models, the outputs of which, except for the last element in the cascade, are sparse and unobservable. That is, intermediate outputs deep in the cascade are sparse, hence the title of this manuscript. We show here, using techniques from the dictionary learning lit… ▽ More

    Submitted 24 April, 2020; v1 submitted 5 July, 2018; originally announced July 2018.

  39. arXiv:1805.07300  [pdf, other

    stat.ML cs.LG eess.SP stat.AP

    Multitaper Spectral Estimation HDP-HMMs for EEG Sleep Inference

    Authors: Leon Chlon, Andrew Song, Sandya Subramanian, Hugo Soulat, John Tauber, Demba Ba, Michael Prerau

    Abstract: Electroencephalographic (EEG) monitoring of neural activity is widely used for sleep disorder diagnostics and research. The standard of care is to manually classify 30-second epochs of EEG time-domain traces into 5 discrete sleep stages. Unfortunately, this scoring process is subjective and time-consuming, and the defined stages do not capture the heterogeneous landscape of healthy and clinical ne… ▽ More

    Submitted 18 May, 2018; originally announced May 2018.

  40. arXiv:1710.01821  [pdf, other

    stat.ME cs.IT

    Classification of Local Field Potentials using Gaussian Sequence Model

    Authors: Taposh Banerjee, John Choi, Bijan Pesaran, Demba Ba, Vahid Tarokh

    Abstract: A problem of classification of local field potentials (LFPs), recorded from the prefrontal cortex of a macaque monkey, is considered. An adult macaque monkey is trained to perform a memory-based saccade. The objective is to decode the eye movement goals from the LFP collected during a memory period. The LFP classification problem is modeled as that of classification of smooth functions embedded in… ▽ More

    Submitted 27 November, 2017; v1 submitted 4 October, 2017; originally announced October 2017.

  41. arXiv:1709.09723  [pdf, other

    stat.ME cs.CE

    Estimating a Separably-Markov Random Field (SMuRF) from Binary Observations

    Authors: Yingzhuo Zhang, Noa Malem-Shinitski, Stephen A Allsop, Kay Tye, Demba Ba

    Abstract: A fundamental problem in neuroscience is to characterize the dynamics of spiking from the neurons in a circuit that is involved in learning about a stimulus or a contingency. A key limitation of current methods to analyze neural spiking data is the need to collapse neural activity over time or trials, which may cause the loss of information pertinent to understanding the function of a neuron or ci… ▽ More

    Submitted 27 September, 2017; originally announced September 2017.

  42. arXiv:1709.04631  [pdf, other

    cs.SE

    Empirical Evaluation of Mutation-based Test Prioritization Techniques

    Authors: Donghwan Shin, Shin Yoo, Mike Papadakis, Doo-Hwan Bae

    Abstract: We propose a new test case prioritization technique that combines both mutation-based and diversity-based approaches. Our diversity-aware mutation-based technique relies on the notion of mutant distinguishment, which aims to distinguish one mutant's behavior from another, rather than from the original program. We empirically investigate the relative cost and effectiveness of the mutation-based pri… ▽ More

    Submitted 23 January, 2018; v1 submitted 14 September, 2017; originally announced September 2017.

  43. arXiv:1601.06466  [pdf, other

    cs.SE

    A Theoretical Framework for Understanding Mutation-Based Testing Methods

    Authors: Donghwan Shin, Doo-Hwan Bae

    Abstract: In the field of mutation analysis, mutation is the systematic generation of mutated programs (i.e., mutants) from an original program. The concept of mutation has been widely applied to various testing problems, including test set selection, fault localization, and program repair. However, surprisingly little focus has been given to the theoretical foundation of mutation-based testing methods, mak… ▽ More

    Submitted 24 January, 2016; originally announced January 2016.

    Comments: To be appear in ICST 2016

    ACM Class: D.2.5, F.1.0

  44. arXiv:1106.0365  [pdf, ps, other

    cs.DS cs.IT

    Lower Bounds for Sparse Recovery

    Authors: Khanh Do Ba, Piotr Indyk, Eric Price, David P. Woodruff

    Abstract: We consider the following k-sparse recovery problem: design an m x n matrix A, such that for any signal x, given Ax we can efficiently recover x' satisfying ||x-x'||_1 <= C min_{k-sparse} x"} ||x-x"||_1. It is known that there exist matrices A with this property that have only O(k log (n/k)) rows. In this paper we show that this bound is tight. Our bound holds even for the more general /rand… ▽ More

    Submitted 2 June, 2011; v1 submitted 2 June, 2011; originally announced June 2011.

    Comments: 11 pages. Appeared at SODA 2010

  45. arXiv:0904.0292  [pdf, ps, other

    cs.DS

    Sublinear Time Algorithms for Earth Mover's Distance

    Authors: Khanh Do Ba, Huy L Nguyen, Huy N Nguyen, Ronitt Rubinfeld

    Abstract: We study the problem of estimating the Earth Mover's Distance (EMD) between probability distributions when given access only to samples. We give closeness testers and additive-error estimators over domains in $[0, Δ]^d$, with sample complexities independent of domain size - permitting the testability even of continuous distributions over infinite domains. Instead, our algorithms depend on other… ▽ More

    Submitted 1 April, 2009; originally announced April 2009.

    Comments: 12 pages