Skip to main content

Showing 1–32 of 32 results for author: Siskind, J M

  1. arXiv:2406.16902  [pdf, other

    eess.SP cs.LG

    Learning Exemplar Representations in Single-Trial EEG Category Decoding

    Authors: Jack Kilgallen, Barak Pearlmutter, Jeffery Mark Siskind

    Abstract: Within neuroimgaing studies it is a common practice to perform repetitions of trials in an experiment when working with a noisy class of data acquisition system, such as electroencephalography (EEG) or magnetoencephalography (MEG). While this approach can be useful in some experimental designs, it presents significant limitations for certain types of analyses, such as identifying the category of a… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  2. arXiv:2010.15033  [pdf, other

    cs.RO

    The Amazing Race TM: Robot Edition

    Authors: Jared Sigurd Johansen, Thomas Victor Ilyevsky, Jeffrey Mark Siskind

    Abstract: State-of-the-art natural-language-driven autonomous-navigation systems generally lack the ability to operate in real unknown environments without crutches, such as having a map of the environment in advance or requiring a strict syntactic structure for natural-language commands. Practical artificial-intelligent systems should not have to depend on such prior knowledge. To encourage effort towards… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 20 pages

    ACM Class: I.2.9

  3. arXiv:2004.06046  [pdf, other

    eess.SP cs.LG stat.ML

    Object classification from randomized EEG trials

    Authors: Hamad Ahmed, Ronnie B Wilbur, Hari M Bharadwaj, Jeffrey Mark Siskind

    Abstract: New results suggest strong limits to the feasibility of classifying human brain activity evoked from image stimuli, as measured through EEG. Considerable prior work suffers from a confound between the stimulus class and the time since the start of the experiment. A prior attempt to avoid this confound using randomized trials was unable to achieve results above chance in a statistically significant… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

  4. arXiv:1812.07697  [pdf, other

    cs.CV cs.LG q-bio.NC

    Training on the test set? An analysis of Spampinato et al. [31]

    Authors: Ren Li, Jared S. Johansen, Hamad Ahmed, Thomas V. Ilyevsky, Ronnie B Wilbur, Hari M Bharadwaj, Jeffrey Mark Siskind

    Abstract: A recent paper [31] claims to classify brain processing evoked in subjects watching ImageNet stimuli as measured with EEG and to use a representation derived from this processing to create a novel object classifier. That paper, together with a series of subsequent papers [8, 15, 17, 20, 21, 30, 35], claims to revolutionize the field by achieving extremely successful results on several computer-vis… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 18 Pages, 4 Figures, 10 Tables

  5. arXiv:1809.09318  [pdf, other

    cs.LG stat.ML

    Floyd-Warshall Reinforcement Learning: Learning from Past Experiences to Reach New Goals

    Authors: Vikas Dhiman, Shurjo Banerjee, Jeffrey M. Siskind, Jason J. Corso

    Abstract: Consider mutli-goal tasks that involve static environments and dynamic goals. Examples of such tasks, such as goal-directed navigation and pick-and-place in robotics, abound. Two types of Reinforcement Learning (RL) algorithms are used for such tasks: model-free or model-based. Each of these approaches has limitations. Model-free RL struggles to transfer learned information when the goal location… ▽ More

    Submitted 4 January, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

  6. arXiv:1802.02274  [pdf, other

    cs.RO cs.AI

    A Critical Investigation of Deep Reinforcement Learning for Navigation

    Authors: Vikas Dhiman, Shurjo Banerjee, Brent Griffin, Jeffrey M Siskind, Jason J Corso

    Abstract: The navigation problem is classically approached in two steps: an exploration step, where map-information about the environment is gathered; and an exploitation step, where this information is used to navigate efficiently. Deep reinforcement learning (DRL) algorithms, alternatively, approach the problem of navigation in an end-to-end fashion. Inspired by the classical approach, we ask whether DRL… ▽ More

    Submitted 4 January, 2019; v1 submitted 6 February, 2018; originally announced February 2018.

  7. Divide-and-Conquer Checkpointing for Arbitrary Programs with No User Annotation

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: Classical reverse-mode automatic differentiation (AD) imposes only a small constant-factor overhead in operation count over the original computation, but has storage requirements that grow, in the worst case, in proportion to the time consumed by the original computation. This storage blowup can be ameliorated by checkpointing, a process that reorders application of classical reverse-mode AD over… ▽ More

    Submitted 29 March, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    MSC Class: 68N20; 68N18; 65F50; 65D25; 46G05; 58C20

    Journal ref: Optimization Methods and Software 33(04-06):1288-1330, 2018

  8. arXiv:1611.03777  [pdf, ps, other

    cs.LG stat.ML

    Tricks from Deep Learning

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: The deep learning community has devised a diverse set of methods to make gradient optimization, using large datasets, of large and highly complex models with deeply cascaded nonlinearities, practical. Taken as a whole, these methods constitute a breakthrough, allowing computational structures which are quite wide, very deep, and with an enormous number and variety of free parameters to be effectiv… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  9. arXiv:1611.03429  [pdf, ps, other

    cs.PL cs.LO

    Evolving the Incremental λ Calculus into a Model of Forward Automatic Differentiation (AD)

    Authors: Robert Kelly, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: Formal transformations somehow resembling the usual derivative are surprisingly common in computer science, with two notable examples being derivatives of regular expressions and derivatives of types. A newcomer to this list is the incremental $λ$-calculus, or ILC, a "theory of changes" that deploys a formal apparatus allowing the automatic generation of efficient update functions which perform in… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  10. arXiv:1611.03423  [pdf, ps, other

    cs.MS cs.LG

    DiffSharp: An AD Library for .NET Languages

    Authors: Atılım Güneş Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: DiffSharp is an algorithmic differentiation or automatic differentiation (AD) library for the .NET ecosystem, which is targeted by the C# and F# languages, among others. The library has been designed with machine learning applications in mind, allowing very succinct implementations of models and optimization routines. DiffSharp is implemented in F# and exposes forward and reverse AD operators as g… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  11. arXiv:1611.03416  [pdf, ps, other

    cs.PL cs.MS

    Efficient Implementation of a Higher-Order Language with Built-In AD

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: We show that Automatic Differentiation (AD) operators can be provided in a dynamic language without sacrificing numeric performance. To achieve this, general forward and reverse AD functions are added to a simple high-level dynamic language, and support for them is included in an aggressive optimizing compiler. Novel technical mechanisms are discussed, which have the ability to migrate the AD tran… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  12. arXiv:1611.03410  [pdf, other

    cs.PL cs.LG cs.MS

    Binomial Checkpointing for Arbitrary Programs with No User Annotation

    Authors: Jeffrey Mark Siskind, Barak A. Pearlmutter

    Abstract: Heretofore, automatic checkpointing at procedure-call boundaries, to reduce the space complexity of reverse mode, has been provided by systems like Tapenade. However, binomial checkpointing, or treeverse, has only been provided in Automatic Differentiation (AD) systems in special cases, e.g., through user-provided pragmas on DO loops in Tapenade, or as the nested taping mechanism in adol-c for tim… ▽ More

    Submitted 10 November, 2016; originally announced November 2016.

    Comments: Extended abstract presented at the AD 2016 Conference, Sep 2016, Oxford UK

  13. arXiv:1511.07727  [pdf, ps, other

    cs.MS

    DiffSharp: Automatic Differentiation Library

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: In this paper we introduce DiffSharp, an automatic differentiation (AD) library designed with machine learning in mind. AD is a family of techniques that evaluate derivatives at machine precision with only a small constant factor of overhead, by systematically applying the chain rule of calculus at the elementary operator level. DiffSharp aims to make an extensive array of AD techniques available,… ▽ More

    Submitted 26 November, 2015; v1 submitted 24 November, 2015; originally announced November 2015.

    Comments: 5 pages, 1 figure, minor fixes, added coauthor

    MSC Class: 68T05; 68W30 ACM Class: I.2.6; G.1.4

  14. arXiv:1511.05914  [pdf, other

    cs.CV

    Collecting and Annotating the Large Continuous Action Dataset

    Authors: Daniel Paul Barrett, Ran Xu, Haonan Yu, Jeffrey Mark Siskind

    Abstract: We make available to the community a new dataset to support action-recognition research. This dataset is different from prior datasets in several key ways. It is significantly larger. It contains streaming video with long segments containing multiple action occurrences that often overlap in space and/or time. All actions were filmed in the same collection of backgrounds so that background gives li… ▽ More

    Submitted 18 November, 2015; originally announced November 2015.

  15. arXiv:1508.06161  [pdf, other

    cs.RO cs.AI cs.CL cs.HC cs.LG

    Robot Language Learning, Generation, and Comprehension

    Authors: Daniel Paul Barrett, Scott Alan Bronikowski, Haonan Yu, Jeffrey Mark Siskind

    Abstract: We present a unified framework which supports grounding natural-language semantics in robotic driving. This framework supports acquisition (learning grounded meanings of nouns and prepositions from human annotation of robotic driving paths), generation (using such acquired meanings to generate sentential description of new robotic driving paths), and comprehension (using such acquired meanings to… ▽ More

    Submitted 25 August, 2015; originally announced August 2015.

  16. arXiv:1506.02059  [pdf, other

    cs.CV

    Sentence Directed Video Object Codetection

    Authors: Haonan Yu, Jeffrey Mark Siskind

    Abstract: We tackle the problem of video object codetection by leveraging the weak semantic constraint implied by sentences that describe the video content. Unlike most existing work that focuses on codetecting large objects which are usually salient both in size and appearance, we can codetect objects that are small or medium sized. Our method assumes no human pose or depth information such as is required… ▽ More

    Submitted 26 January, 2016; v1 submitted 5 June, 2015; originally announced June 2015.

  17. arXiv:1502.05767  [pdf, ps, other

    cs.SC cs.LG stat.ML

    Automatic differentiation in machine learning: a survey

    Authors: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind

    Abstract: Derivatives, mostly in the form of gradients and Hessians, are ubiquitous in machine learning. Automatic differentiation (AD), also called algorithmic differentiation or simply "autodiff", is a family of techniques similar to but more general than backpropagation for efficiently and accurately evaluating derivatives of numeric functions expressed as computer programs. AD is a small but established… ▽ More

    Submitted 5 February, 2018; v1 submitted 19 February, 2015; originally announced February 2015.

    Comments: 43 pages, 5 figures

    MSC Class: 68W30; 65D25; 68T05 ACM Class: G.1.4; I.2.6

    Journal ref: Atilim Gunes Baydin, Barak A. Pearlmutter, Alexey Andreyevich Radul, Jeffrey Mark Siskind. Automatic differentiation in machine learning: a survey. The Journal of Machine Learning Research, 18(153):1--43, 2018

  18. arXiv:1411.4064  [pdf, other

    cs.CV

    A Faster Method for Tracking and Scoring Videos Corresponding to Sentences

    Authors: Haonan Yu, Daniel P. Barrett, Jeffrey Mark Siskind

    Abstract: Prior work presented the sentence tracker, a method for scoring how well a sentence describes a video clip or alternatively how well a video clip depicts a sentence. We present an improved method for optimizing the same cost function employed by this prior work, reducing the space complexity from exponential in the sentence length to polynomial, as well as producing a qualitatively identical resul… ▽ More

    Submitted 14 November, 2014; originally announced November 2014.

  19. arXiv:1408.6418  [pdf

    cs.CV cs.CL cs.IR

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, Jinlian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases, spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adj… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-102-112

  20. arXiv:1309.5174  [pdf, other

    cs.CV cs.CL cs.IR

    Saying What You're Looking For: Linguistics Meets Video Search

    Authors: Andrei Barbu, N. Siddharth, Jeffrey Mark Siskind

    Abstract: We present an approach to searching large video corpora for video clips which depict a natural-language query in the form of a sentence. This approach uses compositional semantics to encode subtle meaning that is lost in other systems, such as the difference between two sentences which have identical words but entirely different meaning: "The person rode the horse} vs. \emph{The horse rode the per… ▽ More

    Submitted 20 September, 2013; originally announced September 2013.

    Comments: 13 pages, 8 figures

  21. arXiv:1308.4189  [pdf, other

    cs.CV cs.AI cs.CL

    Seeing What You're Told: Sentence-Guided Activity Recognition In Video

    Authors: N. Siddharth, Andrei Barbu, Jeffrey Mark Siskind

    Abstract: We present a system that demonstrates how the compositional structure of events, in concert with the compositional structure of language, can interplay with the underlying focusing mechanisms in video action recognition, thereby providing a medium, not only for top-down and bottom-up integration, but also for multi-modal integration between vision and language. We show how the roles played by part… ▽ More

    Submitted 28 May, 2014; v1 submitted 19 August, 2013; originally announced August 2013.

    Comments: To appear in CVPR 2014

  22. arXiv:1306.5263  [pdf, other

    cs.CV cs.CL

    Discriminative Training: Learning to Describe Video with Sentences, from Video Described with Sentences

    Authors: Haonan Yu, Jeffrey Mark Siskind

    Abstract: We present a method for learning word meanings from complex and realistic video clips by discriminatively training (DT) positive sentential labels against negative ones, and then use the trained word models to generate sentential descriptions for new video. This new work is inspired by recent work which adopts a maximum likelihood (ML) framework to address the same problem using only positive sent… ▽ More

    Submitted 21 June, 2013; originally announced June 2013.

  23. arXiv:1306.4746  [pdf, other

    cs.CV

    Felzenszwalb-Baum-Welch: Event Detection by Changing Appearance

    Authors: Daniel Paul Barrett, Jeffrey Mark Siskind

    Abstract: We propose a method which can detect events in videos by modeling the change in appearance of the event participants over time. This method makes it possible to detect events which are characterized not by motion, but by the changing state of the people or objects involved. This is accomplished by using object detectors as output models for the states of a hidden Markov model (HMM). The method all… ▽ More

    Submitted 19 June, 2013; originally announced June 2013.

  24. arXiv:1211.4892  [pdf, ps, other

    cs.SC cs.MS math.DG

    Confusion of Tagged Perturbations in Forward Automatic Differentiation of Higher-Order Functions

    Authors: Oleksandr Manzyuk, Barak A. Pearlmutter, Alexey Andreyevich Radul, David R. Rush, Jeffrey Mark Siskind

    Abstract: Forward Automatic Differentiation (AD) is a technique for augmenting programs to compute derivatives. The essence of Forward AD is to attach perturbations to each number, and propagate these through the computation. When derivatives are nested, the distinct derivative calculations, and their associated perturbations, must be distinguished. This is typically accomplished by creating a unique tag fo… ▽ More

    Submitted 29 June, 2019; v1 submitted 20 November, 2012; originally announced November 2012.

  25. arXiv:1204.3616  [pdf, other

    cs.CV cs.AI

    Large-Scale Automatic Labeling of Video Events with Verbs Based on Event-Participant Interaction

    Authors: Andrei Barbu, Alexander Bridge, Dan Coroian, Sven Dickinson, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, Jinlian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present an approach to labeling short video clips with English verbs as event descriptions. A key distinguishing aspect of this work is that it labels videos with verbs that describe the spatiotemporal interaction between event participants, humans and objects interacting with each other, abstracting away all object-class information and fine-grained image characteristics, and relying solely on… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

  26. arXiv:1204.2801  [pdf, other

    cs.CV cs.AI cs.RO

    Seeing Unseeability to See the Unseeable

    Authors: Siddharth Narayanaswamy, Andrei Barbu, Jeffrey Mark Siskind

    Abstract: We present a framework that allows an observer to determine occluded portions of a structure by finding the maximum-likelihood estimate of those occluded portions consistent with visible image evidence and a consistency model. Doing this requires determining which portions of the structure are occluded in the first place. Since each process relies on the other, we determine a solution to both prob… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Journal ref: Advances in Cognitive Systems, Vol. 2, pp. 77-94, 2012

  27. arXiv:1204.2742  [pdf, other

    cs.CV cs.AI

    Video In Sentences Out

    Authors: Andrei Barbu, Alexander Bridge, Zachary Burchill, Dan Coroian, Sven Dickinson, Sanja Fidler, Aaron Michaux, Sam Mussman, Siddharth Narayanaswamy, Dhaval Salvi, Lara Schmidt, Jiangnan Shangguan, Jeffrey Mark Siskind, Jarrell Waggoner, Song Wang, Jinlian Wei, Yifan Yin, Zhiqi Zhang

    Abstract: We present a system that produces sentential descriptions of video: who did what to whom, and where and how they did it. Action class is rendered as a verb, participant objects as noun phrases, properties of those objects as adjectival modifiers in those noun phrases,spatial relations between those participants as prepositional phrases, and characteristics of the event as prepositional-phrase adju… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

  28. arXiv:1204.2741  [pdf, other

    cs.CV cs.AI

    Simultaneous Object Detection, Tracking, and Event Recognition

    Authors: Andrei Barbu, Aaron Michaux, Siddharth Narayanaswamy, Jeffrey Mark Siskind

    Abstract: The common internal structure and algorithmic organization of object detection, detection-based tracking, and event recognition facilitates a general approach to integrating these three components. This supports multidirectional information flow between these components allowing object detection to influence tracking and event recognition and event recognition to influence tracking and object dete… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Journal ref: Advances in Cognitive Systems, Vol. 2, pp. 203-220, 2012

  29. arXiv:1203.1450  [pdf, ps, other

    cs.PL cs.MS math.NA

    AD in Fortran, Part 2: Implementation via Prepreprocessor

    Authors: Alexey Radul, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: We describe an implementation of the Farfel Fortran AD extensions. These extensions integrate forward and reverse AD directly into the programming model, with attendant benefits to flexibility, modularity, and ease of use. The implementation we describe is a "prepreprocessor" that generates input to existing Fortran-based AD tools. In essence, blocks of code which are targeted for AD by Farfel con… ▽ More

    Submitted 8 March, 2012; v1 submitted 7 March, 2012; originally announced March 2012.

    Journal ref: Recent Advances in Algorithmic Differentiation, Springer Lecture Notes in Computational Science and Engineering volume 87, 2012, ISBN 978-3-642-30022-6, pages 273-284

  30. arXiv:1203.1448  [pdf, ps, other

    cs.PL cs.MS math.NA

    AD in Fortran, Part 1: Design

    Authors: Alexey Radul, Barak A. Pearlmutter, Jeffrey Mark Siskind

    Abstract: We propose extensions to Fortran which integrate forward and reverse Automatic Differentiation (AD) directly into the programming model. Irrespective of implementation technology, embedding AD constructs directly into the language extends the reach and convenience of AD while allowing abstraction of concepts of interest to scientific-computing practice, such as root finding, optimization, and find… ▽ More

    Submitted 8 March, 2012; v1 submitted 7 March, 2012; originally announced March 2012.

  31. arXiv:1106.4572  [pdf, ps

    cs.AI cs.LG

    Specific-to-General Learning for Temporal Events with Application to Learning Event Definitions from Video

    Authors: A. Fern, R. Givan, J. M. Siskind

    Abstract: We develop, analyze, and evaluate a novel, supervised, specific-to-general learner for a simple temporal logic and use the resulting algorithm to learn visual event definitions from video sequences. First, we introduce a simple, propositional, temporal, event-description language called AMA that is sufficiently expressive to represent many events yet sufficiently restrictive to sup… ▽ More

    Submitted 22 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 17, pages 379-449, 2002

  32. Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic

    Authors: J. M. Siskind

    Abstract: This paper presents an implemented system for recognizing the occurrence of events described by simple spatial-motion verbs in short image sequences. The semantics of these verbs is specified with event-logic expressions that describe changes in the state of force-dynamic relations between the participants of the event. An efficient finite representation is introduced for the infin… ▽ More

    Submitted 1 June, 2011; originally announced June 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 15, pages 31-90, 2001