Skip to main content

Showing 1–13 of 13 results for author: Shocher, A

  1. arXiv:2311.01462  [pdf, other

    cs.CV cs.LG

    Idempotent Generative Network

    Authors: Assaf Shocher, Amil Dravid, Yossi Gandelsman, Inbar Mosseri, Michael Rubinstein, Alexei A. Efros

    Abstract: We propose a new approach for generative modeling based on training a neural network to be idempotent. An idempotent operator is one that can be applied sequentially without changing the result beyond the initial application, namely $f(f(z))=f(z)$. The proposed model $f$ is trained to map a source distribution (e.g, Gaussian noise) to a target distribution (e.g. realistic images) using the followi… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  2. arXiv:2308.00566  [pdf, other

    cs.CV cs.AI cs.LG

    Stochastic positional embeddings improve masked image modeling

    Authors: Amir Bar, Florian Bordes, Assaf Shocher, Mahmoud Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann LeCun

    Abstract: Masked Image Modeling (MIM) is a promising self-supervised learning approach that enables learning from unlabeled images. Despite its recent success, learning good representations through MIM remains challenging because it requires predicting the right semantic content in accurate locations. For example, given an incomplete picture of a dog, we can guess that there is a tail, but we cannot determi… ▽ More

    Submitted 27 February, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: Code and models available in https://github.com/amirbar/StoP

  3. arXiv:2306.09346  [pdf, other

    cs.CV

    Rosetta Neurons: Mining the Common Units in a Model Zoo

    Authors: Amil Dravid, Yossi Gandelsman, Alexei A. Efros, Assaf Shocher

    Abstract: Do different neural networks, trained for various vision tasks, share some common representations? In this paper, we demonstrate the existence of common features we call "Rosetta Neurons" across a range of models with different architectures, different tasks (generative and discriminative), and different types of supervision (class-supervised, text-supervised, self-supervised). We present an algor… ▽ More

    Submitted 16 June, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    Comments: Project page: https://yossigandelsman.github.io/rosetta_neurons/

  4. arXiv:2306.00966  [pdf, other

    cs.CV

    The Hidden Language of Diffusion Models

    Authors: Hila Chefer, Oran Lang, Mor Geva, Volodymyr Polosukhin, Assaf Shocher, Michal Irani, Inbar Mosseri, Lior Wolf

    Abstract: Text-to-image diffusion models have demonstrated an unparalleled ability to generate high-quality, diverse images from a textual prompt. However, the internal representations learned by these models remain an enigma. In this work, we present Conceptor, a novel method to interpret the internal representation of a textual concept by a diffusion model. This interpretation is obtained by decomposing t… ▽ More

    Submitted 5 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

  5. arXiv:2205.05725  [pdf, other

    cs.CV

    Diverse Video Generation from a Single Video

    Authors: Niv Haim, Ben Feinstein, Niv Granot, Assaf Shocher, Shai Bagon, Tali Dekel, Michal Irani

    Abstract: GANs are able to perform generation and manipulation tasks, trained on a single video. However, these single video GANs require unreasonable amount of time to train on a single video, rendering them almost impractical. In this paper we question the necessity of a GAN for generation from a single video, and introduce a non-parametric baseline for a variety of generation and manipulation tasks. We r… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

    Comments: AI for Content Creation Workshop @ CVPR 2022

  6. arXiv:2109.08591  [pdf, other

    cs.CV

    Diverse Generation from a Single Video Made Possible

    Authors: Niv Haim, Ben Feinstein, Niv Granot, Assaf Shocher, Shai Bagon, Tali Dekel, Michal Irani

    Abstract: GANs are able to perform generation and manipulation tasks, trained on a single video. However, these single video GANs require unreasonable amount of time to train on a single video, rendering them almost impractical. In this paper we question the necessity of a GAN for generation from a single video, and introduce a non-parametric baseline for a variety of generation and manipulation tasks. We r… ▽ More

    Submitted 5 December, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

  7. arXiv:2103.15545  [pdf, other

    cs.CV

    Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative Models

    Authors: Niv Granot, Ben Feinstein, Assaf Shocher, Shai Bagon, Michal Irani

    Abstract: Single image generative models perform synthesis and manipulation tasks by capturing the distribution of patches within a single image. The classical (pre Deep Learning) prevailing approaches for these tasks are based on an optimization process that maximizes patch similarity between the input and generated output. Recently, however, Single Image GANs were introduced both as a superior solution fo… ▽ More

    Submitted 24 August, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: 11 pages, 10 figures, added references and acknowledgments

  8. arXiv:2006.11120  [pdf, other

    cs.LG cs.CV stat.ML

    From Discrete to Continuous Convolution Layers

    Authors: Assaf Shocher, Ben Feinstein, Niv Haim, Michal Irani

    Abstract: A basic operation in Convolutional Neural Networks (CNNs) is spatial resizing of feature maps. This is done either by strided convolution (donwscaling) or transposed convolution (upscaling). Such operations are limited to a fixed filter moving at predetermined integer steps (strides). Spatial sizes of consecutive layers are related by integer scale factors, predetermined at architectural design, a… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.

  9. arXiv:2003.06221  [pdf, other

    cs.CV cs.LG

    Semantic Pyramid for Image Generation

    Authors: Assaf Shocher, Yossi Gandelsman, Inbar Mosseri, Michal Yarom, Michal Irani, William T. Freeman, Tali Dekel

    Abstract: We present a novel GAN-based model that utilizes the space of deep features learned by a pre-trained classification model. Inspired by classical image pyramid representations, we construct our model as a Semantic Generation Pyramid -- a hierarchical framework which leverages the continuum of semantic information encapsulated in such deep features; this ranges from low level information contained i… ▽ More

    Submitted 16 March, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition, 2020. CVPR 2020

  10. arXiv:1909.06581  [pdf, other

    cs.CV

    Blind Super-Resolution Kernel Estimation using an Internal-GAN

    Authors: Sefi Bell-Kligler, Assaf Shocher, Michal Irani

    Abstract: Super resolution (SR) methods typically assume that the low-resolution (LR) image was downscaled from the unknown high-resolution (HR) image by a fixed 'ideal' downscaling kernel (e.g. Bicubic downscaling). However, this is rarely the case in real LR images, in contrast to synthetically generated SR datasets. When the assumed downscaling kernel deviates from the true one, the performance of SR met… ▽ More

    Submitted 7 January, 2020; v1 submitted 14 September, 2019; originally announced September 2019.

  11. arXiv:1812.00467  [pdf, other

    cs.CV cs.LG

    "Double-DIP": Unsupervised Image Decomposition via Coupled Deep-Image-Priors

    Authors: Yossi Gandelsman, Assaf Shocher, Michal Irani

    Abstract: Many seemingly unrelated computer vision tasks can be viewed as a special case of image decomposition into separate layers. For example, image segmentation (separation into foreground and background layers); transparent layer separation (into reflection and transmission layers); Image dehazing (separation into a clear image and a haze map), and more. In this paper we propose a unified framework fo… ▽ More

    Submitted 5 December, 2018; v1 submitted 2 December, 2018; originally announced December 2018.

    Comments: Project page: http://www.wisdom.weizmann.ac.il/~vision/DoubleDIP/

  12. arXiv:1812.00231  [pdf, other

    cs.CV

    InGAN: Capturing and Remapping the "DNA" of a Natural Image

    Authors: Assaf Shocher, Shai Bagon, Phillip Isola, Michal Irani

    Abstract: Generative Adversarial Networks (GANs) typically learn a distribution of images in a large image dataset, and are then able to generate new images from this distribution. However, each natural image has its own internal statistics, captured by its unique distribution of patches. In this paper we propose an "Internal GAN" (InGAN) - an image-specific GAN - which trains on a single input image and le… ▽ More

    Submitted 24 April, 2019; v1 submitted 1 December, 2018; originally announced December 2018.

  13. arXiv:1712.06087  [pdf, other

    cs.CV cs.LG cs.NE eess.IV

    "Zero-Shot" Super-Resolution using Deep Internal Learning

    Authors: Assaf Shocher, Nadav Cohen, Michal Irani

    Abstract: Deep Learning has led to a dramatic leap in Super-Resolution (SR) performance in the past few years. However, being supervised, these SR methods are restricted to specific training data, where the acquisition of the low-resolution (LR) images from their high-resolution (HR) counterparts is predetermined (e.g., bicubic downscaling), without any distracting artifacts (e.g., sensor noise, image compr… ▽ More

    Submitted 17 December, 2017; originally announced December 2017.