Skip to main content

Showing 1–37 of 37 results for author: Kovashka, A

  1. arXiv:2405.11092  [pdf, other

    cs.HC cs.RO

    What metrics of participation balance predict outcomes of collaborative learning with a robot?

    Authors: Yuya Asano, Diane Litman, Quentin King-Shepard, Tristan Maidment, Tyree Langley, Teresa Davison, Timothy Nokes-Malach, Adriana Kovashka, Erin Walker

    Abstract: One of the keys to the success of collaborative learning is balanced participation by all learners, but this does not always happen naturally. Pedagogical robots have the potential to facilitate balance. However, it remains unclear what participation balance robots should aim at; various metrics have been proposed, but it is still an open question whether we should balance human participation in h… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: To appear in Seventeenth International Conference on Educational Data Mining (EDM 2024)

  2. arXiv:2401.01482  [pdf, other

    cs.CV cs.AI cs.LG

    Incorporating Geo-Diverse Knowledge into Prompting for Increased Geographical Robustness in Object Recognition

    Authors: Kyle Buettner, Sina Malakouti, Xiang Lorraine Li, Adriana Kovashka

    Abstract: Existing object recognition models have been shown to lack robustness in diverse geographical scenarios due to domain shifts in design and context. Class representations need to be adapted to more accurately reflect an object concept under these shifts. In the absence of training data from target geographies, we hypothesize that geographically diverse descriptive knowledge of categories can enhanc… ▽ More

    Submitted 29 March, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Comments: To appear in IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024

  3. arXiv:2309.13525  [pdf, other

    cs.CV

    Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment

    Authors: Sina Malakouti, Adriana Kovashka

    Abstract: Existing domain adaptation (DA) and generalization (DG) methods in object detection enforce feature alignment in the visual space but face challenges like object appearance variability and scene complexity, which make it difficult to distinguish between objects and achieve accurate detection. In this paper, we are the first to address the problem of semi-supervised domain generalization by explori… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: Accepted at BMVC 2023

  4. arXiv:2306.07302  [pdf, other

    cs.HC cs.AI cs.CL

    Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport

    Authors: Yuya Asano, Diane Litman, Mingzhi Yu, Nikki Lobczowski, Timothy Nokes-Malach, Adriana Kovashka, Erin Walker

    Abstract: While speech-enabled teachable agents have some advantages over typing-based ones, they are vulnerable to errors stemming from misrecognition by automatic speech recognition (ASR). These errors may propagate, resulting in unexpected changes in the flow of conversation. We analyzed how such changes are linked with learning gains and learners' rapport with the agents. Our results show they are not r… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Accepted to AIED 2023

  5. Hypernymization of named entity-rich captions for grounding-based multi-modal pretraining

    Authors: Giacomo Nebbia, Adriana Kovashka

    Abstract: Named entities are ubiquitous in text that naturally accompanies images, especially in domains such as news or Wikipedia articles. In previous work, named entities have been identified as a likely reason for low performance of image-text retrieval models pretrained on Wikipedia and evaluated on named entities-free benchmark datasets. Because they are rarely mentioned, named entities could be chall… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  6. arXiv:2303.10937  [pdf, other

    cs.CV

    Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth

    Authors: Cagri Gungor, Adriana Kovashka

    Abstract: Despite recent attention and exploration of depth for various tasks, it is still an unexplored modality for weakly-supervised object detection (WSOD). We propose an amplifier method for enhancing the performance of WSOD by integrating depth information. Our approach can be applied to any WSOD method based on multiple-instance learning, without necessitating additional annotations or inducing large… ▽ More

    Submitted 8 November, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

  7. arXiv:2303.10093  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection

    Authors: Kyle Buettner, Adriana Kovashka

    Abstract: Vision-language alignment learned from image-caption pairs has been shown to benefit tasks like object recognition and detection. Methods are mostly evaluated in terms of how well object class names are learned, but captions also contain rich attribute context that should be considered when learning object alignment. It is unclear how methods use this context in learning, as well as whether models… ▽ More

    Submitted 6 November, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: Accepted at Winter Conference on Applications of Computer Vision (WACV), 2024

  8. arXiv:2303.09608  [pdf, other

    cs.CV

    VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

    Authors: Arushi Rai, Adriana Kovashka

    Abstract: The use of large-scale vision-language datasets is limited for object detection due to the negative impact of label noise on localization. Prior methods have shown how such large-scale datasets can be used for pretraining, which can provide initial signal for localization, but is insufficient without clean bounding-box data for at least some categories. We propose a technique to "vet" labels extra… ▽ More

    Submitted 10 March, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) 2024 camera-ready

  9. arXiv:2303.05546  [pdf, other

    cs.CV cs.AI

    Weakly-Supervised HOI Detection from Interaction Labels Only and Language/Vision-Language Priors

    Authors: Mesut Erhan Unal, Adriana Kovashka

    Abstract: Human-object interaction (HOI) detection aims to extract interacting human-object pairs and their interaction categories from a given natural image. Even though the labeling effort required for building HOI detection datasets is inherently more extensive than for many other computer vision tasks, weakly-supervised directions in this area have not been sufficiently explored due to the difficulty of… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: 8 pages, 3 figures and 5 tables

  10. arXiv:2212.04613  [pdf, other

    cs.CV cs.AI cs.LG

    Contrastive View Design Strategies to Enhance Robustness to Domain Shifts in Downstream Object Detection

    Authors: Kyle Buettner, Adriana Kovashka

    Abstract: Contrastive learning has emerged as a competitive pretraining method for object detection. Despite this progress, there has been minimal investigation into the robustness of contrastively pretrained detectors when faced with domain shifts. To address this gap, we conduct an empirical study of contrastive learning and out-of-domain object detection, studying how contrastive view design affects robu… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: To appear, 2nd International Workshop on Practical Deep Learning in the Wild at AAAI Conference on Artificial Intelligence 2023

  11. arXiv:2209.11842  [pdf, other

    cs.CL cs.HC cs.RO

    Comparison of Lexical Alignment with a Teachable Robot in Human-Robot and Human-Human-Robot Interactions

    Authors: Yuya Asano, Diane Litman, Mingzhi Yu, Nikki Lobczowski, Timothy Nokes-Malach, Adriana Kovashka, Erin Walker

    Abstract: Speakers build rapport in the process of aligning conversational behaviors with each other. Rapport engendered with a teachable agent while instructing domain material has been shown to promote learning. Past work on lexical alignment in the field of education suffers from limitations in both the measures used to quantify alignment and the types of interactions in which alignment with agents has b… ▽ More

    Submitted 23 September, 2022; originally announced September 2022.

    Comments: To be published in SIGDial 2022

  12. arXiv:2206.04863  [pdf, other

    cs.CV cs.LG

    Symbolic image detection using scene and knowledge graphs

    Authors: Nasrin Kalanat, Adriana Kovashka

    Abstract: Sometimes the meaning conveyed by images goes beyond the list of objects they contain; instead, images may express a powerful message to affect the viewers' minds. Inferring this message requires reasoning about the relationships between the objects, and general common-sense knowledge about the components. In this paper, we use a scene graph, a graph representation of an image, to capture visual c… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  13. arXiv:2205.05895  [pdf, other

    cs.CV

    Weakly-Supervised Action Detection Guided by Audio Narration

    Authors: Keren Ye, Adriana Kovashka

    Abstract: Videos are more well-organized curated data sources for visual concept learning than images. Unlike the 2-dimensional images which only involve the spatial information, the additional temporal dimension bridges and synchronizes multiple modalities. However, in most video detection benchmarks, these additional modalities are not fully utilized. For example, EPIC Kitchens is the largest dataset in f… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: To appear, in Joint 1st Ego4D and 10th EPIC Workshop, held in conjunction with the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  14. arXiv:2112.13910  [pdf, other

    cs.CL cs.AI cs.CV

    Visual Persuasion in COVID-19 Social Media Content: A Multi-Modal Characterization

    Authors: Mesut Erhan Unal, Adriana Kovashka, Wen-Ting Chung, Yu-Ru Lin

    Abstract: Social media content routinely incorporates multi-modal design to covey information and shape meanings, and sway interpretations toward desirable implications, but the choices and outcomes of using both texts and visual images have not been sufficiently studied. This work proposes a computational approach to analyze the outcome of persuasive information in multi-modal content, focusing on two aspe… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: 10 pages

  15. arXiv:2109.09532  [pdf, other

    cs.SI cs.CY

    Characterizing User Susceptibility to COVID-19 Misinformation on Twitter

    Authors: Xian Teng, Yu-Ru Lin, Wen-Ting Chung, Ang Li, Adriana Kovashka

    Abstract: Though significant efforts such as removing false claims and promoting reliable sources have been increased to combat COVID-19 "misinfodemic", it remains an unsolved societal challenge if lacking a proper understanding of susceptible online users, i.e., those who are likely to be attracted by, believe and spread misinformation. This study attempts to answer {\it who} constitutes the population vul… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

    Comments: Accepted into ICWSM 2022, 9 figures (main text)

  16. arXiv:2106.13122  [pdf, other

    cs.CV cs.LG

    Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers

    Authors: Katelyn Morrison, Benjamin Gilby, Colton Lipchak, Adam Mattioli, Adriana Kovashka

    Abstract: Recently, vision transformers and MLP-based models have been developed in order to address some of the prevalent weaknesses in convolutional neural networks. Due to the novelty of transformers being used in this domain along with the self-attention mechanism, it remains unclear to what degree these architectures are robust to corruptions. Despite some works proposing that data augmentation remains… ▽ More

    Submitted 3 July, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

    Comments: Under review at the Uncertainty and Robustness in Deep Learning workshop at ICML 2021. Our appendix is attached to the last page of the paper

  17. arXiv:2105.13994  [pdf, other

    cs.CV

    Linguistic Structures as Weak Supervision for Visual Scene Graph Generation

    Authors: Keren Ye, Adriana Kovashka

    Abstract: Prior work in scene graph generation requires categorical supervision at the level of triplets - subjects and objects, and predicates that relate them, either with or without bounding box information. However, scene graph generation is a holistic task: thus holistic, contextual supervision should intuitively improve performance. In this work, we explore how linguistic structures in captions can be… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: To appear in CVPR 2021

  18. arXiv:2105.03014  [pdf, other

    cs.CV

    BasisNet: Two-stage Model Synthesis for Efficient Inference

    Authors: Mingda Zhang, Chun-Te Chu, Andrey Zhmoginov, Andrew Howard, Brendan Jou, Yukun Zhu, Li Zhang, Rebecca Hwa, Adriana Kovashka

    Abstract: In this work, we present BasisNet which combines recent advancements in efficient neural network architectures, conditional computation, and early termination in a simple new form. Our approach incorporates a lightweight model to preview the input and generate input-dependent combination coefficients, which later controls the synthesis of a more accurate specialist model to make final prediction.… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: To appear, 4th Workshop on Efficient Deep Learning for Computer Vision (ECV2021), CVPR2021 Workshop

  19. arXiv:2103.15974  [pdf, other

    cs.CV

    Domain-robust VQA with diverse datasets and methods but no target labels

    Authors: Mingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa

    Abstract: The observation that computer vision methods overfit to dataset specifics has inspired diverse attempts to make object recognition models robust to domain shifts. However, similar work on domain-robust visual question answering methods is very limited. Domain adaptation for VQA differs from adaptation for object recognition due to additional complexity: VQA models handle multimodal inputs, methods… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: To appear in CVPR 2021

  20. arXiv:2101.01260  [pdf, other

    cs.CV

    SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection

    Authors: Keren Ye, Adriana Kovashka, Mark Sandler, Menglong Zhu, Andrew Howard, Marco Fornoni

    Abstract: Deep learning based object detectors are commonly deployed on mobile devices to solve a variety of tasks. For maximum accuracy, each detector is usually trained to solve one single specific task, and comes with a completely independent set of parameters. While this guarantees high performance, it is also highly inefficient, as each model has to be separately downloaded and stored. In this paper we… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: Accepted by the ACCV2020 (Oral)

  21. arXiv:2012.01642  [pdf, other

    cs.CV

    Learning to Transfer Visual Effects from Videos to Images

    Authors: Christopher Thomas, Yale Song, Adriana Kovashka

    Abstract: We study the problem of animating images by transferring spatio-temporal visual effects (such as melting) from a collection of videos. We tackle two primary challenges in visual effect transfer: 1) how to capture the effect we wish to distill; and 2) how to ensure that only the effect, rather than content or artistic style, is transferred from the source videos to the input image. To address the f… ▽ More

    Submitted 17 December, 2020; v1 submitted 2 December, 2020; originally announced December 2020.

  22. arXiv:2007.08617  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    Preserving Semantic Neighborhoods for Robust Cross-modal Retrieval

    Authors: Christopher Thomas, Adriana Kovashka

    Abstract: The abundance of multimodal data (e.g. social media posts) has inspired interest in cross-modal retrieval methods. Popular approaches rely on a variety of metric learning losses, which prescribe what the proximity of image and text should be, in the learned space. However, most prior methods have focused on the case where image and text convey redundant information; in contrast, real-world image-t… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Journal ref: ECCV 2020

  23. arXiv:1911.00147  [pdf, other

    cs.LG cs.CV

    Predicting the Politics of an Image Using Webly Supervised Data

    Authors: Christopher Thomas, Adriana Kovashka

    Abstract: The news media shape public opinion, and often, the visual bias they contain is evident for human observers. This bias can be inferred from how different media sources portray different subjects or topics. In this paper, we model visual political bias in contemporary media sources at scale, using webly supervised data. We collect a dataset of over one million unique images and associated news arti… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Journal ref: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  24. arXiv:1907.10164  [pdf, other

    cs.CV

    Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection

    Authors: Keren Ye, Mingda Zhang, Adriana Kovashka, Wei Li, Danfeng Qin, Jesse Berent

    Abstract: Learning to localize and name object instances is a fundamental problem in vision, but state-of-the-art approaches rely on expensive bounding box supervision. While weakly supervised detection (WSOD) methods relax the need for boxes to that of image-level annotations, even cheaper supervision is naturally available in the form of unstructured textual descriptions that users may freely provide when… ▽ More

    Submitted 16 August, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: To appear in ICCV 2019

  25. arXiv:1901.07366  [pdf, other

    cs.CV

    Measuring Effectiveness of Video Advertisements

    Authors: James Hahn, Adriana Kovashka

    Abstract: Advertisements are unavoidable in modern society. Times Square is notorious for its incessant display of advertisements. Its popularity is worldwide and smaller cities possess miniature versions of the display, such as Pittsburgh and its digital works in Oakland on Forbes Avenue. Tokyo's Ginza district recently rose to popularity due to its upscale shops and constant onslaught of advertisements to… ▽ More

    Submitted 28 January, 2019; v1 submitted 14 January, 2019; originally announced January 2019.

    Comments: 9 pages, 7 figures, 2 tables

  26. arXiv:1812.11139  [pdf, other

    cs.CV

    Artistic Object Recognition by Unsupervised Style Adaptation

    Authors: Christopher Thomas, Adriana Kovashka

    Abstract: Computer vision systems currently lack the ability to reliably recognize artistically rendered objects, especially when such data is limited. In this paper, we propose a method for recognizing objects in artistic modalities (such as paintings, cartoons, or sketches), without requiring any labeled data from those modalities. Our method explicitly accounts for stylistic domain shifts between and wit… ▽ More

    Submitted 28 December, 2018; originally announced December 2018.

    Journal ref: Asian Conference on Computer Vision 2018 (ACCV)

  27. arXiv:1811.10080  [pdf, other

    cs.CV

    Learning to discover and localize visual objects with open vocabulary

    Authors: Keren Ye, Mingda Zhang, Wei Li, Danfeng Qin, Adriana Kovashka, Jesse Berent

    Abstract: To alleviate the cost of obtaining accurate bounding boxes for training today's state-of-the-art object detection models, recent weakly supervised detection work has proposed techniques to learn from image-level labels. However, requiring discrete image-level labels is both restrictive and suboptimal. Real-world "supervision" usually consists of more unstructured text, such as captions. In this wo… ▽ More

    Submitted 25 November, 2018; originally announced November 2018.

  28. arXiv:1807.11122  [pdf, other

    cs.CV

    Story Understanding in Video Advertisements

    Authors: Keren Ye, Kyle Buettner, Adriana Kovashka

    Abstract: In order to resonate with the viewers, many video advertisements explore creative narrative techniques such as "Freytag's pyramid" where a story begins with exposition, followed by rising action, then climax, concluding with denouement. In the dramatic structure of ads in particular, climax depends on changes in sentiment. We dedicate our study to understand the dynamic structure of video ads auto… ▽ More

    Submitted 29 July, 2018; originally announced July 2018.

    Comments: To appear, Proceedings of the British Machine Vision Conference (BMVC)

  29. arXiv:1807.09882  [pdf, other

    cs.CV

    Persuasive Faces: Generating Faces in Advertisements

    Authors: Christopher Thomas, Adriana Kovashka

    Abstract: In this paper, we examine the visual variability of objects across different ad categories, i.e. what causes an advertisement to be visually persuasive. We focus on modeling and generating faces which appear to come from different types of ads. For example, if faces in beauty ads tend to be women wearing lipstick, a generative model should portray this distinct visual appearance. Training generati… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Journal ref: In British Machine Vision Conference (BMVC), Newcastle upon Tyne, UK, September 2018

  30. arXiv:1807.08205  [pdf, other

    cs.CV

    Equal But Not The Same: Understanding the Implicit Relationship Between Persuasive Images and Text

    Authors: Mingda Zhang, Rebecca Hwa, Adriana Kovashka

    Abstract: Images and text in advertisements interact in complex, non-literal ways. The two channels are usually complementary, with each channel telling a different part of the story. Current approaches, such as image captioning methods, only examine literal, redundant relationships, where image and text show exactly the same content. To understand more complex relationships, we first collect a dataset of a… ▽ More

    Submitted 21 July, 2018; originally announced July 2018.

    Comments: To appear in BMVC2018

  31. arXiv:1805.03134  [pdf, other

    cs.CV

    Image Retrieval with Mixed Initiative and Multimodal Feedback

    Authors: Nils Murrugarra-Llerena, Adriana Kovashka

    Abstract: How would you search for a unique, fashionable shoe that a friend wore and you want to buy, but you didn't take a picture? Existing approaches propose interactive image search as a promising venue. However, they either entrust the user with taking the initiative to provide informative feedback, or give all control to the system which determines informative questions to ask. Instead, we propose a m… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: In submission to BMVC 2018

  32. arXiv:1711.06666  [pdf, other

    cs.CV

    ADVISE: Symbolism and External Knowledge for Decoding Advertisements

    Authors: Keren Ye, Adriana Kovashka

    Abstract: In order to convey the most content in their limited space, advertisements embed references to outside knowledge via symbolism. For example, a motorcycle stands for adventure (a positive property the ad wants associated with the product being sold), and a gun stands for danger (a negative property to dissuade viewers from undesirable behaviors). We show how to use symbolic references to better und… ▽ More

    Submitted 29 July, 2018; v1 submitted 17 November, 2017; originally announced November 2017.

    Comments: To appear, Proceedings of the European Conference on Computer Vision (ECCV)

  33. arXiv:1707.03067  [pdf, other

    cs.CV

    Automatic Understanding of Image and Video Advertisements

    Authors: Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

    Abstract: There is more to images than their objective physical content: for example, advertisements are created to persuade a viewer to take a certain action. We propose the novel problem of automatic advertisement understanding. To enable research on this problem, we create two datasets: an image dataset of 64,832 image ads, and a video dataset of 3,477 ads. Our data contains rich annotations encompassing… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

    Comments: To appear in CVPR 2017; data available on http://cs.pitt.edu/~kovashka/ads

  34. Crowdsourcing in Computer Vision

    Authors: Adriana Kovashka, Olga Russakovsky, Li Fei-Fei, Kristen Grauman

    Abstract: Computer vision systems require large amounts of manually annotated data to properly learn challenging visual concepts. Crowdsourcing platforms offer an inexpensive method to capture human knowledge and understanding, for a vast number of visual perception tasks. In this survey, we describe the types of annotations computer vision researchers have collected using crowdsourcing, and how they have e… ▽ More

    Submitted 7 November, 2016; originally announced November 2016.

    Comments: A 69-page meta review of the field, Foundations and Trends in Computer Graphics and Vision, 2016

  35. arXiv:1508.05038  [pdf, other

    cs.CV

    Seeing Behind the Camera: Identifying the Authorship of a Photograph

    Authors: Christopher Thomas, Adriana Kovashka

    Abstract: We introduce the novel problem of identifying the photographer behind a photograph. To explore the feasibility of current computer vision techniques to address this problem, we created a new dataset of over 180,000 images taken by 41 well-known photographers. Using this dataset, we examined the effectiveness of a variety of features (low and high-level, including CNN features) at identifying the p… ▽ More

    Submitted 31 May, 2016; v1 submitted 20 August, 2015; originally announced August 2015.

    Comments: Dataset downloadable at http://www.cs.pitt.edu/~chris/photographer To Appear in CVPR 2016

  36. WhittleSearch: Interactive Image Search with Relative Attribute Feedback

    Authors: Adriana Kovashka, Devi Parikh, Kristen Grauman

    Abstract: We propose a novel mode of feedback for image search, where a user describes which properties of exemplar images should be adjusted in order to more closely match his/her mental model of the image sought. For example, perusing image results for a query "black shoes", the user might state, "Show me shoe images like these, but sportier." Offline, our approach first learns a set of ranking functions,… ▽ More

    Submitted 18 May, 2015; v1 submitted 15 May, 2015; originally announced May 2015.

    Comments: Published in the International Journal of Computer Vision (IJCV), April 2015. The final publication is available at Springer via http://dx.doi.org/10.1007/s11263-015-0814-0

    Journal ref: International Journal of Computer Vision, 1573-1405 (2015, Springer)

  37. Discovering Attribute Shades of Meaning with the Crowd

    Authors: Adriana Kovashka, Kristen Grauman

    Abstract: To learn semantic attributes, existing methods typically train one discriminative model for each word in a vocabulary of nameable properties. However, this "one model per word" assumption is problematic: while a word might have a precise linguistic definition, it need not have a precise visual definition. We propose to discover shades of attribute meaning. Given an attribute name, we use crowdsour… ▽ More

    Submitted 15 May, 2015; originally announced May 2015.

    Comments: Published in the International Journal of Computer Vision (IJCV), January 2015. The final publication is available at Springer via http://dx.doi.org/10.1007/s11263-014-0798-1

    Journal ref: International Journal of Computer Vision 1573-1405 (2015, Springer)