Skip to main content

Showing 1–22 of 22 results for author: Redi, M

  1. arXiv:2403.07613  [pdf, other

    cs.HC cs.MM

    Imagine a dragon made of seaweed: How images enhance learning in Wikipedia

    Authors: Anita Silva, Maria Tracy, Katharina Reinecke, Eytan Adar, Miriam Redi

    Abstract: Though images are ubiquitous across Wikipedia, it is not obvious that the image choices optimally support learning. When well selected, images can enhance learning by dual coding, complementing, or supporting articles. When chosen poorly, images can mislead, distract, and confuse. We developed a large dataset containing 470 questions & answers to 94 Wikipedia articles with images on a wide range o… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 16 pages, 10 figures

  2. A Comparative Study of Reference Reliability in Multiple Language Editions of Wikipedia

    Authors: Aitolkyn Baigutanova, Diego Saez-Trumper, Miriam Redi, Meeyoung Cha, Pablo Aragón

    Abstract: Information presented in Wikipedia articles must be attributable to reliable published sources in the form of references. This study examines over 5 million Wikipedia articles to assess the reliability of references in multiple language editions. We quantify the cross-lingual patterns of the perennial sources list, a collection of reliability labels for web domains identified and collaboratively a… ▽ More

    Submitted 4 September, 2023; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: Conference on Information & Knowledge Management (CIKM '23)

  3. arXiv:2304.01961  [pdf, other

    cs.IR cs.CL cs.CV

    AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation

    Authors: Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin

    Abstract: This paper presents the AToMiC (Authoring Tools for Multimedia Content) dataset, designed to advance research in image/text cross-modal retrieval. While vision-language pretrained transformers have led to significant improvements in retrieval effectiveness, existing research has relied on image-caption datasets that feature only simplistic image-text relationships and underspecified user models of… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  4. Longitudinal Assessment of Reference Quality on Wikipedia

    Authors: Aitolkyn Baigutanova, Jaehyeon Myung, Diego Saez-Trumper, Ai-Jou Chou, Miriam Redi, Changwook Jung, Meeyoung Cha

    Abstract: Wikipedia plays a crucial role in the integrity of the Web. This work analyzes the reliability of this global encyclopedia through the lens of its references. We operationalize the notion of reference quality by defining reference need (RN), i.e., the percentage of sentences missing a citation, and reference risk (RR), i.e., the proportion of non-authoritative references. We release Citation Detec… ▽ More

    Submitted 9 March, 2023; originally announced March 2023.

    Comments: Published at the Web Conference 2023 (WWW '23)

    Journal ref: Proceedings of the ACM Web Conference 2023 (WWW '23), May 1-5, 2023, Austin, TX, USA. ACM

  5. arXiv:2112.01868  [pdf, other

    cs.CY

    A Large Scale Study of Reader Interactions with Images on Wikipedia

    Authors: Daniele Rama, Tiziano Piccardi, Miriam Redi, Rossano Schifanella

    Abstract: Wikipedia is the largest source of free encyclopedic knowledge and one of the most visited sites on the Web. To increase reader understanding of the article, Wikipedia editors add images within the text of the article's body. However, despite their widespread usage on web platforms and the huge volume of visual content on Wikipedia, little is known about the importance of images in the context of… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: 29 pages, 12 figures, final version to be published in EPJ Data Science

  6. arXiv:2105.04117  [pdf, other

    cs.IR cs.CL cs.LG

    Wiki-Reliability: A Large Scale Dataset for Content Reliability on Wikipedia

    Authors: KayYen Wong, Miriam Redi, Diego Saez-Trumper

    Abstract: Wikipedia is the largest online encyclopedia, used by algorithms and web users as a central hub of reliable information on the web. The quality and reliability of Wikipedia content is maintained by a community of volunteer editors. Machine learning and information retrieval algorithms could help scale up editors' manual efforts around Wikipedia content reliability. However, there is a lack of larg… ▽ More

    Submitted 1 June, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

    Comments: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '21), 2021

  7. On the Value of Wikipedia as a Gateway to the Web

    Authors: Tiziano Piccardi, Miriam Redi, Giovanni Colavizza, Robert West

    Abstract: By linking to external websites, Wikipedia can act as a gateway to the Web. To date, however, little is known about the amount of traffic generated by Wikipedia's external links. We fill this gap in a detailed analysis of usage logs gathered from Wikipedia users' client devices. Our analysis proceeds in three steps: First, we quantify the level of engagement with external links, finding that, in o… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

    Comments: The Web Conference WWW 2021, 12 pages

  8. arXiv:2008.12314  [pdf, other

    cs.CY

    A Taxonomy of Knowledge Gaps for Wikimedia Projects (Second Draft)

    Authors: Miriam Redi, Martin Gerlach, Isaac Johnson, Jonathan Morgan, Leila Zia

    Abstract: In January 2019, prompted by the Wikimedia Movement's 2030 strategic direction, the Research team at the Wikimedia Foundation identified the need to develop a knowledge gaps index -- a composite index to support the decision makers across the Wikimedia movement by providing: a framework to encourage structured and targeted brainstorming discussions; data on the state of the knowledge gaps across t… ▽ More

    Submitted 29 January, 2021; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Second draft: see summary of changes at https://meta.wikimedia.org/wiki/Research:Knowledge_Gaps_Index/Taxonomy/Summary_of_Changes_for_Second_Version

  9. arXiv:2007.11659   

    cs.IR cs.DB

    Proceedings of the KG-BIAS Workshop 2020 at AKBC 2020

    Authors: Edgar Meij, Tara Safavi, Chenyan Xiong, Gianluca Demartini, Miriam Redi, Fatma Özcan

    Abstract: The KG-BIAS 2020 workshop touches on biases and how they surface in knowledge graphs (KGs), biases in the source data that is used to create KGs, methods for measuring or remediating bias in KGs, but also identifying other biases such as how and which languages are represented in automatically constructed KGs or how personal KGs might incur inherent biases. The goal of this workshop is to uncover… ▽ More

    Submitted 18 June, 2020; originally announced July 2020.

  10. arXiv:2006.13400  [pdf, other

    cs.SI

    Wikipedia and Westminster: Quality and Dynamics of Wikipedia Pages about UK Politicians

    Authors: Pushkal Agarwal, Miriam Redi, Nishanth Sastry, Edward Wood, Andrew Blick

    Abstract: Wikipedia is a major source of information providing a large variety of content online, trusted by readers from around the world. Readers go to Wikipedia to get reliable information about different subjects, one of the most popular being living people, and especially politicians. While a lot is known about the general usage and information consumption on Wikipedia, less is known about the life-cyc… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: A preprint of accepted publication at the 31ST ACM Conference on Hypertext and Social Media (HT'20)

  11. arXiv:2001.08614  [pdf, other

    cs.CY

    Quantifying Engagement with Citations on Wikipedia

    Authors: Tiziano Piccardi, Miriam Redi, Giovanni Colavizza, Robert West

    Abstract: Wikipedia, the free online encyclopedia that anyone can edit, is one of the most visited sites on the Web and a common source of information for many users. As an encyclopedia, Wikipedia is not a source of original information, but was conceived as a gateway to secondary sources: according to Wikipedia's guidelines, facts must be backed up by reliable sources that reflect the full spectrum of view… ▽ More

    Submitted 26 January, 2020; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: The Web Conference WWW 2020, 10 pages

  12. FaceLift: A transparent deep learning framework to beautify urban scenes

    Authors: Sagar Joglekar, Daniele Quercia, Miriam Redi, Luca Maria Aiello, Tobias Kauer, Nishanth Sastry

    Abstract: In the area of computer vision, deep learning techniques have recently been used to predict whether urban scenes are likely to be considered beautiful: it turns out that these techniques are able to make accurate predictions. Yet they fall short when it comes to generating actionable insights for urban design. To support urban interventions, one needs to go beyond predicting beauty, and tackle the… ▽ More

    Submitted 16 January, 2020; originally announced January 2020.

  13. arXiv:1902.11116  [pdf, other

    cs.CY cs.CL cs.DL

    Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's Verifiability

    Authors: Miriam Redi, Besnik Fetahu, Jonathan Morgan, Dario Taraborelli

    Abstract: Wikipedia is playing an increasingly central role on the web,and the policies its contributors follow when sourcing and fact-checking content affect million of readers. Among these core guiding principles, verifiability policies have a particularly important role. Verifiability requires that information included in a Wikipedia article be corroborated against reliable secondary sources. Because of… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

  14. arXiv:1806.08282  [pdf, other

    cs.SI

    Online Petitioning Through Data Exploration and What We Found There: A Dataset of Petitions from Avaaz.org

    Authors: Pablo Aragón, Diego Sáez-Trumper, Miriam Redi, Scott A. Hale, Vicenç Gómez, Andreas Kaltenbrunner

    Abstract: The Internet has become a fundamental resource for activism as it facilitates political mobilization at a global scale. Petition platforms are a clear example of how thousands of people around the world can contribute to social change. Avaaz.org, with a presence in over 200 countries, is one of the most popular of this type. However, little research has focused on this platform, probably due to a… ▽ More

    Submitted 21 June, 2018; originally announced June 2018.

    Comments: Accepted as a dataset paper at the 12th International AAAI Conference on Web and Social Media (ICWSM-18). This preprint includes an additional appendix with the reasons, provided by Avaaz.org, about the anomalies detected when exploring the dataset. For academic purposes, please cite the ICWSM version

  15. arXiv:1711.00536  [pdf, other

    cs.SI cs.AI cs.CV cs.MM physics.soc-ph

    Beautiful and damned. Combined effect of content quality and social ties on user engagement

    Authors: Luca M. Aiello, Rossano Schifanella, Miriam Redi, Stacey Svetlichnaya, Frank Liu, Simon Osindero

    Abstract: User participation in online communities is driven by the intertwinement of the social network structure with the crowd-generated content that flows along its links. These aspects are rarely explored jointly and at scale. By looking at how users generate and access pictures of varying beauty on Flickr, we investigate how the production of quality impacts the dynamics of online social systems. We d… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.

    Comments: 13 pages, 12 figures, final version published in IEEE Transactions on Knowledge and Data Engineering (Volume: PP, Issue: 99)

  16. arXiv:1609.01388  [pdf, other

    cs.MM

    To Click or Not To Click: Automatic Selection of Beautiful Thumbnails from Videos

    Authors: Yale Song, Miriam Redi, Jordi Vallmitjana, Alejandro Jaimes

    Abstract: Thumbnails play such an important role in online videos. As the most representative snapshot, they capture the essence of a video and provide the first impression to the viewers; ultimately, a great thumbnail makes a video more attractive to click and watch. We present an automatic thumbnail selection system that exploits two important characteristics commonly associated with meaningful and attrac… ▽ More

    Submitted 6 September, 2016; originally announced September 2016.

    Comments: To appear in CIKM 2016

  17. arXiv:1606.02276  [pdf, other

    cs.CL cs.CV cs.IR cs.MM

    Multilingual Visual Sentiment Concept Matching

    Authors: Nikolaos Pappas, Miriam Redi, Mercan Topkara, Brendan Jou, Hongyi Liu, Tao Chen, Shih-Fu Chang

    Abstract: The impact of culture in visual emotion perception has recently captured the attention of multimedia research. In this study, we pro- vide powerful computational linguistics tools to explore, retrieve and browse a dataset of 16K multilingual affective visual concepts and 7.3M Flickr images. First, we design an effective crowdsourc- ing experiment to collect human judgements of sentiment connected… ▽ More

    Submitted 7 June, 2016; originally announced June 2016.

    Journal ref: Proceedings ICMR '16 Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval Pages 151-158

  18. arXiv:1508.03868  [pdf, other

    cs.MM cs.CL cs.CV cs.IR

    Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology

    Authors: Brendan Jou, Tao Chen, Nikolaos Pappas, Miriam Redi, Mercan Topkara, Shih-Fu Chang

    Abstract: Every culture and language is unique. Our work expressly focuses on the uniqueness of culture and language in relation to human affect, specifically sentiment and emotion semantics, and how they manifest in social multimedia. We develop sets of sentiment- and emotion-polarized visual concepts by adapting semantic structures called adjective-noun pairs, originally introduced by Borth et al. (2013),… ▽ More

    Submitted 7 October, 2015; v1 submitted 16 August, 2015; originally announced August 2015.

    Comments: 11 pages, to appear at ACM MM'15

    ACM Class: H.1.2; H.5.1; H.5.4; I.2.10

  19. arXiv:1505.07522  [pdf, other

    cs.HC cs.CV cs.CY

    Like Partying? Your Face Says It All. Predicting the Ambiance of Places with Profile Pictures

    Authors: Miriam Redi, Daniele Quercia, Lindsay T. Graham, Samuel D. Gosling

    Abstract: To choose restaurants and coffee shops, people are increasingly relying on social-networking sites. In a popular site such as Foursquare or Yelp, a place comes with descriptions and reviews, and with profile pictures of people who frequent them. Descriptions and reviews have been widely explored in the research area of data mining. By contrast, profile pictures have received little attention. Prev… ▽ More

    Submitted 27 May, 2015; originally announced May 2015.

    Comments: 10 pages

  20. arXiv:1505.03358  [pdf, other

    cs.SI cs.CV cs.CY cs.MM

    An Image is Worth More than a Thousand Favorites: Surfacing the Hidden Beauty of Flickr Pictures

    Authors: Rossano Schifanella, Miriam Redi, Luca Aiello

    Abstract: The dynamics of attention in social media tend to obey power laws. Attention concentrates on a relatively small number of popular items and neglecting the vast majority of content produced by the crowd. Although popularity can be an indication of the perceived value of an item within its community, previous research has hinted to the fact that popularity is distinct from intrinsic quality. As a re… ▽ More

    Submitted 15 May, 2015; v1 submitted 13 May, 2015; originally announced May 2015.

    Comments: ICWSM 2015

  21. arXiv:1501.07304  [pdf, other

    cs.CV cs.CY cs.MM

    The Beauty of Capturing Faces: Rating the Quality of Digital Portraits

    Authors: Miriam Redi, Nikhil Rasiwasia, Gaurav Aggarwal, Alejandro Jaimes

    Abstract: Digital portrait photographs are everywhere, and while the number of face pictures keeps growing, not much work has been done to on automatic portrait beauty assessment. In this paper, we design a specific framework to automatically evaluate the beauty of digital portraits. To this end, we procure a large dataset of face images annotated not only with aesthetic scores but also with information abo… ▽ More

    Submitted 28 January, 2015; originally announced January 2015.

    Comments: FG 2015, 8 pages

  22. arXiv:1411.4080  [pdf, other

    cs.MM cs.CV cs.HC

    6 Seconds of Sound and Vision: Creativity in Micro-Videos

    Authors: Miriam Redi, Neil O Hare, Rossano Schifanella, Michele Trevisiol, Alejandro Jaimes

    Abstract: The notion of creativity, as opposed to related concepts such as beauty or interestingness, has not been studied from the perspective of automatic analysis of multimedia content. Meanwhile, short online videos shared on social media platforms, or micro-videos, have arisen as a new medium for creative expression. In this paper we study creative micro-videos in an effort to understand the features t… ▽ More

    Submitted 14 November, 2014; originally announced November 2014.

    Comments: 8 pages, 1 figures, conference IEEE CVPR 2014