Skip to main content

Showing 1–43 of 43 results for author: Macdonald, C

  1. arXiv:2406.11460  [pdf, other

    cs.CL cs.AI

    TRACE the Evidence: Constructing Knowledge-Grounded Reasoning Chains for Retrieval-Augmented Generation

    Authors: Jinyuan Fang, Zaiqiao Meng, Craig Macdonald

    Abstract: Retrieval-augmented generation (RAG) offers an effective approach for addressing question answering (QA) tasks. However, the imperfections of the retrievers in RAG models often result in the retrieval of irrelevant information, which could introduce noises and degrade the performance, especially when handling multi-hop questions that require multiple steps of reasoning. To enhance the multi-hop re… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  2. Shallow Cross-Encoders for Low-Latency Retrieval

    Authors: Aleksandr V. Petrov, Sean MacAvaney, Craig Macdonald

    Abstract: Transformer-based Cross-Encoders achieve state-of-the-art effectiveness in text retrieval. However, Cross-Encoders based on large transformer models (such as BERT or T5) are computationally expensive and allow for scoring only a small number of documents within a reasonably small latency window. However, keeping search latencies low is important for user satisfaction and energy usage. In this pape… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by ECIR2024

  3. arXiv:2403.04875  [pdf, other

    cs.IR cs.LG

    Aligning GPTRec with Beyond-Accuracy Goals with Reinforcement Learning

    Authors: Aleksandr Petrov, Craig Macdonald

    Abstract: Adaptations of Transformer models, such as BERT4Rec and SASRec, achieve state-of-the-art performance in the sequential recommendation task according to accuracy-based metrics, such as NDCG. These models treat items as tokens and then utilise a score-and-rank approach (Top-K strategy), where the model first computes item scores and then ranks them according to this score. While this approach works… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by the 2nd Workshop The 2nd Workshop on Recommendation with Generative Models, in conjunction with The Web Conference 2024

  4. arXiv:2401.05783  [pdf, other

    cs.IR

    What Else Would I Like? A User Simulator using Alternatives for Improved Evaluation of Fashion Conversational Recommendation Systems

    Authors: Maria Vlachou, Craig Macdonald

    Abstract: In Conversational Recommendation Systems (CRS), a user can provide feedback on recommended items at each interaction turn, leading the CRS towards more desirable recommendations. Currently, different types of CRS offer various possibilities for feedback, i.e., natural language feedback, or answering clarifying questions. In most cases, a user simulator is employed for training as well as evaluatin… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  5. RecJPQ: Training Large-Catalogue Sequential Recommenders

    Authors: Aleksandr V. Petrov, Craig Macdonald

    Abstract: Sequential Recommendation is a popular recommendation task that uses the order of user-item interaction to model evolving users' interests and sequential patterns in their behaviour. Current state-of-the-art Transformer-based models for sequential recommendation, such as BERT4Rec and SASRec, generate sequence embeddings and compute scores for catalogue items, but the increasing catalogue size make… ▽ More

    Submitted 18 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted by ACM WSDM 2024

  6. arXiv:2311.15790  [pdf, other

    cs.IR cs.AI

    A Social-aware Gaussian Pre-trained Model for Effective Cold-start Recommendation

    Authors: Siwei Liu, Xi Wang, Craig Macdonald, Iadh Ounis

    Abstract: The use of pre-training is an emerging technique to enhance a neural model's performance, which has been shown to be effective for many neural language models such as BERT. This technique has also been used to enhance the performance of recommender systems. In such recommender systems, pre-training models are used to learn a better initialisation for both users and items. However, recent existing… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 20 pages

    MSC Class: 68P20 ACM Class: H.3.3

  7. arXiv:2310.20343  [pdf, other

    cs.IR cs.MM

    Large Multi-modal Encoders for Recommendation

    Authors: Zixuan Yi, Zijun Long, Iadh Ounis, Craig Macdonald, Richard Mccreadie

    Abstract: In recent years, the rapid growth of online multimedia services, such as e-commerce platforms, has necessitated the development of personalised recommendation approaches that can encode diverse content about each item. Indeed, modern multi-modal recommender systems exploit diverse features obtained from raw images and item descriptions to enhance the recommendation performance. However, the existi… ▽ More

    Submitted 3 November, 2023; v1 submitted 31 October, 2023; originally announced October 2023.

  8. arXiv:2310.11405  [pdf, other

    cs.IR

    On Coherence-based Predictors for Dense Query Performance Prediction

    Authors: Maria Vlachou, Craig Macdonald

    Abstract: Query Performance Prediction (QPP) estimates the effectiveness of a search engine's results in response to a query without relevance judgments. Traditionally, post-retrieval predictors have focused upon either the distribution of the retrieval scores, or the coherence of the top-ranked documents using traditional bag-of-words index representations. More recently, BERT-based models using dense embe… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  9. arXiv:2308.10685  [pdf, other

    cs.IR

    Contrastive Graph Prompt-tuning for Cross-domain Recommendation

    Authors: Zixuan Yi, Iadh Ounis, Craig Macdonald

    Abstract: Recommender systems are frequently challenged by the data sparsity problem. One approach to mitigate this issue is through cross-domain recommendation techniques. In a cross-domain context, sharing knowledge between domains can enhance the effectiveness in the target domain. Recent cross-domain methods have employed a pre-training approach, but we argue that these methods often result in suboptima… ▽ More

    Submitted 3 November, 2023; v1 submitted 21 August, 2023; originally announced August 2023.

  10. gSASRec: Reducing Overconfidence in Sequential Recommendation Trained with Negative Sampling

    Authors: Aleksandr Petrov, Craig Macdonald

    Abstract: A large catalogue size is one of the central challenges in training recommendation models: a large number of items makes them memory and computationally inefficient to compute scores for all items during training, forcing these models to deploy negative sampling. However, negative sampling increases the proportion of positive interactions in the training data, and therefore models trained with neg… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted at ACM RecSys 2023

  11. arXiv:2308.00415  [pdf, other

    cs.IR

    Generative Query Reformulation for Effective Adhoc Search

    Authors: Xiao Wang, Sean MacAvaney, Craig Macdonald, Iadh Ounis

    Abstract: Performing automatic reformulations of a user's query is a popular paradigm used in information retrieval (IR) for improving effectiveness -- as exemplified by the pseudo-relevance feedback approaches, which expand the query in order to alleviate the vocabulary mismatch problem. Recent advancements in generative language models have demonstrated their ability in generating responses that are relev… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted to Gen-IR@SIGIR2023 Workshop

  12. arXiv:2306.11114  [pdf, other

    cs.IR

    Generative Sequential Recommendation with GPTRec

    Authors: Aleksandr V. Petrov, Craig Macdonald

    Abstract: Sequential recommendation is an important recommendation task that aims to predict the next item in a sequence. Recently, adaptations of language models, particularly Transformer-based models such as SASRec and BERT4Rec, have achieved state-of-the-art results in sequential recommendation. In these models, item ids replace tokens in the original language models. However, this approach has limitatio… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

    Comments: Accepted at Gen-IR@SIGIR2023 workshop

  13. arXiv:2301.03266  [pdf, other

    cs.IR

    Doc2Query--: When Less is More

    Authors: Mitko Gospodinov, Sean MacAvaney, Craig Macdonald

    Abstract: Doc2Query -- the process of expanding the content of a document before indexing using a sequence-to-sequence model -- has emerged as a prominent technique for improving the first-stage retrieval effectiveness of search engines. However, sequence-to-sequence models are known to be prone to "hallucinating" content that is not present in the source text. We argue that Doc2Query is indeed prone to hal… ▽ More

    Submitted 27 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: ECIR 2023

  14. arXiv:2211.10678  [pdf, other

    cs.CL cs.IR cs.LG

    Entity-Assisted Language Models for Identifying Check-worthy Sentences

    Authors: Ting Su, Craig Macdonald, Iadh Ounis

    Abstract: We propose a new uniform framework for text classification and ranking that can automate the process of identifying check-worthy sentences in political debates and speech transcripts. Our framework combines the semantic analysis of the sentences, with additional entity embeddings obtained through the identified entities within the sentences. In particular, we analyse the semantic meaning of each s… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 22 pages, 15 tables, 3 figures

  15. arXiv:2211.10672  [pdf, other

    cs.SI cs.IR

    Leveraging Users' Social Network Embeddings for Fake News Detection on Twitter

    Authors: Ting Su, Craig Macdonald, Iadh Ounis

    Abstract: Social networks (SNs) are increasingly important sources of news for many people. The online connections made by users allows information to spread more easily than traditional news media (e.g., newspaper, television). However, they also make the spread of fake news easier than in traditional media, especially through the users' social network connections. In this paper, we focus on investigating… ▽ More

    Submitted 19 November, 2022; originally announced November 2022.

    Comments: 15 pages, 5 figures

  16. Adaptive Re-Ranking with a Corpus Graph

    Authors: Sean MacAvaney, Nicola Tonellotto, Craig Macdonald

    Abstract: Search systems often employ a re-ranking pipeline, wherein documents (or passages) from an initial pool of candidates are assigned new ranking scores. The process enables the use of highly-effective but expensive scoring functions that are not suitable for use directly in structures like inverted indices or approximate nearest neighbour indices. However, re-ranking pipelines are inherently limited… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: CIKM 2022

  17. arXiv:2207.07483  [pdf, other

    cs.IR cs.AI cs.LG

    A Systematic Review and Replicability Study of BERT4Rec for Sequential Recommendation

    Authors: Aleksandr Petrov, Craig Macdonald

    Abstract: BERT4Rec is an effective model for sequential recommendation based on the Transformer architecture. In the original publication, BERT4Rec claimed superiority over other available sequential recommendation approaches (e.g. SASRec), and it is now frequently being used as a state-of-the art baseline for sequential recommendations. However, not all subsequent publications confirmed this result and pro… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: This paper is accepted at the Reproducibility track of the ACM RecSys '22 conference

  18. arXiv:2207.02643  [pdf, other

    cs.IR cs.AI cs.LG

    Effective and Efficient Training for Sequential Recommendation using Recency Sampling

    Authors: Aleksandr Petrov, Craig Macdonald

    Abstract: Many modern sequential recommender systems use deep neural networks, which can effectively estimate the relevance of items but require a lot of time to train. Slow training increases expenses, hinders product development timescales and prevents the model from being regularly updated to adapt to changing user preferences. Training such sequential models involves appropriately sampling past user int… ▽ More

    Submitted 15 July, 2022; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: This full research paper is accepted at 16th ACM Conference on Recommender Systems (ACM RecSys)

  19. arXiv:2201.08622  [pdf, other

    cs.IR

    Reproducing Personalised Session Search over the AOL Query Log

    Authors: Sean MacAvaney, Craig Macdonald, Iadh Ounis

    Abstract: Despite its troubled past, the AOL Query Log continues to be an important resource to the research community -- particularly for tasks like search personalisation. When using the query log these ranking experiments, little attention is usually paid to the document corpus. Recent work typically uses a corpus containing versions of the documents collected long after the log was produced. Given that… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: ECIR 2022 (reproducibility)

  20. arXiv:2111.13466  [pdf, ps, other

    cs.IR

    Streamlining Evaluation with ir-measures

    Authors: Sean MacAvaney, Craig Macdonald, Iadh Ounis

    Abstract: We present ir-measures, a new tool that makes it convenient to calculate a diverse set of evaluation measures used in information retrieval. Rather than implementing its own measure calculations, ir-measures provides a common interface to a handful of evaluation tools. The necessary tools are automatically invoked (potentially multiple times) to calculate all the desired metrics, simplifying the e… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

    Comments: ECIR 2022 (demo)

  21. On Approximate Nearest Neighbour Selection for Multi-Stage Dense Retrieval

    Authors: Craig Macdonald, Nicola Tonellotto

    Abstract: Dense retrieval, which describes the use of contextualised language models such as BERT to identify documents from a collection by leveraging approximate nearest neighbour (ANN) techniques, has been increasing in popularity. Two families of approaches have emerged, depending on whether documents and queries are represented by single or multiple embeddings. ColBERT, the exemplar of the latter, uses… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    ACM Class: H.3.3

  22. Query Embedding Pruning for Dense Retrieval

    Authors: Nicola Tonellotto, Craig Macdonald

    Abstract: Recent advances in dense retrieval techniques have offered the promise of being able not just to re-rank documents using contextualised language models such as BERT, but also to use such models to identify documents from the collection in the first place. However, when using dense retrieval approaches that use multiple embedded representations for each query, a large number of documents can be ret… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  23. arXiv:2108.06279  [pdf, other

    cs.IR cs.CL

    On Single and Multiple Representations in Dense Passage Retrieval

    Authors: Craig Macdonald, Nicola Tonellotto, Iadh Ounis

    Abstract: The advent of contextualised language models has brought gains in search effectiveness, not just when applied for re-ranking the output of classical weighting models such as BM25, but also when used directly for passage indexing and retrieval, a technique which is called dense retrieval. In the existing literature in neural ranking, two dense retrieval families have become apparent: single represe… ▽ More

    Submitted 19 August, 2021; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: Published at the 11th Italian Information Retrieval Workshop (IIR 2021)

  24. arXiv:2108.04026  [pdf, other

    cs.IR

    IntenT5: Search Result Diversification using Causal Language Models

    Authors: Sean MacAvaney, Craig Macdonald, Roderick Murray-Smith, Iadh Ounis

    Abstract: Search result diversification is a beneficial approach to overcome under-specified queries, such as those that are ambiguous or multi-faceted. Existing approaches often rely on massive query logs and interaction data to generate a variety of possible query intents, which then can be used to re-rank documents. However, relying on user interaction data is problematic because one first needs a massiv… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  25. arXiv:2107.03936  [pdf, other

    cs.IR

    Graph Neural Pre-training for Enhancing Recommendations using Side Information

    Authors: Zaiqiao Meng, Siwei Liu, Craig Macdonald, Iadh Ounis

    Abstract: Leveraging the side information associated with entities (i.e. users and items) to enhance the performance of recommendation systems has been widely recognized as an important modelling dimension. While many existing approaches focus on the integration scheme to incorporate entity side information -- by combining the recommendation loss function with an extra side information-aware loss -- in this… ▽ More

    Submitted 9 July, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  26. Pseudo-Relevance Feedback for Multiple Representation Dense Retrieval

    Authors: Xiao Wang, Craig Macdonald, Nicola Tonellotto, Iadh Ounis

    Abstract: Pseudo-relevance feedback mechanisms, from Rocchio to the relevance models, have shown the usefulness of expanding and reweighting the users' initial queries using information occurring in an initial set of retrieved documents, known as the pseudo-relevant set. Recently, dense retrieval -- through the use of neural contextual language models such as BERT for analysing the documents' and queries' c… ▽ More

    Submitted 1 July, 2021; v1 submitted 21 June, 2021; originally announced June 2021.

    Comments: 10 pages

    Journal ref: Proceedings of ICTIR 2021

  27. arXiv:2104.08912  [pdf, other

    cs.IR

    The Simpson's Paradox in the Offline Evaluation of Recommendation Systems

    Authors: Amir H. Jadidinejad, Craig Macdonald, Iadh Ounis

    Abstract: Recommendation systems are often evaluated based on user's interactions that were collected from an existing, already deployed recommendation system. In this situation, users only provide feedback on the exposed items and they may not leave feedback on other items since they have not been exposed to them by the deployed system. As a result, the collected feedback dataset that is used to evaluate a… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  28. arXiv:2102.03089  [pdf, other

    cs.IR

    Leveraging Review Properties for Effective Recommendation

    Authors: Xi Wang, Iadh Ounis, Craig Macdonald

    Abstract: Many state-of-the-art recommendation systems leverage explicit item reviews posted by users by considering their usefulness in representing the users' preferences and describing the items' attributes. These posted reviews may have various associated properties, such as their length, their age since they were posted, or their item rating. However, it remains unclear how these different review prope… ▽ More

    Submitted 5 February, 2021; originally announced February 2021.

    Comments: To be published in the International World Wide Web Conference (WWW) 2021

  29. Negative Confidence-Aware Weakly Supervised Binary Classification for Effective Review Helpfulness Classification

    Authors: Xi Wang, Iadh Ounis, Craig Macdonald

    Abstract: The incompleteness of positive labels and the presence of many unlabelled instances are common problems in binary classification applications such as in review helpfulness classification. Various studies from the classification literature consider all unlabelled instances as negative examples. However, a classification model that learns to classify binary instances with incomplete positive labels… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

    Comments: CIKM 2020

  30. Declarative Experimentation in Information Retrieval using PyTerrier

    Authors: Craig Macdonald, Nicola Tonellotto

    Abstract: The advent of deep machine learning platforms such as Tensorflow and Pytorch, developed in expressive high-level languages such as Python, have allowed more expressive representations of deep neural network architectures. We argue that such a powerful formalism is missing in information retrieval (IR), and propose a framework called PyTerrier that allows advanced retrieval pipelines to be expresse… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Journal ref: 2020 ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR '20)

  31. arXiv:2007.13237  [pdf, other

    cs.IR

    Exploring Data Splitting Strategies for the Evaluation of Recommendation Models

    Authors: Zaiqiao Meng, Richard McCreadie, Craig Macdonald, Iadh Ounis

    Abstract: Effective methodologies for evaluating recommender systems are critical, so that such systems can be compared in a sound manner. A commonly overlooked aspect of recommender system evaluation is the selection of the data splitting strategy. In this paper, we both show that there is no standard splitting strategy and that the selection of splitting strategy can have a strong impact on the ranking of… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

  32. arXiv:2007.07987  [pdf, other

    cs.IR

    Deep Reinforced Query Reformulation for Information Retrieval

    Authors: Xiao Wang, Craig Macdonald, Iadh Ounis

    Abstract: Query reformulations have long been a key mechanism to alleviate the vocabulary-mismatch problem in information retrieval, for example by expanding the queries with related query terms or by generating paraphrases of the queries. In this work, we propose a deep reinforced query reformulation (DRQR) model to automatically generate new reformulations of the query. To encourage the model to generate… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 10 pages, 4 figures

  33. arXiv:2003.08276  [pdf, other

    cs.IR

    Supporting Interoperability Between Open-Source Search Engines with the Common Index File Format

    Authors: Jimmy Lin, Joel Mackenzie, Chris Kamphuis, Craig Macdonald, Antonio Mallia, Michał Siedlaczek, Andrew Trotman, Arjen de Vries

    Abstract: There exists a natural tension between encouraging a diverse ecosystem of open-source search engines and supporting fair, replicable comparisons across those systems. To balance these two goals, we examine two approaches to providing interoperability between the inverted indexes of several systems. The first takes advantage of internal abstractions around index structures and building wrappers tha… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

  34. arXiv:1911.10737  [pdf, other

    cs.CV cs.LG

    Nearest Neighbor Sampling of Point Sets using Rays

    Authors: Liangchen Liu, Louis Ly, Colin Macdonald, Yen-Hsi Richard Tsai

    Abstract: We propose a new framework for the sampling, compression, and analysis of distributions of point sets and other geometric objects embedded in Euclidean spaces. Our approach involves constructing a tensor called the RaySense sketch, which captures nearest neighbors from the underlying geometry of points along a set of rays. We explore various operations that can be performed on the RaySense sketch,… ▽ More

    Submitted 13 September, 2023; v1 submitted 25 November, 2019; originally announced November 2019.

    Comments: 48 pages, 14 figures, accepted to Communication on Applied Mathematics and Computation (CAMC), Focused Issue in Honor of Prof. Stanley Osher on the Occasion of His 80th Birthday. Fixed typos and improved notations

    MSC Class: 68T09; 65D19 (Primary) 68T07; 65D40 (Secondary) ACM Class: G.0; G.2.3; G.3; I.2.10; I.4.7

  35. arXiv:1909.07705  [pdf, other

    cs.IR cs.LG

    Variational Bayesian Context-aware Representation for Grocery Recommendation

    Authors: Zaiqiao Meng, Richard McCreadie, Craig Macdonald, Iadh Ounis

    Abstract: Grocery recommendation is an important recommendation use-case, which aims to predict which items a user might choose to buy in the future, based on their shopping history. However, existing methods only represent each user and item by single deterministic points in a low-dimensional continuous space. In addition, most of these methods are trained by maximizing the co-occurrence likelihood with a… ▽ More

    Submitted 29 October, 2019; v1 submitted 17 September, 2019; originally announced September 2019.

    Comments: Accepted for CARS 2.0 - Context-Aware Recommender Systems Workshop @ RecSys'19

  36. arXiv:1908.11319  [pdf

    stat.ML cs.CY cs.LG

    Machine Learning and the Internet of Things Enable Steam Flood Optimization for Improved Oil Production

    Authors: Mi Yan, Jonathan C. MacDonald, Chris T. Reaume, Wesley Cobb, Tamas Toth, Sarah S. Karthigan

    Abstract: Recently developed machine learning techniques, in association with the Internet of Things (IoT) allow for the implementation of a method of increasing oil production from heavy-oil wells. Steam flood injection, a widely used enhanced oil recovery technique, uses thermal and gravitational potential to mobilize and dilute heavy oil in situ to increase oil production. In contrast to traditional stea… ▽ More

    Submitted 29 August, 2019; v1 submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted by the 1st International Workshop on Artificial Intelligence of Things at KDD 2019

    Journal ref: The 1st International Workshop on Artificial Intelligence of Things at KDD 2019

  37. arXiv:1907.02956  [pdf, other

    cs.CY cs.IR

    The FACTS of Technology-Assisted Sensitivity Review

    Authors: Graham McDonald, Craig Macdonald, Iadh Ounis

    Abstract: At least ninety countries implement Freedom of Information laws that state that government documents must be made freely available, or opened, to the public. However, many government documents contain sensitive information, such as personal or confidential information. Therefore, all government documents that are opened to the public must first be reviewed to identify, and protect, any sensitive i… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

    Comments: 4 pages

  38. arXiv:1702.00171  [pdf, other

    cs.IR

    Inferring Conceptual Relationships When Ranking Patients

    Authors: Nut Limsopatham, Craig Macdonald, Iadh Ounis

    Abstract: Searching patients based on the relevance of their medical records is challenging because of the inherent implicit knowledge within the patients' medical records and queries. Such knowledge is known to the medical practitioners but may be hidden from a search system. For example, when searching for the patients with a heart disease, medical practitioners commonly know that patients who are taking… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

  39. arXiv:1610.04002  [pdf, other

    cs.IR cs.SI

    Emergency Identification and Analysis with EAIMS

    Authors: Richard McCreadie, Craig Macdonald, Iadh Ounis

    Abstract: Social media platforms are now a key source of information for a large segment of the public. As such, these platforms have a great potential as a means to provide real-time information to emergency management agencies. Moreover, during an emergency, these agencies are very interested in social media as a means to find public-driven response efforts, as well as to track how their handling of that… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

  40. arXiv:1606.07828  [pdf, ps, other

    cs.IR

    Modelling User Preferences using Word Embeddings for Context-Aware Venue Recommendation

    Authors: Jarana Manotumruksa, Craig Macdonald, Iadh Ounis

    Abstract: Venue recommendation aims to assist users by making personalised suggestions of venues to visit, building upon data available from location-based social networks (LBSNs) such as Foursquare. A particular challenge for this task is context-aware venue recommendation (CAVR), which additionally takes the surrounding context of the user (e.g. the user's location and the time of day) into account in ord… ▽ More

    Submitted 24 June, 2016; originally announced June 2016.

    Comments: 4 pages

  41. arXiv:1606.07006  [pdf, other

    cs.IR cs.CL

    Using Word Embeddings in Twitter Election Classification

    Authors: Xiao Yang, Craig Macdonald, Iadh Ounis

    Abstract: Word embeddings and convolutional neural networks (CNN) have attracted extensive attention in various classification tasks for Twitter, e.g. sentiment classification. However, the effect of the configuration used to train and generate the word embeddings on the classification performance has not been studied in the existing literature. In this paper, using a Twitter election classification task th… ▽ More

    Submitted 21 March, 2017; v1 submitted 22 June, 2016; originally announced June 2016.

    Comments: NeuIR Workshop 2016

  42. arXiv:1601.06826  [pdf, other

    quant-ph cs.IT

    Covert Communication over Classical-Quantum Channels

    Authors: Michael S. Bullock, Azadeh Sheikholeslami, Mehrdad Tahmasbi, Robert C. Macdonald, Saikat Guha, Boulat A. Bash

    Abstract: We investigate covert communication over general memoryless classical-quantum channels with fixed finite-size input alphabets. We show that the square root law (SRL) governs covert communication in this setting when product of $n$ input states is used: $L_{\rm SRL}\sqrt{n}+o(\sqrt{n})$ covert bits (but no more) can be reliably transmitted in $n$ uses of classical-quantum channel, where… ▽ More

    Submitted 21 July, 2023; v1 submitted 25 January, 2016; originally announced January 2016.

    Comments: Corrected typos, 52 pages, 2 figures

  43. arXiv:1312.1611  [pdf, ps, other

    cs.IR

    Intent Models for Contextualising and Diversifying Query Suggestions

    Authors: Eugene Kharitonov, Craig Macdonald, Pavel Serdyukov, Iadh Ounis

    Abstract: The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in the query logs is suboptimal. Firstly, many candidate queries with the same prefix can be removed as redundant. Secondly, the suggestions can also be personalised based on the user's context. These two directions… ▽ More

    Submitted 5 December, 2013; originally announced December 2013.

    Comments: A short version of this paper was presented at CIKM 2013