Skip to main content

Showing 1–23 of 23 results for author: Saeidi, M

  1. arXiv:2312.10728  [pdf, other

    cs.AI

    Benchmarks for Physical Reasoning AI

    Authors: Andrew Melnik, Robin Schiewer, Moritz Lange, Andrei Muresanu, Mozhgan Saeidi, Animesh Garg, Helge Ritter

    Abstract: Physical reasoning is a crucial aspect in the development of general AI systems, given that human learning starts with interacting with the physical world before progressing to more complex concepts. Although researchers have studied and assessed the physical reasoning of AI approaches through various specific benchmarks, there is no comprehensive approach to evaluating and measuring progress. The… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  2. arXiv:2311.08195  [pdf, other

    cs.CL cs.AI

    Automated Fact-Checking in Dialogue: Are Specialized Models Needed?

    Authors: Eric Chamoun, Marzieh Saeidi, Andreas Vlachos

    Abstract: Prior research has shown that typical fact-checking models for stand-alone claims struggle with claims made in dialogues. As a solution, fine-tuning these models on labelled dialogue data has been proposed. However, creating separate models for each use case is impractical, and we show that fine-tuning models for dialogue results in poor performance on typical fact-checking. To overcome this chall… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: Accepted to EMNLP 2023

  3. arXiv:2308.03676  [pdf, other

    eess.SP cs.IT

    A Tractable Handoff-aware Rate Outage Approximation with Applications to THz-enabled Vehicular Network Optimization

    Authors: Mohammad Amin Saeidi, Haider Shoaib, Hina Tabassum

    Abstract: In this paper, we first develop a tractable mathematical model of the handoff (HO)-aware rate outage experienced by a typical connected and autonomous vehicle (CAV) in a given THz vehicular network. The derived model captures the impact of line-of-sight (LOS) Nakagami-m fading channels, interference, and molecular absorption effects. We first derive the statistics of the interference-plus-molecula… ▽ More

    Submitted 25 August, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted in the IEEE Global Communications (GLOBECOM) 2023 conference

  4. arXiv:2306.11167  [pdf, other

    cs.CL cs.AI cs.LG

    Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset

    Authors: Saeid Naeini, Raeid Saqur, Mozhgan Saeidi, John Giorgi, Babak Taati

    Abstract: The quest for human imitative AI has been an enduring topic in AI research since its inception. The technical evolution and emerging capabilities of the latest cohort of large language models (LLMs) have reinvigorated the subject beyond academia to the cultural zeitgeist. While recent NLP evaluation benchmark tasks test some aspects of human-imitative behaviour (e.g., BIG-bench's 'human-like behav… ▽ More

    Submitted 8 November, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: v4,v3: Mincor cosmetic adjustments, typo-fixes etc. from V2. Fixed Fig. 2 caption overlapping with text in S2.2. V2: with added OCW-Randomized and OCW-WordNet results in Section 4.3 (added). 22 pages with Appendix

    ACM Class: I.2.7

  5. arXiv:2306.08781  [pdf, ps, other

    cs.IT eess.SP

    Resource Allocation and Performance Analysis of Hybrid RSMA-NOMA in the Downlink

    Authors: Mohammad Amin Saeidi, Hina Tabassum

    Abstract: Rate splitting multiple access (RSMA) and non-orthogonal multiple access (NOMA) are the key enabling multiple access techniques to enable massive connectivity. However, it is unclear whether RSMA would consistently outperform NOMA from a system sum-rate perspective, users' fairness, as well as convergence and feasibility of the resource allocation solutions. This paper investigates the weighted su… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

    Comments: This paper has been accepted in the 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC)

  6. arXiv:2306.01069  [pdf, other

    cs.CL cs.AI cs.IR

    TimelineQA: A Benchmark for Question Answering over Timelines

    Authors: Wang-Chiew Tan, Jane Dwivedi-Yu, Yuliang Li, Lambert Mathias, Marzieh Saeidi, Jing Nathan Yan, Alon Y. Halevy

    Abstract: Lifelogs are descriptions of experiences that a person had during their life. Lifelogs are created by fusing data from the multitude of digital services, such as online photos, maps, shopping and content streaming services. Question answering over lifelogs can offer personal assistants a critical resource when they try to provide advice in context. However, obtaining answers to questions over life… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

  7. arXiv:2212.07606  [pdf, other

    cs.IT

    Multi-band Wireless Networks: Architectures, Challenges, and Comparative Analysis

    Authors: Mohammad Amin Saeidi, Hina Tabassum, Mohamed-Slim Alouini

    Abstract: This paper presents the vision of multi-band communication networks (MBN) in 6G, where optical and TeraHertz (THz) transmissions will coexist with the conventional radio frequency (RF) spectrum. This paper will first pin-point the fundamental challenges in MBN architectures at the PHYsical (PHY) and Medium Access (MAC) layer, such as unique channel propagation and estimation issues, user offloadin… ▽ More

    Submitted 20 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

    Comments: This work has been accepted to be published in IEEE Communications Magazine

  8. arXiv:2211.01482  [pdf, other

    cs.CL cs.AI cs.LG

    RQUGE: Reference-Free Metric for Evaluating Question Generation by Answering the Question

    Authors: Alireza Mohammadshahi, Thomas Scialom, Majid Yazdani, Pouya Yanki, Angela Fan, James Henderson, Marzieh Saeidi

    Abstract: Existing metrics for evaluating the quality of automatically generated questions such as BLEU, ROUGE, BERTScore, and BLEURT compare the reference and predicted questions, providing a high score when there is a considerable lexical overlap or semantic similarity between the candidate and the reference questions. This approach has two major shortcomings. First, we need expensive human-provided refer… ▽ More

    Submitted 26 May, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: Accepted to Findings of ACL 2023

  9. arXiv:2205.12259  [pdf, other

    cs.CL cs.LG

    Policy Compliance Detection via Expression Tree Inference

    Authors: Neema Kotonya, Andreas Vlachos, Majid Yazdani, Lambert Mathias, Marzieh Saeidi

    Abstract: Policy Compliance Detection (PCD) is a task we encounter when reasoning over texts, e.g. legal frameworks. Previous work to address PCD relies heavily on modeling the task as a special case of Recognizing Textual Entailment. Entailment is applicable to the problem of PCD, however viewing the policy as a single proposition, as opposed to multiple interlinked propositions, yields poor performance an… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  10. arXiv:2204.01172  [pdf, other

    cs.CL

    PERFECT: Prompt-free and Efficient Few-shot Learning with Language Models

    Authors: Rabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson, Marzieh Saeidi, Lambert Mathias, Veselin Stoyanov, Majid Yazdani

    Abstract: Current methods for few-shot fine-tuning of pretrained masked language models (PLMs) require carefully engineered prompts and verbalizers for each new task to convert examples into a cloze-format that the PLM can score. In this work, we propose PERFECT, a simple and efficient method for few-shot fine-tuning of PLMs without relying on any such handcrafting, which is highly effective given as few as… ▽ More

    Submitted 25 April, 2022; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: ACL, 2022

  11. arXiv:2109.14497  [pdf, other

    cs.DS

    Ruler Wrapping

    Authors: Travis Gagie, Mozhgan Saeidi, Allan Sapucaia

    Abstract: In 1985 Hopcroft, Joseph and Whitesides showed it is NP-complete to decide whether a carpenter's ruler with segments of given positive lengths can be folded into a line of at most a given length, such that the folded hinges alternate between 180 degrees clockwise and 180 degrees counter-clockwise. At the open-problem session of 33rd Canadian Conference on Computational Geometry (CCCG '21), O'Rourk… ▽ More

    Submitted 9 January, 2022; v1 submitted 29 September, 2021; originally announced September 2021.

  12. arXiv:2109.03731  [pdf, other

    cs.CL

    Cross-Policy Compliance Detection via Question Answering

    Authors: Marzieh Saeidi, Majid Yazdani, Andreas Vlachos

    Abstract: Policy compliance detection is the task of ensuring that a scenario conforms to a policy (e.g. a claim is valid according to government rules or a post in an online platform conforms to community guidelines). This task has been previously instantiated as a form of textual entailment, which results in poor accuracy due to the complexity of the policies. In this paper we propose to address policy co… ▽ More

    Submitted 8 September, 2021; originally announced September 2021.

    Journal ref: EMNLP 2021

  13. arXiv:2106.01074  [pdf, other

    cs.CL cs.AI cs.DB

    Database Reasoning Over Text

    Authors: James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel, Alon Halevy

    Abstract: Neural models have shown impressive performance gains in answering queries from natural language text. However, existing works are unable to support database queries, such as "List/Count all female athletes who were born in 20th century", which require reasoning over sets of relevant facts with operations such as join, filtering and aggregation. We show that while state-of-the-art transformer mode… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: To appear at ACL2021

  14. arXiv:2012.12518  [pdf, ps, other

    cs.CR

    If This Context Then That Concern: Exploring users' concerns with IFTTT applets

    Authors: Mahsa Saeidi, McKenzie Calvert, Audrey W. Au, Anita Sarma, Rakesh B. Bobba

    Abstract: End users are increasingly using trigger-action platforms like, If-This-Then-That (IFTTT) to create applets to connect smart home devices and services. However, there are inherent risks in using such applets -- even non-malicious ones -- as sensitive information may leak through their use in certain contexts (e.g., where the device is located, who can observe the resultant action). This work aims… ▽ More

    Submitted 23 December, 2020; originally announced December 2020.

  15. arXiv:2011.05448  [pdf, other

    cs.CL

    Generating Fact Checking Briefs

    Authors: Angela Fan, Aleksandra Piktus, Fabio Petroni, Guillaume Wenzek, Marzieh Saeidi, Andreas Vlachos, Antoine Bordes, Sebastian Riedel

    Abstract: Fact checking at scale is difficult -- while the number of active fact checking websites is growing, it remains too small for the needs of the contemporary media ecosystem. However, despite good intentions, contributions from volunteers are often error-prone, and thus in practice restricted to claim detection. We investigate how to increase the accuracy and efficiency of fact checking by providing… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  16. arXiv:2010.06973  [pdf, other

    cs.CL cs.DB cs.LG

    Neural Databases

    Authors: James Thorne, Majid Yazdani, Marzieh Saeidi, Fabrizio Silvestri, Sebastian Riedel, Alon Halevy

    Abstract: In recent years, neural networks have shown impressive performance gains on long-standing AI problems, and in particular, answering queries from natural language text. These advances raise the question of whether they can be extended to a point where we can relax the fundamental assumption of database management, namely, that our data is represented as fields of a pre-defined schema. This paper… ▽ More

    Submitted 14 October, 2020; originally announced October 2020.

    Comments: Submitted to PVLDB vol 14

  17. arXiv:2010.01339  [pdf, ps, other

    cs.IT

    Weighted Sum-Rate Maximization for Multi-IRS-assisted Full-Duplex Systems with Hardware Impairments

    Authors: Mohammad Amin Saeidi, Mohammad Javad Emadi, Hamed Masoumi, Mohammad Robat Mili, Derrick Wing Kwan Ng, Ioannis Krikidis

    Abstract: Smart and reconfigurable wireless communication environments can be established by exploiting well-designed intelligent reflecting surfaces (IRSs) to shape the communication channels. In this paper, we investigate how multiple IRSs affect the performance of multi-user full-duplex communication systems under hardware impairment at each node, wherein the base station (BS) and the uplink users are su… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: 30 pages, This work has been submitted for possible publication

  18. arXiv:2009.10311  [pdf, other

    cs.SI cs.AI

    Preserving Integrity in Online Social Networks

    Authors: Alon Halevy, Cristian Canton Ferrer, Hao Ma, Umut Ozertem, Patrick Pantel, Marzieh Saeidi, Fabrizio Silvestri, Ves Stoyanov

    Abstract: Online social networks provide a platform for sharing information and free expression. However, these networks are also used for malicious purposes, such as distributing misinformation and hate speech, selling illegal drugs, and coordinating sex trafficking or child exploitation. This paper surveys the state of the art in keeping online platforms and their users safe from such harm, also known as… ▽ More

    Submitted 25 September, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

  19. arXiv:2008.06274  [pdf, other

    cs.CL cs.LG

    Graph-based Modeling of Online Communities for Fake News Detection

    Authors: Shantanu Chandra, Pushkar Mishra, Helen Yannakoudakis, Madhav Nimishakavi, Marzieh Saeidi, Ekaterina Shutova

    Abstract: Over the past few years, there has been a substantial effort towards automated detection of fake news on social media platforms. Existing research has modeled the structure, style, content, and patterns in dissemination of online posts, as well as the demographic traits of users who interact with them. However, no attention has been directed towards modeling the properties of online communities th… ▽ More

    Submitted 23 November, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

  20. arXiv:1809.01494  [pdf, other

    cs.CL cs.LG stat.ML

    Interpretation of Natural Language Rules in Conversational Machine Reading

    Authors: Marzieh Saeidi, Max Bartolo, Patrick Lewis, Sameer Singh, Tim Rocktäschel, Mike Sheldon, Guillaume Bouchard, Sebastian Riedel

    Abstract: Most work in machine reading focuses on question answering problems where the answer is directly expressed in the text to read. However, many real-world question answering problems require the reading of text not because it contains the literal answer, but because it contains a recipe to derive an answer together with the reader's background knowledge. One example is the task of interpreting regul… ▽ More

    Submitted 28 August, 2018; originally announced September 2018.

    Comments: EMNLP 2018

  21. On the Effect of Semantically Enriched Context Models on Software Modularization

    Authors: Amir Saeidi, Jurriaan Hage, Ravi Khadka, Slinger Jansen

    Abstract: Many of the existing approaches for program comprehension rely on the linguistic information found in source code, such as identifier names and comments. Semantic clustering is one such technique for modularization of the system that relies on the informal semantics of the program, encoded in the vocabulary used in the source code. Treating the source code as a collection of tokens loses the seman… ▽ More

    Submitted 4 August, 2017; originally announced August 2017.

    Journal ref: The Art, Science, and Engineering of Programming, 2018, Vol. 2, Issue 1, Article 2

  22. arXiv:1701.04653  [pdf, other

    cs.CL cs.SI

    Community Question Answering Platforms vs. Twitter for Predicting Characteristics of Urban Neighbourhoods

    Authors: Marzieh Saeidi, Alessandro Venerandi, Licia Capra, Sebastian Riedel

    Abstract: In this paper, we investigate whether text from a Community Question Answering (QA) platform can be used to predict and describe real-world attributes. We experiment with predicting a wide range of 62 demographic attributes for neighbourhoods of London. We use the text from QA platform of Yahoo! Answers and compare our results to the ones obtained from Twitter microblogs. Outcomes show that the co… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

    Comments: Submitted to ICWSM2017

  23. arXiv:1610.03771  [pdf, other

    cs.CL

    SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods

    Authors: Marzieh Saeidi, Guillaume Bouchard, Maria Liakata, Sebastian Riedel

    Abstract: In this paper, we introduce the task of targeted aspect-based sentiment analysis. The goal is to extract fine-grained information with respect to entities mentioned in user comments. This work extends both aspect-based sentiment analysis that assumes a single entity per document and targeted sentiment analysis that assumes a single sentiment towards a target entity. In particular, we identify the… ▽ More

    Submitted 12 October, 2016; originally announced October 2016.

    Comments: Accepted at COLING 2016