Skip to main content

Showing 1–50 of 71 results for author: Rahmani, H

  1. arXiv:2407.08394  [pdf, other

    cs.CV

    Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

    Authors: Zhengbo Zhang, Li Xu, Duo Peng, Hossein Rahmani, Jun Liu

    Abstract: We introduce Diff-Tracker, a novel approach for the challenging unsupervised visual tracking task leveraging the pre-trained text-to-image diffusion model. Our main idea is to leverage the rich knowledge encapsulated within the pre-trained diffusion model, such as the understanding of image semantics and structural information, to address unsupervised visual tracking. To this end, we design an ini… ▽ More

    Submitted 16 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  2. arXiv:2407.07673  [pdf, other

    cs.CV

    Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization

    Authors: Feixiang Zhou, Bryan Williams, Hossein Rahmani

    Abstract: Alleviating noisy pseudo labels remains a key challenge in Semi-Supervised Temporal Action Localization (SS-TAL). Existing methods often filter pseudo labels based on strict conditions, but they typically assess classification and localization quality separately, leading to suboptimal pseudo-label ranking and selection. In particular, there might be inaccurate pseudo labels within selected positiv… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  3. arXiv:2406.17803  [pdf, other

    cs.CL cs.AI cs.IR

    Understanding the Role of User Profile in the Personalization of Large Language Models

    Authors: Bin Wu, Zhengyan Shi, Hossein A. Rahmani, Varsha Ramineni, Emine Yilmaz

    Abstract: Utilizing user profiles to personalize Large Language Models (LLMs) has been shown to enhance the performance on a wide range of tasks. However, the precise role of user profiles and their effect mechanism on LLMs remains unclear. This study first confirms that the effectiveness of user profiles is primarily due to personalization information rather than semantic information. Furthermore, we inves… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  4. arXiv:2405.15196  [pdf, other

    cs.CV

    DisC-GS: Discontinuity-aware Gaussian Splatting

    Authors: Haoxuan Qu, Zhuoling Li, Hossein Rahmani, Yujun Cai, Jun Liu

    Abstract: Recently, Gaussian Splatting, a method that represents a 3D scene as a collection of Gaussian distributions, has gained significant attention in addressing the task of novel view synthesis. In this paper, we highlight a fundamental limitation of Gaussian Splatting: its inability to accurately render discontinuities and boundaries in images due to the continuous nature of Gaussian distributions. To… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.12663  [pdf, other

    cs.GR cs.CV

    LAGA: Layered 3D Avatar Generation and Customization via Gaussian Splatting

    Authors: Jia Gong, Shenyu Ji, Lin Geng Foo, Kang Chen, Hossein Rahmani, Jun Liu

    Abstract: Creating and customizing a 3D clothed avatar from textual descriptions is a critical and challenging task. Traditional methods often treat the human body and clothing as inseparable, limiting users' ability to freely mix and match garments. In response to this limitation, we present LAyered Gaussian Avatar (LAGA), a carefully designed framework enabling the creation of high-fidelity decomposable a… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.07801  [pdf, other

    cs.CV

    Deep Learning-Based Object Pose Estimation: A Comprehensive Survey

    Authors: Jian Liu, Wei Sun, Hui Yang, Zhiwen Zeng, Chongpei Liu, Jin Zheng, Xingyu Liu, Hossein Rahmani, Nicu Sebe, Ajmal Mian

    Abstract: Object pose estimation is a fundamental computer vision problem with broad applications in augmented reality and robotics. Over the past decade, deep learning models, due to their superior accuracy and robustness, have increasingly supplanted conventional algorithms reliant on engineered point pair features. Nevertheless, several challenges persist in contemporary methods, including their dependen… ▽ More

    Submitted 31 May, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: 27 pages, 7 figures

  7. arXiv:2405.07767  [pdf, other

    cs.IR cs.AI

    Synthetic Test Collections for Retrieval Evaluation

    Authors: Hossein A. Rahmani, Nick Craswell, Emine Yilmaz, Bhaskar Mitra, Daniel Campos

    Abstract: Test collections play a vital role in evaluation of information retrieval (IR) systems. Obtaining a diverse set of user queries for test collection construction can be challenging, and acquiring relevance judgments, which indicate the appropriateness of retrieved documents to a query, is often costly and resource-intensive. Generating synthetic datasets using Large Language Models (LLMs) has recen… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: SIGIR 2024

  8. arXiv:2404.01051  [pdf, other

    cs.CV

    Action Detection via an Image Diffusion Process

    Authors: Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Jun Liu

    Abstract: Action detection aims to localize the starting and ending points of action instances in untrimmed videos, and predict the classes of those instances. In this paper, we make the observation that the outputs of the action detection task can be formulated as images. Thus, from a novel perspective, we tackle action detection via a three-image generation process to generate starting point, ending point… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  9. arXiv:2404.00925  [pdf, other

    cs.CV cs.CL

    LLMs are Good Sign Language Translators

    Authors: Jia Gong, Lin Geng Foo, Yixuan He, Hossein Rahmani, Jun Liu

    Abstract: Sign Language Translation (SLT) is a challenging task that aims to translate sign videos into spoken language. Inspired by the strong translation capabilities of large language models (LLMs) that are trained on extensive multilingual text corpora, we aim to harness off-the-shelf LLMs to handle SLT. In this paper, we regularize the sign videos to embody linguistic characteristics of spoken language… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR 2024

  10. arXiv:2403.18212  [pdf, other

    cs.RO cs.AI cs.FL cs.LO

    Preference-Based Planning in Stochastic Environments: From Partially-Ordered Temporal Goals to Most Preferred Policies

    Authors: Hazhar Rahmani, Abhishek N. Kulkarni, Jie Fu

    Abstract: Human preferences are not always represented via complete linear orders: It is natural to employ partially-ordered preferences for expressing incomparable outcomes. In this work, we consider decision-making and probabilistic planning in stochastic systems modeled as Markov decision processes (MDPs), given a partially ordered preference over a set of temporally extended goals. Specifically, each te… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2209.12267

  11. arXiv:2402.17891  [pdf, other

    cs.CV

    Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

    Authors: Xinyu Yang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study… ▽ More

    Submitted 9 July, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted at ECCV24

  12. arXiv:2402.09101  [pdf, other

    eess.IV cs.CV

    DestripeCycleGAN: Stripe Simulation CycleGAN for Unsupervised Infrared Image Destriping

    Authors: Shiqi Yang, Hanlin Qin, Shuai Yuan, Xiang Yan, Hossein Rahmani

    Abstract: CycleGAN has been proven to be an advanced approach for unsupervised image restoration. This framework consists of two generators: a denoising one for inference and an auxiliary one for modeling noise to fulfill cycle-consistency constraints. However, when applied to the infrared destriping task, it becomes challenging for the vanilla auxiliary generator to consistently produce vertical noise unde… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  13. arXiv:2402.05810  [pdf, other

    cs.IR

    Natural Language User Profiles for Transparent and Scrutable Recommendations

    Authors: Jerome Ramos, Hossen A. Rahmani, Xi Wang, Xiao Fu, Aldo Lipani

    Abstract: Current state-of-the-art recommender systems predominantly rely on either implicit or explicit feedback from users to suggest new items. While effective in recommending novel options, these conventional systems often use uninterpretable embeddings. This lack of transparency not only limits user understanding of why certain items are suggested but also reduces the user's ability to easily scrutiniz… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  14. arXiv:2402.01934  [pdf, other

    cs.IR

    Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness

    Authors: Hossein A. Rahmani, Xi Wang, Mohammad Aliannejadi, Mohammadmehdi Naghiaei, Emine Yilmaz

    Abstract: Clarifying questions are an integral component of modern information retrieval systems, directly impacting user satisfaction and overall system performance. Poorly formulated questions can lead to user frustration and confusion, negatively affecting the system's performance. This research addresses the urgent need to identify and leverage key features that contribute to the classification of clari… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: EACL

  15. arXiv:2402.00485  [pdf, other

    cs.AI cs.CY cs.IR

    A Personalized Framework for Consumer and Producer Group Fairness Optimization in Recommender Systems

    Authors: Hossein A. Rahmani, Mohammadmehdi Naghiaei, Yashar Deldjoo

    Abstract: In recent years, there has been an increasing recognition that when machine learning (ML) algorithms are used to automate decisions, they may mistreat individuals or groups, with legal, ethical, or economic implications. Recommender systems are prominent examples of these machine learning (ML) systems that aid users in making decisions. The majority of past literature research on RS fairness treat… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: TORS. arXiv admin note: substantial text overlap with arXiv:2204.08085

  16. arXiv:2401.01505  [pdf, other

    cs.CV

    Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports

    Authors: Haopeng Li, Andong Deng, Qiuhong Ke, Jun Liu, Hossein Rahmani, Yulan Guo, Bernt Schiele, Chen Chen

    Abstract: Reasoning over sports videos for question answering is an important task with numerous applications, such as player training and information retrieval. However, this task has not been explored due to the lack of relevant datasets and the challenging nature it presents. Most datasets for video question answering (VideoQA) focus mainly on general and coarse-grained understanding of daily-life videos… ▽ More

    Submitted 14 February, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  17. arXiv:2312.13770  [pdf, other

    cs.CV

    3D Points Splatting for Real-Time Dynamic Hand Reconstruction

    Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: We present 3D Points Splatting Hand Reconstruction (3D-PSHR), a real-time and photo-realistic hand reconstruction approach. We propose a self-adaptive canonical points upsampling strategy to achieve high-resolution hand geometry representation. This is followed by a self-adaptive deformation that deforms the hand from the canonical space to the target pose, adapting to the dynamic changing of cano… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  18. arXiv:2310.16738  [pdf, other

    cs.CL cs.IR

    Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation

    Authors: Xi Wang, Hossein A. Rahmani, Jiqun Liu, Emine Yilmaz

    Abstract: Conversational Recommendation System (CRS) is a rapidly growing research area that has gained significant attention alongside advancements in language modelling techniques. However, the current state of conversational recommendation faces numerous challenges due to its relative novelty and limited existing contributions. In this study, we delve into benchmark datasets for developing CRS models and… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted by EMNLP 2023 (Findings)

  19. arXiv:2309.04250  [pdf, other

    cs.IR

    Provider Fairness and Beyond-Accuracy Trade-offs in Recommender Systems

    Authors: Saeedeh Karimi, Hossein A. Rahmani, Mohammadmehdi Naghiaei, Leila Safari

    Abstract: Recommender systems, while transformative in online user experiences, have raised concerns over potential provider-side fairness issues. These systems may inadvertently favor popular items, thereby marginalizing less popular ones and compromising provider fairness. While previous research has recognized provider-side fairness issues, the investigation into how these biases affect beyond-accuracy a… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: FAccTRec at RecSys 2023

  20. Cellular Wireless Networks in the Upper Mid-Band

    Authors: Seongjoon Kang, Marco Mezzavilla, Sundeep Rangan, Arjuna Madanayake, Satheesh Bojja Venkatakrishnan, Gregory Hellbourg, Monisha Ghosh, Hamed Rahmani, Aditya Dhananjay

    Abstract: The upper mid-band - roughly from 7 to 24 GHz - has attracted considerable recent interest for new cellular services. This frequency range has vastly more spectrum than the highly congested bands below 7 GHz while offering more favorable propagation and coverage than the millimeter wave (mmWave) frequencies. The upper mid-band can thus provide a powerful and complementary frequency range to balanc… ▽ More

    Submitted 6 March, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: 18 pages

  21. arXiv:2308.14177  [pdf, other

    cs.CV

    AI-Generated Content (AIGC) for Various Data Modalities: A Survey

    Authors: Lin Geng Foo, Hossein Rahmani, Jun Liu

    Abstract: AI-generated content (AIGC) methods aim to produce text, images, videos, 3D assets, and other media using AI algorithms. Due to its wide range of applications and the demonstrated potential of recent works, AIGC developments have been attracting lots of attention recently, and AIGC methods have been developed for various data modalities, such as image, video, text, 3D shape (as voxels, point cloud… ▽ More

    Submitted 21 October, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

  22. arXiv:2308.13369  [pdf, other

    cs.CV

    Distribution-Aligned Diffusion for Human Mesh Recovery

    Authors: Lin Geng Foo, Jia Gong, Hossein Rahmani, Jun Liu

    Abstract: Recovering a 3D human mesh from a single RGB image is a challenging task due to depth ambiguity and self-occlusion, resulting in a high degree of uncertainty. Meanwhile, diffusion models have recently seen much success in generating high-quality outputs by progressively denoising noisy inputs. Inspired by their capability, we explore a diffusion-based approach for human mesh recovery, and propose… ▽ More

    Submitted 24 October, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted to ICCV 2023

  23. arXiv:2308.00911  [pdf, other

    cs.RO

    Optimal Sensor Deception to Deviate from an Allowed Itinerary

    Authors: Hazhar Rahmani, Arash Ahadi, Jie Fu

    Abstract: In this work, we study a class of deception planning problems in which an agent aims to alter a security monitoring system's sensor readings so as to disguise its adversarial itinerary as an allowed itinerary in the environment. The adversarial itinerary set and allowed itinerary set are captured by regular languages. To deviate without being detected, we investigate whether there exists a strateg… ▽ More

    Submitted 27 June, 2024; v1 submitted 1 August, 2023; originally announced August 2023.

  24. CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

    Authors: Ali Tourani, Hossein A. Rahmani, Mohammadmehdi Naghiaei, Yashar Deldjoo

    Abstract: Point-of-Interest (POI ) recommendation systems have gained popularity for their unique ability to suggest geographical destinations with the incorporation of contextual information such as time, location, and user-item interaction. Existing recommendation frameworks lack the contextual fusion required for POI systems. This paper presents CAPRI, a novel POI recommendation framework that effectivel… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    ACM Class: H.3.3

  25. arXiv:2305.15933  [pdf, other

    cs.IR

    A Survey on Asking Clarification Questions Datasets in Conversational Systems

    Authors: Hossein A. Rahmani, Xi Wang, Yue Feng, Qiang Zhang, Emine Yilmaz, Aldo Lipani

    Abstract: The ability to understand a user's underlying needs is critical for conversational systems, especially with limited input from users in a conversation. Thus, in such a domain, Asking Clarification Questions (ACQs) to reveal users' true intent from their queries or utterances arise as an essential task. However, it is noticeable that a key limitation of the existing ACQs studies is their incomparab… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023, 17 pages

  26. arXiv:2305.13690  [pdf, other

    cs.CL cs.IR

    Towards Asking Clarification Questions for Information Seeking on Task-Oriented Dialogues

    Authors: Yue Feng, Hossein A. Rahmani, Aldo Lipani, Emine Yilmaz

    Abstract: Task-oriented dialogue systems aim at providing users with task-specific services. Users of such systems often do not know all the information about the task they are trying to accomplish, requiring them to seek information about the task. To provide accurate and personalized task-oriented information seeking results, task-oriented dialogue systems need to address two potential issues: 1) users' i… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  27. arXiv:2305.05754  [pdf, other

    cs.CL cs.AI cs.LG

    When and What to Ask Through World States and Text Instructions: IGLU NLP Challenge Solution

    Authors: Zhengxiang Shi, Jerome Ramos, To Eun Kim, Xi Wang, Hossein A. Rahmani, Aldo Lipani

    Abstract: In collaborative tasks, effective communication is crucial for achieving joint goals. One such task is collaborative building where builders must communicate with each other to construct desired structures in a simulated environment such as Minecraft. We aim to develop an intelligent builder agent to build structures based on user input through dialogue. However, in collaborative building, builder… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: The work won NIPS 2022 IGLU Competition Research Prize. The first two authors contribute equally. arXiv admin note: substantial text overlap with arXiv:2204.08373

  28. arXiv:2304.14299  [pdf, other

    cs.CV

    A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image

    Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: Recently, deep learning based approaches have shown promising results in 3D hand reconstruction from a single RGB image. These approaches can be roughly divided into model-based approaches, which are heavily dependent on the model's parameter space, and model-free approaches, which require large numbers of 3D ground truths to reduce depth ambiguity and struggle in weakly-supervised scenarios. To o… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  29. arXiv:2304.11641  [pdf, other

    cs.FL cs.AI

    Probabilistic Planning with Prioritized Preferences over Temporal Logic Objectives

    Authors: Lening Li, Hazhar Rahmani, Jie Fu

    Abstract: This paper studies temporal planning in probabilistic environments, modeled as labeled Markov decision processes (MDPs), with user preferences over multiple temporal goals. Existing works reflect such preferences as a prioritized list of goals. This paper introduces a new specification language, termed prioritized qualitative choice linear temporal logic on finite traces, which augments linear tem… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 11 pages, 4 figures, accepted by 2023 International Joint Conference on Artificial Intelligence

  30. arXiv:2304.06724  [pdf, other

    cs.CR cs.CV cs.LG

    GradMDM: Adversarial Attack on Dynamic Networks

    Authors: Jianhong Pan, Lin Geng Foo, Qichen Zheng, Zhipeng Fan, Hossein Rahmani, Qiuhong Ke, Jun Liu

    Abstract: Dynamic neural networks can greatly reduce computation redundancy without compromising accuracy by adapting their structures based on the input. In this paper, we explore the robustness of dynamic neural networks against energy-oriented attacks targeted at reducing their efficiency. Specifically, we attack dynamic models with our novel algorithm GradMDM. GradMDM is a technique that adjusts the dir… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  31. arXiv:2304.04175  [pdf, other

    cs.CV

    Token Boosting for Robust Self-Supervised Visual Transformer Pre-training

    Authors: Tianjiao Li, Lin Geng Foo, Ping Hu, Xindi Shang, Hossein Rahmani, Zehuan Yuan, Jun Liu

    Abstract: Learning with large-scale unlabeled data has become a powerful tool for pre-training Visual Transformers (VTs). However, prior works tend to overlook that, in real-world scenarios, the input data may be corrupted and unreliable. Pre-training VTs on such corrupted data can be challenging, especially when we pre-train via the masked autoencoding approach, where both the inputs and masked ``ground tr… ▽ More

    Submitted 12 April, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

    Comments: Accepted to CVPR 2023

  32. arXiv:2304.00280  [pdf, other

    cs.CV

    Progressive Channel-Shrinking Network

    Authors: Jianhong Pan, Siyuan Yang, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Zhipeng Fan, Jun Liu

    Abstract: Currently, salience-based channel pruning makes continuous breakthroughs in network compression. In the realization, the salience mechanism is used as a metric of channel salience to guide pruning. Therefore, salience-based channel pruning can dynamically adjust the channel width at run-time, which provides a flexible pruning scheme. However, there are two problems emerging: a gating function is o… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  33. arXiv:2303.15681  [pdf, ps, other

    cs.LG cs.CE

    GNN-based physics solver for time-independent PDEs

    Authors: Rini Jasmine Gladstone, Helia Rahmani, Vishvas Suryakumar, Hadi Meidani, Marta D'Elia, Ahmad Zareei

    Abstract: Physics-based deep learning frameworks have shown to be effective in accurately modeling the dynamics of complex physical systems with generalization capability across problem inputs. However, time-independent problems pose the challenge of requiring long-range exchange of information across the computational domain for obtaining accurate predictions. In the context of graph neural networks (GNNs)… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: 12 pages, 2 figures

  34. arXiv:2211.16940  [pdf, other

    cs.CV

    DiffPose: Toward More Reliable 3D Pose Estimation

    Authors: Jia Gong, Lin Geng Foo, Zhipeng Fan, Qiuhong Ke, Hossein Rahmani, Jun Liu

    Abstract: Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand, diffusion models have recently emerged as an effective tool for generating high-quality images from noise. Inspired by their capability, we explore a novel pose estimation framework (DiffPose) that formulates 3D pose estimat… ▽ More

    Submitted 9 April, 2023; v1 submitted 30 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023

  35. arXiv:2209.12267  [pdf, other

    cs.RO cs.AI

    Probabilistic Planning with Partially Ordered Preferences over Temporal Goals

    Authors: Hazhar Rahmani, Abhishek N. Kulkarni, Jie Fu

    Abstract: In this paper, we study planning in stochastic systems, modeled as Markov decision processes (MDPs), with preferences over temporally extended goals. Prior work on temporal planning with preferences assumes that the user preferences form a total order, meaning that every pair of outcomes are comparable with each other. In this work, we consider the case where the preferences over possible outcomes… ▽ More

    Submitted 7 March, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  36. arXiv:2209.01425  [pdf, other

    cs.CV

    Dynamic Spatio-Temporal Specialization Learning for Fine-Grained Action Recognition

    Authors: Tianjiao Li, Lin Geng Foo, Qiuhong Ke, Hossein Rahmani, Anran Wang, Jinghua Wang, Jun Liu

    Abstract: The goal of fine-grained action recognition is to successfully discriminate between action categories with subtle differences. To tackle this, we derive inspiration from the human visual system which contains specialized regions in the brain that are dedicated towards handling specific tasks. We design a novel Dynamic Spatio-Temporal Specialization (DSTS) module, which consists of specialized neur… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

    Comments: Accepted to ECCV 2022

  37. arXiv:2208.10192  [pdf, other

    cs.IR

    Towards Confidence-aware Calibrated Recommendation

    Authors: Mohammadmehdi Naghiaei, Hossein A. Rahmani, Mohammad Aliannejadi, Nasim Sonboli

    Abstract: Recommender systems utilize users' historical data to learn and predict their future interests, providing them with suggestions tailored to their tastes. Calibration ensures that the distribution of recommended item categories is consistent with the user's historical data. Mitigating miscalibration brings various benefits to a recommender system. For example, it becomes less likely that a system o… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: CIKM 2022

  38. arXiv:2207.12100  [pdf, other

    cs.CV

    IGFormer: Interaction Graph Transformer for Skeleton-based Human Interaction Recognition

    Authors: Yunsheng Pang, Qiuhong Ke, Hossein Rahmani, James Bailey, Jun Liu

    Abstract: Human interaction recognition is very important in many applications. One crucial cue in recognizing an interaction is the interactive body parts. In this work, we propose a novel Interaction Graph Transformer (IGFormer) network for skeleton-based interaction recognition via modeling the interactive body parts as graphs. More specifically, the proposed IGFormer constructs interaction graphs accord… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted by ECCV 2022

  39. Exploring the Impact of Temporal Bias in Point-of-Interest Recommendation

    Authors: Hossein A. Rahmani, Mohammadmehdi Naghiaei, Ali Tourani, Yashar Deldjoo

    Abstract: Recommending appropriate travel destinations to consumers based on contextual information such as their check-in time and location is a primary objective of Point-of-Interest (POI) recommender systems. However, the issue of contextual bias (i.e., how much consumers prefer one situation over another) has received little attention from the research community. This paper examines the effect of tempor… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: RecSys 2022

  40. arXiv:2207.09675  [pdf, other

    cs.CV

    ERA: Expert Retrieval and Assembly for Early Action Prediction

    Authors: Lin Geng Foo, Tianjiao Li, Hossein Rahmani, Qiuhong Ke, Jun Liu

    Abstract: Early action prediction aims to successfully predict the class label of an action before it is completely performed. This is a challenging task because the beginning stages of different actions can be very similar, with only minor subtle differences for discrimination. In this paper, we propose a novel Expert Retrieval and Assembly (ERA) module that retrieves and assembles a set of experts most sp… ▽ More

    Submitted 22 July, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ECCV 2022

  41. arXiv:2206.10298  [pdf, other

    cs.CL cs.SI

    ViralBERT: A User Focused BERT-Based Approach to Virality Prediction

    Authors: Rikaz Rameez, Hossein A. Rahmani, Emine Yilmaz

    Abstract: Recently, Twitter has become the social network of choice for sharing and spreading information to a multitude of users through posts called 'tweets'. Users can easily re-share these posts to other users through 'retweets', which allow information to cascade to many more users, increasing its outreach. Clearly, being able to know the extent to which a post can be retweeted has great value in adver… ▽ More

    Submitted 17 May, 2022; originally announced June 2022.

    Comments: UMAP 2022

  42. arXiv:2205.08289  [pdf, other

    cs.IR cs.AI

    Experiments on Generalizability of User-Oriented Fairness in Recommender Systems

    Authors: Hossein A. Rahmani, Mohammadmehdi Naghiaei, Mahdi Dehghan, Mohammad Aliannejadi

    Abstract: Recent work in recommender systems mainly focuses on fairness in recommendations as an important aspect of measuring recommendations quality. A fairness-aware recommender system aims to treat different user groups similarly. Relevant work on user-oriented fairness highlights the discriminative behavior of fairness-unaware recommendation algorithms towards a certain user group, defined based on use… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: SIGIR 2022

  43. arXiv:2204.08085  [pdf, other

    cs.IR cs.AI

    CPFair: Personalized Consumer and Producer Fairness Re-ranking for Recommender Systems

    Authors: Mohammadmehdi Naghiaei, Hossein A. Rahmani, Yashar Deldjoo

    Abstract: Recently, there has been a rising awareness that when machine learning (ML) algorithms are used to automate choices, they may treat/affect individuals unfairly, with legal, ethical, or economic consequences. Recommender systems are prominent examples of such ML systems that assist users in making high-stakes judgments. A common trend in the previous literature research on fairness in recommender s… ▽ More

    Submitted 17 April, 2022; originally announced April 2022.

    Comments: SIGIR 2022

  44. arXiv:2202.13446  [pdf, other

    cs.IR cs.AI

    The Unfairness of Popularity Bias in Book Recommendation

    Authors: Mohammadmehdi Naghiaei, Hossein A. Rahmani, Mahdi Dehghan

    Abstract: Recent studies have shown that recommendation systems commonly suffer from popularity bias. Popularity bias refers to the problem that popular items (i.e., frequently rated items) are recommended frequently while less popular items are recommended rarely or not at all. Researchers adopted two approaches to examining popularity bias: (i) from the users' perspective, by analyzing how far a recommend… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted at Bias@ECIR 2022

  45. arXiv:2202.13307  [pdf, other

    cs.IR cs.AI

    The Unfairness of Active Users and Popularity Bias in Point-of-Interest Recommendation

    Authors: Hossein A. Rahmani, Yashar Deldjoo, Ali Tourani, Mohammadmehdi Naghiaei

    Abstract: Point-of-Interest (POI) recommender systems provide personalized recommendations to users and help businesses attract potential customers. Despite their success, recent studies suggest that highly data-driven recommendations could be impacted by data biases, resulting in unfair outcomes for different stakeholders, mainly consumers (users) and providers (items). Most existing fairness-related resea… ▽ More

    Submitted 8 April, 2022; v1 submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted at Bias@ECIR 2022

  46. arXiv:2201.08150  [pdf, other

    cs.IR cs.AI

    A Systematic Analysis on the Impact of Contextual Information on Point-of-Interest Recommendation

    Authors: Hossein A. Rahmani, Mohammad Aliannejadi, Mitra Baratchi, Fabio Crestani

    Abstract: As the popularity of Location-based Social Networks (LBSNs) increases, designing accurate models for Point-of-Interest (POI) recommendation receives more attention. POI recommendation is often performed by incorporating contextual information into previously designed recommendation algorithms. Some of the major contextual information that has been considered in POI recommendation are the location… ▽ More

    Submitted 20 January, 2022; originally announced January 2022.

    Comments: To appear in ACM TOIS

  47. arXiv:2201.03450  [pdf, other

    cs.IR cs.AI

    Leveraging Social Influence based on Users Activity Centers for Point-of-Interest Recommendation

    Authors: Kosar Seyedhoseinzadeh, Hossein A. Rahmani, Mohsen Afsharchi, Mohammad Aliannejadi

    Abstract: Recommender Systems (RSs) aim to model and predict the user preference while interacting with items, such as Points of Interest (POIs). These systems face several challenges, such as data sparsity, limiting their effectiveness. In this paper, we address this problem by incorporating social, geographical, and temporal information into the Matrix Factorization (MF) technique. To this end, we model s… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

    Comments: To appear in Information Processing and Management (IP&M) journal

  48. arXiv:2110.09248  [pdf, ps, other

    cs.IR

    Demographic Biases of Crowd Workers in Key Opinion Leaders Finding

    Authors: Hossein A. Rahmani, Jie Yang

    Abstract: Key Opinion Leaders (KOLs) are people that have a strong influence and their opinions are listened to by people when making important decisions. Crowdsourcing provides an efficient and cost-effective means to gather data for the KOL finding task. However, data collected through crowdsourcing is affected by the inherent demographic biases of crowd workers. To avoid such demographic biases, we need… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: 3 pages, CSCW 2021 Workshop - Investigating and Mitigating Biases in Crowdsourced Data

  49. arXiv:2109.11369  [pdf, other

    cs.CV

    Recent Advances of Continual Learning in Computer Vision: An Overview

    Authors: Haoxuan Qu, Hossein Rahmani, Li Xu, Bryan Williams, Jun Liu

    Abstract: In contrast to batch learning where all training data is available at once, continual learning represents a family of methods that accumulate knowledge and learn continuously with data available in sequential order. Similar to the human learning process with the ability of learning, fusing, and accumulating new knowledge coming at different time steps, continual learning is considered to have high… ▽ More

    Submitted 30 November, 2023; v1 submitted 23 September, 2021; originally announced September 2021.

  50. arXiv:2108.08344  [pdf, other

    cs.CV cs.AI

    The Multi-Modal Video Reasoning and Analyzing Competition

    Authors: Haoran Peng, He Huang, Li Xu, Tianjiao Li, Jun Liu, Hossein Rahmani, Qiuhong Ke, Zhicheng Guo, Cong Wu, Rongchang Li, Mang Ye, Jiahao Wang, Jiaxu Zhang, Yuanzhong Liu, Tao He, Fuwei Zhang, Xianbin Liu, Tao Lin

    Abstract: In this paper, we introduce the Multi-Modal Video Reasoning and Analyzing Competition (MMVRAC) workshop in conjunction with ICCV 2021. This competition is composed of four different tracks, namely, video question answering, skeleton-based action recognition, fisheye video-based action recognition, and person re-identification, which are based on two datasets: SUTD-TrafficQA and UAV-Human. We summa… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021 Workshops

    ACM Class: I.2.10; I.2.6