Skip to main content

Showing 1–50 of 55 results for author: Soleymani, M

  1. arXiv:2406.02170  [pdf, other

    cs.IT eess.SP

    MIMO Capacity Maximization with Beyond-Diagonal RIS

    Authors: Ignacio Santamaria, Mohammad Soleymani, Eduard Jorswieck, Jesús Gutiérrez

    Abstract: This paper addresses the problem of maximizing the capacity of a multiple-input multiple-output (MIMO) link assisted by a beyond-diagonal reconfigurable intelligent surface (BD-RIS). We maximize the capacity by alternately optimizing the transmit covariance matrix, and the BD-RIS scattering matrix, which, according to network theory, should be unitary and symmetric. These constraints make the opti… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures

  2. arXiv:2405.14017  [pdf, other

    cs.CV

    MagicPose4D: Crafting Articulated Models with Appearance and Motion Control

    Authors: Hao Zhang, Di Chang, Fang Li, Mohammad Soleymani, Narendra Ahuja

    Abstract: With the success of 2D and 3D visual generative models, there is growing interest in generating 4D content. Existing methods primarily rely on text prompts to produce 4D content, but they often fall short of accurately defining complex or rare motions. To address this limitation, we propose MagicPose4D, a novel framework for refined control over both appearance and motion in 4D generation. Unlike… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Project Page: https://boese0601.github.io/magicpose4d

  3. arXiv:2403.10737  [pdf, other

    cs.CV

    Leveraging Synthetic Data for Generalizable and Fair Facial Action Unit Detection

    Authors: Liupei Lu, Yufeng Yin, Yuming Gu, Yizhen Wu, Pratusha Prasad, Yajie Zhao, Mohammad Soleymani

    Abstract: Facial action unit (AU) detection is a fundamental block for objective facial expression analysis. Supervised learning approaches require a large amount of manual labeling which is costly. The limited labeled data are also not diverse in terms of gender which can affect model fairness. In this paper, we propose to use synthetically generated data and multi-source domain adaptation (MSDA) to addres… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: The work was done in 2021

  4. arXiv:2403.09069  [pdf, other

    cs.CV

    Dyadic Interaction Modeling for Social Behavior Generation

    Authors: Minh Tran, Di Chang, Maksim Siniukov, Mohammad Soleymani

    Abstract: Human-human communication is like a delicate dance where listeners and speakers concurrently interact to maintain conversational dynamics. Hence, an effective model for generating listener nonverbal behaviors requires understanding the dyadic context and interaction. In this paper, we present an effective framework for creating 3D facial motions in dyadic interactions. Existing work consider a lis… ▽ More

    Submitted 26 March, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  5. arXiv:2402.16434  [pdf, other

    cs.IT eess.SP

    Optimization of the Downlink Spectral- and Energy-Efficiency of RIS-aided Multi-user URLLC MIMO Systems

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck, Robert Schober, Lajos Hanzo

    Abstract: Modern wireless communication systems are expected to provide improved latency and reliability. To meet these expectations, a short packet length is needed, which makes the first-order Shannon rate an inaccurate performance metric for such communication systems. A more accurate approximation of the achievable rates of finite-block-length (FBL) coding regimes is known as the normal approximation (N… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  6. arXiv:2402.15513  [pdf, other

    cs.MM cs.LG eess.SP physics.med-ph

    Investigating the Generalizability of Physiological Characteristics of Anxiety

    Authors: Emily Zhou, Mohammad Soleymani, Maja J. Matarić

    Abstract: Recent works have demonstrated the effectiveness of machine learning (ML) techniques in detecting anxiety and stress using physiological signals, but it is unclear whether ML models are learning physiological features specific to stress. To address this ambiguity, we evaluated the generalizability of physiological features that have been shown to be correlated with anxiety and stress to high-arous… ▽ More

    Submitted 23 January, 2024; originally announced February 2024.

    Journal ref: 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2023, pp. 4848-4855

  7. arXiv:2402.01647  [pdf, other

    cs.CY cs.AI cs.HC cs.LG cs.RO

    Build Your Own Robot Friend: An Open-Source Learning Module for Accessible and Engaging AI Education

    Authors: Zhonghao Shi, Allison O'Connell, Zongjian Li, Siqi Liu, Jennifer Ayissi, Guy Hoffman, Mohammad Soleymani, Maja J. Matarić

    Abstract: As artificial intelligence (AI) is playing an increasingly important role in our society and global economy, AI education and literacy have become necessary components in college and K-12 education to prepare students for an AI-powered society. However, current AI curricula have not yet been made accessible and engaging enough for students and schools from all socio-economic backgrounds with diffe… ▽ More

    Submitted 6 January, 2024; originally announced February 2024.

    Comments: Accepted to the Proceedings of the AAAI Conference on Artificial Intelligence (2024)

  8. arXiv:2401.11921  [pdf, other

    cs.IT eess.SP

    Maximizing Spectral and Energy Efficiency in Multi-user MIMO OFDM Systems with RIS and Hardware Impairment

    Authors: Mohammad Soleymani, Ignacio Santamaria, Aydin Sezgin, Eduard Jorswieck

    Abstract: An emerging technology to enhance the spectral efficiency (SE) and energy efficiency (EE) of wireless communication systems is reconfigurable intelligent surface (RIS), which is shown to be very powerful in single-carrier systems. However, in multi-user orthogonal frequency division multiplexing (OFDM) systems, RIS may not be as promising as in single-carrier systems since an independent optimizat… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  9. arXiv:2311.12052  [pdf, other

    cs.CV

    MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

    Authors: Di Chang, Yichun Shi, Quankai Gao, Jessica Fu, Hongyi Xu, Guoxian Song, Qing Yan, Yizhe Zhu, Xiao Yang, Mohammad Soleymani

    Abstract: In this work, we propose MagicPose, a diffusion-based model for 2D human pose and facial expression retargeting. Specifically, given a reference image, we aim to generate a person's new images by controlling the poses and facial expressions while keeping the identity unchanged. To this end, we propose a two-stage training strategy to disentangle human motions and appearance (e.g., facial expressio… ▽ More

    Submitted 5 May, 2024; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: Accepted by ICML 2024. MagicPose and MagicDance are the same project. Website:https://boese0601.github.io/magicdance/ Code:https://github.com/Boese0601/MagicDance

  10. arXiv:2310.08289  [pdf, other

    cs.IT eess.SP

    Maximization of minimum rate in MIMO OFDM RIS-assisted Broadcast Channels

    Authors: Mohammad Soleymani, Ignacio Santamaria, Aydin Sezgin, Eduard Jorswieck

    Abstract: Reconfigurable intelligent surface (RIS) is a promising technology to enhance the spectral efficiency of wireless communication systems. By optimizing the RIS elements, the performance of the overall system can be improved. Yet, in contrast to single-carrier systems, in multi-carrier systems, it is not possible to independently optimize RIS elements at each sub-carrier, which may reduce the benefi… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted at IEEE CAMSAP 2023

  11. arXiv:2309.02418  [pdf, other

    eess.AS cs.SD eess.SP

    Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

    Authors: Minh Tran, Yufeng Yin, Mohammad Soleymani

    Abstract: There are individual differences in expressive behaviors driven by cultural norms and personality. This between-person variation can result in reduced emotion recognition performance. Therefore, personalization is an important step in improving the generalization and robustness of speech emotion recognition. In this paper, to achieve unsupervised personalized emotion recognition, we first pre-trai… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: Accepted by INTERSPEECH 2023

  12. arXiv:2308.12544  [pdf, other

    cs.DC cs.CR cs.IT

    Analog Multi-Party Computing: Locally Differential Private Protocols for Collaborative Computations

    Authors: Hsuan-Po Liu, Mahdi Soleymani, Hessam Mahdavifar

    Abstract: We consider a fully-decentralized scenario in which no central trusted entity exists and all clients are honest-but-curious. The state-of-the-art approaches to this problem often rely on cryptographic protocols, such as multiparty computation (MPC), that require mapping real-valued data to a discrete alphabet, specifically a finite field. These approaches, however, can result in substantial accura… ▽ More

    Submitted 18 October, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

  13. arXiv:2308.12380  [pdf, other

    cs.CV

    FG-Net: Facial Action Unit Detection with Generalizable Pyramidal Features

    Authors: Yufeng Yin, Di Chang, Guoxian Song, Shen Sang, Tiancheng Zhi, Jing Liu, Linjie Luo, Mohammad Soleymani

    Abstract: Automatic detection of facial Action Units (AUs) allows for objective facial expression analysis. Due to the high cost of AU labeling and the limited size of existing benchmarks, previous AU detection methods tend to overfit the dataset, resulting in a significant performance loss when evaluated across corpora. To address this problem, we propose FG-Net for generalizable facial action unit detecti… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  14. arXiv:2308.11078  [pdf, other

    cs.IT

    Matrix Completion over Finite Fields: Bounds and Belief Propagation Algorithms

    Authors: Mahdi Soleymani, Qiang Liu, Hessam Mahdavifar, Laura Balzano

    Abstract: We consider the low rank matrix completion problem over finite fields. This problem has been extensively studied in the domain of real/complex numbers, however, to the best of authors' knowledge, there exists merely one efficient algorithm to tackle the problem in the binary field, due to Saunderson et al. [1]. In this paper, we improve upon the theoretical guarantees for the algorithm provided in… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  15. arXiv:2308.10713  [pdf, other

    cs.CV

    LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis

    Authors: Di Chang, Yufeng Yin, Zongjian Li, Minh Tran, Mohammad Soleymani

    Abstract: Facial expression analysis is an important tool for human-computer interaction. In this paper, we introduce LibreFace, an open-source toolkit for facial expression analysis. This open-source toolbox offers real-time and offline analysis of facial behavior through deep learning models, including facial action unit (AU) detection, AU intensity estimation, and facial expression recognition. To accomp… ▽ More

    Submitted 23 August, 2023; v1 submitted 17 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures. Accepted by WACV 2024 Round 1. (Application Track) Project Page: https://boese0601.github.io/libreface/

  16. arXiv:2308.02696  [pdf, other

    cs.IT eess.SP

    NOMA-based Improper Signaling for MIMO STAR-RIS-assisted Broadcast Channels with Hardware Impairments

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck

    Abstract: This paper proposes schemes to improve the spectral efficiency of a multiple-input multiple-output (MIMO) broadcast channel (BC) with I/Q imbalance (IQI) at transceivers by employing a combination of improper Gaussian signaling (IGS), non-orthogonal multiple access (NOMA) and simultaneously transmit and reflect (STAR) reconfigurable intelligent surface (RIS). When there exists IQI, the output RF s… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: IEEE GLOBECOM 2023

  17. SNR Maximization in Beyond Diagonal RIS-assisted Single and Multiple Antenna Links

    Authors: Ignacio Santamaria, Mohammad Soleymani, Eduard Jorswieck, Jesus Gutierrez

    Abstract: Reconfigurable intelligent surface (RIS) architectures not limited to diagonal phase shift matrices have recently been considered to increase their flexibility in shaping the wireless channel. One of these beyond-diagonal RIS or BD-RIS architectures leads to a unitary and symmetric RIS matrix. In this letter, we consider the problem of maximizing the signal-to-noise ratio (SNR) in single and multi… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: 5 pages, 2 figures

    Journal ref: IEEE Signal Processing Letters, 2023

  18. Optimization of Rate-Splitting Multiple Access in Beyond Diagonal RIS-assisted URLLC Systems

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck, Bruno Clerckx

    Abstract: This paper proposes a general optimization framework for rate splitting multiple access (RSMA) in beyond diagonal (BD) reconfigurable intelligent surface (RIS) assisted ultra-reliable low-latency communications (URLLC) systems. This framework can provide a suboptimal solution for a large family of optimization problems in which the objective and/or constraints are linear functions of the rates and… ▽ More

    Submitted 13 October, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Accepted at IEEE Transaction of Wireless Communications

  19. arXiv:2306.01309  [pdf, other

    cs.IT eess.SP

    Energy-efficient Rate Splitting for MIMO STAR-RIS-assisted Broadcast Channels with I/Q Imbalance

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck

    Abstract: This paper proposes an energy-efficient scheme for multicell multiple-input, multiple-output (MIMO) simultaneous transmit and reflect (STAR) reconfigurable intelligent surfaces (RIS)-assisted broadcast channels by employing rate splitting (RS) and improper Gaussian signaling (IGS). Regular RISs can only reflect signals. Thus, a regular RIS can assist only when the transmitter and receiver are in t… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted at the 31st European Signal Processing Conference (EUSIPCO 2023)

  20. arXiv:2303.10590  [pdf, other

    cs.CV

    Multi-modal Facial Action Unit Detection with Large Pre-trained Models for the 5th Competition on Affective Behavior Analysis in-the-wild

    Authors: Yufeng Yin, Minh Tran, Di Chang, Xinrui Wang, Mohammad Soleymani

    Abstract: Facial action unit detection has emerged as an important task within facial expression analysis, aimed at detecting specific pre-defined, objective facial expressions, such as lip tightening and cheek raising. This paper presents our submission to the Affective Behavior Analysis in-the-wild (ABAW) 2023 Competition for AU detection. We propose a multi-modal method for facial action unit detection w… ▽ More

    Submitted 17 April, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

    Comments: 8 pages, 7 figures, 5 tables

  21. arXiv:2303.03014  [pdf, other

    cs.IT eess.SP

    Interference Leakage Minimization in RIS-assisted MIMO Interference Channels

    Authors: Ignacio Santamaria, Mohammad Soleymani, Eduard Jorswieck, Jesus Gutierrez

    Abstract: We address the problem of interference leakage (IL) minimization in the $K$-user multiple-input multiple-output (MIMO) interference channel (IC) assisted by a reconfigurable intelligent surface (RIS). We describe an iterative algorithm based on block coordinate descent to minimize the IL cost function. A reformulation of the problem provides a geometric interpretation and shows interesting connect… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  22. arXiv:2301.00594  [pdf, other

    cs.IT eess.SP

    Rate Region of MIMO RIS-assisted Broadcast Channels with Rate Splitting and Improper Signaling

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck

    Abstract: In this paper, we study the achievable rate region of 1-layer rate splitting (RS) in the presence of hardware impairment (HWI) and improper Gaussian signaling (IGS) for a single-cell reconfigurable intelligent surface (RIS) assisted broadcast channel (BC). We assume that the transceivers may suffer from an imbalance in in-band and quadrature signals, which is known as I/Q imbalance (IQI). The rece… ▽ More

    Submitted 2 January, 2023; originally announced January 2023.

  23. Spectral and Energy Efficiency Maximization of MISO STAR-RIS-assisted URLLC Systems

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck

    Abstract: This paper proposes a general optimization framework to improve the spectral and energy efficiency (EE) of ultra-reliable low-latency communication (URLLC) simultaneous-transfer-and-receive (STAR) reconfigurable intelligent surface (RIS)-assisted interference-limited systems with finite block length (FBL). This framework can solve a large variety of optimization problems in which the objective and… ▽ More

    Submitted 10 July, 2023; v1 submitted 2 January, 2023; originally announced January 2023.

    Comments: Accepted at IEEE ACCESS

  24. arXiv:2210.15760  [pdf

    cs.CV

    Towards Improving Workers' Safety and Progress Monitoring of Construction Sites Through Construction Site Understanding

    Authors: Mahdi Bonyani, Maryam Soleymani

    Abstract: An important component of computer vision research is object detection. In recent years, there has been tremendous progress in the study of construction site images. However, there are obvious problems in construction object detection, including complex backgrounds, varying-sized objects, and poor imaging quality. In the state-of-the-art approaches, elaborate attention mechanisms are developed to… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  25. Rate Splitting in MIMO RIS-assisted Systems with Hardware Impairments and Improper Signaling

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck

    Abstract: In this paper, we propose an optimization framework for rate splitting (RS) techniques in multiple-input multiple-output (MIMO) reconfigurable intelligent surface (RIS)-assisted systems, possibly with I/Q imbalance (IQI). This framework can be applied to any optimization problem in which the objective and/or constraints are linear functions of the rates and/or transmit covariance matrices. Such pr… ▽ More

    Submitted 15 November, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: accepted at IEEE Transaction on Vehicular Technology

  26. arXiv:2208.08087  [pdf

    cs.LG cs.CV

    Autonomous Resource Management in Construction Companies Using Deep Reinforcement Learning Based on IoT

    Authors: Maryam Soleymani, Mahdi Bonyani, Meghdad Attarzadeh

    Abstract: Resource allocation is one of the most critical issues in planning construction projects, due to its direct impact on cost, time, and quality. There are usually specific allocation methods for autonomous resource management according to the projects objectives. However, integrated planning and optimization of utilizing resources in an entire construction organization are scarce. The purpose of thi… ▽ More

    Submitted 6 September, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  27. NOMA-based Improper Signaling for Multicell MISO RIS-assisted Broadcast Channels

    Authors: Mohammad Soleymani, Ignacio Santamaria, Eduard Jorswieck, Sepehr Rezvani

    Abstract: In this paper, we study the performance of reconfigurable intelligent surfaces (RISs) in a multicell broadcast channel (BC) that employs improper Gaussian signaling (IGS) jointly with non-orthogonal multiple access (NOMA) to optimize either the minimum-weighted rate or the energy efficiency (EE) of the network. We show that although the RIS can significantly improve the system performance, it cann… ▽ More

    Submitted 15 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted at IEEE Transactions on Signal Processing

  28. arXiv:2203.14171  [pdf, other

    eess.AS cs.CR cs.SD

    A Speech Representation Anonymization Framework via Selective Noise Perturbation

    Authors: Minh Tran, Mohammad Soleymani

    Abstract: Privacy and security are major concerns when communicating speech signals to cloud services such as automatic speech recognition (ASR) and speech emotion recognition (SER). Existing solutions for speech anonymization mainly focus on voice conversion or voice modification to convert a raw utterance into another one with similar content but different, or no, identity-related information. However, an… ▽ More

    Submitted 27 October, 2022; v1 submitted 26 March, 2022; originally announced March 2022.

  29. arXiv:2202.09914  [pdf, other

    cs.LG

    SOInter: A Novel Deep Energy Based Interpretation Method for Explaining Structured Output Models

    Authors: S. Fatemeh Seyyedsalehi, Mahdieh Soleymani, Hamid R. Rabiee

    Abstract: We propose a novel interpretation technique to explain the behavior of structured output models, which learn mappings between an input vector to a set of output variables simultaneously. Because of the complex relationship between the computational path of output variables in structured models, a feature can affect the value of output through other ones. We focus on one of the outputs as the targe… ▽ More

    Submitted 20 February, 2022; originally announced February 2022.

  30. arXiv:2201.09165  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    A Pre-trained Audio-Visual Transformer for Emotion Recognition

    Authors: Minh Tran, Mohammad Soleymani

    Abstract: In this paper, we introduce a pretrained audio-visual Transformer trained on more than 500k utterances from nearly 4000 celebrities from the VoxCeleb2 dataset for human behavior understanding. The model aims to capture and extract useful information from the interactions between human facial and auditory behaviors, with application in emotion recognition. We evaluate the model performance on two d… ▽ More

    Submitted 22 January, 2022; originally announced January 2022.

    Comments: Accepted by IEEE ICASSP 2022

  31. arXiv:2109.09868  [pdf, other

    cs.LG cs.DC cs.IT

    ApproxIFER: A Model-Agnostic Approach to Resilient and Robust Prediction Serving Systems

    Authors: Mahdi Soleymani, Ramy E. Ali, Hessam Mahdavifar, A. Salman Avestimehr

    Abstract: Due to the surge of cloud-assisted AI services, the problem of designing resilient prediction serving systems that can effectively cope with stragglers/failures and minimize response delays has attracted much interest. The common approach for tackling this problem is replication which assigns the same prediction task to multiple workers. This approach, however, is very inefficient and incurs signi… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  32. arXiv:2109.05056  [pdf, other

    cs.CL cs.SD eess.AS

    Speaker Turn Modeling for Dialogue Act Classification

    Authors: Zihao He, Leili Tavabi, Kristina Lerman, Mohammad Soleymani

    Abstract: Dialogue Act (DA) classification is the task of classifying utterances with respect to the function they serve in a dialogue. Existing approaches to DA classification model utterances without incorporating the turn changes among speakers throughout the dialogue, therefore treating it no different than non-interactive written text. In this paper, we propose to integrate the turn changes in conversa… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

  33. arXiv:2108.09934  [pdf, other

    cs.CV

    Modeling Dynamics of Facial Behavior for Mental Health Assessment

    Authors: Minh Tran, Ellen Bradley, Michelle Matvey, Joshua Woolley, Mohammad Soleymani

    Abstract: Facial action unit (FAU) intensities are popular descriptors for the analysis of facial behavior. However, FAUs are sparsely represented when only a few are activated at a time. In this study, we explore the possibility of representing the dynamics of facial expressions by adopting algorithms used for word representation in natural language processing. Specifically, we perform clustering on a larg… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted to FG 2021

  34. arXiv:2108.09527  [pdf

    cs.CV

    Construction material classification on imbalanced datasets using Vision Transformer (ViT) architecture

    Authors: Maryam Soleymani, Mahdi Bonyani, Hadi Mahami, Farnad Nasirzadeh

    Abstract: This research proposes a reliable model for identifying different construction materials with the highest accuracy, which is exploited as an advantageous tool for a wide range of construction applications such as automated progress monitoring. In this study, a novel deep learning architecture called Vision Transformer (ViT) is used for detecting and classifying construction materials. The robustne… ▽ More

    Submitted 6 September, 2022; v1 submitted 21 August, 2021; originally announced August 2021.

    Comments: 18 pages, 11 figures, 7 tables

  35. V2X in 3GPP Standardization: NR Sidelink in Rel-16 and Beyond

    Authors: Mehdi Harounabadi, Dariush Mohammad Soleymani, Shubhangi Bhadauria, Martin Leyh, Elke Roth-Mandutz

    Abstract: The 5G mobile network brings several new features that can be applied to existing and new applications. High reliability, low latency, and high data rate are some of the features which fulfill the requirements of vehicular networks. Vehicular networks aim to provide safety for road users and several additional advantages such as enhanced traffic efficiency and in-vehicle infotainment services. Thi… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  36. arXiv:2103.01503  [pdf, other

    cs.IT cs.DC

    Coded Computing via Binary Linear Codes: Designs and Performance Limits

    Authors: Mahdi Soleymani, Mohammad Vahid Jamali, Hessam Mahdavifar

    Abstract: We consider the problem of coded distributed computing where a large linear computational job, such as a matrix multiplication, is divided into $k$ smaller tasks, encoded using an $(n,k)$ linear code, and performed over $n$ distributed nodes. The goal is to reduce the average execution time of the computational job. We provide a connection between the problem of characterizing the average executio… ▽ More

    Submitted 4 October, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IEEE Journal on Selected Areas in Information Theory. arXiv admin note: substantial text overlap with arXiv:1906.10105

  37. arXiv:2101.11653  [pdf, other

    cs.IT cs.DC cs.LG

    List-Decodable Coded Computing: Breaking the Adversarial Toleration Barrier

    Authors: Mahdi Soleymani, Ramy E. Ali, Hessam Mahdavifar, A. Salman Avestimehr

    Abstract: We consider the problem of coded computing, where a computational task is performed in a distributed fashion in the presence of adversarial workers. We propose techniques to break the adversarial toleration threshold barrier previously known in coded computing. More specifically, we leverage list-decoding techniques for folded Reed-Solomon codes and propose novel algorithms to recover the correct… ▽ More

    Submitted 19 August, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  38. arXiv:2011.00403  [pdf, other

    cs.CL cs.LG cs.SI

    Towards A Friendly Online Community: An Unsupervised Style Transfer Framework for Profanity Redaction

    Authors: Minh Tran, Yipeng Zhang, Mohammad Soleymani

    Abstract: Offensive and abusive language is a pressing problem on social media platforms. In this work, we propose a method for transforming offensive comments, statements containing profanity or offensive language, into non-offensive ones. We design a RETRIEVE, GENERATE and EDIT unsupervised style transfer pipeline to redact the offensive comments in a word-restricted manner while maintaining a high level… ▽ More

    Submitted 31 October, 2020; originally announced November 2020.

    Comments: COLING 2020

  39. arXiv:2008.08565  [pdf, other

    cs.IT cs.DC cs.LG

    Analog Lagrange Coded Computing

    Authors: Mahdi Soleymani, Hessam Mahdavifar, A. Salman Avestimehr

    Abstract: A distributed computing scenario is considered, where the computational power of a set of worker nodes is used to perform a certain computation task over a dataset that is dispersed among the workers. Lagrange coded computing (LCC), proposed by Yu et al., leverages the well-known Lagrange polynomial to perform polynomial evaluation of the dataset in such a scenario in an efficient parallel fashion… ▽ More

    Submitted 29 January, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

  40. arXiv:2007.08803  [pdf, other

    cs.LG cs.CR cs.DC cs.IT stat.ML

    Privacy-Preserving Distributed Learning in the Analog Domain

    Authors: Mahdi Soleymani, Hessam Mahdavifar, A. Salman Avestimehr

    Abstract: We consider the critical problem of distributed learning over data while keeping it private from the computational servers. The state-of-the-art approaches to this problem rely on quantizing the data into a finite field, so that the cryptographic approaches for secure multiparty computing can then be employed. These approaches, however, can result in substantial accuracy losses due to fixed-point… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  41. arXiv:2001.10403  [pdf, ps, other

    eess.SP cs.IT

    Improper Gaussian Signaling for the $K$-user MIMO Interference Channels with Hardware Impairments

    Authors: Mohammad Soleymani, Ignacio Santamaria, Peter J. Schreier

    Abstract: This paper investigates the performance of improper Gaussian signaling (IGS) for the $K$-user multiple-input, multiple-output (MIMO) interference channel (IC) with hardware impairments (HWI). HWI may arise due to imperfections in the devices like I/Q imbalance, phase noise, etc. With I/Q imbalance, the received signal is a widely linear transformation of the transmitted signal and noise. Thus, the… ▽ More

    Submitted 6 August, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: accepted

    Journal ref: Transaction on Vehicular Technology 2020

  42. arXiv:1911.05609  [pdf, other

    cs.MM cs.CV

    Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey

    Authors: Sicheng Zhao, Shangfei Wang, Mohammad Soleymani, Dhiraj Joshi, Qiang Ji

    Abstract: The wide popularity of digital photography and social networks has generated a rapidly growing volume of multimedia data (i.e., image, music, and video), resulting in a great demand for managing, retrieving, and understanding these data. Affective computing (AC) of these data can help to understand human behaviors and enable wide applications. In this article, we survey the state-of-the-art AC tec… ▽ More

    Submitted 3 October, 2019; originally announced November 2019.

    Comments: Accepted by ACM TOMM

  43. arXiv:1909.07533  [pdf, other

    cs.IT

    Analog Subspace Coding: A New Approach to Coding for Non-Coherent Wireless Networks

    Authors: Mahdi Soleymani, Hessam Mahdavifar

    Abstract: We provide a novel framework to study subspace codes for non-coherent communications in wireless networks. To this end, an analog operator channel is defined with inputs and outputs being subspaces of $\mathbb{C}^n$. Then a certain distance is defined to capture the performance of subspace codes in terms of their capability to recover from interference and rank-deficiency of the network. We also s… ▽ More

    Submitted 28 January, 2022; v1 submitted 16 September, 2019; originally announced September 2019.

  44. arXiv:1907.11510  [pdf, ps, other

    cs.HC cs.CV cs.IR cs.LG stat.ML

    AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

    Authors: Fabien Ringeval, Björn Schuller, Michel Valstar, NIcholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic

    Abstract: The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions. The goal of the Challen… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

  45. arXiv:1906.10105  [pdf, ps, other

    cs.IT cs.DC

    Coded Distributed Computing: Performance Limits and Code Designs

    Authors: Mohammad Vahid Jamali, Mahdi Soleymani, Hessam Mahdavifar

    Abstract: We consider the problem of coded distributed computing where a large linear computational job, such as a matrix multiplication, is divided into $k$ smaller tasks, encoded using an $(n,k)$ linear code, and performed over $n$ distributed nodes. The goal is to reduce the average execution time of the computational job. We provide a connection between the problem of characterizing the average executio… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

  46. arXiv:1906.04402  [pdf, other

    cs.CV

    Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval

    Authors: Yale Song, Mohammad Soleymani

    Abstract: Visual-semantic embedding aims to find a shared latent space where related visual and textual instances are close to each other. Most current methods learn injective embedding functions that map an instance to a single point in the shared space. Unfortunately, injective embedding cannot effectively handle polysemous instances with multiple possible meanings; at best, it would find an average repre… ▽ More

    Submitted 17 July, 2019; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: CVPR 2019. Includes supplementary material. Have updated results on TGIF and MRW

  47. arXiv:1806.04903  [pdf, other

    cs.SD eess.AS

    A data-driven approach to mid-level perceptual musical feature modeling

    Authors: Anna Aljanaki, Mohammad Soleymani

    Abstract: Musical features and descriptors could be coarsely divided into three levels of complexity. The bottom level contains the basic building blocks of music, e.g., chords, beats and timbre. The middle level contains concepts that emerge from combining the basic blocks: tonal and rhythmic stability, harmonic and rhythmic complexity, etc. High-level descriptors (genre, mood, expressive style) are usuall… ▽ More

    Submitted 13 June, 2018; originally announced June 2018.

    Comments: 7 pages, ISMIR conference paper

  48. arXiv:1804.04318  [pdf, other

    cs.CV cs.AI cs.MM

    Cross-Modal Retrieval with Implicit Concept Association

    Authors: Yale Song, Mohammad Soleymani

    Abstract: Traditional cross-modal retrieval assumes explicit association of concepts across modalities, where there is no ambiguity in how the concepts are linked to each other, e.g., when we do the image search with a query "dogs", we expect to see dog images. In this paper, we consider a different setting for cross-modal retrieval where data from different modalities are implicitly linked via concepts tha… ▽ More

    Submitted 25 April, 2018; v1 submitted 12 April, 2018; originally announced April 2018.

  49. arXiv:1801.04384  [pdf, other

    cs.IT

    Distributed Multi-User Secret Sharing

    Authors: Mahdi Soleymani, Hessam Mahdavifar

    Abstract: We consider a distributed secret sharing system that consists of a dealer, $n$ storage nodes, and $m$ users. Each user is given access to a certain subset of storage nodes, where it can download the stored data. The dealer wants to securely convey a specific secret $s_j$ to user $j$ via storage nodes, for $j=1,2,...,m$. More specifically, two secrecy conditions are considered in this multi-user co… ▽ More

    Submitted 29 September, 2020; v1 submitted 13 January, 2018; originally announced January 2018.

  50. arXiv:1609.09761  [pdf, other

    cs.HC

    Detecting Cognitive Appraisals from Facial Expressions for Interest Recognition

    Authors: Mohammad Soleymani

    Abstract: Interest makes one hold her attention on the object of interest. Automatic recognition of interest has numerous applications in human-computer interaction. In this paper, we study the facial expressions associated with interest and its underlying and closely related components, namely, curiosity, coping potential, novelty and complexity. To this end, we conducted an experiment in which participant… ▽ More

    Submitted 10 October, 2016; v1 submitted 30 September, 2016; originally announced September 2016.

    Comments: 6 pages, discussions and analysis were added. More results are also added and discussed