subscribe to arXiv mailings

Navigating Efficiency in MobileViT through Gaussian Process on Global Architecture Factors

Abstract: Numerous techniques have been meticulously designed to achieve optimal architectures for convolutional neural networks (CNNs), yet a comparable focus on vision transformers (ViTs) has been somewhat lacking. Despite the remarkable success of ViTs in various vision tasks, their heavyweight nature presents challenges of computational costs. In this paper, we leverage the Gaussian process to systemati… ▽ More Numerous techniques have been meticulously designed to achieve optimal architectures for convolutional neural networks (CNNs), yet a comparable focus on vision transformers (ViTs) has been somewhat lacking. Despite the remarkable success of ViTs in various vision tasks, their heavyweight nature presents challenges of computational costs. In this paper, we leverage the Gaussian process to systematically explore the nonlinear and uncertain relationship between performance and global architecture factors of MobileViT, such as resolution, width, and depth including the depth of in-verted residual blocks and the depth of ViT blocks, and joint factors including resolution-depth and resolution-width. We present design principles twisting magic 4D cube of the global architecture factors that minimize model sizes and computational costs with higher model accuracy. We introduce a formula for downsizing architectures by iteratively deriving smaller MobileViT V2, all while adhering to a specified constraint of multiply-accumulate operations (MACs). Experiment results show that our formula significantly outperforms CNNs and mobile ViTs across diversified datasets △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2405.05549 [pdf, other]

Intelligent Reflecting Surface Aided AirComp: Multi-Timescale Design and Performance Analysis

Authors: Guangji Chen, Jun Li, Qingqing Wu, Meng Hua, Kaitao Meng, Zhonghao Lyu

Abstract: The integration of intelligent reflecting surface (IRS) into over-the-air computation (AirComp) is an effective solution for reducing the computational mean squared error (MSE) via its high passive beamforming gain. Prior works on IRS aided AirComp generally rely on the full instantaneous channel state information (I-CSI), which is not applicable to large-scale systems due to its heavy signalling… ▽ More The integration of intelligent reflecting surface (IRS) into over-the-air computation (AirComp) is an effective solution for reducing the computational mean squared error (MSE) via its high passive beamforming gain. Prior works on IRS aided AirComp generally rely on the full instantaneous channel state information (I-CSI), which is not applicable to large-scale systems due to its heavy signalling overhead. To address this issue, we propose a novel multi-timescale transmission protocol. In particular, the receive beamforming at the access point (AP) is pre-determined based on the static angle information and the IRS phase-shifts are optimized relying on the long-term statistical CSI. With the obtained AP receive beamforming and IRS phase-shifts, the effective low-dimensional I-CSI is exploited to determine devices' transmit power in each coherence block, thus substantially reducing the signalling overhead. Theoretical analysis unveils that the achievable MSE scales on the order of ${\cal O}\left( {K/\left( {{N^2}M} \right)} \right)$, where $M$, $N$, and $K$ are the number of AP antennas, IRS elements, and devices, respectively. We also prove that the channel-inversion power control is asymptotically optimal for large $N$, which reveals that the full power transmission policy is not needed for lowering the power consumption of energy-limited devices. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: submitted to IEEE Journal for possible publication

arXiv:2404.14835 [pdf, other]

Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking

Authors: Kexin Meng, Ruirui Li, Daguang Jiang

Abstract: Human pose estimation is a fundamental and challenging task in computer vision. Larger-scale and more accurate keypoint annotations, while helpful for improving the accuracy of supervised pose estimation, are often expensive and difficult to obtain. Semi-supervised pose estimation tries to leverage a large amount of unlabeled data to improve model performance, which can alleviate the problem of in… ▽ More Human pose estimation is a fundamental and challenging task in computer vision. Larger-scale and more accurate keypoint annotations, while helpful for improving the accuracy of supervised pose estimation, are often expensive and difficult to obtain. Semi-supervised pose estimation tries to leverage a large amount of unlabeled data to improve model performance, which can alleviate the problem of insufficient labeled samples. The latest semi-supervised learning usually adopts a strong and weak data augmented teacher-student learning framework to deal with the challenge of "Human postural diversity and its long-tailed distribution". Appropriate data augmentation method is one of the key factors affecting the accuracy and generalization of semi-supervised models. Aiming at the problem that the difference of sample learning is not considered in the fixed keypoint masking augmentation method, this paper proposes an adaptive keypoint masking method, which can fully mine the information in the samples and obtain better estimation performance. In order to further improve the generalization and robustness of the model, this paper proposes a dual-branch data augmentation scheme, which can perform Mixup on samples and features on the basis of adaptive keypoint masking. The effectiveness of the proposed method is verified on COCO and MPII, outperforming the state-of-the-art semi-supervised pose estimation by 5.2% and 0.3%, respectively. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: China Multimedia 2023

arXiv:2404.14514 [pdf, other]

Cooperative ISAC Networks: Performance Analysis, Scaling Laws and Optimization

Authors: Kaitao Meng, Christos Masouros, Athina P. Petropulu, Lajos Hanzo

Abstract: Integrated sensing and communication (ISAC) networks are investigated with the objective of effectively balancing the sensing and communication (S&C) performance at the network level. Through the simultaneous utilization of multi-point (CoMP) coordinated joint transmission and distributed multiple-input multiple-output (MIMO) radar techniques, we propose an innovative networked ISAC scheme, where… ▽ More Integrated sensing and communication (ISAC) networks are investigated with the objective of effectively balancing the sensing and communication (S&C) performance at the network level. Through the simultaneous utilization of multi-point (CoMP) coordinated joint transmission and distributed multiple-input multiple-output (MIMO) radar techniques, we propose an innovative networked ISAC scheme, where multiple transceivers are employed for collaboratively enhancing the S&C services. Then, the potent tool of stochastic geometry is exploited for characterizing the S&C performance, which allows us to illuminate the key cooperative dependencies in the ISAC network and optimize salient network-level parameters. Remarkably, the Cramer-Rao lower bound (CRLB) expression of the localization accuracy derived unveils a significant finding: Deploying N ISAC transceivers yields an enhanced average cooperative sensing performance across the entire network, in accordance with the ln^2N scaling law. Crucially, this scaling law is less pronounced in comparison to the performance enhancement of N^2 achieved when the transceivers are equidistant from the target, which is primarily due to the substantial path loss from the distant base stations (BSs) and leads to reduced contributions to sensing performance gain. Moreover, we derive a tight expression of the communication rate, and present a low-complexity algorithm to determine the optimal cooperative cluster size. Based on our expression derived for the S&C performance, we formulate the optimization problem of maximizing the network performance in terms of two joint S&C metrics. To this end, we jointly optimize the cooperative BS cluster sizes and the transmit power to strike a flexible tradeoff between the S&C performance. △ Less

Submitted 11 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

Comments: 13 pages, 10 figures, this work has been submitted to IEEE for possible publication. arXiv admin note: text overlap with arXiv:2403.20228

arXiv:2403.20228 [pdf, other]

Cooperative Sensing and Communication for ISAC Networks: Performance Analysis and Optimization

Authors: Kaitao Meng, Christos Masouros

Abstract: In this work, we study integrated sensing and communication (ISAC) networks intending to effectively balance sensing and communication (S&C) performance at the network level. Through the simultaneous utilization of multi-point (CoMP) coordinated joint transmission and distributed multiple-input multiple-output (MIMO) radar techniques, we propose a cooperative networked ISAC scheme to enhance both… ▽ More In this work, we study integrated sensing and communication (ISAC) networks intending to effectively balance sensing and communication (S&C) performance at the network level. Through the simultaneous utilization of multi-point (CoMP) coordinated joint transmission and distributed multiple-input multiple-output (MIMO) radar techniques, we propose a cooperative networked ISAC scheme to enhance both S&C services. Then, the tool of stochastic geometry is exploited to capture the S&C performance, which allows us to illuminate key cooperative dependencies in the ISAC network. Remarkably, the derived expression of the Cramer-Rao lower bound (CRLB) of the localization accuracy unveils a significant finding: Deploying $N$ ISAC transceivers yields an enhanced sensing performance across the entire network, in accordance with the $\ln^2N$ scaling law. Simulation results demonstrate that compared to the time-sharing scheme, the proposed cooperative ISAC scheme can effectively improve the average data rate and reduce the CRLB. △ Less

Submitted 11 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

Comments: 7 pages, 5 figures, this paper has been submitted to IEEE for possible publication

arXiv:2403.03771 [pdf, other]

doi 10.1109/TVT.2024.3375027

Joint Sparsity Pattern Learning Based Channel Estimation for Massive MIMO-OTFS Systems

Authors: Kuo Meng, Shaoshi Yang, Xiao-Yang Wang, Yan Bu, Yurong Tang, Jianhua Zhang, Lajos Hanzo

Abstract: We propose a channel estimation scheme based on joint sparsity pattern learning (JSPL) for massive multi-input multi-output (MIMO) orthogonal time-frequency-space (OTFS) modulation aided systems. By exploiting the potential joint sparsity of the delay-Doppler-angle (DDA) domain channel, the channel estimation problem is transformed into a sparse recovery problem. To solve it, we first apply the sp… ▽ More We propose a channel estimation scheme based on joint sparsity pattern learning (JSPL) for massive multi-input multi-output (MIMO) orthogonal time-frequency-space (OTFS) modulation aided systems. By exploiting the potential joint sparsity of the delay-Doppler-angle (DDA) domain channel, the channel estimation problem is transformed into a sparse recovery problem. To solve it, we first apply the spike and slab prior model to iteratively estimate the support set of the channel matrix, and a higher-accuracy parameter update rule relying on the identified support set is introduced into the iteration. Then the specific values of the channel elements corresponding to the support set are estimated by the orthogonal matching pursuit (OMP) method. Both our simulation results and analysis demonstrate that the proposed JSPL channel estimation scheme achieves an improved performance over the representative state-of-the-art baseline schemes, despite its reduced pilot overhead. △ Less

Submitted 6 March, 2024; originally announced March 2024.

Comments: 6 pages, 6 figures, accepted to appear on IEEE Transactions on Vehicular Technology, Mar. 2024

arXiv:2402.18683 [pdf, other]

Integrated Sensing and Communication Meets Smart Propagation Engineering: Opportunities and Challenges

Authors: Kaitao Meng, Christos Masouros, Kai-Kit Wong, Athina P. Petropulu, Lajos Hanzo

Abstract: Both smart propagation engineering as well as integrated sensing and communication (ISAC) constitute promising candidates for next-generation (NG) mobile networks. We provide a synergistic view of these technologies, and explore their mutual benefits. First, moving beyond just intelligent surfaces, we provide a holistic view of the engineering aspects of smart propagation environments. By delving… ▽ More Both smart propagation engineering as well as integrated sensing and communication (ISAC) constitute promising candidates for next-generation (NG) mobile networks. We provide a synergistic view of these technologies, and explore their mutual benefits. First, moving beyond just intelligent surfaces, we provide a holistic view of the engineering aspects of smart propagation environments. By delving into the fundamental characteristics of intelligent surfaces, fluid antennas, and unmanned aerial vehicles, we reveal that more efficient control of the pathloss and fading can be achieved, thus facilitating intrinsic integration and mutual assistance between sensing and communication functionalities. In turn, with the exploitation of the sensing capabilities of ISAC to orchestrate the efficient configuration of radio environments, both the computational effort and signaling overheads can be reduced. We present indicative simulation results, which verify that cooperative smart propagation environment design significantly enhances the ISAC performance. Finally, some promising directions are outlined for combining ISAC with smart propagation engineering. △ Less

Submitted 28 February, 2024; originally announced February 2024.

Comments: 7 pages, 5 figures, submitted to IEEE journal for possible publication

arXiv:2401.04918 [pdf, other]

BS Coordination Optimization in Integrated Sensing and Communication: A Stochastic Geometric View

Authors: Kaitao Meng, Christos Masouros, Guangji Chen, Fan Liu

Abstract: In this study, we explore integrated sensing and communication (ISAC) networks to strike a more effective balance between sensing and communication (S&C) performance at the network scale. We leverage stochastic geometry to analyze the S&C performance, shedding light on critical cooperative dependencies of ISAC networks. According to the derived expressions of network performance, we optimize the u… ▽ More In this study, we explore integrated sensing and communication (ISAC) networks to strike a more effective balance between sensing and communication (S&C) performance at the network scale. We leverage stochastic geometry to analyze the S&C performance, shedding light on critical cooperative dependencies of ISAC networks. According to the derived expressions of network performance, we optimize the user/target loads and the cooperative base station cluster sizes for S&C to achieve a flexible trade-off between network-scale S&C performance. It is observed that the optimal strategy emphasizes the full utilization of spatial resources to enhance multiplexing and diversity gain when maximizing communication ASE. In contrast, for sensing objectives, parts of spatial resources are allocated to cancel inter-cell sensing interference to maximize sensing ASE. Simulation results validate that the proposed ISAC scheme realizes a remarkable enhancement in overall S&C network performance. △ Less

Submitted 9 January, 2024; originally announced January 2024.

Comments: 8 pages, 7 figures, accepted by IEEE WCNC 2024. arXiv admin note: substantial text overlap with arXiv:2311.09052

arXiv:2401.03726 [pdf, other]

doi 10.1109/LCOMM.2024.3379504

UAV-enabled Integrated Sensing and Communication: Tracking Design and Optimization

Authors: Yifan Jiang, Qingqing Wu, Wen Chen, Kaitao Meng

Abstract: Integrated sensing and communications (ISAC) enabled by unmanned aerial vehicles (UAVs) is a promising technology to facilitate target tracking applications. In contrast to conventional UAV-based ISAC system designs that mainly focus on estimating the target position, the target velocity estimation also needs to be considered due to its crucial impacts on link maintenance and real-time response, w… ▽ More Integrated sensing and communications (ISAC) enabled by unmanned aerial vehicles (UAVs) is a promising technology to facilitate target tracking applications. In contrast to conventional UAV-based ISAC system designs that mainly focus on estimating the target position, the target velocity estimation also needs to be considered due to its crucial impacts on link maintenance and real-time response, which requires new designs on resource allocation and tracking scheme. In this paper, we propose an extended Kalman filtering-based tracking scheme for a UAV-enabled ISAC system where a UAV tracks a moving object and also communicates with a device attached to the object. Specifically, a weighted sum of predicted posterior Cramér-Rao bound (PCRB) for object relative position and velocity estimation is minimized by optimizing the UAV trajectory, where an efficient solution is obtained based on the successive convex approximation method. Furthermore, under a special case with the measurement mean square error (MSE), the optimal relative motion state is obtained and proved to keep a fixed elevation angle and zero relative velocity. Numerical results validate that the obtained solution to the predicted PCRB minimization can be approximated by the optimal relative motion state when predicted measurement MSE dominates the predicted PCRBs, as well as the effectiveness of the proposed tracking scheme. Moreover, three interesting trade-offs on system performance resulted from the fixed elevation angle are illustrated. △ Less

Submitted 16 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

Comments: 3 figures, 5 pages, Accepted by IEEE Communications Letters

arXiv:2312.12107 [pdf, other]

GraphScope Flex: LEGO-like Graph Computing Stack

Authors: Tao He, Shuxian Hu, Longbin Lai, Dongze Li, Neng Li, Xue Li, Lexiao Liu, Xiaojian Luo, Binqing Lyu, Ke Meng, Sijie Shen, Li Su, Lei Wang, Jingbo Xu, Wenyuan Yu, Weibin Zeng, Lei Zhang, Siyuan Zhang, Jingren Zhou, Xiaoli Zhou, Diwen Zhu

Abstract: Graph computing has become increasingly crucial in processing large-scale graph data, with numerous systems developed for this purpose. Two years ago, we introduced GraphScope as a system addressing a wide array of graph computing needs, including graph traversal, analytics, and learning in one system. Since its inception, GraphScope has achieved significant technological advancements and gained w… ▽ More Graph computing has become increasingly crucial in processing large-scale graph data, with numerous systems developed for this purpose. Two years ago, we introduced GraphScope as a system addressing a wide array of graph computing needs, including graph traversal, analytics, and learning in one system. Since its inception, GraphScope has achieved significant technological advancements and gained widespread adoption across various industries. However, one key lesson from this journey has been understanding the limitations of a "one-size-fits-all" approach, especially when dealing with the diversity of programming interfaces, applications, and data storage formats in graph computing. In response to these challenges, we present GraphScope Flex, the next iteration of GraphScope. GraphScope Flex is designed to be both resource-efficient and cost-effective, while also providing flexibility and user-friendliness through its LEGO-like modularity. This paper explores the architectural innovations and fundamental design principles of GraphScope Flex, all of which are direct outcomes of the lessons learned during our ongoing development process. We validate the adaptability and efficiency of GraphScope Flex with extensive evaluations on synthetic and real-world datasets. The results show that GraphScope Flex achieves 2.4X throughput and up to 55.7X speedup over other systems on the LDBC Social Network and Graphalytics benchmarks, respectively. Furthermore, GraphScope Flex accomplishes up to a 2,400X performance gain in real-world applications, demonstrating its proficiency across a wide range of graph computing scenarios with increased effectiveness. △ Less

Submitted 19 December, 2023; originally announced December 2023.

arXiv:2311.09052 [pdf, other]

Network-Level Integrated Sensing and Communication: Interference Management and BS Coordination Using Stochastic Geometry

Authors: Kaitao Meng, Christos Masouros, Guangji Chen, Fan Liu

Abstract: In this work, we study integrated sensing and communication (ISAC) networks with the aim of effectively balancing sensing and communication (S&C) performance at the network level. Focusing on monostatic sensing, the tool of stochastic geometry is exploited to capture the S&C performance, which facilitates us to illuminate key cooperative dependencies in the ISAC network and optimize key network-le… ▽ More In this work, we study integrated sensing and communication (ISAC) networks with the aim of effectively balancing sensing and communication (S&C) performance at the network level. Focusing on monostatic sensing, the tool of stochastic geometry is exploited to capture the S&C performance, which facilitates us to illuminate key cooperative dependencies in the ISAC network and optimize key network-level parameters. Based on the derived tractable expression of area spectral efficiency (ASE), we formulate the optimization problem to maximize the network performance from the view point of two joint S&C metrics. Towards this end, we further jointly optimize the cooperative BS cluster sizes for S&C and the serving/probing numbers of users/targets to achieve a flexible tradeoff between S&C at the network level. It is verified that interference nulling can effectively improve the average data rate and radar information rate. Surprisingly, the optimal communication tradeoff for the case of the ASE maximization tends to employ all spacial resources towards multiplexing and diversity gain, without interference nulling. By contrast, for the sensing objectives, resource allocation tends to eliminate certain interference especially when the antenna resources are sufficient, because the inter-cell interference becomes a more dominant factor affecting sensing performance. Furthermore, we prove that the ratio of the optimal number of users and the number of transmit antennas is a constant value when the communication performance is optimal. Simulation results demonstrate that the proposed cooperative ISAC scheme achieves a substantial gain in S&C performance at the network level. △ Less

Submitted 15 November, 2023; originally announced November 2023.

Comments: 13 pages, 12 figures. This work has been submitted to the IEEE for possible publication

arXiv:2311.00418 [pdf, other]

Intelligent Surface Empowered Integrated Sensing and Communication: From Coexistence to Reciprocity

Authors: Kaitao Meng, Qingqing Wu, Christos Masouros, Wen Chen, Deshi Li

Abstract: Integrated sensing and communication (ISAC) has attracted growing interests for sixth-generation (6G) and beyond wireless networks. The primary challenges faced by highly efficient ISAC include limited sensing and communication (S&C) coverage, constrained integration gain between S&C under weak channel correlations, and unknown performance boundary. Intelligent reflecting/refracting surfaces (IRSs… ▽ More Integrated sensing and communication (ISAC) has attracted growing interests for sixth-generation (6G) and beyond wireless networks. The primary challenges faced by highly efficient ISAC include limited sensing and communication (S&C) coverage, constrained integration gain between S&C under weak channel correlations, and unknown performance boundary. Intelligent reflecting/refracting surfaces (IRSs) can effectively expand S&C coverage and control the degree of freedom of channels between the transmitters and receivers, thereby realizing increasing integration gains. In this work, we first delve into the fundamental characteristics of IRS-empowered ISAC and innovative IRS-assisted sensing architectures. Then, we discuss various objectives for IRS channel control and deployment optimization in ISAC systems. Furthermore, the interplay between S&C in different deployment strategies is investigated and some promising directions for IRS enhanced ISAC are outlined. △ Less

Submitted 1 November, 2023; originally announced November 2023.

Comments: 8 pages, 4 figures, submitted to IEEE Journal for possible publication

arXiv:2310.15574 [pdf, other]

3D Multi-Target Localization Via Intelligent Reflecting Surface: Protocol and Analysis

Authors: Meng Hua, Guangji Chen, Kaitao Meng, Shaodan Ma, Chau Yuen, Hing Cheung So

Abstract: With the emerging environment-aware applications, ubiquitous sensing is expected to play a key role in future networks. In this paper, we study a 3-dimensional (3D) multi-target localization system where multiple intelligent reflecting surfaces (IRSs) are applied to create virtual line-of-sight (LoS) links that bypass the base station (BS) and targets. To fully unveil the fundamental limit of IRS… ▽ More With the emerging environment-aware applications, ubiquitous sensing is expected to play a key role in future networks. In this paper, we study a 3-dimensional (3D) multi-target localization system where multiple intelligent reflecting surfaces (IRSs) are applied to create virtual line-of-sight (LoS) links that bypass the base station (BS) and targets. To fully unveil the fundamental limit of IRS for sensing, we first study a single-target-single-IRS case and propose a novel \textit{two-stage localization protocol} by controlling the on/off state of IRS. To be specific, in the IRS-off stage, we derive the Cramér-Rao bound (CRB) of the azimuth/elevation direction-of-arrival (DoA) of the BS-target link and design a DoA estimator based on the MUSIC algorithm. In the IRS-on stage, the CRB of the azimuth/elevation DoA of the IRS-target link is derived and a simple DoA estimator based on the on-grid IRS beam scanning method is proposed. Particularly, the impact of echo signals reflected by IRS from different paths on sensing performance is analyzed. Moreover, we prove that the single-beam of the IRS is not capable of sensing, but it can be achieved with \textit{multi-beam}. Based on the two obtained DoAs, the 3D single-target location is constructed. We then extend to the multi-target-multi-IRS case and propose an \textit{IRS-adaptive sensing protocol} by controlling the on/off state of multiple IRSs, and a multi-target localization algorithm is developed. Simulation results demonstrate the effectiveness of our scheme and show that sub-meter-level positioning accuracy can be achieved. △ Less

Submitted 28 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

Comments: This paper has been submitted to IEEE journal for possible publication

arXiv:2310.14783 [pdf, other]

Interpretable Deep Reinforcement Learning for Optimizing Heterogeneous Energy Storage Systems

Authors: Luolin Xiong, Yang Tang, Chensheng Liu, Shuai Mao, Ke Meng, Zhaoyang Dong, Feng Qian

Abstract: Energy storage systems (ESS) are pivotal component in the energy market, serving as both energy suppliers and consumers. ESS operators can reap benefits from energy arbitrage by optimizing operations of storage equipment. To further enhance ESS flexibility within the energy market and improve renewable energy utilization, a heterogeneous photovoltaic-ESS (PV-ESS) is proposed, which leverages the u… ▽ More Energy storage systems (ESS) are pivotal component in the energy market, serving as both energy suppliers and consumers. ESS operators can reap benefits from energy arbitrage by optimizing operations of storage equipment. To further enhance ESS flexibility within the energy market and improve renewable energy utilization, a heterogeneous photovoltaic-ESS (PV-ESS) is proposed, which leverages the unique characteristics of battery energy storage (BES) and hydrogen energy storage (HES). For scheduling tasks of the heterogeneous PV-ESS, cost description plays a crucial role in guiding operator's strategies to maximize benefits. We develop a comprehensive cost function that takes into account degradation, capital, and operation/maintenance costs to reflect real-world scenarios. Moreover, while numerous methods excel in optimizing ESS energy arbitrage, they often rely on black-box models with opaque decision-making processes, limiting practical applicability. To overcome this limitation and enable transparent scheduling strategies, a prototype-based policy network with inherent interpretability is introduced. This network employs human-designed prototypes to guide decision-making by comparing similarities between prototypical situations and encountered situations, which allows for naturally explained scheduling strategies. Comparative results across four distinct cases underscore the effectiveness and practicality of our proposed pre-hoc interpretable optimization method when contrasted with black-box models. △ Less

Submitted 19 October, 2023; originally announced October 2023.

arXiv:2308.09124 [pdf, other]

Linearity of Relation Decoding in Transformer Language Models

Authors: Evan Hernandez, Arnab Sen Sharma, Tal Haklay, Kevin Meng, Martin Wattenberg, Jacob Andreas, Yonatan Belinkov, David Bau

Abstract: Much of the knowledge encoded in transformer language models (LMs) may be expressed in terms of relations: relations between words and their synonyms, entities and their attributes, etc. We show that, for a subset of relations, this computation is well-approximated by a single linear transformation on the subject representation. Linear relation representations may be obtained by constructing a fir… ▽ More Much of the knowledge encoded in transformer language models (LMs) may be expressed in terms of relations: relations between words and their synonyms, entities and their attributes, etc. We show that, for a subset of relations, this computation is well-approximated by a single linear transformation on the subject representation. Linear relation representations may be obtained by constructing a first-order approximation to the LM from a single prompt, and they exist for a variety of factual, commonsense, and linguistic relations. However, we also identify many cases in which LM predictions capture relational knowledge accurately, but this knowledge is not linearly encoded in their representations. Our results thus reveal a simple, interpretable, but heterogeneously deployed knowledge representation strategy in transformer LMs. △ Less

Submitted 15 February, 2024; v1 submitted 17 August, 2023; originally announced August 2023.

arXiv:2308.05862 [pdf, other]

Unleashing the Strengths of Unlabeled Data in Pan-cancer Abdominal Organ Quantification: the FLARE22 Challenge

Authors: Jun Ma, Yao Zhang, Song Gu, Cheng Ge, Shihao Ma, Adamo Young, Cheng Zhu, Kangkang Meng, Xin Yang, Ziyan Huang, Fan Zhang, Wentao Liu, YuanKe Pan, Shoujin Huang, Jiacheng Wang, Mingze Sun, Weixin Xu, Dengqiang Jia, Jae Won Choi, Natália Alves, Bram de Wilde, Gregor Koehler, Yajun Wu, Manuel Wiesenfarth, Qiongjie Zhu , et al. (4 additional authors not shown)

Abstract: Quantitative organ assessment is an essential step in automated abdominal disease diagnosis and treatment planning. Artificial intelligence (AI) has shown great potential to automatize this process. However, most existing AI algorithms rely on many expert annotations and lack a comprehensive evaluation of accuracy and efficiency in real-world multinational settings. To overcome these limitations,… ▽ More Quantitative organ assessment is an essential step in automated abdominal disease diagnosis and treatment planning. Artificial intelligence (AI) has shown great potential to automatize this process. However, most existing AI algorithms rely on many expert annotations and lack a comprehensive evaluation of accuracy and efficiency in real-world multinational settings. To overcome these limitations, we organized the FLARE 2022 Challenge, the largest abdominal organ analysis challenge to date, to benchmark fast, low-resource, accurate, annotation-efficient, and generalized AI algorithms. We constructed an intercontinental and multinational dataset from more than 50 medical groups, including Computed Tomography (CT) scans with different races, diseases, phases, and manufacturers. We independently validated that a set of AI algorithms achieved a median Dice Similarity Coefficient (DSC) of 90.0\% by using 50 labeled scans and 2000 unlabeled scans, which can significantly reduce annotation requirements. The best-performing algorithms successfully generalized to holdout external validation sets, achieving a median DSC of 89.5\%, 90.9\%, and 88.3\% on North American, European, and Asian cohorts, respectively. They also enabled automatic extraction of key organ biology features, which was labor-intensive with traditional manual measurements. This opens the potential to use unlabeled data to boost performance and alleviate annotation shortages for modern AI models. △ Less

Submitted 10 August, 2023; originally announced August 2023.

Comments: MICCAI FLARE22: https://flare22.grand-challenge.org/

arXiv:2307.15434 [pdf, other]

doi 10.1109/TCOMM.2023.3349158

Cooperative Cellular Localization with Intelligent Reflecting Surface: Design, Analysis and Optimization

Authors: Kaitao Meng, Qingqing Wu, Wen Chen, Deshi Li

Abstract: Autonomous driving and intelligent transportation applications have dramatically increased the demand for high-accuracy and low-latency localization services. While cellular networks are potentially capable of target detection and localization, achieving accurate and reliable positioning faces critical challenges. Particularly, the relatively small radar cross sections (RCS) of moving targets and… ▽ More Autonomous driving and intelligent transportation applications have dramatically increased the demand for high-accuracy and low-latency localization services. While cellular networks are potentially capable of target detection and localization, achieving accurate and reliable positioning faces critical challenges. Particularly, the relatively small radar cross sections (RCS) of moving targets and the high complexity for measurement association give rise to weak echo signals and discrepancies in the measurements. To tackle this issue, we propose a novel approach for multi-target localization by leveraging the controllable signal reflection capabilities of intelligent reflecting surfaces (IRSs). Specifically, IRSs are strategically mounted on the targets (e.g., vehicles and robots), enabling effective association of multiple measurements and facilitating the localization process. We aim to minimize the maximum Cramér-Rao lower bound (CRLB) of targets by jointly optimizing the target association, the IRS phase shifts, and the dwell time. However, solving this CRLB optimization problem is non-trivial due to the non-convex objective function and closely coupled variables. For single-target localization, a simplified closed-form expression is presented for the case where base stations (BSs) can be deployed flexibly, and the optimal BS location is derived to provide a lower performance bound of the original problem ... △ Less

Submitted 4 January, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: 14 pages

Journal ref: IEEE Transactions on Communications, 2024

arXiv:2304.11559 [pdf, other]

doi 10.1109/LWC.2023.3270423

Lightweight Machine Learning for Digital Cross-Link Interference Cancellation with RF Chain Characteristics in Flexible Duplex MIMO Systems

Authors: Jing-Sheng Tan, Shaoshi Yang, Kuo Meng, Jianhua Zhang, Yurong Tang, Yan Bu, Guizhen Wang

Abstract: The flexible duplex (FD) technique, including dynamic time-division duplex (D-TDD) and dynamic frequency-division duplex (D-FDD), is regarded as a promising solution to achieving a more flexible uplink/downlink transmission in 5G-Advanced or 6G mobile communication systems. However, it may introduce serious cross-link interference (CLI). For better mitigating the impact of CLI, we first present a… ▽ More The flexible duplex (FD) technique, including dynamic time-division duplex (D-TDD) and dynamic frequency-division duplex (D-FDD), is regarded as a promising solution to achieving a more flexible uplink/downlink transmission in 5G-Advanced or 6G mobile communication systems. However, it may introduce serious cross-link interference (CLI). For better mitigating the impact of CLI, we first present a more realistic base station (BS)-to-BS channel model incorporating the radio frequency (RF) chain characteristics, which exhibit a hardware-dependent nonlinear property, and hence the accuracy of conventional channel modelling is inadequate for CLI cancellation. Then, we propose a channel parameter estimation based polynomial CLI canceller and two machine learning (ML) based CLI cancellers that use the lightweight feedforward neural network (FNN). Our simulation results and analysis show that the ML based CLI cancellers achieve notable performance improvement and dramatic reduction of computational complexity, in comparison with the polynomial CLI canceller. △ Less

Submitted 23 April, 2023; originally announced April 2023.

Comments: 5 pages, 6 figures

arXiv:2212.12942 [pdf, ps, other]

Rethinking Dense Cells for Integrated Sensing and Communications: A Stochastic Geometric View

Authors: Abdelhamid Salem, Kaitao Meng, Christos Masouros, Fan Liu, David López-Pérez

Abstract: The inclusion of the sensing functionality in the coming generations of cellular networks necessitates a rethink of dense cell deployments. In this paper, we analyze and optimize dense cell topologies for dual-functional radar-communication (DFRC) cellular networks. With the aid of tools from stochastic geometry, we derive new analytical expressions of the potential area spectral efficiencies in (… ▽ More The inclusion of the sensing functionality in the coming generations of cellular networks necessitates a rethink of dense cell deployments. In this paper, we analyze and optimize dense cell topologies for dual-functional radar-communication (DFRC) cellular networks. With the aid of tools from stochastic geometry, we derive new analytical expressions of the potential area spectral efficiencies in (bit/sec/m2) of radar and communication systems. Based on the new formulations of the potential area spectral efficiencies, the energy efficiency (bit/Joule) of DFRC systems is provided in a closed-form formula. Then, an optimization problem to obtain the optimal base station (BS) density that maximizes the network-level energy efficiency is formulated and investigated. In this regard, the mathematical expression of the energy efficiency is shown to be a uni-modal and pseudo-concave function in the density of the BSs. Therefore, the optimal density of the BSs that maximizes the energy efficiency can be obtained. Our analytical and numerical results demonstrate that the inclusion of the sensing functionality clearly differentiates the optimal BS topologies for the DFRC systems against classical communication-only systems. △ Less

Submitted 26 August, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

Comments: 30 pages

arXiv:2212.12909 [pdf, other]

doi 10.1109/LCOMM.2023.3279142

Intelligent Surface Empowered Sensing and Communication: A Novel Mutual Assistance Design

Authors: Kaitao Meng, Qingqing Wu, Wen Chen, Enrico Paolini, Elisabetta Matricardi

Abstract: Integrated sensing and communication (ISAC) is a promising paradigm to provide both sensing and communication (S&C) services in vehicular networks. However, the power of echo signals reflected from vehicles may be too weak to be used for future precise positioning, due to the practically small radar cross section of vehicles with random reflection/scattering coefficient. To tackle this issue, we p… ▽ More Integrated sensing and communication (ISAC) is a promising paradigm to provide both sensing and communication (S&C) services in vehicular networks. However, the power of echo signals reflected from vehicles may be too weak to be used for future precise positioning, due to the practically small radar cross section of vehicles with random reflection/scattering coefficient. To tackle this issue, we propose a novel mutual assistance scheme for intelligent surface-mounted vehicles, where S&C are innovatively designed to assist each other for achieving an efficient win-win integration, i.e., sensing-assisted phase shift design and communication-assisted high-precision sensing. Specifically, we first derive closed-form expressions of the echo power and achievable rate under uncertain angle information. Then, the communication rate is maximized while satisfying sensing requirements, which is proved to be a monotonic optimization problem on time allocation. Furthermore, we unveil the feasible condition of the problem and propose a polyblock-based optimal algorithm. Simulation results validate that the performance trade-off bound of S&C is significantly enlarged by the novel design exploiting mutual assistance in intelligent surface-aided vehicular networks. △ Less

Submitted 20 May, 2023; v1 submitted 25 December, 2022; originally announced December 2022.

Comments: 5 pages, 5 figures, accept by IEEE Communications Letters

Journal ref: IEEE Communications Letters, 2023

arXiv:2211.11475 [pdf, other]

Sensing-Assisted Communication in Vehicular Networks with Intelligent Surface

Authors: Kaitao Meng, Qingqing Wu, Wen Chen, Deshi Li

Abstract: The recent development of integrated sensing and communications (ISAC) technology offers new opportunities to meet high-throughput and low-latency communication as well as high-resolution localization requirements in vehicular networks. However, considering the limited transmit power of the road site units (RSUs) and the relatively small radar cross section (RCS) of vehicles with random reflection… ▽ More The recent development of integrated sensing and communications (ISAC) technology offers new opportunities to meet high-throughput and low-latency communication as well as high-resolution localization requirements in vehicular networks. However, considering the limited transmit power of the road site units (RSUs) and the relatively small radar cross section (RCS) of vehicles with random reflection coefficients, the power of echo signals may be too weak to be utilized for effective target detection and tracking. Moreover, high-frequency signals usually suffer from large fading loss when penetrating vehicles, which seriously degrades the quality of communication services inside the vehicles. To handle this issue, we propose a novel sensing-assisted communication mechanism by employing an intelligent omni-surface (IOS) on the surface of vehicles to enhance both sensing and communication (S&C) performance. To this end, we first propose a two-stage ISAC protocol, including the joint S&C stage and the communication-only stage, to fulfill more efficient communication performance improvements benefited from sensing. The achievable communication rate maximization problem is formulated by jointly optimizing the transmit beamforming, the IOS phase shifts, and the duration of the joint S&C stage. However, solving this ISAC optimization problem is highly non-trivial since inaccurate estimation and measurement information renders the achievable rate lack of closed-form expression. To handle this issue, we first derive a closed-form expression of the achievable rate under uncertain location information, and then unveil a sufficient and necessary condition for the existence of the joint S&C stage to offer useful insights for practical system design. Moreover, two typical scenarios including interference-limited and noise-limited cases are analyzed. △ Less

Submitted 14 August, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

Comments: IEEE Transactions on Vehicular Technology, 2023. arXiv admin note: text overlap with arXiv:2211.04200

arXiv:2211.04200 [pdf, other]

Intelligent Surface Enabled Sensing-Assisted Communication

Authors: Kaitao Meng, Qingqing Wu, Wen Chen, Deshi Li

Abstract: Vehicle-to-everything (V2X) communication is expected to support many promising applications in next-generation wireless networks. The recent development of integrated sensing and communications (ISAC) technology offers new opportunities to meet the stringent sensing and communication (S&C) requirements in V2X networks. However, considering the relatively small radar cross section (RCS) of the veh… ▽ More Vehicle-to-everything (V2X) communication is expected to support many promising applications in next-generation wireless networks. The recent development of integrated sensing and communications (ISAC) technology offers new opportunities to meet the stringent sensing and communication (S&C) requirements in V2X networks. However, considering the relatively small radar cross section (RCS) of the vehicles and the limited transmit power of the road site units (RSUs), the power of echoes may be too weak to achieve effective target detection and tracking. To handle this issue, we propose a novel sensing-assisted communication scheme by employing an intelligent Omni-surface (IOS) on the surface of the vehicle. First, a two-phase ISAC protocol, including the S&C phase and the communication-only phase, was presented to maximize the throughput by jointly optimizing the IOS phase shifts and the sensing duration. Then, we derive a closed-form expression of the achievable rate which achieves a good approximation. Furthermore, a sufficient and necessary condition for the existence of the S&C phase is derived to provide useful insights for practical system design. Simulation results demonstrate the effectiveness of the proposed sensing-assisted communication scheme in achieving high throughput with low transmit power requirements. △ Less

Submitted 10 December, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

Comments: 8 pages, Submitted to IEEE for possible publication

arXiv:2210.12662 [pdf, other]

Improving Chinese Named Entity Recognition by Search Engine Augmentation

Authors: Qinghua Mao, Jiatong Li, Kui Meng

Abstract: Compared with English, Chinese suffers from more grammatical ambiguities, like fuzzy word boundaries and polysemous words. In this case, contextual information is not sufficient to support Chinese named entity recognition (NER), especially for rare and emerging named entities. Semantic augmentation using external knowledge is a potential way to alleviate this problem, while how to obtain and lever… ▽ More Compared with English, Chinese suffers from more grammatical ambiguities, like fuzzy word boundaries and polysemous words. In this case, contextual information is not sufficient to support Chinese named entity recognition (NER), especially for rare and emerging named entities. Semantic augmentation using external knowledge is a potential way to alleviate this problem, while how to obtain and leverage external knowledge for the NER task remains a challenge. In this paper, we propose a neural-based approach to perform semantic augmentation using external knowledge from search engine for Chinese NER. In particular, a multi-channel semantic fusion model is adopted to generate the augmented input representations, which aggregates external related texts retrieved from the search engine. Experiments have shown the superiority of our model across 4 NER datasets, including formal and social media language contexts, which further prove the effectiveness of our approach. △ Less

Submitted 23 October, 2022; originally announced October 2022.

arXiv:2210.07229 [pdf, other]

Mass-Editing Memory in a Transformer

Authors: Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, David Bau

Abstract: Recent work has shown exciting promise in updating large language models with new memories, so as to replace obsolete information or add specialized knowledge. However, this line of work is predominantly limited to updating single associations. We develop MEMIT, a method for directly updating a language model with many memories, demonstrating experimentally that it can scale up to thousands of ass… ▽ More Recent work has shown exciting promise in updating large language models with new memories, so as to replace obsolete information or add specialized knowledge. However, this line of work is predominantly limited to updating single associations. We develop MEMIT, a method for directly updating a language model with many memories, demonstrating experimentally that it can scale up to thousands of associations for GPT-J (6B) and GPT-NeoX (20B), exceeding prior work by orders of magnitude. Our code and data are at https://memit.baulab.info. △ Less

Submitted 1 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

Comments: 18 pages, 11 figures. Code and data at https://memit.baulab.info

arXiv:2207.14258 [pdf, other]

Exploiting and Defending Against the Approximate Linearity of Apple's NeuralHash

Authors: Jagdeep Singh Bhatia, Kevin Meng

Abstract: Perceptual hashes map images with identical semantic content to the same $n$-bit hash value, while mapping semantically-different images to different hashes. These algorithms carry important applications in cybersecurity such as copyright infringement detection, content fingerprinting, and surveillance. Apple's NeuralHash is one such system that aims to detect the presence of illegal content on us… ▽ More Perceptual hashes map images with identical semantic content to the same $n$-bit hash value, while mapping semantically-different images to different hashes. These algorithms carry important applications in cybersecurity such as copyright infringement detection, content fingerprinting, and surveillance. Apple's NeuralHash is one such system that aims to detect the presence of illegal content on users' devices without compromising consumer privacy. We make the surprising discovery that NeuralHash is approximately linear, which inspires the development of novel black-box attacks that can (i) evade detection of "illegal" images, (ii) generate near-collisions, and (iii) leak information about hashed images, all without access to model parameters. These vulnerabilities pose serious threats to NeuralHash's security goals; to address them, we propose a simple fix using classical cryptographic standards. △ Less

Submitted 28 July, 2022; originally announced July 2022.

Comments: Accepted to the ML4Cyber Workshop at ICML 2022

arXiv:2207.04498 [pdf, other]

doi 10.1109/TWC.2022.3224143

Multi-UAV Collaborative Sensing and Communication: Joint Task Allocation and Power Optimization

Authors: Kaitao Meng, Xiaofan He, Qingqing Wu, Deshi Li

Abstract: Compared to a single UAV with limited sensing coverage and communication capability, multi-UAV cooperation is able to provide more effective sensing and transmission (S&T) services. Nevertheless, most existing works on multi-UAV sensing mainly focus on mutually exclusive task allocation and independent data transmission, which did not fully exploit the benefit of multi-UAV sensing and communicatio… ▽ More Compared to a single UAV with limited sensing coverage and communication capability, multi-UAV cooperation is able to provide more effective sensing and transmission (S&T) services. Nevertheless, most existing works on multi-UAV sensing mainly focus on mutually exclusive task allocation and independent data transmission, which did not fully exploit the benefit of multi-UAV sensing and communication. Motivated by this, we propose a novel multi-UAV cooperative S&T scheme with replicated sensing task allocation. Although replicated task allocation may sound counter-intuitive, it can actually foster cooperative transmission among multiple UAVs and thus reduce the overall sensing mission completion time. To obtain the optimal task allocation and transmit power of the proposed scheme, a mission completion time minimization problem is formulated. To solve this problem, a necessary condition for replicated sensing task allocation is derived. For the cases of replicated sensing, the considered problem is transformed into a monotonic optimization and is solved by the generic Polyblock algorithm. To efficiently evaluate the mission completion time in each iteration of the Polyblock algorithm, new auxiliary variables are introduced to decouple the otherwise sophisticated joint optimization of transmission time and power. While for the degenerated case of non-replicated sensing, the closed-form expression of the optimal transmission time is derived △ Less

Submitted 24 November, 2022; v1 submitted 10 July, 2022; originally announced July 2022.

Comments: 32 pages, submitted to IEEE for possible publication

Journal ref: IEEE Transactions on Wireless Communications, 2022

arXiv:2207.01230 [pdf, other]

doi 10.1109/TCOMM.2022.3217564

Intelligent Reflecting Surface Enabled Multi-Target Sensing

Authors: Kaitao Meng, Qingqing Wu, Robert Schober, Wen Chen

Abstract: Besides improving communication performance, intelligent reflecting surfaces (IRSs) are also promising enablers for achieving larger sensing coverage and enhanced sensing quality. Nevertheless, in the absence of a direct path between the base station (BS) and the targets, multi-target sensing is generally very difficult, since IRSs are incapable of proactively transmitting sensing beams or analyzi… ▽ More Besides improving communication performance, intelligent reflecting surfaces (IRSs) are also promising enablers for achieving larger sensing coverage and enhanced sensing quality. Nevertheless, in the absence of a direct path between the base station (BS) and the targets, multi-target sensing is generally very difficult, since IRSs are incapable of proactively transmitting sensing beams or analyzing target information. Moreover, the echoes of different targets reflected via the IRS-established virtual links share the same directionality at the BS. In this paper, we study a wireless system comprising a multi-antenna BS and an IRS for multi-target sensing, where the beamforming vector and the IRS phase shifts are jointly optimized to improve the sensing performance. To meet the different sensing requirements, such as a minimum received power and a minimum sensing frequency, we propose three novel IRS-assisted sensing schemes: Time division (TD) sensing, signature sequence (SS) sensing, and hybrid TD-SS sensing. First, for TD sensing, the sensing tasks are performed in sequence over time. Subsequently, a novel signature sequence (SS) sensing scheme is proposed to improve sensing efficiency by establishing a relationship between directions and SSs. To strike a flexible balance between the beam pattern gain and sensing efficiency, we also propose a general hybrid TD-SS sensing scheme with target grouping, where targets belonging to the same group are sensed simultaneously via SS sensing, while the targets in different groups are assigned to orthogonal time slots. By controlling the number of groups, the hybrid TD-SS sensing scheme can provide a more flexible balance between beam pattern gain and sensing frequency. Moreover, ... △ Less

Submitted 5 November, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

Comments: 33 pages. This work has been accept by IEEE Transactions on Communications

Journal ref: IEEE Transactions on Communications, 2022

arXiv:2206.03408 [pdf, other]

doi 10.1109/MWC.131.2200442.

UAV-Enabled Integrated Sensing and Communication: Opportunities and Challenges

Authors: Kaitao Meng, Qingqing Wu, Jie Xu, Wen Chen, Zhiyong Feng, Robert Schober, A. Lee Swindlehurst

Abstract: Unmanned aerial vehicle (UAV)-enabled integrated sensing and communication (ISAC) has attracted growing research interests in the context of sixth-generation (6G) wireless networks, in which UAVs will be exploited as aerial wireless platforms to provide better coverage and enhanced sensing and communication (S&C) services. However, due to the UAVs' size, weight, and power (SWAP) constraints, contr… ▽ More Unmanned aerial vehicle (UAV)-enabled integrated sensing and communication (ISAC) has attracted growing research interests in the context of sixth-generation (6G) wireless networks, in which UAVs will be exploited as aerial wireless platforms to provide better coverage and enhanced sensing and communication (S&C) services. However, due to the UAVs' size, weight, and power (SWAP) constraints, controllable mobility, and line-of-sight (LoS) air-ground channels, UAV-enabled ISAC introduces both new opportunities and challenges. This article provides an overview of UAV-enabled ISAC, and proposes various solutions for optimizing the S&C performance. In particular, we first introduce UAV-enabled joint S&C, and discuss UAV motion control, wireless resource allocation, and interference management for the cases of single and multiple UAVs. Then, we present two application scenarios for exploiting the synergy between S&C, namely sensing-assisted UAV communication and communication-assisted UAV sensing. Finally, we highlight several interesting research directions to guide and motivate future work. △ Less

Submitted 19 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

Comments: 9 pages, 6 figures

Journal ref: IEEE Wireless Communications, 2023

arXiv:2203.10223 [pdf, other]

doi 10.1109/LWC.2022.3161338

UAV Trajectory and Beamforming Optimization for Integrated Periodic Sensing and Communication

Authors: Kaitao Meng, Qingqing Wu, Shaodan Ma, Wen Chen, Tony Q. S. Quek

Abstract: Unmanned aerial vehicle (UAV) is expected to bring transformative improvement to the integrated sensing and communication (ISAC) system. However, due to shared spectrum resources, it is challenging to achieve a critical trade-off between these two integrated functionalities. To address this issue, we propose in this paper a new integrated \emph{periodic} sensing and communication mechanism for the… ▽ More Unmanned aerial vehicle (UAV) is expected to bring transformative improvement to the integrated sensing and communication (ISAC) system. However, due to shared spectrum resources, it is challenging to achieve a critical trade-off between these two integrated functionalities. To address this issue, we propose in this paper a new integrated \emph{periodic} sensing and communication mechanism for the UAV-enable ISAC system. Specifically, the user achievable rate is maximized via jointly optimizing UAV trajectory, transmit precoder, and sensing start instant, subject to the sensing frequency and beam pattern gain constraints. Despite that this problem is highly non-convex and involves an infinite number of variables, we obtain the optimal transmit precoder and derive the optimal achievable rate in closed-form for any given UAV location to facilitate the UAV trajectory design. Furthermore, we first prove the structural symmetry between optimal solutions in different ISAC frames without location constraints and then propose a high-quality UAV trajectory and sensing optimization algorithm for the general location-constrained case. Simulation results corroborate the effectiveness of the proposed design and also unveil a more flexible trade-off in ISAC systems over benchmark schemes. △ Less

Submitted 18 March, 2022; originally announced March 2022.

Comments: Accepted by IEEE Wireless Communications Letters

Journal ref: IEEE Wireless Communications Letters, 2022

arXiv:2203.06358 [pdf, other]

doi 10.1109/TWC.2022.3197623

Throughput Maximization for UAV-enabled Integrated Periodic Sensing and Communication

Authors: Kaitao Meng, Qingqing Wu, Shaodan Ma, Wen Chen, Kunlun Wang, Jun Li

Abstract: Unmanned aerial vehicle (UAV) is expected to revolutionize the existing integrated sensing and communication (ISAC) system and promise a more flexible joint design. Nevertheless, the existing works on ISAC mainly focus on exploring the performance of both functionalities simultaneously during the entire considered period, which may ignore the practical asymmetric sensing and communication requirem… ▽ More Unmanned aerial vehicle (UAV) is expected to revolutionize the existing integrated sensing and communication (ISAC) system and promise a more flexible joint design. Nevertheless, the existing works on ISAC mainly focus on exploring the performance of both functionalities simultaneously during the entire considered period, which may ignore the practical asymmetric sensing and communication requirements. In particular, always forcing sensing along with communication may make it is harder to balance between these two functionalities due to shared spectrum resources and limited transmit power. To address this issue, we propose a new integrated periodic sensing and communication mechanism for the UAV-enabled ISAC system to provide a more flexible trade-off between two integrated functionalities. Specifically, the system achievable rate is maximized via jointly optimizing UAV trajectory, user association, target sensing selection, and transmit beamforming, while meeting the sensing frequency and beam pattern gain requirement for the given targets. Despite that this problem is highly non-convex and involves closely coupled integer variables, we derive the closed-form optimal beamforming vector to dramatically reduce the complexity of beamforming design, and present a tight lower bound of the achievable rate to facilitate UAV trajectory design. Based on the above results, we propose a penalty-based algorithm to efficiently solve the considered problem. The optimal achievable rate and the optimal UAV location are analyzed under a special case of infinity number of antennas. Furthermore, we prove the structural symmetry between the optimal solutions in different ISAC frames without location constraints and propose an efficient algorithm for solving the problem with location constraints. △ Less

Submitted 31 March, 2022; v1 submitted 12 March, 2022; originally announced March 2022.

Comments: 32 pages, This work has been submitted to the IEEE for possible publication

Journal ref: IEEE Transactions on Wireless Communications, 2022

arXiv:2202.05262 [pdf, other]

Locating and Editing Factual Associations in GPT

Authors: Kevin Meng, David Bau, Alex Andonian, Yonatan Belinkov

Abstract: We analyze the storage and recall of factual associations in autoregressive transformer language models, finding evidence that these associations correspond to localized, directly-editable computations. We first develop a causal intervention for identifying neuron activations that are decisive in a model's factual predictions. This reveals a distinct set of steps in middle-layer feed-forward modul… ▽ More We analyze the storage and recall of factual associations in autoregressive transformer language models, finding evidence that these associations correspond to localized, directly-editable computations. We first develop a causal intervention for identifying neuron activations that are decisive in a model's factual predictions. This reveals a distinct set of steps in middle-layer feed-forward modules that mediate factual predictions while processing subject tokens. To test our hypothesis that these computations correspond to factual association recall, we modify feed-forward weights to update specific factual associations using Rank-One Model Editing (ROME). We find that ROME is effective on a standard zero-shot relation extraction (zsRE) model-editing task, comparable to existing methods. To perform a more sensitive evaluation, we also evaluate ROME on a new dataset of counterfactual assertions, on which it simultaneously maintains both specificity and generalization, whereas other methods sacrifice one or another. Our results confirm an important role for mid-layer feed-forward modules in storing factual associations and suggest that direct manipulation of computational mechanisms may be a feasible approach for model editing. The code, dataset, visualizations, and an interactive demo notebook are available at https://rome.baulab.info/ △ Less

Submitted 13 January, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

Comments: NeurIPS 2022. 35 pages, 30 figures. Code and data at https://rome.baulab.info/

ACM Class: I.2.7

arXiv:2109.07877 [pdf, other]

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition

Authors: Jiatong Li, Kui Meng

Abstract: In Chinese Named Entity Recognition, character substitution is a complicated linguistic phenomenon. Some Chinese characters are quite similar as they share the same components or have similar pronunciations. People replace characters in a named entity with similar characters to generate a new collocation but referring to the same object. As a result, it always leads to unrecognizable or mislabelin… ▽ More In Chinese Named Entity Recognition, character substitution is a complicated linguistic phenomenon. Some Chinese characters are quite similar as they share the same components or have similar pronunciations. People replace characters in a named entity with similar characters to generate a new collocation but referring to the same object. As a result, it always leads to unrecognizable or mislabeling errors in the NER task. In this paper, we propose a lightweight method, MFE-NER, which fuses glyph and phonetic features, to help pre-trained language models handle the character substitution problem in the NER task with limited extra cost. Basically, in the glyph domain, we disassemble Chinese characters into Five-Stroke components to represent structure features. In the phonetic domain, an improved phonetic system is proposed in our work, making it reasonable to describe phonetic similarity among Chinese characters. Experiments demonstrate that our method performs especially well in detecting character substitutions while slightly improving the overall performance of Chinese NER. △ Less

Submitted 17 April, 2024; v1 submitted 16 September, 2021; originally announced September 2021.

arXiv:2006.09473 [pdf, other]

Guiding Optimizations with Meliora: A Deep Walk down Memory Lane

Authors: Kewen Meng, Boyana Norris

Abstract: Performance models can be very useful for understanding the behavior of applications and hence can help guide design and optimization decisions. Unfortunately, performance modeling of nontrivial computations typically requires significant expertise and human effort. Moreover, even when performed by experts, it is necessarily limited in scope, accuracy, or both. However, since models are not typica… ▽ More Performance models can be very useful for understanding the behavior of applications and hence can help guide design and optimization decisions. Unfortunately, performance modeling of nontrivial computations typically requires significant expertise and human effort. Moreover, even when performed by experts, it is necessarily limited in scope, accuracy, or both. However, since models are not typically available, programmers, compilers or autotuners cannot use them easily to guide optimizations and are limited to heuristic-based methods that potentially take a lot of time to perform unnecessary transformations. We believe that streamlining model generation and making it scalable (both in terms of human effort and code size) would enable dramatic improvements in compilation techniques, as well as manual optimization and autotuning. To that end, we are building the Meliora code analysis infrastructure for machine learning-based performance model generation of arbitrary codes based on static analysis of intermediate language representations. We demonstrate good accuracy in matching known codes and show how Meliora can be used to optimize new codes though reusing optimization knowledge, either manually or in conjunction with an autotuner. When autotuning, Meliora eliminates or dramatically reduces the empirical search space, while generally achieving competitive performance. △ Less

Submitted 8 June, 2020; originally announced June 2020.

arXiv:2002.07725 [pdf, other]

Gradient-Based Adversarial Training on Transformer Networks for Detecting Check-Worthy Factual Claims

Authors: Kevin Meng, Damian Jimenez, Fatma Arslan, Jacob Daniel Devasier, Daniel Obembe, Chengkai Li

Abstract: We present a study on the efficacy of adversarial training on transformer neural network models, with respect to the task of detecting check-worthy claims. In this work, we introduce the first adversarially-regularized, transformer-based claim spotter model that achieves state-of-the-art results on multiple challenging benchmarks. We obtain a 4.70 point F1-score improvement over current state-of-t… ▽ More We present a study on the efficacy of adversarial training on transformer neural network models, with respect to the task of detecting check-worthy claims. In this work, we introduce the first adversarially-regularized, transformer-based claim spotter model that achieves state-of-the-art results on multiple challenging benchmarks. We obtain a 4.70 point F1-score improvement over current state-of-the-art models on the ClaimBuster Dataset and CLEF2019 Dataset, respectively. In the process, we propose a method to apply adversarial training to transformer models, which has the potential to be generalized to many similar text classification tasks. Along with our results, we are releasing our codebase and manually labeled datasets. We also showcase our models' real world usage via a live public API. △ Less

Submitted 21 May, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

Comments: 11 pages, 4 figures, 6 tables

arXiv:1908.02110 [pdf, other]

Threshold Changeable Secret Sharing Scheme and Its Application to Group Authentication

Authors: Fuyou Miao, Yue Yu, Keju Meng, Wenchao Huang, Yan Xiong

Abstract: Group oriented applications are getting more and more popular in mobile Internet and call for secure and efficient secret sharing (SS) scheme to meet their requirements. A $(t,n)$ threshold SS scheme divides a secret into $n$ shares such that any $t$ or more than $t$ shares can recover the secret while less than $t$ shares cannot. However, an adversary, even without a valid share, may obtain the s… ▽ More Group oriented applications are getting more and more popular in mobile Internet and call for secure and efficient secret sharing (SS) scheme to meet their requirements. A $(t,n)$ threshold SS scheme divides a secret into $n$ shares such that any $t$ or more than $t$ shares can recover the secret while less than $t$ shares cannot. However, an adversary, even without a valid share, may obtain the secret by impersonating a shareholder to recover the secret with $t$ or more legal shareholders. Therefore, this paper uses linear code to propose a threshold changeable secret sharing (TCSS) scheme, in which threshold should increase from $t$ to the exact number of all participants during secret reconstruction. The scheme does not depend on any computational assumption and realizes asymptotically perfect security. Furthermore, based on the proposed TCSS scheme, a group authentication scheme is constructed, which allows a group user to authenticate whether all users are legal group members at once and thus provides efficient and flexible m-to-m authentication for group oriented applications. △ Less

Submitted 10 April, 2021; v1 submitted 6 August, 2019; originally announced August 2019.

arXiv:1905.02004 [pdf]

doi 10.1109/ICCTEC.2017.00310

Realize General Access Structure Based On Single Share

Authors: Yang Xie, Sijjad Ali Khuhro, Fuyou Miao, Keju Meng

Abstract: Traditional threshold secret sharing cannot realizing all access structures of secret sharing. So, Ito introduced the concept of Secret sharing scheme realizing general access structure. But Its scheme has to send multiple shares to each trustee. In this paper, we proposed two new secret sharing schemes realizing general access structures by only assigning one share to each trustee. Our proposed s… ▽ More Traditional threshold secret sharing cannot realizing all access structures of secret sharing. So, Ito introduced the concept of Secret sharing scheme realizing general access structure. But Its scheme has to send multiple shares to each trustee. In this paper, we proposed two new secret sharing schemes realizing general access structures by only assigning one share to each trustee. Our proposed second scheme is a perfect secret sharing scheme. Furthermore, our schemes can realize any access structures. △ Less

Submitted 7 September, 2022; v1 submitted 6 May, 2019; originally announced May 2019.

Comments: updated version

arXiv:1904.00739 [pdf]

Through-Wall Pose Imaging in Real-Time with a Many-to-Many Encoder/Decoder Paradigm

Authors: Kevin Meng, Yu Meng

Abstract: Overcoming the visual barrier and developing "see-through vision" has been one of mankind's long-standing dreams. Unlike visible light, Radio Frequency (RF) signals penetrate opaque obstructions and reflect highly off humans. This paper establishes a deep-learning model that can be trained to reconstruct continuous video of a 15-point human skeleton even through visual occlusion. The training proc… ▽ More Overcoming the visual barrier and developing "see-through vision" has been one of mankind's long-standing dreams. Unlike visible light, Radio Frequency (RF) signals penetrate opaque obstructions and reflect highly off humans. This paper establishes a deep-learning model that can be trained to reconstruct continuous video of a 15-point human skeleton even through visual occlusion. The training process adopts a student/teacher learning procedure inspired by the Feynman learning technique, in which video frames and RF data are first collected simultaneously using a co-located setup containing an optical camera and an RF antenna array transceiver. Next, the video frames are processed with a computer-vision-based gait analysis "teacher" module to generate ground-truth human skeletons for each frame. Then, the same type of skeleton is predicted from corresponding RF data using a "student" deep-learning model consisting of a Residual Convolutional Neural Network (CNN), Region Proposal Network (RPN), and Recurrent Neural Network with Long-Short Term Memory (LSTM) that 1) extracts spatial features from RF images, 2) detects all people present in a scene, and 3) aggregates information over many time-steps, respectively. The model is shown to both accurately and completely predict the pose of humans behind visual obstruction solely using RF signals. Primary academic contributions include the novel many-to-many imaging methodology, unique integration of RPN and LSTM networks, and original training pipeline. △ Less

Submitted 20 October, 2019; v1 submitted 15 March, 2019; originally announced April 2019.

arXiv:1705.07575 [pdf, other]

Mira: A Framework for Static Performance Analysis

Authors: Kewen Meng, Boyana Norris

Abstract: The performance model of an application can pro- vide understanding about its runtime behavior on particular hardware. Such information can be analyzed by developers for performance tuning. However, model building and analyzing is frequently ignored during software development until perfor- mance problems arise because they require significant expertise and can involve many time-consuming applicat… ▽ More The performance model of an application can pro- vide understanding about its runtime behavior on particular hardware. Such information can be analyzed by developers for performance tuning. However, model building and analyzing is frequently ignored during software development until perfor- mance problems arise because they require significant expertise and can involve many time-consuming application runs. In this paper, we propose a fast, accurate, flexible and user-friendly tool, Mira, for generating performance models by applying static program analysis, targeting scientific applications running on supercomputers. We parse both the source code and binary to estimate performance attributes with better accuracy than considering just source or just binary code. Because our analysis is static, the target program does not need to be executed on the target architecture, which enables users to perform analysis on available machines instead of conducting expensive exper- iments on potentially expensive resources. Moreover, statically generated models enable performance prediction on non-existent or unavailable architectures. In addition to flexibility, because model generation time is significantly reduced compared to dynamic analysis approaches, our method is suitable for rapid application performance analysis and improvement. We present several scientific application validation results to demonstrate the current capabilities of our approach on small benchmarks and a mini application. △ Less

Submitted 22 May, 2017; originally announced May 2017.

Showing 1–38 of 38 results for author: Meng, K