-
Not all explicit cues help communicate: Pedestrians' perceptions, fixations, and decisions toward automated vehicles with varied appearance
Authors:
Wei Lyu,
Yaqin Cao,
Yi Ding,
Jingyu Li,
Kai Tian,
Hui Zhang
Abstract:
Given pedestrians' vulnerability in road traffic, it remains unclear how novel AV appearances will impact pedestrians crossing behaviour. To address this gap, this study pioneers an investigation into the influence of AVs' exterior design, correlated with their kinematics, on pedestrians' road-crossing perception and decision-making. A video-based eye-tracking experimental study was conducted with…
▽ More
Given pedestrians' vulnerability in road traffic, it remains unclear how novel AV appearances will impact pedestrians crossing behaviour. To address this gap, this study pioneers an investigation into the influence of AVs' exterior design, correlated with their kinematics, on pedestrians' road-crossing perception and decision-making. A video-based eye-tracking experimental study was conducted with 61 participants who responded to video stimuli depicting a manipulated vehicle approaching a predefined road-crossing location on an unsignalized, two-way road. The vehicle's kinematic pattern was manipulated into yielding and non-yielding, and its external appearances were varied across five types: with a human driver (as a conventional vehicle), with no driver (as an AV), with text-based identity indications, with roof radar sensors, with dynamic eHMIs adjusted to vehicle kinematics. Participants' perceived clarity, crossing initiation distance (CID), crossing decision time (CDT), and gaze behaviour, during interactions were recorded and reported. The results indicated that AVs' kinematic profiles play a dominant role in pedestrians' road-crossing decisions, supported by their subjective evaluations, CID, CDT, and gaze patterns during interactions. Moreover, the use of clear eHMI, such as dynamic pedestrian icons, reduced pedestrians' visual load, enhanced their perceptual clarity, expedited road-crossing decisions, and thereby improved overall crossing efficiency. However, it was found that both textual identity indications and roof radar sensors have no significant effect on pedestrians' decisions but negatively impact pedestrians' visual attention, as evidenced by heightened fixation counts and prolonged fixation durations, particularly under yielding conditions. Excessive visual and cognitive resource occupation suggests that not all explicit cues facilitate human-vehicle communication.
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records
Authors:
Weimin Lyu,
Zexin Bi,
Fusheng Wang,
Chao Chen
Abstract:
The advent of clinical language models integrated into electronic health records (EHR) for clinical decision support has marked a significant advancement, leveraging the depth of clinical notes for improved decision-making. Despite their success, the potential vulnerabilities of these models remain largely unexplored. This paper delves into the realm of backdoor attacks on clinical language models…
▽ More
The advent of clinical language models integrated into electronic health records (EHR) for clinical decision support has marked a significant advancement, leveraging the depth of clinical notes for improved decision-making. Despite their success, the potential vulnerabilities of these models remain largely unexplored. This paper delves into the realm of backdoor attacks on clinical language models, introducing an innovative attention-based backdoor attack method, BadCLM (Bad Clinical Language Models). This technique clandestinely embeds a backdoor within the models, causing them to produce incorrect predictions when a pre-defined trigger is present in inputs, while functioning accurately otherwise. We demonstrate the efficacy of BadCLM through an in-hospital mortality prediction task with MIMIC III dataset, showcasing its potential to compromise model integrity. Our findings illuminate a significant security risk in clinical decision support systems and pave the way for future endeavors in fortifying clinical language models against such vulnerabilities.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Extremely Large-Scale Dynamic Metasurface Antennas (XL-DMAs): Near-Field Modeling and Channel Estimation
Authors:
Songjie Yang,
Wanting Lyu,
Boyu Ning,
Yue Xiu,
Youzhi Xiong,
Hua Chen,
Chadi Assi,
Chau Yuen
Abstract:
Dynamic metasurface antennas (DMAs) represent a novel transceiver array architecture for extremely large-scale (XL) communications, offering the advantages of reduced power consumption and lower hardware costs compared to conventional arrays.
This paper focuses on near-field channel estimation for XL-DMAs. We begin by analyzing the near-field characteristics of uniform planar arrays (UPAs) and i…
▽ More
Dynamic metasurface antennas (DMAs) represent a novel transceiver array architecture for extremely large-scale (XL) communications, offering the advantages of reduced power consumption and lower hardware costs compared to conventional arrays.
This paper focuses on near-field channel estimation for XL-DMAs. We begin by analyzing the near-field characteristics of uniform planar arrays (UPAs) and introducing the Oblong Approx. model. This model decouples elevation-azimuth (EL-AZ) parameters for XL-DMAs, providing an effective means to characterize the near-field effect. It offers simpler mathematical expressions than the second-order Taylor expansion model, all while maintaining negligible model errors for oblong-shaped arrays.
Building on the Oblong Approx. model, we propose an EL-AZ-decoupled estimation framework that involves near- and far-field parameter estimation for AZ/EL and EL/AZ directions, respectively. The former is formulated as a distributed compressive sensing problem, addressed using the proposed off-grid distributed orthogonal least squares algorithm, while the latter involves a straightforward parallelizable search. Crucially, we illustrate the viability of decoupled EL-AZ estimation for near-field UPAs, exhibiting commendable performance and linear complexity correlated with the number of metasurface elements.
Moreover, we design an measurement matrix optimization method with the Lorentzian constraint on DMAs and highlight the estimation performance degradation resulting from this constraint.
△ Less
Submitted 6 July, 2024;
originally announced July 2024.
-
Flexible Antenna Arrays for Wireless Communications: Modeling and Performance Evaluation
Authors:
Songjie Yang,
Jiancheng An,
Yue Xiu,
Wanting Lyu,
Boyu Ning,
Zhongpei Zhang,
Merouane Debbah,
Chau Yuen
Abstract:
Flexible antenna arrays (FAAs), distinguished by their rotatable, bendable, and foldable properties, are extensively employed in flexible radio systems to achieve customized radiation patterns. This paper aims to illustrate that FAAs, capable of dynamically adjusting surface shapes, can enhance communication performances with both omni-directional and directional antenna patterns, in terms of mult…
▽ More
Flexible antenna arrays (FAAs), distinguished by their rotatable, bendable, and foldable properties, are extensively employed in flexible radio systems to achieve customized radiation patterns. This paper aims to illustrate that FAAs, capable of dynamically adjusting surface shapes, can enhance communication performances with both omni-directional and directional antenna patterns, in terms of multi-path channel power and channel angle Cramér-Rao bounds. To this end, we develop a mathematical model that elucidates the impacts of the variations in antenna positions and orientations as the array transitions from a flat to a rotated, bent, and folded state, all contingent on the flexible degree-of-freedom. Moreover, since the array shape adjustment operates across the entire beamspace, especially with directional patterns, we discuss the sum-rate in the multi-sector base station that covers the $360^\circ$ communication area. Particularly, to thoroughly explore the multi-sector sum-rate, we propose separate flexible precoding (SFP), joint flexible precoding (JFP), and semi-joint flexible precoding (SJFP), respectively. In our numerical analysis comparing the optimized FAA to the fixed uniform planar array, we find that the bendable FAA achieves a remarkable $156\%$ sum-rate improvement compared to the fixed planar array in the case of JFP with the directional pattern. Furthermore, the rotatable FAA exhibits notably superior performance in SFP and SJFP cases with omni-directional patterns, with respective $35\%$ and $281\%$.
△ Less
Submitted 5 July, 2024;
originally announced July 2024.
-
Near-Field Channel Estimation for Extremely Large-Scale Terahertz Communications
Authors:
Songjie Yang,
Yizhou Peng,
Wanting Lyu,
Ya Li,
Hongjun He,
Zhongpei Zhang,
Chau Yuen
Abstract:
Future Terahertz communications exhibit significant potential in accommodating ultra-high-rate services. Employing extremely large-scale array antennas is a key approach to realize this potential, as they can harness substantial beamforming gains to overcome the severe path loss and leverage the electromagnetic advantages in the near field. This paper proposes novel estimation methods designed to…
▽ More
Future Terahertz communications exhibit significant potential in accommodating ultra-high-rate services. Employing extremely large-scale array antennas is a key approach to realize this potential, as they can harness substantial beamforming gains to overcome the severe path loss and leverage the electromagnetic advantages in the near field. This paper proposes novel estimation methods designed to enhance efficiency in Terahertz widely-spaced multi-subarray (WSMS) systems. Initially, we introduce three sparse channel representation methods: polar-domain representation (PD-R), multi-angular-domain representation (MAD-R), and two-dimensional polar-angular-domain representation (2D-PAD-R). Each method is meticulously developed for near-field WSMS channels, capitalizing on their sparsity characteristics. Building on this, we propose four estimation frameworks using the sparse recovery theory: polar-domain estimation (PD-E), multi-angular-domain estimation (MAD-E), two-stage polar-angular-domain estimation (TS-PAD-E), and two-dimensional polar-angular-domain estimation (2D-PAD-E). Particularly, 2D-PAD-E, integrating a 2D dictionary process, and TS-PAD-E, with its sequential approach to angle and distance estimation, stand out as particularly effective for near-field angle-distance estimation, enabling decoupled calculation of these parameters. Overall, these frameworks provide versatile and efficient solutions for WSMS channel estimation, balancing low complexity with high-performance outcomes. Additionally, they represent a fresh perspective on near-field signal processing.
△ Less
Submitted 8 June, 2024;
originally announced June 2024.
-
Explore the properties of $Λ(1670)$ in the Cabibbo-favored process $Λ^+_c \to p K^- π^+$ decay
Authors:
Sheng-Chao Zhang,
Man-Yu Duan,
Wen-Tao Lyu,
Guan-Ying Wang,
Jing-Yu Zhu,
En Wang
Abstract:
Recently, the Belle and LHCb Collaborations have measured the $Λ^+_c \to p K^- π^+$ decay and reported the $p K^-$ invariant mass distribution, which shows a clear cusp structure around the $ηΛ$ threshold. In this work, we have analyzed this process by considering the triangle mechanism and the $S$-wave pseudoscalar meson-octet baryon interactions within the chiral unitary approach, which dynamica…
▽ More
Recently, the Belle and LHCb Collaborations have measured the $Λ^+_c \to p K^- π^+$ decay and reported the $p K^-$ invariant mass distribution, which shows a clear cusp structure around the $ηΛ$ threshold. In this work, we have analyzed this process by considering the triangle mechanism and the $S$-wave pseudoscalar meson-octet baryon interactions within the chiral unitary approach, which dynamically generate the $Λ(1670)$. Our results are in good agreement with the Belle measurements, which implies that the cusp structure around $ηΛ$ threshold could be associated with the $Λ(1670)$ with the molecular nature.
△ Less
Submitted 23 May, 2024;
originally announced May 2024.
-
Dual-Robust Integrated Sensing and Communication: Beamforming under CSI Imperfection and Location Uncertainty
Authors:
Wanting Lyu,
Songjie Yang,
Yue Xiu,
Xinyi Chen,
Zhongpei Zhang,
Chadi Assi,
Chau Yuan
Abstract:
A dual-robust design of beamforming is investigated in an integrated sensing and communication (ISAC) system.Existing research on robust ISAC waveform design, while proposing solutions to imperfect channel state information (CSI), generally depends on prior knowledge of the target's approximate location to design waveforms. This approach, however, limits the precision in sensing the target's exact…
▽ More
A dual-robust design of beamforming is investigated in an integrated sensing and communication (ISAC) system.Existing research on robust ISAC waveform design, while proposing solutions to imperfect channel state information (CSI), generally depends on prior knowledge of the target's approximate location to design waveforms. This approach, however, limits the precision in sensing the target's exact location. In this paper, considering both CSI imperfection and target location uncertainty, a novel framework of joint robust optimization is proposed by maximizing the weighted sum of worst-case data rate and beampattern gain. To address this challenging problem, we propose an efficient two-layer iteration algorithm based on S-Procedure and convex hull. Finally, numerical results verify the effectiveness and performance improvement of our dual-robust algorithm, as well as the trade-off between communication and sensing performance.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Flexible Beamforming for Movable Antenna-Enabled Integrated Sensing and Communication
Authors:
Wanting Lyu,
Songjie Yang,
Yue Xiu,
Zhongpei Zhang,
Chadi Assi,
Chau Yuen
Abstract:
This paper investigates flexible beamforming design in an integrated sensing and communication (ISAC) network with movable antennas (MAs). A bistatic radar system is integrated into a multi-user multiple-input-single-output (MU-MISO) system, with the base station (BS) equipped with MAs. This enables array response reconfiguration by adjusting the positions of antennas. Thus, a joint beamforming an…
▽ More
This paper investigates flexible beamforming design in an integrated sensing and communication (ISAC) network with movable antennas (MAs). A bistatic radar system is integrated into a multi-user multiple-input-single-output (MU-MISO) system, with the base station (BS) equipped with MAs. This enables array response reconfiguration by adjusting the positions of antennas. Thus, a joint beamforming and antenna position optimization problem, namely flexible beamforming, is proposed to maximize communication rate and sensing mutual information (MI). The fractional programming (FP) method is adopted to transform the non-convex objective function, and we alternatively update the beamforming matrix and antenna positions. Karush-Kuhn-Tucker (KKT) conditions are employed to derive the close-form solution of the beamforming matrix, while we propose an efficient search-based projected gradient ascent (SPGA) method to update the antenna positions. Simulation results demonstrate that MAs significantly enhance the ISAC performance when employing our proposed algorithm, achieving a 59.8% performance gain compared to fixed uniform arrays.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Evidence of the low-lying baryon $Σ^*(1/2^-)$ in the process $Λ_c^+\to ηπ^+Λ$
Authors:
Wen-Tao Lyu,
Sheng-Chao Zhang,
Guan-Ying Wang,
Jia-Jun Wu,
En Wang,
Li-Sheng Geng,
Ju-Jun Xie
Abstract:
Motivated by the Belle measurements of the process $Λ_c^+\to ηπ^+Λ$, we investigate this process by considering the contributions from the $Λ(1670)$, $a_0(980)$, and $Σ(1385)$. In addition, we also consider the predicted low-lying baryon $Σ^*(1/2^-)$. Our results involving the $Σ^*(1/2^-)$ are favored by fitting to the Belle data of the $ηΛ$ and $π^+Λ$ invariant mass distributions. Furthermore, we…
▽ More
Motivated by the Belle measurements of the process $Λ_c^+\to ηπ^+Λ$, we investigate this process by considering the contributions from the $Λ(1670)$, $a_0(980)$, and $Σ(1385)$. In addition, we also consider the predicted low-lying baryon $Σ^*(1/2^-)$. Our results involving the $Σ^*(1/2^-)$ are favored by fitting to the Belle data of the $ηΛ$ and $π^+Λ$ invariant mass distributions. Furthermore, we predict the $ηπ^+$ invariant mass distribution and the angular distribution $dΓ/d{\rm cos}θ$, which are significantly different depending on whether or not the contribution from the $Σ^*(1/2^-)$ is considered. Finally, we show that, with the contribution from the $Σ^*(1/2^-)$, the calculated Dalizt plot agrees with the Belle measurements. Future precise measurements of the process $Λ_c^+\to ηπ^+Λ$ could shed further light on the existence of the low-lying $Σ^*(1/2^-)$.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
Evaluating the Effectiveness of LLMs in Introductory Computer Science Education: A Semester-Long Field Study
Authors:
Wenhan Lyu,
Yimeng Wang,
Tingting,
Chung,
Yifan Sun,
Yixuan Zhang
Abstract:
The integration of AI assistants, especially through the development of Large Language Models (LLMs), into computer science education has sparked significant debate. An emerging body of work has looked into using LLMs in education, but few have examined the impacts of LLMs on students in entry-level programming courses, particularly in real-world contexts and over extended periods. To address this…
▽ More
The integration of AI assistants, especially through the development of Large Language Models (LLMs), into computer science education has sparked significant debate. An emerging body of work has looked into using LLMs in education, but few have examined the impacts of LLMs on students in entry-level programming courses, particularly in real-world contexts and over extended periods. To address this research gap, we conducted a semester-long, between-subjects study with 50 students using CodeTutor, an LLM-powered assistant developed by our research team. Our study results show that students who used CodeTutor (the experimental group) achieved statistically significant improvements in their final scores compared to peers who did not use the tool (the control group). Within the experimental group, those without prior experience with LLM-powered tools demonstrated significantly greater performance gain than their counterparts. We also found that students expressed positive feedback regarding CodeTutor's capability, though they also had concerns about CodeTutor's limited role in developing critical thinking skills. Over the semester, students' agreement with CodeTutor's suggestions decreased, with a growing preference for support from traditional human teaching assistants. Our analysis further reveals that the quality of user prompts was significantly correlated with CodeTutor's response effectiveness. Building upon our results, we discuss the implications of our findings for integrating Generative AI literacy into curricula to foster critical thinking skills and turn to examining the temporal dynamics of user engagement with LLM-powered tools. We further discuss the discrepancy between the anticipated functions of tools and students' actual capabilities, which sheds light on the need for tailored strategies to improve educational outcomes.
△ Less
Submitted 2 May, 2024; v1 submitted 20 April, 2024;
originally announced April 2024.
-
Broadband microwave waveform generation with programmable chirp shapes via recirculating phase-modulated optical fiber loop controlled by low-speed electronics
Authors:
Weiqiang Lyu,
Huan Tian,
Zhenwei Fu,
Lingjie Zhang,
Zhen Zeng,
Yaowen Zhang,
Heping Li,
Zhiyao Zhang,
Yong Liu
Abstract:
Broadband microwave waveforms with programmable chirp shapes are captivating in numerous practical applications. Compared with electronic technology, photonic-assisted solutions exhibit excellent performance in bandwidth and flexibility, but still suffer from complex architecture and requirement of high-speed electronics. Besides, rapid manipulation of chirp shape is still a challenge in the scien…
▽ More
Broadband microwave waveforms with programmable chirp shapes are captivating in numerous practical applications. Compared with electronic technology, photonic-assisted solutions exhibit excellent performance in bandwidth and flexibility, but still suffer from complex architecture and requirement of high-speed electronics. Besides, rapid manipulation of chirp shape is still a challenge in the scientific community. In this paper, we propose and demonstrate a novel concept for generating broadband microwave waveforms with programmable chirp shapes. This concept is realized on a simple fiber-optic platform involving a continuous-wave laser source, a recirculating phase-modulated optical fiber loop, and low-speed electronics with a sampling rate at the level of MS/s. Based on this method, chirped microwave waveforms with a bandwidth up to tens of GHz can be generated, where the chirp shape is identical to the low-frequency driving waveform of the recirculating phase-modulated optical fiber loop. In addition, all the parameters of the generated chirped microwave waveforms can be easily reconfigured in real time, including the bandwidth, the central frequency, and the temporal duration. In the experiment, broadband microwave waveforms with customized chirp shapes are generated, where the center frequency and bandwidth tuning ranges exceed 21 GHz, the temporal duration is tuned in the range of 9 ns to 180 ns, and the coherent time of the generated microwave waveform is larger than 100 μs. This simple fiber-optic platform paves a way to generate broadband microwave waveforms with user-definable chirp shapes, which can find applications in broadband radar systems, electronic warfare and wireless communications.
△ Less
Submitted 12 May, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
Gaga: Group Any Gaussians via 3D-aware Memory Bank
Authors:
Weijie Lyu,
Xueting Li,
Abhijit Kundu,
Yi-Hsuan Tsai,
Ming-Hsuan Yang
Abstract:
We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of cont…
▽ More
We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot segmentation models. Contrasted to prior 3D scene segmentation approaches that heavily rely on video object tracking, Gaga utilizes spatial information and effectively associates object masks across diverse camera poses. By eliminating the assumption of continuous view changes in training images, Gaga demonstrates robustness to variations in camera poses, particularly beneficial for sparsely sampled images, ensuring precise mask label consistency. Furthermore, Gaga accommodates 2D segmentation masks from diverse sources and demonstrates robust performance with different open-world zero-shot segmentation models, enhancing its versatility. Extensive qualitative and quantitative evaluations demonstrate that Gaga performs favorably against state-of-the-art methods, emphasizing its potential for real-world applications such as scene understanding and manipulation.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Formal deformations, cohomology theory and $L_\infty[1]$-structures for differential Lie algebras of arbitrary weight
Authors:
Weiguo Lyu,
Zihao Qi,
Jian Yang,
Guodong Zhou
Abstract:
Generalising a previous work of Jiang and Sheng, a cohomology theory for differential Lie algebras of arbitrary weight is introduced. The underlying $L_\infty[1]$-structure on the cochain complex is also determined via a generalised version of higher derived brackets. The equivalence between $L_\infty[1]$-structures for absolute and relative differential Lie algebras are established. Formal deform…
▽ More
Generalising a previous work of Jiang and Sheng, a cohomology theory for differential Lie algebras of arbitrary weight is introduced. The underlying $L_\infty[1]$-structure on the cochain complex is also determined via a generalised version of higher derived brackets. The equivalence between $L_\infty[1]$-structures for absolute and relative differential Lie algebras are established. Formal deformations and abelian extensions are interpreted by using lower degree cohomology groups. Also we introduce the homotopy differential Lie algebras. In a forthcoming paper, we will show that the operad of homotopy (relative) differential Lie algebras is the minimal model of the operad of (relative) differential Lie algebras.
△ Less
Submitted 26 March, 2024;
originally announced March 2024.
-
Task-Agnostic Detector for Insertion-Based Backdoor Attacks
Authors:
Weimin Lyu,
Xiao Lin,
Songzhu Zheng,
Lu Pang,
Haibin Ling,
Susmit Jha,
Chao Chen
Abstract:
Textual backdoor attacks pose significant security threats. Current detection approaches, typically relying on intermediate feature representation or reconstructing potential triggers, are task-specific and less effective beyond sentence classification, struggling with tasks like question answering and named entity recognition. We introduce TABDet (Task-Agnostic Backdoor Detector), a pioneering ta…
▽ More
Textual backdoor attacks pose significant security threats. Current detection approaches, typically relying on intermediate feature representation or reconstructing potential triggers, are task-specific and less effective beyond sentence classification, struggling with tasks like question answering and named entity recognition. We introduce TABDet (Task-Agnostic Backdoor Detector), a pioneering task-agnostic method for backdoor detection. TABDet leverages final layer logits combined with an efficient pooling technique, enabling unified logit representation across three prominent NLP tasks. TABDet can jointly learn from diverse task-specific models, demonstrating superior detection efficacy over traditional task-specific methods.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Flexible Precoding for Multi-User Movable Antenna Communications
Authors:
Songjie Yang,
Wanting Lyu,
Boyu Ning,
Zhongpei Zhang,
Chau Yuen
Abstract:
This letter rethinks traditional precoding in multi-user wireless communications with movable antennas (MAs). Utilizing MAs for optimal antenna positioning, we introduce a sparse optimization (SO)-based approach focusing on regularized zero-forcing (RZF). This framework targets the optimization of antenna positions and the precoding matrix to minimize inter-user interference and transmit power. We…
▽ More
This letter rethinks traditional precoding in multi-user wireless communications with movable antennas (MAs). Utilizing MAs for optimal antenna positioning, we introduce a sparse optimization (SO)-based approach focusing on regularized zero-forcing (RZF). This framework targets the optimization of antenna positions and the precoding matrix to minimize inter-user interference and transmit power. We propose an off-grid regularized least squares-based orthogonal matching pursuit (RLS-OMP) method for this purpose. Moreover, we provide deeper insights into antenna position optimization using RLS-OMP, viewed from a subspace projection angle. Overall, our proposed flexible precoding scheme demonstrates a sum rate that exceeds more than twice that of fixed antenna positions.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
A Dataset of Open-Domain Question Answering with Multiple-Span Answers
Authors:
Zhiyi Luo,
Yingying Zhang,
Shuyun Luo,
Ying Zhao,
Wentao Lyu
Abstract:
Multi-span answer extraction, also known as the task of multi-span question answering (MSQA), is critical for real-world applications, as it requires extracting multiple pieces of information from a text to answer complex questions. Despite the active studies and rapid progress in English MSQA research, there is a notable lack of publicly available MSQA benchmark in Chinese. Previous efforts for c…
▽ More
Multi-span answer extraction, also known as the task of multi-span question answering (MSQA), is critical for real-world applications, as it requires extracting multiple pieces of information from a text to answer complex questions. Despite the active studies and rapid progress in English MSQA research, there is a notable lack of publicly available MSQA benchmark in Chinese. Previous efforts for constructing MSQA datasets predominantly emphasized entity-centric contextualization, resulting in a bias towards collecting factoid questions and potentially overlooking questions requiring more detailed descriptive responses. To overcome these limitations, we present CLEAN, a comprehensive Chinese multi-span question answering dataset that involves a wide range of open-domain subjects with a substantial number of instances requiring descriptive answers. Additionally, we provide established models from relevant literature as baselines for CLEAN. Experimental results and analysis show the characteristics and challenge of the newly proposed CLEAN dataset for the community. Our dataset, CLEAN, will be publicly released at zhiyiluo.site/misc/clean_v1.0_ sample.json.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
An Analysis of the Recovery Path of the Consumer Sector in the Post-Pandemic Era
Authors:
Wenbo Lyu,
Jiayi Zhu,
Yunan Ding,
Keming Zhang
Abstract:
This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index…
▽ More
This paper proposes a referencable pattern of the recovery of the consumption sector, a new dimension to observe and evaluate the intrinsic value of the consumption sector, and proposes the concept of sensory-based consumption and the ranking of the weights of different categories;creates the concept of digital consumption index, coupled with digital RMB index and China-style digital economy index. Finally we explain the internal logic of digital consumption as a consumption upgrade tool and a higher valuation target in the context of China's economic performance in 2022 and the Chinese government's policy in 2023, leading to the investment strategy of roller conduction effect.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
Research on the multi-stage impact of digital economy on rural revitalization in Hainan Province based on GPM model
Authors:
Wenbo Lyu
Abstract:
The rapid development of the digital economy has had a profound impact on the implementation of the rural revitalization strategy. Based on this, this study takes Hainan Province as the research object to deeply explore the impact of digital economic development on rural revitalization. The study collected panel data from 2003 to 2022 to construct an evaluation index system for the digital economy…
▽ More
The rapid development of the digital economy has had a profound impact on the implementation of the rural revitalization strategy. Based on this, this study takes Hainan Province as the research object to deeply explore the impact of digital economic development on rural revitalization. The study collected panel data from 2003 to 2022 to construct an evaluation index system for the digital economy and rural revitalization and used panel regression analysis and other methods to explore the promotion effect of the digital economy on rural revitalization. Research results show that the digital economy has a significant positive impact on rural revitalization, and this impact increases as the level of fiscal expenditure increases. The issuance of digital RMB has further exerted a regulatory effect and promoted the development of the digital economy and the process of rural revitalization. At the same time, the establishment of the Hainan Free Trade Port has also played a positive role in promoting the development of the digital economy and rural revitalization. In the prediction of the optimal strategy for rural revitalization based on the development levels of the primary, secondary, and tertiary industries (Rate1, Rate2, and Rate3), it was found that rate1 can encourage Hainan Province to implement digital economic innovation, encourage rate3 to implement promotion behaviors, and increase rate2 can At the level of sustainable development when rate3 promotes rate2's digital economic innovation behavior, it can standardize rate2's production behavior to the greatest extent, accelerate the faster application of the digital economy to the rural revitalization industry, and promote the technological advancement of enterprises.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
"The Roller Conduction Effect" from the A-share Data Evidence
Authors:
Wenbo Lyu
Abstract:
In the post-epidemic era, consumption recovery has obvious time and space transmission laws, and there are different valuation criteria for consumption segments. Using the A-share data of the consumption recovery stage from January to April 2022, this paper quantitatively compares the rotation effect between different consumption sectors when the valuation returns to the reasonable range. Accordin…
▽ More
In the post-epidemic era, consumption recovery has obvious time and space transmission laws, and there are different valuation criteria for consumption segments. Using the A-share data of the consumption recovery stage from January to April 2022, this paper quantitatively compares the rotation effect between different consumption sectors when the valuation returns to the reasonable range. According to the new classification of "sensory-based consumption", it interprets the internal logic of digital consumption as A consumption upgrade tool and a higher valuation target, and expounds the "the roller conduction effect". The law of consumption recovery and valuation return period is explained from the perspective of time and space conduction. The study found that in the early stage of consumption recovery, the recovery of consumer confidence was slow. In this period, A-shares were mainly dominated by the stock capital game, and there was an obvious plate rotation law in the game. Being familiar with this law has strong significance, which not only helps policy makers to adjust the direction of policy guidance, but also helps financial investors to make better investment strategies. The disadvantage of this paper is that it has not yet studied the roller conduction effect of the global financial market, and more rigorous mathematical models are still needed to support the definition of stock funds, which is also the main direction of the author's future research.
△ Less
Submitted 15 October, 2023;
originally announced January 2024.
-
TrustLLM: Trustworthiness in Large Language Models
Authors:
Lichao Sun,
Yue Huang,
Haoran Wang,
Siyuan Wu,
Qihui Zhang,
Yuan Li,
Chujie Gao,
Yixin Huang,
Wenhan Lyu,
Yixuan Zhang,
Xiner Li,
Zhengliang Liu,
Yixin Liu,
Yijue Wang,
Zhikun Zhang,
Bertie Vidgen,
Bhavya Kailkhura,
Caiming Xiong,
Chaowei Xiao,
Chunyuan Li,
Eric Xing,
Furong Huang,
Hao Liu,
Heng Ji,
Hongyi Wang
, et al. (45 additional authors not shown)
Abstract:
Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in…
▽ More
Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in LLMs, including principles for different dimensions of trustworthiness, established benchmark, evaluation, and analysis of trustworthiness for mainstream LLMs, and discussion of open challenges and future directions. Specifically, we first propose a set of principles for trustworthy LLMs that span eight different dimensions. Based on these principles, we further establish a benchmark across six dimensions including truthfulness, safety, fairness, robustness, privacy, and machine ethics. We then present a study evaluating 16 mainstream LLMs in TrustLLM, consisting of over 30 datasets. Our findings firstly show that in general trustworthiness and utility (i.e., functional effectiveness) are positively related. Secondly, our observations reveal that proprietary LLMs generally outperform most open-source counterparts in terms of trustworthiness, raising concerns about the potential risks of widely accessible open-source LLMs. However, a few open-source LLMs come very close to proprietary ones. Thirdly, it is important to note that some LLMs may be overly calibrated towards exhibiting trustworthiness, to the extent that they compromise their utility by mistakenly treating benign prompts as harmful and consequently not responding. Finally, we emphasize the importance of ensuring transparency not only in the models themselves but also in the technologies that underpin trustworthiness. Knowing the specific trustworthy technologies that have been employed is crucial for analyzing their effectiveness.
△ Less
Submitted 17 March, 2024; v1 submitted 10 January, 2024;
originally announced January 2024.
-
CRB Minimization for RIS-aided mmWave Integrated Sensing and Communications
Authors:
Wanting Lyu,
Songjie Yang,
Yue Xiu,
Ya Li,
Hongjun He,
Chau Yuen,
Zhongpei Zhang
Abstract:
In this paper, reconfigurable intelligent surface (RIS) is employed in a millimeter wave (mmWave) integrated sensing and communications (ISAC) system. To alleviate the multi-hop attenuation, the semi-self sensing RIS approach is adopted, wherein sensors are configured at the RIS to receive the radar echo signal. Focusing on the estimation accuracy, the Cramer-Rao bound (CRB) for estimating the dir…
▽ More
In this paper, reconfigurable intelligent surface (RIS) is employed in a millimeter wave (mmWave) integrated sensing and communications (ISAC) system. To alleviate the multi-hop attenuation, the semi-self sensing RIS approach is adopted, wherein sensors are configured at the RIS to receive the radar echo signal. Focusing on the estimation accuracy, the Cramer-Rao bound (CRB) for estimating the direction-of-the-angles is derived as the metric for sensing performance. A joint optimization problem on hybrid beamforming and RIS phaseshifts is proposed to minimize the CRB, while maintaining satisfactory communication performance evaluated by the achievable data rate. The CRB minimization problem is first transformed as a more tractable form based on Fisher information matrix (FIM). To solve the complex non-convex problem, a double layer loop algorithm is proposed based on penalty concave-convex procedure (penalty-CCCP) and block coordinate descent (BCD) method with two sub-problems. Successive convex approximation (SCA) algorithm and second order cone (SOC) constraints are employed to tackle the non-convexity in the hybrid beamforming optimization. To optimize the unit modulus constrained analog beamforming and phase shifts, manifold optimization (MO) is adopted. Finally, the numerical results verify the effectiveness of the proposed CRB minimization algorithm, and show the performance improvement compared with other baselines. Additionally, the proposed hybrid beamforming algorithm can achieve approximately 96% of the sensing performance exhibited by the full digital approach within only a limited number of radio frequency (RF) chains.
△ Less
Submitted 2 January, 2024;
originally announced January 2024.
-
PTT: Point-Trajectory Transformer for Efficient Temporal 3D Object Detection
Authors:
Kuan-Chih Huang,
Weijie Lyu,
Ming-Hsuan Yang,
Yi-Hsuan Tsai
Abstract:
Recent temporal LiDAR-based 3D object detectors achieve promising performance based on the two-stage proposal-based approach. They generate 3D box candidates from the first-stage dense detector, followed by different temporal aggregation methods. However, these approaches require per-frame objects or whole point clouds, posing challenges related to memory bank utilization. Moreover, point clouds a…
▽ More
Recent temporal LiDAR-based 3D object detectors achieve promising performance based on the two-stage proposal-based approach. They generate 3D box candidates from the first-stage dense detector, followed by different temporal aggregation methods. However, these approaches require per-frame objects or whole point clouds, posing challenges related to memory bank utilization. Moreover, point clouds and trajectory features are combined solely based on concatenation, which may neglect effective interactions between them. In this paper, we propose a point-trajectory transformer with long short-term memory for efficient temporal 3D object detection. To this end, we only utilize point clouds of current-frame objects and their historical trajectories as input to minimize the memory bank storage requirement. Furthermore, we introduce modules to encode trajectory features, focusing on long short-term and future-aware perspectives, and then effectively aggregate them with point cloud features. We conduct extensive experiments on the large-scale Waymo dataset to demonstrate that our approach performs well against state-of-the-art methods. Code and models will be made publicly available at https://github.com/kuanchihhuang/PTT.
△ Less
Submitted 24 April, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Efficient Robust Bayesian Optimization for Arbitrary Uncertain Inputs
Authors:
Lin Yang,
Junlong Lyu,
Wenlong Lyu,
Zhitang Chen
Abstract:
Bayesian Optimization (BO) is a sample-efficient optimization algorithm widely employed across various applications. In some challenging BO tasks, input uncertainty arises due to the inevitable randomness in the optimization process, such as machining errors, execution noise, or contextual variability. This uncertainty deviates the input from the intended value before evaluation, resulting in sign…
▽ More
Bayesian Optimization (BO) is a sample-efficient optimization algorithm widely employed across various applications. In some challenging BO tasks, input uncertainty arises due to the inevitable randomness in the optimization process, such as machining errors, execution noise, or contextual variability. This uncertainty deviates the input from the intended value before evaluation, resulting in significant performance fluctuations in the final result. In this paper, we introduce a novel robust Bayesian Optimization algorithm, AIRBO, which can effectively identify a robust optimum that performs consistently well under arbitrary input uncertainty. Our method directly models the uncertain inputs of arbitrary distributions by empowering the Gaussian Process with the Maximum Mean Discrepancy (MMD) and further accelerates the posterior inference via Nystrom approximation. Rigorous theoretical regret bound is established under MMD estimation error and extensive experiments on synthetic functions and real problems demonstrate that our approach can handle various input uncertainties and achieve state-of-the-art performance.
△ Less
Submitted 3 November, 2023; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Attention-Enhancing Backdoor Attacks Against BERT-based Models
Authors:
Weimin Lyu,
Songzhu Zheng,
Lu Pang,
Haibin Ling,
Chao Chen
Abstract:
Recent studies have revealed that \textit{Backdoor Attacks} can threaten the safety of natural language processing (NLP) models. Investigating the strategies of backdoor attacks will help to understand the model's vulnerability. Most existing textual backdoor attacks focus on generating stealthy triggers or modifying model weights. In this paper, we directly target the interior structure of neural…
▽ More
Recent studies have revealed that \textit{Backdoor Attacks} can threaten the safety of natural language processing (NLP) models. Investigating the strategies of backdoor attacks will help to understand the model's vulnerability. Most existing textual backdoor attacks focus on generating stealthy triggers or modifying model weights. In this paper, we directly target the interior structure of neural networks and the backdoor mechanism. We propose a novel Trojan Attention Loss (TAL), which enhances the Trojan behavior by directly manipulating the attention patterns. Our loss can be applied to different attacking methods to boost their attack efficacy in terms of attack successful rates and poisoning rates. It applies to not only traditional dirty-label attacks, but also the more challenging clean-label attacks. We validate our method on different backbone models (BERT, RoBERTa, and DistilBERT) and various tasks (Sentiment Analysis, Toxic Detection, and Topic Classification).
△ Less
Submitted 24 October, 2023; v1 submitted 22 October, 2023;
originally announced October 2023.
-
Mutual Information-Based Integrated Sensing and Communications: A WMMSE Framework
Authors:
Yizhou Peng,
Songjie Yang,
Wanting Lyu,
Ya Li,
Hongjun He,
Zhongpei Zhang,
Chadi Assi
Abstract:
In this letter, a weighted minimum mean square error (WMMSE) empowered integrated sensing and communication (ISAC) system is investigated. One transmitting base station and one receiving wireless access point are considered to serve multiple users a sensing target. Based on the theory of mutual-information (MI), communication MI and sensing MI rate are utilized as the performance metrics under the…
▽ More
In this letter, a weighted minimum mean square error (WMMSE) empowered integrated sensing and communication (ISAC) system is investigated. One transmitting base station and one receiving wireless access point are considered to serve multiple users a sensing target. Based on the theory of mutual-information (MI), communication MI and sensing MI rate are utilized as the performance metrics under the presence of clutters. In particular, we propose an novel MI-based WMMSE-ISAC method by developing a unique transceiver design mechanism to maximize the weighted sensing and communication sum-rate of this system. Such a maximization process is achieved by utilizing the classical method -- WMMSE, aiming to better manage the effect of sensing clutters and the interference among users. Numerical results show the effectiveness of our proposed method, and the performance trade-off between sensing and communication is also validated.
△ Less
Submitted 19 January, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Theoretical study of the open-flavor tetraquark $T_{c\bar{s}}(2900)$ in the process $Λ_b\to K^0D^0Λ$
Authors:
Wen-Tao Lyu,
Yun-He Lyu,
Man-Yu Duan,
Guan-Ying Wang,
Dian-Yong Chen,
En Wang
Abstract:
Recently, the LHCb Collaboration has measured the processes $B^0\to\bar{D}^0D_s^+π^-$ and $B^+\to\bar{D}^0D_s^+π^+$, where the $D_s^+π^-$ and $D_s^+π^+$ invariant mass distributions show the significant signals of two new open-flavor tetraquark states $T_{c\bar{s}}(2900)^0$ and $T_{c\bar{s}}(2900)^{++}$, as the two of the isospin triplet. In this work, we have investigated the process…
▽ More
Recently, the LHCb Collaboration has measured the processes $B^0\to\bar{D}^0D_s^+π^-$ and $B^+\to\bar{D}^0D_s^+π^+$, where the $D_s^+π^-$ and $D_s^+π^+$ invariant mass distributions show the significant signals of two new open-flavor tetraquark states $T_{c\bar{s}}(2900)^0$ and $T_{c\bar{s}}(2900)^{++}$, as the two of the isospin triplet. In this work, we have investigated the process $Λ_b\to K^0D^0Λ$ by taking into account the intermediate nucleon resonance $N^*(1535)$ and the tetraquark state $T_{c\bar{s}}(2900)^0$, which could be dynamically generated by the interactions of the $D^*K^*/D^*_sρ$ and the pseoduscalar mesons-octet baryons, respectively. Our results show that a clear peak of the open-flavor tetraquark $T_{c\bar{s}}(2900)$ may appear in the $K^0D^0$ invariant mass distribution of the process $Λ_b\to K^0D^0Λ$, which could be tested by future experiments.
△ Less
Submitted 22 October, 2023; v1 submitted 17 October, 2023;
originally announced October 2023.
-
Enhancing Near-Field Sensing and Communications with Sparse Arrays: Potentials, Challenges, and Emerging Trends
Authors:
Songjie Yang,
Wanting Lyu,
Zhongpei Zhang,
Chau Yuen
Abstract:
As a promising technique, extremely large-scale (XL)-arrays offer potential solutions for overcoming the severe path loss in millimeter-wave (mmWave) and TeraHertz (THz) channels, crucial for enabling 6G. Nevertheless, XL-arrays introduce deviations in electromagnetic propagation compared to traditional arrays, fundamentally challenging the assumption with the planar-wave model. Instead, it ushers…
▽ More
As a promising technique, extremely large-scale (XL)-arrays offer potential solutions for overcoming the severe path loss in millimeter-wave (mmWave) and TeraHertz (THz) channels, crucial for enabling 6G. Nevertheless, XL-arrays introduce deviations in electromagnetic propagation compared to traditional arrays, fundamentally challenging the assumption with the planar-wave model. Instead, it ushers in the spherical-wave (SW) model to accurately represent the near-field propagation characteristics, significantly increasing signal processing complexity. Fortunately, the SW model shows remarkable benefits on sensing and communications (S\&C), e.g., improving communication multiplexing capability, spatial resolution, and degrees of freedom. In this context, this article first overviews hardware/algorithm challenges, fundamental potentials, promising applications of near-field S\&C enabled by XL-arrays. To overcome the limitations of existing XL-arrays with dense uniform array layouts and improve S\&C applications, we introduce sparse arrays (SAs). Exploring their potential, we propose XL-SAs for mmWave/THz systems using multi-subarray designs. Finally, several applications, challenges and resarch directions are identified.
△ Less
Submitted 15 September, 2023;
originally announced September 2023.
-
Hybrid NOMA assisted Integrated Sensing and Communication via RIS
Authors:
Wanting Lyu,
Yue Xiu,
Xinyang Li,
Songjie Yang,
Phee Lep Yeoh,
Yonghui Li,
Zhongpei Zhang
Abstract:
This paper investigates the optimization of reconfigurable intelligent surface (RIS) in an integrated sensing and communication (ISAC) system. \red{To meet the demand of growing number of devices, power domain non-orthogonal multiple access (NOMA) is considered. However, traditional NOMA with a large number of devices is challenging due to large decoding delay and propagation error introduced by s…
▽ More
This paper investigates the optimization of reconfigurable intelligent surface (RIS) in an integrated sensing and communication (ISAC) system. \red{To meet the demand of growing number of devices, power domain non-orthogonal multiple access (NOMA) is considered. However, traditional NOMA with a large number of devices is challenging due to large decoding delay and propagation error introduced by successive interference cancellation (SIC). Thus, OMA is integrated into NOMA to support more devices}. We formulate a max-min problem to optimize the sensing beampattern \red{with constraints on communication rate}, through joint power allocation, active beamforming and RIS phase shift design. To solve the non-convex problem with a non-smooth objective function, we propose a low complexity alternating optimization (AO) algorithm, where a closed form expression for the intra-cluster power allocation (intra-CPA) is derived, and penalty and successive convex approximation (SCA) methods are used to optimize the beamforming and phase shift design. Simulation results show the effectiveness of the proposed algorithm in terms of improving minimum beampattern gain (MBPG) compared with other baselines. Furthermore, the trade-off between sensing and communication is analyzed and demonstrated in the simulation results.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Performance Bounds for Near-Field Localization with Widely-Spaced Multi-Subarray mmWave/THz MIMO
Authors:
Songjie Yang,
Xinyi Chen,
Yue Xiu,
Wanting Lyu,
Zhongpei Zhang,
Chau Yuen
Abstract:
This paper investigates the potential of near-field localization using widely-spaced multi-subarrays (WSMSs) and analyzing the corresponding angle and range Cramér-Rao bounds (CRBs). By employing the Riemann sum, closed-form CRB expressions are derived for the spherical wavefront-based WSMS (SW-WSMS). We find that the CRBs can be characterized by the angular span formed by the line connecting the…
▽ More
This paper investigates the potential of near-field localization using widely-spaced multi-subarrays (WSMSs) and analyzing the corresponding angle and range Cramér-Rao bounds (CRBs). By employing the Riemann sum, closed-form CRB expressions are derived for the spherical wavefront-based WSMS (SW-WSMS). We find that the CRBs can be characterized by the angular span formed by the line connecting the array's two ends to the target, and the different WSMSs with same angular spans but different number of subarrays have identical normalized CRBs. We provide a theoretical proof that, in certain scenarios, the CRB of WSMSs is smaller than that of uniform arrays. We further yield the closed-form CRBs for the hybrid spherical and planar wavefront-based WSMS (HSPW-WSMS), and its components can be seen as decompositions of the parameters from the CRBs for the SW-WSMS. Simulations are conducted to validate the accuracy of the derived closed-form CRBs and provide further insights into various system characteristics. Basically, this paper underscores the high resolution of utilizing WSMS for localization, reinforces the validity of adopting the HSPW assumption, and, considering its applications in communications, indicates a promising outlook for integrated sensing and communications based on HSPW-WSMSs.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Efficient Bayesian Optimization with Deep Kernel Learning and Transformer Pre-trained on Multiple Heterogeneous Datasets
Authors:
Wenlong Lyu,
Shoubo Hu,
Jie Chuai,
Zhitang Chen
Abstract:
Bayesian optimization (BO) is widely adopted in black-box optimization problems and it relies on a surrogate model to approximate the black-box response function. With the increasing number of black-box optimization tasks solved and even more to solve, the ability to learn from multiple prior tasks to jointly pre-train a surrogate model is long-awaited to further boost optimization efficiency. In…
▽ More
Bayesian optimization (BO) is widely adopted in black-box optimization problems and it relies on a surrogate model to approximate the black-box response function. With the increasing number of black-box optimization tasks solved and even more to solve, the ability to learn from multiple prior tasks to jointly pre-train a surrogate model is long-awaited to further boost optimization efficiency. In this paper, we propose a simple approach to pre-train a surrogate, which is a Gaussian process (GP) with a kernel defined on deep features learned from a Transformer-based encoder, using datasets from prior tasks with possibly heterogeneous input spaces. In addition, we provide a simple yet effective mix-up initialization strategy for input tokens corresponding to unseen input variables and therefore accelerate new tasks' convergence. Experiments on both synthetic and real benchmark problems demonstrate the effectiveness of our proposed pre-training and transfer BO strategy over existing methods.
△ Less
Submitted 8 August, 2023;
originally announced August 2023.
-
Continual Learning in Open-vocabulary Classification with Complementary Memory Systems
Authors:
Zhen Zhu,
Weijie Lyu,
Yao Xiao,
Derek Hoiem
Abstract:
We introduce a method for flexible and efficient continual learning in open-vocabulary image classification, drawing inspiration from the complementary learning systems observed in human cognition. Specifically, we propose to combine predictions from a CLIP zero-shot model and the exemplar-based model, using the zero-shot estimated probability that a sample's class is within the exemplar classes.…
▽ More
We introduce a method for flexible and efficient continual learning in open-vocabulary image classification, drawing inspiration from the complementary learning systems observed in human cognition. Specifically, we propose to combine predictions from a CLIP zero-shot model and the exemplar-based model, using the zero-shot estimated probability that a sample's class is within the exemplar classes. We also propose a "tree probe" method, an adaption of lazy learning principles, which enables fast learning from new examples with competitive accuracy to batch-trained linear models. We test in data incremental, class incremental, and task incremental settings, as well as ability to perform flexible inference on varying subsets of zero-shot and learned categories. Our proposed method achieves a good balance of learning speed, target task effectiveness, and zero-shot effectiveness. Code will be available at https://github.com/jessemelpolio/TreeProbe.
△ Less
Submitted 3 October, 2023; v1 submitted 3 July, 2023;
originally announced July 2023.
-
Consistent Multimodal Generation via A Unified GAN Framework
Authors:
Zhen Zhu,
Yijun Li,
Weijie Lyu,
Krishna Kumar Singh,
Zhixin Shu,
Soeren Pirk,
Derek Hoiem
Abstract:
We investigate how to generate multimodal image outputs, such as RGB, depth, and surface normals, with a single generative model. The challenge is to produce outputs that are realistic, and also consistent with each other. Our solution builds on the StyleGAN3 architecture, with a shared backbone and modality-specific branches in the last layers of the synthesis network, and we propose per-modality…
▽ More
We investigate how to generate multimodal image outputs, such as RGB, depth, and surface normals, with a single generative model. The challenge is to produce outputs that are realistic, and also consistent with each other. Our solution builds on the StyleGAN3 architecture, with a shared backbone and modality-specific branches in the last layers of the synthesis network, and we propose per-modality fidelity discriminators and a cross-modality consistency discriminator. In experiments on the Stanford2D3D dataset, we demonstrate realistic and consistent generation of RGB, depth, and normal images. We also show a training recipe to easily extend our pretrained model on a new domain, even with a few pairwise data. We further evaluate the use of synthetically generated RGB and depth pairs for training or fine-tuning depth estimators. Code will be available at https://github.com/jessemelpolio/MultimodalGAN.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
The roles of the $T_{c\bar{s}0}(2900)^0$ and $D_0^*(2300)$ in the process $B^-\to D_s^+K^-π^-$
Authors:
Wen-Tao Lyu,
Yun-He Lyu,
Man-Yu Duan,
De-Min Li,
Dian-Yong Chen,
En Wang
Abstract:
Motivated by the recent LHCb observations of $T_{c\bar{s}0}(2900)^0$ and $T_{c\bar{s}0}(2900)^{++}$ in the processes $B^0\to\bar{D}^0D_s^+π^-$ and $B^+\to D^-D_s^+π^+$, we have investigated the decay $B^-\to D_s^+K^-π^-$ by taking into account the contributions from the $S$-wave vector-vector interactions, and the $S$-wave $D^+_s K^-$ interactions. Our results show that the $D_s^+K^-$ invariant ma…
▽ More
Motivated by the recent LHCb observations of $T_{c\bar{s}0}(2900)^0$ and $T_{c\bar{s}0}(2900)^{++}$ in the processes $B^0\to\bar{D}^0D_s^+π^-$ and $B^+\to D^-D_s^+π^+$, we have investigated the decay $B^-\to D_s^+K^-π^-$ by taking into account the contributions from the $S$-wave vector-vector interactions, and the $S$-wave $D^+_s K^-$ interactions. Our results show that the $D_s^+K^-$ invariant mass distribution has an enhancement structure near the threshold, associated with the $D^*_0(2300)$, which is in good agreement with the Belle measurements. We have also predicted the $D^+_sπ^-$ invariant mass distribution and the Dalitz plot, which show the significant signal of the $T_{c\bar{s}0}(2900)$. With the same formalism, the $D^-_sK^0_s$ invariant mass distribution of the process $B^0 \to D^-_sK^0_sπ^+$ measured by Belle could be well reproduced, and the peak of $T_{c\bar{s}0}(2900)$ is expected to be observed around 2900~MeV in the $D^-_sπ^+$ invariant mass distribution. Our results could be tested by the Belle II and LHCb experiments in the future.
△ Less
Submitted 9 January, 2024; v1 submitted 28 June, 2023;
originally announced June 2023.
-
Exploring Effective Mask Sampling Modeling for Neural Image Compression
Authors:
Lin Liu,
Mingming Zhao,
Shanxin Yuan,
Wenlong Lyu,
Wengang Zhou,
Houqiang Li,
Yanfeng Wang,
Qi Tian
Abstract:
Image compression aims to reduce the information redundancy in images. Most existing neural image compression methods rely on side information from hyperprior or context models to eliminate spatial redundancy, but rarely address the channel redundancy. Inspired by the mask sampling modeling in recent self-supervised learning methods for natural language processing and high-level vision, we propose…
▽ More
Image compression aims to reduce the information redundancy in images. Most existing neural image compression methods rely on side information from hyperprior or context models to eliminate spatial redundancy, but rarely address the channel redundancy. Inspired by the mask sampling modeling in recent self-supervised learning methods for natural language processing and high-level vision, we propose a novel pretraining strategy for neural image compression. Specifically, Cube Mask Sampling Module (CMSM) is proposed to apply both spatial and channel mask sampling modeling to image compression in the pre-training stage. Moreover, to further reduce channel redundancy, we propose the Learnable Channel Mask Module (LCMM) and the Learnable Channel Completion Module (LCCM). Our plug-and-play CMSM, LCMM, LCCM modules can apply to both CNN-based and Transformer-based architectures, significantly reduce the computational cost, and improve the quality of images. Experiments on the public Kodak and Tecnick datasets demonstrate that our method achieves competitive performance with lower computational complexity compared to state-of-the-art image compression methods.
△ Less
Submitted 9 June, 2023;
originally announced June 2023.
-
Beyond Gaussian Quantum Channels: A model case
Authors:
Daniel Speed,
Wenyang Lyu,
Roman Schubert
Abstract:
Gaussian quantum channels are well understood and have many applications, e.g., in Quantum Information Theory and in Quantum Optics. For more general quantum channels one can in general use semiclassical approximations or perturbation theory, but it is not easy to judge the accuracy of such methods. We study a relatively simple model case, where the quantum channel is generated by a Lindblad equat…
▽ More
Gaussian quantum channels are well understood and have many applications, e.g., in Quantum Information Theory and in Quantum Optics. For more general quantum channels one can in general use semiclassical approximations or perturbation theory, but it is not easy to judge the accuracy of such methods. We study a relatively simple model case, where the quantum channel is generated by a Lindblad equation where one of the Lindblad operators is a multiple of the internal Hamiltonian, and therefore the channel is not Gaussian. For this model we can compute the characteristic function of the action of the channel on a Gaussian state explicitly and we can as well derive a representation of the propagator in an integral form. This allows us to compare the exact results with semiclassical approximations and perturbation theory and evaluate their accuracy. We finally apply these results to the study of the evolution of the von Neumann entropy of a state.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Enhancing Clinical Predictive Modeling through Model Complexity-Driven Class Proportion Tuning for Class Imbalanced Data: An Empirical Study on Opioid Overdose Prediction
Authors:
Yinan Liu,
Xinyu Dong,
Weimin Lyu,
Richard N. Rosenthal,
Rachel Wong,
Tengfei Ma,
Fusheng Wang
Abstract:
Class imbalance problems widely exist in the medical field and heavily deteriorates performance of clinical predictive models. Most techniques to alleviate the problem rebalance class proportions and they predominantly assume the rebalanced proportions should be a function of the original data and oblivious to the model one uses. This work challenges this prevailing assumption and proposes that li…
▽ More
Class imbalance problems widely exist in the medical field and heavily deteriorates performance of clinical predictive models. Most techniques to alleviate the problem rebalance class proportions and they predominantly assume the rebalanced proportions should be a function of the original data and oblivious to the model one uses. This work challenges this prevailing assumption and proposes that links the optimal class proportions to the model complexity, thereby tuning the class proportions per model. Our experiments on the opioid overdose prediction problem highlight the performance gain of tuning class proportions. Rigorous regression analysis also confirms the advantages of the theoretical framework proposed and the statistically significant correlation between the hyperparameters controlling the model complexity and the optimal class proportions.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Adversarial Examples Detection with Enhanced Image Difference Features based on Local Histogram Equalization
Authors:
Zhaoxia Yin,
Shaowei Zhu,
Hang Su,
Jianteng Peng,
Wanli Lyu,
Bin Luo
Abstract:
Deep Neural Networks (DNNs) have recently made significant progress in many fields. However, studies have shown that DNNs are vulnerable to adversarial examples, where imperceptible perturbations can greatly mislead DNNs even if the full underlying model parameters are not accessible. Various defense methods have been proposed, such as feature compression and gradient masking. However, numerous st…
▽ More
Deep Neural Networks (DNNs) have recently made significant progress in many fields. However, studies have shown that DNNs are vulnerable to adversarial examples, where imperceptible perturbations can greatly mislead DNNs even if the full underlying model parameters are not accessible. Various defense methods have been proposed, such as feature compression and gradient masking. However, numerous studies have proven that previous methods create detection or defense against certain attacks, which renders the method ineffective in the face of the latest unknown attack methods. The invisibility of adversarial perturbations is one of the evaluation indicators for adversarial example attacks, which also means that the difference in the local correlation of high-frequency information in adversarial examples and normal examples can be used as an effective feature to distinguish the two. Therefore, we propose an adversarial example detection framework based on a high-frequency information enhancement strategy, which can effectively extract and amplify the feature differences between adversarial examples and normal examples. Experimental results show that the feature augmentation module can be combined with existing detection models in a modular way under this framework. Improve the detector's performance and reduce the deployment cost without modifying the existing detection model.
△ Less
Submitted 7 May, 2023;
originally announced May 2023.
-
Near-Field Channel Estimation for Extremely Large-Scale Reconfigurable Intelligent Surface (XL-RIS)-Aided Wideband mmWave Systems
Authors:
Songjie Yang,
Chenfei Xie,
Wanting Lyu,
Boyu Ning,
Zhongpei Zhang,
Chau Yuen
Abstract:
Near-field communications present new opportunities over near-field channels, however, the spherical wavefront propagation makes near-field signal processing challenging. In this context, this paper proposes efficient near-field channel estimation methods for wideband MIMO mmWave systems with the aid of extremely large-scale reconfigurable intelligent surfaces (XL-RIS). For the wideband signals re…
▽ More
Near-field communications present new opportunities over near-field channels, however, the spherical wavefront propagation makes near-field signal processing challenging. In this context, this paper proposes efficient near-field channel estimation methods for wideband MIMO mmWave systems with the aid of extremely large-scale reconfigurable intelligent surfaces (XL-RIS). For the wideband signals reflected by the analog RIS, we characterize their near-field beam squint effect in both angle and distance domains. Based on the mathematical analysis of the near-field beam patterns over all frequencies, a wideband spherical-domain dictionary is constructed by minimizing the coherence of two arbitrary beams. In light of this, we formulate a two-dimensional compressive sensing problem to recover the channel parameter based on the spherical-domain sparsity of mmWave channels. To this end, we present a correlation coefficient-based atom matching method within our proposed multi-frequency parallelizable subspace recovery framework for efficient solutions. Additionally, we propose a two-dimensional oracle estimator as a benchmark and derive its lower bound across all subcarriers. Our findings emphasize the significance of system hyperparameters and the sensing matrix of each subcarrier in determining the accuracy of the estimation. Finally, numerical results show that our proposed method achieves considerable performance compared with the lower bound and has a time complexity linear to the number of RIS elements.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
Reconfigurable Intelligent Surface-Aided Full-Duplex mmWave MIMO: Channel Estimation, Passive and Hybrid Beamforming
Authors:
Songjie Yang,
Wanting Lyu,
Yunis Xanthos,
Zhongpei Zhang,
Chadi Assi,
Chau Yuen
Abstract:
Millimeter wave (mmWave) full-duplex (FD) is a promising technique for improving capacity by maximizing the utilization of both time and the rich mmWave frequency resources. Still, it has restrictions due to FD self-interference (SI) and mmWave's limited coverage. Therefore, this study dives into FD mmWave MIMO with the assistance of reconfigurable intelligent surfaces (RIS) for capacity improveme…
▽ More
Millimeter wave (mmWave) full-duplex (FD) is a promising technique for improving capacity by maximizing the utilization of both time and the rich mmWave frequency resources. Still, it has restrictions due to FD self-interference (SI) and mmWave's limited coverage. Therefore, this study dives into FD mmWave MIMO with the assistance of reconfigurable intelligent surfaces (RIS) for capacity improvement. First, we demonstrate the angular-domain reciprocity of FD antenna arrays under the far-field planar wavefront assumption. Accordingly, a strategy for joint downlink-uplink (DL-UL) channel estimation is presented. For estimating the SI channel, the direct channel, and the cascaded channel, the Khatri-Rao product-based compressive sensing (KR-CS), distributed CS (D-CS), and two-stage multiple measurement vector-based D-CS (M-D-CS) frameworks are proposed, respectively. Additionally, we propose a passive beamforming optimization solution based on the angular-domain cascaded channel. With hybrid beamforming architectures, a novel hybrid weighted minimum mean squared error method for SI cancellation (H-WMMSE-SIC) is proposed. Simulations have revealed that joint DL-UL processing significantly improves estimation performance in comparison to separate DL/UL channel estimation. Particularly, when the interference-to-noise ratio is less than 35 dB, our proposed H-WMMSE-SIC offers spectral efficiency performance comparable to fully-digital WMMSE-SIC. Finally, the computational complexity is analyzed for our proposed methods.
△ Less
Submitted 25 March, 2023;
originally announced March 2023.
-
Meta Computing
Authors:
Xiuzhen Cheng,
Minghui Xu,
Runyu Pan,
Dongxiao Yu,
Chenxu Wang,
Xue Xiao,
Weifeng Lyu
Abstract:
With the continuous improvement of information infrastructures, academia and industry have been constantly exploring new computing paradigms to fully exploit computing powers. In this paper, we propose Meta Computing, a new computing paradigm that aims to utilize all available computing resources hooked on the Internet, provide efficient, fault-tolerant, and personalized services with strong secur…
▽ More
With the continuous improvement of information infrastructures, academia and industry have been constantly exploring new computing paradigms to fully exploit computing powers. In this paper, we propose Meta Computing, a new computing paradigm that aims to utilize all available computing resources hooked on the Internet, provide efficient, fault-tolerant, and personalized services with strong security and privacy guarantee, and virtualize the Internet as a giant computer, that is, ``Network-as-a-Computer, NaaC'', or ``Meta Computer'' for short, for any task or any person on-demand.
△ Less
Submitted 19 February, 2023;
originally announced February 2023.
-
Nanomotion of micro-objects driven by light-induced elastic waves on solid interfaces
Authors:
Wei Lyu,
Weiwei Tang,
Wei Yan,
Min Qiu
Abstract:
It has been recently reported that elastic waves induced by nanosecond light pulses can be used to drive nano-motion of micro-objects on frictional solid interfaces, a challenging task for traditional techniques using tiny optical force. In this technique, the main physical quantities/parameters involved are: temporal width and energy of light pulses, thermal heating and cooling time, friction for…
▽ More
It has been recently reported that elastic waves induced by nanosecond light pulses can be used to drive nano-motion of micro-objects on frictional solid interfaces, a challenging task for traditional techniques using tiny optical force. In this technique, the main physical quantities/parameters involved are: temporal width and energy of light pulses, thermal heating and cooling time, friction force and elastic waves. Despite a few experimental observations based on micro-fiber systems, a microscopic theory, which reveals how these quantities collaboratively enable motion of the micro-objects and derives what the underlying manipulation principles emerge, is absent. In this article, a comprehensive theoretical analysis--centralized around the above listed physical quantities, and illuminated by a single-friction-point model in conjunction with numerical simulations--is established to pedagogically clarify the physics. Our results reveal the two essential factors in this technique: (1) the use of short light pulses for rapid thermal expansion overwhelming friction resistance and (2) the timescale asymmetry in thermal heating and cooling for accumulating a net sliding distance. Moreover, we examine the effects of spatially distributed friction beyond the single-friction-point consideration, and show "tug-of-war"-like friction stretching in the driving process. Given these insights, we positively predict that this elastic-wave-based manipulation principle could be directly translated to micro/nano-scale optical waveguides on optical chips, and propose a practical design. We wish that these results offer theoretical guidelines for ongoing efforts of optical manipulation on solid interfaces with light-induced elastic waves.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization
Authors:
Junlong Lyu,
Zhitang Chen,
Wenlong Lyu,
Jianye Hao
Abstract:
We proposed a new technique to accelerate sampling methods for solving difficult optimization problems. Our method investigates the intrinsic connection between posterior distribution sampling and optimization with Langevin dynamics, and then we propose an interacting particle scheme that approximates a Reweighted Interacting Langevin Diffusion system (RILD). The underlying system is designed by a…
▽ More
We proposed a new technique to accelerate sampling methods for solving difficult optimization problems. Our method investigates the intrinsic connection between posterior distribution sampling and optimization with Langevin dynamics, and then we propose an interacting particle scheme that approximates a Reweighted Interacting Langevin Diffusion system (RILD). The underlying system is designed by adding a multiplicative source term into the classical Langevin operator, leading to a higher convergence rate and a more concentrated invariant measure. We analyze the convergence rate of our algorithm and the improvement compared to existing results in the asymptotic situation. We also design various tests to verify our theoretical results, showing the advantages of accelerating convergence and breaking through barriers of suspicious local minimums, especially in high-dimensional non-convex settings. Our algorithms and analysis shed some light on combining gradient and genetic algorithms using Partial Differential Equations (PDEs) with provable guarantees.
△ Less
Submitted 29 January, 2023;
originally announced January 2023.
-
Brain Tissue Segmentation Across the Human Lifespan via Supervised Contrastive Learning
Authors:
Xiaoyang Chen,
Jinjian Wu,
Wenjiao Lyu,
Yicheng Zou,
Kim-Han Thung,
Siyuan Liu,
Ye Wu,
Sahar Ahmad,
Pew-Thian Yap
Abstract:
Automatic segmentation of brain MR images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) is critical for tissue volumetric analysis and cortical surface reconstruction. Due to dramatic structural and appearance changes associated with developmental and aging processes, existing brain tissue segmentation methods are only viable for specific age groups. Consequently, methods…
▽ More
Automatic segmentation of brain MR images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF) is critical for tissue volumetric analysis and cortical surface reconstruction. Due to dramatic structural and appearance changes associated with developmental and aging processes, existing brain tissue segmentation methods are only viable for specific age groups. Consequently, methods developed for one age group may fail for another. In this paper, we make the first attempt to segment brain tissues across the entire human lifespan (0-100 years of age) using a unified deep learning model. To overcome the challenges related to structural variability underpinned by biological processes, intensity inhomogeneity, motion artifacts, scanner-induced differences, and acquisition protocols, we propose to use contrastive learning to improve the quality of feature representations in a latent space for effective lifespan tissue segmentation. We compared our approach with commonly used segmentation methods on a large-scale dataset of 2,464 MR images. Experimental results show that our model accurately segments brain tissues across the lifespan and outperforms existing methods.
△ Less
Submitted 3 January, 2023;
originally announced January 2023.
-
Standardizing Representation for Equality with a Population Seat Index
Authors:
Liang Zhao,
Akiko Tanimoto,
Wenruo Lyu
Abstract:
Proportional representation (PR) has long been believed the ideal system for the equality of individuals in apportioning the seats of a legislature body to subgroups. We observe that PR implicitly assumes the (standard) number of representatives is proportional to the population, a situation no longer observed since 1820s. To address this issue, we suggest to formulate the apportionment problem in…
▽ More
Proportional representation (PR) has long been believed the ideal system for the equality of individuals in apportioning the seats of a legislature body to subgroups. We observe that PR implicitly assumes the (standard) number of representatives is proportional to the population, a situation no longer observed since 1820s. To address this issue, we suggest to formulate the apportionment problem in a broader context by explicitly specifying a standard function $f$ such that $f(p)$ is the standard, possibly fractional number of representatives for population $p$, where PR assumes $f(p)\propto p$. For this generalized apportionment problem, we give a population seat index (PSI) $\frac{f^{-1}(s)}{p}$ for quantifying the contribution of an individual in assigning $s$ seats to a population $p$, where $f^{-1}$ is the inverse of $f$. With the PSI, we derive apportioning schemes with absolute and relative individual equality. Particularly, for $s$ seats, populations $p_1, \ldots, p_k$, and a standard function $f(p) = a + b p^γ$ with constants $a, b, γ\ge 0$, the ideal, possibly fractional number of seats for subgroup $i$ is $a + \frac{(S-ka)p_i^γ}{\sum p_j^γ}$, not $\frac{Sp_i}{\sum p_j}$ calculated by PR which works only for $a=0$, $γ=1$. Finally, since real-world observations indicate a standard function $f \propto p^γ$ with $γ< 1$, we conclude that PR represents individuals in less populous subgroups less than individuals in more populous subgroups.
△ Less
Submitted 12 May, 2023; v1 submitted 11 December, 2022;
originally announced December 2022.
-
Adversarial Example Defense via Perturbation Grading Strategy
Authors:
Shaowei Zhu,
Wanli Lyu,
Bin Li,
Zhaoxia Yin,
Bin Luo
Abstract:
Deep Neural Networks have been widely used in many fields. However, studies have shown that DNNs are easily attacked by adversarial examples, which have tiny perturbations and greatly mislead the correct judgment of DNNs. Furthermore, even if malicious attackers cannot obtain all the underlying model parameters, they can use adversarial examples to attack various DNN-based task systems. Researcher…
▽ More
Deep Neural Networks have been widely used in many fields. However, studies have shown that DNNs are easily attacked by adversarial examples, which have tiny perturbations and greatly mislead the correct judgment of DNNs. Furthermore, even if malicious attackers cannot obtain all the underlying model parameters, they can use adversarial examples to attack various DNN-based task systems. Researchers have proposed various defense methods to protect DNNs, such as reducing the aggressiveness of adversarial examples by preprocessing or improving the robustness of the model by adding modules. However, some defense methods are only effective for small-scale examples or small perturbations but have limited defense effects for adversarial examples with large perturbations. This paper assigns different defense strategies to adversarial perturbations of different strengths by grading the perturbations on the input examples. Experimental results show that the proposed method effectively improves defense performance. In addition, the proposed method does not modify any task model, which can be used as a preprocessing module, which significantly reduces the deployment cost in practical applications.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Active 3D Double-RIS-Aided Multi-User Communications: Two-Timescale-Based Separate Channel Estimation via Bayesian Learning
Authors:
Songjie Yang,
Wanting Lyu,
Yue Xiu,
Zhongpei Zhang,
Chau Yuen
Abstract:
Double-reconfigurable intelligent surface (RIS) is a promising technique, achieving a substantial gain improvement compared to single-RIS techniques. However, in double-RIS-aided systems, accurate channel estimation is more challenging than in single-RIS-aided systems. This work solves the problem of double-RIS-based channel estimation based on active RIS architectures with only one radio frequenc…
▽ More
Double-reconfigurable intelligent surface (RIS) is a promising technique, achieving a substantial gain improvement compared to single-RIS techniques. However, in double-RIS-aided systems, accurate channel estimation is more challenging than in single-RIS-aided systems. This work solves the problem of double-RIS-based channel estimation based on active RIS architectures with only one radio frequency (RF) chain. Since the slow time-varying channels, i.e., the BS-RIS 1, BS-RIS 2, and RIS 1-RIS 2 channels, can be obtained with active RIS architectures, a novel multi-user two-timescale channel estimation protocol is proposed to minimize the pilot overhead. First, we propose an uplink training scheme for slow time-varying channel estimation, which can effectively address the double-reflection channel estimation problem. With channels' sparisty, a low-complexity Singular Value Decomposition Multiple Measurement Vector-Based Compressive Sensing (SVD-MMV-CS) framework with the line-of-sight (LoS)-aided off-grid MMV expectation maximization-based generalized approximate message passing (M-EM-GAMP) algorithm is proposed for channel parameter recovery. For fast time-varying channel estimation, based on the estimated large-timescale channels, a measurements-augmentation-estimate (MAE) framework is developed to decrease the pilot overhead.Additionally, a comprehensive analysis of pilot overhead and computing complexity is conducted. Finally, the simulation results demonstrate the effectiveness of our proposed multi-user two-timescale estimation strategy and the low-complexity Bayesian CS framework.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Joint Localization and Beamforming for Reconfigurable Intelligent Surface Aided 5G mmWave Communication Systems
Authors:
Yunis Xanthos,
Wanting Lyu,
Songjie Yang,
Chadi Assi,
Xianbing Zou,
Ning Wei
Abstract:
Reconfigurable intelligent surface (RIS) is an attractive technology to improve the transmission rate of millimetre-wave (mmWave) communication systems. The previous {research} on RIS technology mainly focused on improving the transmission rate and security rate of the mmWave communication systems. Since the emergence of RIS technology creates the conditions for generating an intelligent radio env…
▽ More
Reconfigurable intelligent surface (RIS) is an attractive technology to improve the transmission rate of millimetre-wave (mmWave) communication systems. The previous {research} on RIS technology mainly focused on improving the transmission rate and security rate of the mmWave communication systems. Since the emergence of RIS technology creates the conditions for generating an intelligent radio environment, it also has potential advantages on improving the localization accuracy of the mmWave communication systems. Deployed on walls and objects, RISs are capable of significantly improving communications and positioning coverage by controlling the multi-path reflection. This paper considers the RIS-aided mmWave localization system and proposes a joint beamforming and localization problem. However, since the objective function depends on the unknown UE's position and instantaneous channel state information (CSI), this beamforming and localization technology based on RIS assistance is challenging. To solve this problem, we propose a new joint localization and beamforming optimization (JLBO) algorithm, and give the proof of its convergence. The simulation results show that the RIS can improve the user localization accuracy of the system and the proposed scheme has a significant performance improvement compared with the traditional schemes.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Energy-Efficient Cell-Free Network Assisted by Hybrid RISs
Authors:
Wanting Lyu,
Yue Xiu,
Songjie Yang,
Chau Yuen,
Zhongpei Zhang
Abstract:
In this letter, we investigate a cell-free network aided by hybrid reconfigurable intelligent surfaces (RISs), which consists of a mixture of passive and active elements that are capable of amplifying and reflecting the incident signal. To maximize the energy efficiency (EE) of the system, we formulate a joint transmit beamforming and RIS coefficients optimization problem. To deal with the fractio…
▽ More
In this letter, we investigate a cell-free network aided by hybrid reconfigurable intelligent surfaces (RISs), which consists of a mixture of passive and active elements that are capable of amplifying and reflecting the incident signal. To maximize the energy efficiency (EE) of the system, we formulate a joint transmit beamforming and RIS coefficients optimization problem. To deal with the fractional objective function, Dinkelbach transform, Lagrangian dual reformulation, and quadratic transform are utilized, with a block coordinate descent (BCD) based algorithm proposed to decouple the variables. In addition, successive convex approximation (SCA) method is applied to iteratively to tackle the non-convexity of the sub-problems. Simulation results illustrate the effectiveness and convergence of the proposed algorithm through analyzing the EE and sum rate performance with varying parameter settings. The proposed hybrid RISs schemes can achieve 92% of the sum rate but 188% of EE of active RISs schemes. As compared with passive RISs, 11% gain in sum rate can be achieved with comparable EE.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
A Multimodal Transformer: Fusing Clinical Notes with Structured EHR Data for Interpretable In-Hospital Mortality Prediction
Authors:
Weimin Lyu,
Xinyu Dong,
Rachel Wong,
Songzhu Zheng,
Kayley Abell-Hart,
Fusheng Wang,
Chao Chen
Abstract:
Deep-learning-based clinical decision support using structured electronic health records (EHR) has been an active research area for predicting risks of mortality and diseases. Meanwhile, large amounts of narrative clinical notes provide complementary information, but are often not integrated into predictive models. In this paper, we provide a novel multimodal transformer to fuse clinical notes and…
▽ More
Deep-learning-based clinical decision support using structured electronic health records (EHR) has been an active research area for predicting risks of mortality and diseases. Meanwhile, large amounts of narrative clinical notes provide complementary information, but are often not integrated into predictive models. In this paper, we provide a novel multimodal transformer to fuse clinical notes and structured EHR data for better prediction of in-hospital mortality. To improve interpretability, we propose an integrated gradients (IG) method to select important words in clinical notes and discover the critical structured EHR features with Shapley values. These important words and clinical features are visualized to assist with interpretation of the prediction outcomes. We also investigate the significance of domain adaptive pretraining and task adaptive fine-tuning on the Clinical BERT, which is used to learn the representations of clinical notes. Experiments demonstrated that our model outperforms other methods (AUCPR: 0.538, AUCROC: 0.877, F1:0.490).
△ Less
Submitted 9 May, 2023; v1 submitted 8 August, 2022;
originally announced August 2022.
-
Attention Hijacking in Trojan Transformers
Authors:
Weimin Lyu,
Songzhu Zheng,
Tengfei Ma,
Haibin Ling,
Chao Chen
Abstract:
Trojan attacks pose a severe threat to AI systems. Recent works on Transformer models received explosive popularity and the self-attentions are now indisputable. This raises a central question: Can we reveal the Trojans through attention mechanisms in BERTs and ViTs? In this paper, we investigate the attention hijacking pattern in Trojan AIs, \ie, the trigger token ``kidnaps'' the attention weight…
▽ More
Trojan attacks pose a severe threat to AI systems. Recent works on Transformer models received explosive popularity and the self-attentions are now indisputable. This raises a central question: Can we reveal the Trojans through attention mechanisms in BERTs and ViTs? In this paper, we investigate the attention hijacking pattern in Trojan AIs, \ie, the trigger token ``kidnaps'' the attention weights when a specific trigger is present. We observe the consistent attention hijacking pattern in Trojan Transformers from both Natural Language Processing (NLP) and Computer Vision (CV) domains. This intriguing property helps us to understand the Trojan mechanism in BERTs and ViTs. We also propose an Attention-Hijacking Trojan Detector (AHTD) to discriminate the Trojan AIs from the clean ones.
△ Less
Submitted 9 August, 2022;
originally announced August 2022.