Skip to main content

Showing 1–50 of 74 results for author: Niu, K

  1. arXiv:2406.07390  [pdf, other

    eess.SP cs.IT eess.IV

    DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling

    Authors: Sixian Wang, Jincheng Dai, Kailin Tan, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: End-to-end visual communication systems typically optimize a trade-off between channel bandwidth costs and signal-level distortion metrics. However, under challenging physical conditions, this traditional discriminative communication paradigm often results in unrealistic reconstructions with perceptible blurring and aliasing artifacts, despite the inclusion of perceptual or adversarial losses for… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2406.06446  [pdf, other

    cs.IT cs.LG cs.MM

    Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency

    Authors: Jincheng Dai, Xiaoqi Qin, Sixian Wang, Lexi Xu, Kai Niu, Ping Zhang

    Abstract: Information theory and machine learning are inextricably linked and have even been referred to as "two sides of the same coin". One particularly elegant connection is the essential equivalence between probabilistic generative modeling and data compression or transmission. In this article, we reveal the dual-functionality of deep generative models that reshapes both data compression for efficiency… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Publication in IEEE Wireless Communications

  3. arXiv:2406.06045  [pdf, other

    cs.CV cs.AI

    Synthesizing Efficient Data with Diffusion Models for Person Re-Identification Pre-Training

    Authors: Ke Niu, Haiyang Yu, Xuelin Qian, Teng Fu, Bin Li, Xiangyang Xue

    Abstract: Existing person re-identification (Re-ID) methods principally deploy the ImageNet-1K dataset for model initialization, which inevitably results in sub-optimal situations due to the large domain gap. One of the key challenges is that building large-scale person Re-ID datasets is time-consuming. Some previous efforts address this problem by collecting person images from the internet e.g., LUPerson,… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  4. arXiv:2406.02962  [pdf, other

    cs.CL cs.AI cs.IR

    Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models

    Authors: Qiang Sun, Yuanyi Luo, Wenxiao Zhang, Sirui Li, Jichunyang Li, Kai Niu, Xiangrui Kong, Wei Liu

    Abstract: Even for a conservative estimate, 80% of enterprise data reside in unstructured files, stored in data lakes that accommodate heterogeneous formats. Classical search engines can no longer meet information seeking needs, especially when the task is to browse and explore for insight formulation. In other words, there are no obvious search keywords to use. Knowledge graphs, due to their natural visual… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2405.19542  [pdf, other

    eess.SP cs.LG cs.RO

    Anatomical Region Recognition and Real-time Bone Tracking Methods by Dynamically Decoding A-Mode Ultrasound Signals

    Authors: Bangyu Lan, Stefano Stramigioli, Kenan Niu

    Abstract: Accurate bone tracking is crucial for kinematic analysis in orthopedic surgery and prosthetic robotics. Traditional methods (e.g., skin markers) are subject to soft tissue artifacts, and the bone pins used in surgery introduce the risk of additional trauma and infection. For electromyography (EMG), its inability to directly measure joint angles requires complex algorithms for kinematic estimation.… ▽ More

    Submitted 31 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  6. arXiv:2403.05879  [pdf, other

    eess.SP cs.LG cs.RO

    Deep Learning based acoustic measurement approach for robotic applications on orthopedics

    Authors: Bangyu Lan, Momen Abayazid, Nico Verdonschot, Stefano Stramigioli, Kenan Niu

    Abstract: In Total Knee Replacement Arthroplasty (TKA), surgical robotics can provide image-guided navigation to fit implants with high precision. Its tracking approach highly relies on inserting bone pins into the bones tracked by the optical tracking system. This is normally done by invasive, radiative manners (implantable markers and CT scans), which introduce unnecessary trauma and prolong the preparati… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  7. arXiv:2401.14634  [pdf, other

    cs.IT

    Semantic Huffman Coding using Synonymous Mapping

    Authors: Jin Xu, Kai Niu, Zijian Liang, Ping Zhang

    Abstract: Semantic communication stands out as a highly promising avenue for future developments in communications. Theoretically, source compression coding based on semantics can achieve lower rates than Shannon entropy. This paper introduces a semantic Huffman coding built upon semantic information theory. By incorporating synonymous mapping and synonymous sets, semantic Huffman coding can achieve shorter… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 6 pages, 3 figures, this paper is submitted to the 2024 IEEE International Symposium on Information Theory (ISIT 2024)

  8. arXiv:2401.14633  [pdf, other

    cs.IT

    Semantic Arithmetic Coding using Synonymous Mappings

    Authors: Zijian Liang, Kai Niu, Jin Xu, Ping Zhang

    Abstract: Recent semantic communication methods explore effective ways to expand the communication paradigm and improve the system performance of the communication systems. Nonetheless, the common problem of these methods is that the essence of semantics is not explicitly pointed out and directly utilized. A new epistemology suggests that synonymy, which is revealed as the fundamental feature of semantics,… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 6 pages, 4 figures. This paper is submitted to the 2024 IEEE International Symposium on Information Theory (ISIT 2024)

  9. arXiv:2401.14160  [pdf, other

    cs.IT

    A Mathematical Theory of Semantic Communication: Overview

    Authors: Kai Niu, Ping Zhang

    Abstract: Semantic communication initiates a new direction for future communication. In this paper, we aim to establish a systematic framework of semantic information theory (SIT). First, we propose a semantic communication model and define the synonymous mapping to indicate the critical relationship between semantic information and syntactic information. Based on this core concept, we introduce the measure… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 6 pages, 2 figures. This paper is submitted to the 2024 IEEE International Symposium on Information Theory (ISIT 2024). arXiv admin note: substantial text overlap with arXiv:2401.13387

  10. arXiv:2401.13387  [pdf, other

    cs.IT

    A Mathematical Theory of Semantic Communication

    Authors: Kai Niu, Ping Zhang

    Abstract: The year 1948 witnessed the historic moment of the birth of classic information theory (CIT). Guided by CIT, modern communication techniques have approached the theoretic limitations, such as, entropy function $H(U)$, channel capacity $C=\max_{p(x)}I(X;Y)$ and rate-distortion function $R(D)=\min_{p(\hat{x}|x):\mathbb{E}d(x,\hat{x})\leq D} I(X;\hat{X})$. Semantic communication paves a new direction… ▽ More

    Submitted 26 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: (version 2.0 updated) 96 pages, 18 figures. This paper is submitted to IEEE Transactions on Information Theory (TIT)

  11. arXiv:2312.08862  [pdf, other

    cs.IT eess.SP

    Semantics-Division Duplexing: A Novel Full-Duplex Paradigm

    Authors: Kai Niu, Zijian Liang, Chao Dong, Jincheng Dai, Zhongwei Si, Ping Zhang

    Abstract: In-band full-duplex (IBFD) is a theoretically effective solution to increase the overall throughput for the future wireless communications system by enabling transmission and reception over the same time-frequency resources. However, reliable source reconstruction remains a great challenge in the practical IBFD systems due to the non-ideal elimination of the self-interference and the inherent limi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 9 pages, 5 figures, submitted to IEEE Wireless Communications Magazine

  12. arXiv:2312.02456  [pdf, other

    cs.CR

    Watermarking for Neural Radiation Fields by Invertible Neural Network

    Authors: Wenquan Sun, Jia Liu, Weina Dong, Lifeng Chen, Ke Niu

    Abstract: To protect the copyright of the 3D scene represented by the neural radiation field, the embedding and extraction of the neural radiation field watermark are considered as a pair of inverse problems of image transformations. A scheme for protecting the copyright of the neural radiation field is proposed using invertible neural network watermarking, which utilizes watermarking techniques for 2D imag… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  13. arXiv:2310.07121  [pdf, other

    cs.MM cs.CR

    Motion Vector-Domain Video Steganalysis Exploiting Skipped Macroblocks

    Authors: Jun Li, Minqing Zhang, Ke Niu, Yingnan Zhang, Xiaoyuan Yang

    Abstract: Video steganography has the potential to be used to convey illegal information, and video steganalysis is a vital tool to detect the presence of this illicit act. Currently, all the motion vector (MV)-based video steganalysis algorithms extract feature sets directly on the MVs, but ignoring the steganograhic operation may perturb the statistics distribution of other video encoding elements, such a… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  14. arXiv:2309.11836  [pdf, other

    cs.IT

    Pre-configured Error Pattern Ordered Statistics Decoding for CRC-Polar Codes

    Authors: Xuanyu Li, Kai Niu, Yuxin Han, Jincheng Dai, Zhiyuan Tan, Zhiheng Guo

    Abstract: In this paper, we propose a pre-configured error pattern ordered statistics decoding (PEPOSD) algorithm and discuss its application to short cyclic redundancy check (CRC)-polar codes. Unlike the traditional OSD that changes the most reliable independent symbols, we regard the decoding process as testing the error patterns, like guessing random additive noise decoding (GRAND). Also, the pre-configu… ▽ More

    Submitted 23 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

  15. arXiv:2309.04682  [pdf, other

    cs.CV

    DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions

    Authors: Teng Fu, Xiaocong Wang, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue

    Abstract: Multiple object tracking (MOT) tends to become more challenging when severe occlusions occur. In this paper, we analyze the limitations of traditional Convolutional Neural Network-based methods and Transformer-based methods in handling occlusions and propose DNMOT, an end-to-end trainable DeNoising Transformer for MOT. To address the challenge of occlusions, we explicitly simulate the scenarios wh… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: ACM Multimedia 2023

  16. arXiv:2309.00885  [pdf, other

    eess.IV cs.CV cs.LG

    A Generic Fundus Image Enhancement Network Boosted by Frequency Self-supervised Representation Learning

    Authors: Heng Li, Haofeng Liu, Huazhu Fu, Yanwu Xu, Hui Shu, Ke Niu, Yan Hu, Jiang Liu

    Abstract: Fundus photography is prone to suffer from image quality degradation that impacts clinical examination performed by ophthalmologists or intelligent systems. Though enhancement algorithms have been developed to promote fundus observation on degraded images, high data demands and limited applicability hinder their clinical deployment. To circumvent this bottleneck, a generic fundus image enhancement… ▽ More

    Submitted 2 September, 2023; originally announced September 2023.

    Comments: Accepted by Medical Image Analysis in Auguest, 2023

    Journal ref: Medical Image Analysis, 2023, 90:102945

  17. arXiv:2308.06464  [pdf, other

    cs.CR cs.LG cs.MM

    A One-dimensional HEVC video steganalysis method using the Optimality of Predicted Motion Vectors

    Authors: Jun Li, Minqing Zhang, Ke Niu, Yingnan Zhang, Xiaoyuan Yang

    Abstract: Among steganalysis techniques, detection against motion vector (MV) domain-based video steganography in High Efficiency Video Coding (HEVC) standard remains a hot and challenging issue. For the purpose of improving the detection performance, this paper proposes a steganalysis feature based on the optimality of predicted MVs with a dimension of one. Firstly, we point out that the motion vector pred… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Submitted to TCSVT

  18. arXiv:2303.14640  [pdf, other

    eess.SP cs.IT

    NeurJSCC Enabled Semantic Communications: Paradigms, Applications, and Potentials

    Authors: Sixian Wang, Jincheng Dai, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: Recent advances in deep learning have led to increased interest in solving high-efficiency end-to-end transmission problems using methods that employ the nonlinear property of neural networks. These techniques, we call neural joint source-channel coding (NeurJSCC), extract latent semantic features of the source signal across space and time, and design corresponding variable-length NeurJSCC approac… ▽ More

    Submitted 23 June, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  19. arXiv:2303.14637  [pdf, other

    eess.SP cs.MM

    Improved Nonlinear Transform Source-Channel Coding to Catalyze Semantic Communications

    Authors: Sixian Wang, Jincheng Dai, Xiaoqi Qin, Zhongwei Si, Kai Niu, Ping Zhang

    Abstract: Recent deep learning methods have led to increased interest in solving high-efficiency end-to-end transmission problems. These methods, we call nonlinear transform source-channel coding (NTSCC), extract the semantic latent features of source signal, and learn entropy model to guide the joint source-channel coding with variable rate to transmit latent features over wireless channels. In this paper,… ▽ More

    Submitted 18 August, 2023; v1 submitted 26 March, 2023; originally announced March 2023.

  20. A Golden Decade of Polar Codes: From Basic Principle to 5G Applications

    Authors: Kai Niu, Ping Zhang, Jincheng Dai, Zhongwei Si, Chao Dong

    Abstract: After the pursuit of seventy years, the invention of polar codes indicates that we have found the first capacity-achieving coding with low complexity construction and decoding, which is the great breakthrough of the coding theory in the past two decades. In this survey, we retrospect the history of polar codes and summarize the advancement in the past ten years. First, the primary principle of cha… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: 29 pages, 21 figures, Published in China Communications

    Journal ref: China Communications, vol.20, no. 2, pp. 94-121, 2023

  21. arXiv:2212.05294  [pdf, ps, other

    cs.SD cs.IT eess.AS

    Variational Speech Waveform Compression to Catalyze Semantic Communications

    Authors: Shengshi Yao, Zixuan Xiao, Sixian Wang, Jincheng Dai, Kai Niu, Ping Zhang

    Abstract: We propose a novel neural waveform compression method to catalyze emerging speech semantic communications. By introducing nonlinear transform and variational modeling, we effectively capture the dependencies within speech frames and estimate the probabilistic distribution of the speech feature more accurately, giving rise to better compression performance. In particular, the speech signals are ana… ▽ More

    Submitted 13 December, 2022; v1 submitted 10 December, 2022; originally announced December 2022.

  22. arXiv:2211.14541  [pdf, other

    cs.AI

    RL-Based Guidance in Outpatient Hysteroscopy Training: A Feasibility Study

    Authors: Vladimir Poliakov, Kenan Niu, Emmanuel Vander Poorten, Dzmitry Tsetserukou

    Abstract: This work presents an RL-based agent for outpatient hysteroscopy training. Hysteroscopy is a gynecological procedure for examination of the uterine cavity. Recent advancements enabled performing this type of intervention in the outpatient setup without anaesthesia. While being beneficial to the patient, this approach introduces new challenges for clinicians, who should take additional measures to… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  23. arXiv:2211.04339  [pdf, other

    cs.IT cs.LG eess.SP

    Toward Adaptive Semantic Communications: Efficient Data Transmission via Online Learned Nonlinear Transform Source-Channel Coding

    Authors: Jincheng Dai, Sixian Wang, Ke Yang, Kailin Tan, Xiaoqi Qin, Zhongwei Si, Kai Niu, Ping Zhang

    Abstract: The emerging field semantic communication is driving the research of end-to-end data transmission. By utilizing the powerful representation ability of deep learning models, learned data transmission schemes have exhibited superior performance than the established source and channel coding methods. While, so far, research efforts mainly concentrated on architecture and model improvements toward a s… ▽ More

    Submitted 24 May, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted by IEEE JSAC

  24. arXiv:2211.02283  [pdf, ps, other

    cs.SD cs.IT eess.AS

    Wireless Deep Speech Semantic Transmission

    Authors: Zixuan Xiao, Shengshi Yao, Jincheng Dai, Sixian Wang, Kai Niu, Ping Zhang

    Abstract: In this paper, we propose a new class of high-efficiency semantic coded transmission methods for end-to-end speech transmission over wireless channels. We name the whole system as deep speech semantic transmission (DSST). Specifically, we introduce a nonlinear transform to map the speech source to semantic latent space and feed semantic features into source-channel encoder to generate the channel-… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

  25. arXiv:2211.00937  [pdf, other

    cs.CV cs.IT

    WITT: A Wireless Image Transmission Transformer for Semantic Communications

    Authors: Ke Yang, Sixian Wang, Jincheng Dai, Kailin Tan, Kai Niu, Ping Zhang

    Abstract: In this paper, we aim to redesign the vision Transformer (ViT) as a new backbone to realize semantic image transmission, termed wireless image transmission transformer (WITT). Previous works build upon convolutional neural networks (CNNs), which are inefficient in capturing global dependencies, resulting in degraded end-to-end transmission performance especially for high-resolution images. To tack… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  26. arXiv:2210.16741  [pdf, ps, other

    cs.IT eess.SP

    Versatile Semantic Coded Transmission over MIMO Fading Channels

    Authors: Shengshi Yao, Sixian Wang, Jincheng Dai, Kai Niu, Ping Zhang

    Abstract: Semantic communications have shown great potential to boost the end-to-end transmission performance. To further improve the system efficiency, in this paper, we propose a class of novel semantic coded transmission (SCT) schemes over multiple-input multiple-output (MIMO) fading channels. In particular, we propose a high-efficiency SCT system supporting concurrent transmission of multiple streams, w… ▽ More

    Submitted 30 October, 2022; originally announced October 2022.

  27. arXiv:2209.08294  [pdf, ps, other

    cs.SI

    A Survey on the Network Models applied in the Industrial Network Optimization

    Authors: Chao Dong, Xiaoxiong Xiong, Qiulin Xue, Zhengzhen Zhang, Kai Niu, Ping Zhang

    Abstract: Network architecture design is very important for the optimization of industrial networks. The type of network architecture can be divided into small-scale network and large-scale network according to its scale. Graph theory is an efficient mathematical tool for network topology modeling. For small-scale networks, its structure often has regular topology. For large-scale networks, the existing res… ▽ More

    Submitted 17 September, 2022; originally announced September 2022.

    Comments: 26 pages, 11 figures, Journal

  28. arXiv:2209.01744  [pdf

    cs.CR cs.MM

    Investigation on Principles for Cost Assignment in Motion Vector-based Video Steganography

    Authors: Jun Li, Minqing Zhang, Ke Niu, Xiaoyuan Yang

    Abstract: Cost assignment in the motion vector domain remains a research focus in video steganography. Recent studies in image steganography have summarized many principles for cost assignment and achieved good results. But the basic principles for cost assignment in motion vector-based video steganography have not been fully discussed yet. Firstly, this paper proposes three principles for cost assignment i… ▽ More

    Submitted 4 September, 2022; originally announced September 2022.

    Comments: 16 pages, 8 figures,

  29. arXiv:2208.02481  [pdf, ps, other

    cs.IT cs.AI cs.LG

    Communication Beyond Transmitting Bits: Semantics-Guided Source and Channel Coding

    Authors: Jincheng Dai, Ping Zhang, Kai Niu, Sixian Wang, Zhongwei Si, Xiaoqi Qin

    Abstract: Classical communication paradigms focus on accurately transmitting bits over a noisy channel, and Shannon theory provides a fundamental theoretical limit on the rate of reliable communications. In this approach, bits are treated equally, and the communication system is oblivious to what meaning these bits convey or how they would be used. Future communications towards intelligence and conciseness… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: IEEE Wireless Communications, text overlap with arXiv:2112.03093

  30. arXiv:2205.13129  [pdf, other

    cs.CV cs.IT

    Wireless Deep Video Semantic Transmission

    Authors: Sixian Wang, Jincheng Dai, Zijian Liang, Kai Niu, Zhongwei Si, Chao Dong, Xiaoqi Qin, Ping Zhang

    Abstract: In this paper, we design a new class of high-efficiency deep joint source-channel coding methods to achieve end-to-end video transmission over wireless channels. The proposed methods exploit nonlinear transform and conditional coding architecture to adaptively extract semantic features across video frames, and transmit semantic feature domain representations over wireless channels via deep joint s… ▽ More

    Submitted 2 November, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

    Comments: published in IEEE JSAC

  31. arXiv:2205.13120  [pdf, ps, other

    cs.CV cs.IT

    Perceptual Learned Source-Channel Coding for High-Fidelity Image Semantic Transmission

    Authors: Jun Wang, Sixian Wang, Jincheng Dai, Zhongwei Si, Dekun Zhou, Kai Niu

    Abstract: As one novel approach to realize end-to-end wireless image semantic transmission, deep learning-based joint source-channel coding (deep JSCC) method is emerging in both deep learning and communication communities. However, current deep JSCC image transmission systems are typically optimized for traditional distortion metrics such as peak signal-to-noise ratio (PSNR) or multi-scale structural simil… ▽ More

    Submitted 25 May, 2022; originally announced May 2022.

  32. arXiv:2205.03534  [pdf, other

    cs.CL cs.CV cs.MM

    Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information

    Authors: Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang

    Abstract: Recently, online shopping has gradually become a common way of shopping for people all over the world. Wonderful merchandise advertisements often attract more people to buy. These advertisements properly integrate multimodal multi-structured information of commodities, such as visual spatial information and fine-grained structure information. However, traditional multimodal text generation focuses… ▽ More

    Submitted 6 May, 2022; originally announced May 2022.

  33. arXiv:2204.07435  [pdf, other

    cs.IT

    Performance and Construction of Polar Codes: The Perspective of Bit Error Probability

    Authors: Bolin Wu, Kai Niu, Jincheng Dai

    Abstract: Most existing works of polar codes focus on the analysis of block error probability. However, in many scenarios, bit error probability is also important for evaluating the performance of channel codes. In this paper, we establish a new framework to analyze the bit error probability of polar codes. Specifically, by revisiting the error event of bit-channel, we first introduce the conditional bit er… ▽ More

    Submitted 15 April, 2022; originally announced April 2022.

  34. arXiv:2204.03535  [pdf, ps, other

    cs.NI eess.SP

    Practical Issues and Challenges in CSI-based Integrated Sensing and Communication

    Authors: Daqing Zhang, Dan Wu, Kai Niu, Xuanzhi Wang, Fusang Zhang, Jian Yao, Dajie Jiang, Fei Qin

    Abstract: Next-generation mobile communication network (i.e., 6G) has been envisioned to go beyond classical communication functionality and provide integrated sensing and communication (ISAC) capability to enable more emerging applications, such as smart cities, connected vehicles, AIoT and health care/elder care. Among all the ISAC proposals, the most practical and promising approach is to empower existin… ▽ More

    Submitted 17 March, 2022; originally announced April 2022.

    Comments: ICC 2022 workshop on integrated sensing and communication (ISAC)

  35. arXiv:2204.03125  [pdf, other

    eess.SY cs.LG

    Deep transfer learning for system identification using long short-term memory neural networks

    Authors: Kaicheng Niu, Mi Zhou, Chaouki T. Abdallah, Mohammad Hayajneh

    Abstract: Recurrent neural networks (RNNs) have many advantages over more traditional system identification techniques. They may be applied to linear and nonlinear systems, and they require fewer modeling assumptions. However, these neural network models may also need larger amounts of data to learn and generalize. Furthermore, neural networks training is a time-consuming process. Hence, building upon long-… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  36. arXiv:2203.06692  [pdf, other

    cs.IT

    Towards Semantic Communications: A Paradigm Shift

    Authors: Kai Niu, Jincheng Dai, Shengshi Yao, Sixian Wang, Zhongwei Si, Xiaoqi Qin, Ping Zhang

    Abstract: The last seventy years have witnessed the transition of communication from Shannon's theoretical concept to current high-efficient practical systems. Classical communication systems address the capability-deficiency issue mainly by module-stacking and technique-densification with ever-increasing complexity. In such a traditional viewpoint, classical source coding only uses explicit probabilistic m… ▽ More

    Submitted 30 March, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

  37. arXiv:2202.14018  [pdf, other

    cs.AI

    Description Logic EL++ Embeddings with Intersectional Closure

    Authors: Xi Peng, Zhenwei Tang, Maxat Kulmanov, Kexin Niu, Robert Hoehndorf

    Abstract: Many ontologies, in particular in the biomedical domain, are based on the Description Logic EL++. Several efforts have been made to interpret and exploit EL++ ontologies by distributed representation learning. Specifically, concepts within EL++ theories have been represented as n-balls within an n-dimensional embedding space. However, the intersectional closure is not satisfied when using n-balls… ▽ More

    Submitted 28 February, 2022; originally announced February 2022.

  38. arXiv:2201.10340  [pdf, other

    cs.IT cs.LG

    Distributed Image Transmission using Deep Joint Source-Channel Coding

    Authors: Sixian Wang, Ke Yang, Jincheng Dai, Kai Niu

    Abstract: We study the problem of deep joint source-channel coding (D-JSCC) for correlated image sources, where each source is transmitted through a noisy independent channel to the common receiver. In particular, we consider a pair of images captured by two cameras with probably overlapping fields of view transmitted over wireless channels and reconstructed in the center node. The challenging problem invol… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: ICASSP 2022

  39. arXiv:2201.03801  [pdf, other

    cs.LG cs.AI

    Winning solutions and post-challenge analyses of the ChaLearn AutoDL challenge 2019

    Authors: Zhengying Liu, Adrien Pavao, Zhen Xu, Sergio Escalera, Fabio Ferreira, Isabelle Guyon, Sirui Hong, Frank Hutter, Rongrong Ji, Julio C. S. Jacques Junior, Ge Li, Marius Lindauer, Zhipeng Luo, Meysam Madadi, Thomas Nierhoff, Kangning Niu, Chunguang Pan, Danny Stoll, Sebastien Treguer, Jin Wang, Peng Wang, Chenglin Wu, Youcheng Xiong, Arbe r Zela, Yang Zhang

    Abstract: This paper reports the results and post-challenge analyses of ChaLearn's AutoDL challenge series, which helped sorting out a profusion of AutoML solutions for Deep Learning (DL) that had been introduced in a variety of settings, but lacked fair comparisons. All input data modalities (time series, images, videos, text, tabular) were formatted as tensors and all tasks were multi-label classification… ▽ More

    Submitted 11 January, 2022; originally announced January 2022.

    Comments: The first three authors contributed equally; This is only a draft version

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI) 2021

  40. arXiv:2201.02924  [pdf, ps, other

    cs.IT

    Joint Successive Cancellation List Decoding for the Double Polar codes

    Authors: Yanfei Dong, Kai Niu, Jincheng Dai, Sen Wang, Yifei Yuan

    Abstract: As a new joint source-channel coding scheme, the double polar (D-Polar) codes have been proposed recently. In this letter, a novel joint source-channel decoder, namely the joint successive cancellation list (J-SCL) decoder, is proposed to improve the decoding performance of the D-Polar codes. We merge the trellis of the source polar code and that of the channel polar code to construct a compound t… ▽ More

    Submitted 8 January, 2022; originally announced January 2022.

  41. arXiv:2112.10961  [pdf, other

    cs.IT cs.CV cs.LG

    Nonlinear Transform Source-Channel Coding for Semantic Communications

    Authors: Jincheng Dai, Sixian Wang, Kailin Tan, Zhongwei Si, Xiaoqi Qin, Kai Niu, Ping Zhang

    Abstract: In this paper, we propose a class of high-efficiency deep joint source-channel coding methods that can closely adapt to the source distribution under the nonlinear transform, it can be collected under the name nonlinear transform source-channel coding (NTSCC). In the considered model, the transmitter first learns a nonlinear analysis transform to map the source data into latent space, then transmi… ▽ More

    Submitted 2 November, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: published in IEEE JSAC

  42. arXiv:2112.03093  [pdf, ps, other

    cs.IT

    Communication Beyond Transmitting Bits: Semantics-Guided Source and Channel Coding

    Authors: Jincheng Dai, Ping Zhang, Kai Niu, Sixian Wang, Zhongwei Si, Xiaoqi Qin

    Abstract: Classical communication paradigms focus on accurately transmitting bits over a noisy channel, and Shannon theory provides a fundamental theoretical limit on the rate of reliable communications. In this approach, bits are treated equally, and the communication system is oblivious to what meaning these bits convey or how they would be used. Future communications towards intelligence and conciseness… ▽ More

    Submitted 1 June, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  43. arXiv:2110.12224  [pdf, other

    cs.IT eess.SP

    Generalized Polarization Transform: A Novel Coded Transmission Paradigm

    Authors: Bolin Wu, Jincheng Dai, Kai Niu, Zhongwei Si, Ping Zhang, Sen Wang, Yifei Yuan, Chih-Lin I

    Abstract: For the upcoming 6G wireless networks, a new wave of applications and services will demand ultra-high data rates and reliability. To this end, future wireless systems are expected to pave the way for entirely new fundamental air interface technologies to attain a breakthrough in spectrum efficiency (SE). This article discusses a new paradigm, named generalized polarization transform (GPT), to achi… ▽ More

    Submitted 27 April, 2022; v1 submitted 23 October, 2021; originally announced October 2021.

  44. arXiv:2110.08268  [pdf, other

    cs.CY cs.AI cs.LG

    Explainable Student Performance Prediction With Personalized Attention for Explaining Why A Student Fails

    Authors: Kun Niu, Xipeng Cao, Yicong Yu

    Abstract: As student failure rates continue to increase in higher education, predicting student performance in the following semester has become a significant demand. Personalized student performance prediction helps educators gain a comprehensive view of student status and effectively intervene in advance. However, existing works scarcely consider the explainability of student performance prediction, which… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

    Comments: AAAI 2021 Workshop on AI Education/TIPCE 2021

  45. arXiv:2109.12965  [pdf, other

    cs.CV cs.AI

    Text-based Person Search in Full Images via Semantic-Driven Proposal Generation

    Authors: Shizhou Zhang, De Cheng, Wenlong Luo, Yinghui Xing, Duo Long, Hao Li, Kai Niu, Guoqiang Liang, Yanning Zhang

    Abstract: Finding target persons in full scene images with a query of text description has important practical applications in intelligent video surveillance.However, different from the real-world scenarios where the bounding boxes are not available, existing text-based person retrieval methods mainly focus on the cross modal matching between the query text descriptions and the gallery of cropped pedestrian… ▽ More

    Submitted 25 February, 2024; v1 submitted 27 September, 2021; originally announced September 2021.

  46. arXiv:2108.03508  [pdf, other

    cs.LG

    The Effect of Training Parameters and Mechanisms on Decentralized Federated Learning based on MNIST Dataset

    Authors: Zhuofan Zhang, Mi Zhou, Kaicheng Niu, Chaouki Abdallah

    Abstract: Federated Learning is an algorithm suited for training models on decentralized data, but the requirement of a central "server" node is a bottleneck. In this document, we first introduce the notion of Decentralized Federated Learning (DFL). We then perform various experiments on different setups, such as changing model aggregation frequency, switching from independent and identically distributed (I… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

  47. arXiv:2108.03495  [pdf, other

    cs.MA

    Game Theory and Machine Learning in UAVs-Assisted Wireless Communication Networks: A Survey

    Authors: M. Zhou, Y. Guan, M. Hayajneh, K. Niu, C. Abdallah

    Abstract: In recent years, Unmanned Aerial Vehicles (UAVs) have been used in fields such as architecture, business delivery, military and civilian theaters, and many others. With increased applications comes the increased demand for advanced algorithms for resource allocation and energy management. As is well known, game theory and machine learning are two powerful tools already widely used in the wireless… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

  48. arXiv:2104.05178  [pdf, other

    cs.IT

    Polar-Precoding: A Unitary Finite-Feedback Transmit Precoder for Polar-Coded MIMO Systems

    Authors: Jinnan Piao, Kai Niu, Jincheng Dai, Lajos Hanzo

    Abstract: We propose a unitary precoding scheme, namely polar-precoding, to improve the performance of polar-coded MIMO systems. In contrast to the traditional design of MIMO precoding criteria, the proposed polar-precoding scheme relies on the \emph{polarization criterion}. In particular, the precoding matrix design comprises two steps. After selecting a basic matrix for maximizing the capacity in the firs… ▽ More

    Submitted 13 September, 2021; v1 submitted 11 April, 2021; originally announced April 2021.

    Comments: Polar-coded MIMO system, polarization criterion, precoding, unitary matrix

  49. arXiv:2102.03828  [pdf, other

    cs.IT

    Learning to Decode Protograph LDPC Codes

    Authors: Jincheng Dai, Kailin Tan, Zhongwei Si, Kai Niu, Mingzhe Chen, H. Vincent Poor, Shuguang Cui

    Abstract: The recent development of deep learning methods provides a new approach to optimize the belief propagation (BP) decoding of linear codes. However, the limitation of existing works is that the scale of neural networks increases rapidly with the codelength, thus they can only support short to moderate codelengths. From the point view of practicality, we propose a high-performance neural min-sum (MS)… ▽ More

    Submitted 10 February, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: To appear in the IEEE JSAC Series on Machine Learning in Communications and Networks

  50. arXiv:2011.10308  [pdf, other

    cs.IT

    Progressive Rate-Filling: A Framework for Agile Construction of Multilevel Polar-Coded Modulation

    Authors: Jincheng Dai, Jinnan Piao, Kai Niu

    Abstract: In this letter, we propose a progressive rate-filling method as a framework to study agile construction of multilevel polar-coded modulation. We show that the bit indices within each component polar code can follow a fixed, precomputed ranking sequence, e.g., the Polar sequence in the 5G standard, while their allocated rates (i.e., the number of information bits of each component polar code) can b… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.