Skip to main content

Showing 1–24 of 24 results for author: Diao, X

  1. arXiv:2404.12605  [pdf, other

    cs.AI

    GluMarker: A Novel Predictive Modeling of Glycemic Control Through Digital Biomarkers

    Authors: Ziyi Zhou, Ming Cheng, Xingjian Diao, Yanjun Cui, Xiangling Li

    Abstract: The escalating prevalence of diabetes globally underscores the need for diabetes management. Recent research highlights the growing focus on digital biomarkers in diabetes management, with innovations in computational frameworks and noninvasive monitoring techniques using personalized glucose metrics. However, they predominantly focus on insulin dosing and specific glucose values, or with limited… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2404.12400  [pdf, other

    cs.LG

    Efflex: Efficient and Flexible Pipeline for Spatio-Temporal Trajectory Graph Modeling and Representation Learning

    Authors: Ming Cheng, Ziyi Zhou, Bowen Zhang, Ziyu Wang, Jiaqi Gan, Ziang Ren, Weiqi Feng, Yi Lyu, Hefan Zhang, Xingjian Diao

    Abstract: In the landscape of spatio-temporal data analytics, effective trajectory representation learning is paramount. To bridge the gap of learning accurate representations with efficient and flexible mechanisms, we introduce Efflex, a comprehensive pipeline for transformative graph modeling and representation learning of the large-volume spatio-temporal trajectories. Efflex pioneers the incorporation of… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  3. arXiv:2404.11924  [pdf, other

    cs.AI

    Toward Short-Term Glucose Prediction Solely Based on CGM Time Series

    Authors: Ming Cheng, Xingjian Diao, Ziyi Zhou, Yanjun Cui, Wenjun Liu, Shitong Cheng

    Abstract: The global diabetes epidemic highlights the importance of maintaining good glycemic control. Glucose prediction is a fundamental aspect of diabetes management, facilitating real-time decision-making. Recent research has introduced models focusing on long-term glucose trend prediction, which are unsuitable for real-time decision-making and result in delayed responses. Conversely, models designed to… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  4. arXiv:2404.10901  [pdf, other

    cs.AI

    CrossGP: Cross-Day Glucose Prediction Excluding Physiological Information

    Authors: Ziyi Zhou, Ming Cheng, Yanjun Cui, Xingjian Diao, Zhaorui Ma

    Abstract: The increasing number of diabetic patients is a serious issue in society today, which has significant negative impacts on people's health and the country's financial expenditures. Because diabetes may develop into potential serious complications, early glucose prediction for diabetic patients is necessary for timely medical treatment. Existing glucose prediction methods typically utilize patients'… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2404.08021  [pdf, other

    cs.LG cs.AI cs.RO

    VeTraSS: Vehicle Trajectory Similarity Search Through Graph Modeling and Representation Learning

    Authors: Ming Cheng, Bowen Zhang, Ziyu Wang, Ziyi Zhou, Weiqi Feng, Yi Lyu, Xingjian Diao

    Abstract: Trajectory similarity search plays an essential role in autonomous driving, as it enables vehicles to analyze the information and characteristics of different trajectories to make informed decisions and navigate safely in dynamic environments. Existing work on the trajectory similarity search task primarily utilizes sequence-processing algorithms or Recurrent Neural Networks (RNNs), which suffer f… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  6. arXiv:2402.18390  [pdf, other

    cs.ET cs.AI cs.NE eess.SY

    Neuromorphic Event-Driven Semantic Communication in Microgrids

    Authors: Xiaoguang Diao, Yubo Song, Subham Sahoo, Yuan Li

    Abstract: Synergies between advanced communications, computing and artificial intelligence are unraveling new directions of coordinated operation and resiliency in microgrids. On one hand, coordination among sources is facilitated by distributed, privacy-minded processing at multiple locations, whereas on the other hand, it also creates exogenous data arrival paths for adversaries that can lead to cyber-phy… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: The manuscript has been accepted for publication in IEEE Transactions on Smart Grid

  7. arXiv:2312.15190  [pdf, other

    cs.SD cs.AI cs.CR eess.AS

    SAIC: Integration of Speech Anonymization and Identity Classification

    Authors: Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu

    Abstract: Speech anonymization and de-identification have garnered significant attention recently, especially in the healthcare area including telehealth consultations, patient voiceprint matching, and patient real-time monitoring. Speaker identity classification tasks, which involve recognizing specific speakers from audio to learn identity features, are crucial for de-identification. Since rare studies ha… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  8. arXiv:2312.05430  [pdf, other

    cs.CV

    FT2TF: First-Person Statement Text-To-Talking Face Generation

    Authors: Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin

    Abstract: Talking face generation has gained immense popularity in the computer vision community, with various applications including AR/VR, teleconferencing, digital assistants, and avatars. Traditional methods are mainly audio-driven ones which have to deal with the inevitable resource-intensive nature of audio storage and processing. To address such a challenge, we propose FT2TF - First-Person Statement… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  9. arXiv:2309.14845  [pdf, other

    cs.RO eess.SY

    Graph Neural Network Based Method for Path Planning Problem

    Authors: Xingrong Diao, Wenzheng Chi, Jiankun Wang

    Abstract: Sampling-based path planning is a widely used method in robotics, particularly in high-dimensional state space. Among the whole process of the path planning, collision detection is the most time-consuming operation. In this paper, we propose a learning-based path planning method that aims to reduce the number of collision detection. We develop an efficient neural network model based on Graph Neura… ▽ More

    Submitted 22 November, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  10. arXiv:2309.08738  [pdf, other

    cs.CV cs.MM

    AV-MaskEnhancer: Enhancing Video Representations through Audio-Visual Masked Autoencoder

    Authors: Xingjian Diao, Ming Cheng, Shitong Cheng

    Abstract: Learning high-quality video representation has shown significant applications in computer vision and remains challenging. Previous work based on mask autoencoders such as ImageMAE and VideoMAE has proven the effectiveness of learning representations in images and videos through reconstruction strategy in the visual modality. However, these models exhibit inherent limitations, particularly in scena… ▽ More

    Submitted 20 December, 2023; v1 submitted 15 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI)

  11. arXiv:2309.07136  [pdf, other

    eess.SP cs.AI cs.LG stat.AP

    Masked Transformer for Electrocardiogram Classification

    Authors: Ya Zhou, Xiaolin Diao, Yanni Huo, Yang Liu, Xiaohan Fan, Wei Zhao

    Abstract: Electrocardiogram (ECG) is one of the most important diagnostic tools in clinical applications. With the advent of advanced algorithms, various deep learning models have been adopted for ECG tasks. However, the potential of Transformer for ECG data has not been fully realized, despite their widespread success in computer vision and natural language processing. In this work, we present Masked Trans… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 August, 2023; originally announced September 2023.

    Comments: more experimental results; more implementation details; different abstracts

  12. Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

    Authors: Xiaolei Diao, Daqian Shi, Jian Li, Lida Shi, Mingzhe Yue, Ruihua Qi, Chuntao Li, Hao Xu

    Abstract: Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR scenario with unbalanced data distribution. However, there is a lack of benchmarks for evaluating such zero-shot methods that apply a divide-and-conque… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted by ACM MM 2023

  13. arXiv:2307.14119  [pdf, other

    cs.CV cs.AI cs.MM

    A semantics-driven methodology for high-quality image annotation

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many mappings which exist between the visual information encoded in images and the intended semantics of the labels annotating them. The net consequence is that… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted @ 26th European Conference on Artificial Intelligence (ECAI) 2023, Kraków, Poland

    Report number: KDECAI23

  14. arXiv:2304.08989  [pdf, other

    cs.CV

    Incremental Image Labeling via Iterative Refinement

    Authors: Fausto Giunchiglia, Xiaolei Diao, Mayukh Bagchi

    Abstract: Data quality is critical for multimedia tasks, while various types of systematic flaws are found in image benchmark datasets, as discussed in recent work. In particular, the existence of the semantic gap problem leads to a many-to-many mapping between the information extracted from an image and its linguistic description. This unavoidable bias further leads to poor performance on current computer… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Journal ref: IWCIM@ICASSP 2023

  15. arXiv:2212.06629  [pdf, other

    cs.CV cs.AI

    Aligning Visual and Lexical Semantics

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in tur… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: iConference 2023, Barcelona, March 27 - 29, 2023

  16. CharFormer: A Glyph Fusion based Attentive Framework for High-precision Character Image Denoising

    Authors: Daqian Shi, Xiaolei Diao, Lida Shi, Hao Tang, Yang Chi, Chuntao Li, Hao Xu

    Abstract: Degraded images commonly exist in the general sources of character images, leading to unsatisfactory character recognition results. Existing methods have dedicated efforts to restoring degraded character images. However, the denoising results obtained by these methods do not appear to improve character recognition performance. This is mainly because current methods only focus on pixel-level inform… ▽ More

    Submitted 19 July, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted by ACM MM 2022

  17. RCRN: Real-world Character Image Restoration Network via Skeleton Extraction

    Authors: Daqian Shi, Xiaolei Diao, Hao Tang, Xiaomin Li, Hao Xing, Hao Xu

    Abstract: Constructing high-quality character image datasets is challenging because real-world images are often affected by image degradation. There are limitations when applying current image restoration methods to such real-world character images, since (i) the categories of noise in character images are different from those in general images; (ii) real-world character images usually contain more complex… ▽ More

    Submitted 19 July, 2022; v1 submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted to ACM MM 2022

  18. arXiv:2207.05842  [pdf, other

    cs.CV

    RZCR: Zero-shot Character Recognition via Radical-based Reasoning

    Authors: Xiaolei Diao, Daqian Shi, Hao Tang, Qiang Shen, Yanzeng Li, Lei Wu, Hao Xu

    Abstract: The long-tail effect is a common issue that limits the performance of deep learning models on real-world datasets. Character image datasets are also affected by such unbalanced data distribution due to differences in character usage frequency. Thus, current character recognition methods are limited when applied in the real world, especially for the categories in the tail that lack training samples… ▽ More

    Submitted 28 April, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted to IJCAI 2023

  19. arXiv:2202.13021  [pdf, other

    cs.CV

    Building a visual semantics aware object hierarchy

    Authors: Xiaolei Diao

    Abstract: The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated images are extensively used for training machine learning models, semantic gap problem (SGP) also results in inevitable bias on image annotations and further leads t… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  20. arXiv:2202.08512  [pdf, other

    cs.CV cs.AI

    Visual Ground Truth Construction as Faceted Classification

    Authors: Fausto Giunchiglia, Mayukh Bagchi, Xiaolei Diao

    Abstract: Recent work in Machine Learning and Computer Vision has provided evidence of systematic design flaws in the development of major object recognition benchmark datasets. One such example is ImageNet, wherein, for several categories of images, there are incongruences between the objects they represent and the labels used to annotate them. The consequences of this problem are major, in particular cons… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  21. arXiv:2008.07819  [pdf

    cs.CV cs.HC

    ConvGRU in Fine-grained Pitching Action Recognition for Action Outcome Prediction

    Authors: Tianqi Ma, Lin Zhang, Xiumin Diao, Ou Ma

    Abstract: Prediction of the action outcome is a new challenge for a robot collaboratively working with humans. With the impressive progress in video action recognition in recent years, fine-grained action recognition from video data turns into a new concern. Fine-grained action recognition detects subtle differences of actions in more specific granularity and is significant in many fields such as human-robo… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

  22. arXiv:1908.04920  [pdf, other

    cs.CR cs.LG

    Aggregating Votes with Local Differential Privacy: Usefulness, Soundness vs. Indistinguishability

    Authors: Shaowei Wang, Jiachun Du, Wei Yang, Xinrong Diao, Zichun Liu, Yiwen Nie, Liusheng Huang, Hongli Xu

    Abstract: Voting plays a central role in bringing crowd wisdom to collective decision making, meanwhile data privacy has been a common ethical/legal issue in eliciting preferences from individuals. This work studies the problem of aggregating individual's voting data under the local differential privacy setting, where usefulness and soundness of the aggregated scores are of major concern. One naive approach… ▽ More

    Submitted 13 August, 2019; originally announced August 2019.

  23. arXiv:1806.03237  [pdf

    cs.CY cs.NI

    A Wireless Multimedia Sensor Network Platform for Environmental Event Detection Dedicated to Precision Agriculture

    Authors: Hongling Shi, Kun Mean Hou, Xunxing Diao, Liu Xing, Jian-Jin Li, Christophe De Vaulx

    Abstract: Precision agriculture has been considered as a new technique to improve agricultural production and support sustainable development by preserving planet resource and minimizing pollution. By monitoring different parameters of interest in a cultivated field, wireless sensor network (WSN) enables real-time decision making with regard to issues such as management of water resources for irrigation, ch… ▽ More

    Submitted 15 May, 2018; originally announced June 2018.

    Journal ref: New and Smart Information Communication Science and Technology to Support Sustainable Development (NICST 2013), Sep 2013, Clermont-Ferrand, France

  24. arXiv:1804.08010  [pdf, other

    cs.AI cs.CV

    Multi-Modal Coreference Resolution with the Correlation between Space Structures

    Authors: Qibin Zheng, Xingchun Diao, Jianjun Cao, Xiaolei Zhou, Yi Liu, Hongmei Li

    Abstract: Multi-modal data is becoming more common in big data background. Finding the semantically similar objects from different modality is one of the heart problems of multi-modal learning. Most of the current methods try to learn the inter-modal correlation with extrinsic supervised information, while intrinsic structural information of each modality is neglected. The performance of these methods heavi… ▽ More

    Submitted 1 September, 2018; v1 submitted 21 April, 2018; originally announced April 2018.

    Comments: 9 pages, 6 figures