Skip to main content

Showing 1–50 of 65 results for author: Bi, Y

  1. arXiv:2406.04100  [pdf, other

    cs.CV cs.RO

    Class-Aware Cartilage Segmentation for Autonomous US-CT Registration in Robotic Intercostal Ultrasound Imaging

    Authors: Zhongliang Jiang, Yunfeng Kang, Yuan Bi, Xuesong Li, Chenyang Li, Nassir Navab

    Abstract: Ultrasound imaging has been widely used in clinical examinations owing to the advantages of being portable, real-time, and radiation-free. Considering the potential of extensive deployment of autonomous examination systems in hospitals, robotic US imaging has attracted increased attention. However, due to the inter-patient variations, it is still challenging to have an optimal path for each patien… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.09552  [pdf, other

    eess.IV cs.AI cs.CV

    ODFormer: Semantic Fundus Image Segmentation Using Transformer for Optic Nerve Head Detection

    Authors: Jiayi Wang, Yi-An Mao, Xiaoyu Ma, Sicen Guo, Yuting Shao, Xiao Lv, Wenting Han, Mark Christopher, Linda M. Zangwill, Yanlong Bi, Rui Fan

    Abstract: Optic nerve head (ONH) detection has been a crucial area of study in ophthalmology for years. However, the significant discrepancy between fundus image datasets, each generated using a single type of fundus camera, poses challenges to the generalizability of ONH detection approaches developed based on semantic segmentation networks. Despite the numerous recent advancements in general-purpose seman… ▽ More

    Submitted 2 June, 2024; v1 submitted 15 April, 2024; originally announced May 2024.

  3. arXiv:2404.09927  [pdf, other

    cs.RO cs.LG

    Autonomous Path Planning for Intercostal Robotic Ultrasound Imaging Using Reinforcement Learning

    Authors: Yuan Bi, Cheng Qian, Zhicheng Zhang, Nassir Navab, Zhongliang Jiang

    Abstract: Ultrasound (US) has been widely used in daily clinical practice for screening internal organs and guiding interventions. However, due to the acoustic shadow cast by the subcutaneous rib cage, the US examination for thoracic application is still challenging. To fully cover and reconstruct the region of interest in US for diagnosis, an intercostal scanning path is necessary. To tackle this challenge… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  4. arXiv:2404.09681  [pdf, other

    cs.CR

    An Empirical Study of Open Edge Computing Platforms: Ecosystem, Usage, and Security Risks

    Authors: Yu Bi, Mingshuo Yang, Yong Fang, Xianghang Mi, Shanqing Guo, Shujun Tang, Haixin Duan

    Abstract: Emerging in recent years, open edge computing platforms (OECPs) claim large-scale edge nodes, the extensive usage and adoption, as well as the openness to any third parties to join as edge nodes. For instance, OneThingCloud, a major OECP operated in China, advertises 5 million edge nodes, 70TB bandwidth, and 1,500PB storage. However, little information is publicly available for such OECPs with reg… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  5. arXiv:2404.05130  [pdf, other

    cs.CR

    Enabling Privacy-Preserving Cyber Threat Detection with Federated Learning

    Authors: Yu Bi, Yekai Li, Xuan Feng, Xianghang Mi

    Abstract: Despite achieving good performance and wide adoption, machine learning based security detection models (e.g., malware classifiers) are subject to concept drift and evasive evolution of attackers, which renders up-to-date threat data as a necessity. However, due to enforcement of various privacy protection regulations (e.g., GDPR), it is becoming increasingly challenging or even prohibitive for sec… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  6. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  7. arXiv:2403.03186  [pdf, other

    cs.AI

    Cradle: Empowering Foundation Agents Towards General Computer Control

    Authors: Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, Yujie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson , et al. (3 additional authors not shown)

    Abstract: Despite the success in specific scenarios, existing foundation agents still struggle to generalize across various virtual scenarios, mainly due to the dramatically different encapsulations of environments with manually designed observation and action spaces. To handle this issue, we propose the General Computer Control (GCC) setting to restrict foundation agents to interact with software through t… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  8. arXiv:2402.08893  [pdf, other

    cs.SI physics.soc-ph

    Inconsistency of evaluation metrics in link prediction

    Authors: Yilin Bi, Xinshan Jiao, Yan-Li Lee, Tao Zhou

    Abstract: Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen at will. This paper implements extensive experime… ▽ More

    Submitted 24 February, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 20 pages, 9 figures

  9. arXiv:2402.02189  [pdf, other

    cs.IT eess.SP

    DoF Analysis for (M, N)-Channels through a Number-Filling Puzzle

    Authors: Yue Bi, Yue Wu, Cunqing Hua

    Abstract: We consider a $\sf K$ user interference network with general connectivity, described by a matrix $\mat{N}$, and general message flows, described by a matrix $\mat{M}$. Previous studies have demonstrated that the standard interference scheme (IA) might not be optimal for networks with sparse connectivity. In this paper, we formalize a general IA coding scheme and an intuitive number-filling puzzle… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  10. arXiv:2401.03673  [pdf, other

    cs.SI physics.data-an

    Comparing discriminating abilities of evaluation metrics in link prediction

    Authors: Xinshan Jiao, Shuyan Wan, Qian Liu, Yilin Bi, Yan-Li Lee, En Xu, Dong Hao, Tao Zhou

    Abstract: Link prediction aims to predict the potential existence of links between two unconnected nodes within a network based on the known topological characteristics. Evaluation metrics are used to assess the effectiveness of algorithms in link prediction. The discriminating ability of these evaluation metrics is vitally important for accurately evaluating link prediction algorithms. In this study, we pr… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  11. arXiv:2401.02376  [pdf, other

    cs.RO

    Machine Learning in Robotic Ultrasound Imaging: Challenges and Perspectives

    Authors: Yuan Bi, Zhongliang Jiang, Felix Duelmer, Dianye Huang, Nassir Navab

    Abstract: This article reviews the recent advances in intelligent robotic ultrasound (US) imaging systems. We commence by presenting the commonly employed robotic mechanisms and control techniques in robotic US imaging, along with their clinical applications. Subsequently, we focus on the deployment of machine learning techniques in the development of robotic sonographers, emphasizing crucial developments a… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

    Comments: Accepted by Annual Review of Control, Robotics, and Autonomous Systems

  12. arXiv:2312.15389  [pdf, other

    eess.IV cs.CV

    TJDR: A High-Quality Diabetic Retinopathy Pixel-Level Annotation Dataset

    Authors: Jingxin Mao, Xiaoyu Ma, Yanlong Bi, Rongqing Zhang

    Abstract: Diabetic retinopathy (DR), as a debilitating ocular complication, necessitates prompt intervention and treatment. Despite the effectiveness of artificial intelligence in aiding DR grading, the progression of research toward enhancing the interpretability of DR grading through precise lesion segmentation faces a severe hindrance due to the scarcity of pixel-level annotated DR datasets. To mitigate… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  13. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  14. arXiv:2312.10997  [pdf, other

    cs.CL cs.AI

    Retrieval-Augmented Generation for Large Language Models: A Survey

    Authors: Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang

    Abstract: Large Language Models (LLMs) showcase impressive capabilities but encounter challenges like hallucination, outdated knowledge, and non-transparent, untraceable reasoning processes. Retrieval-Augmented Generation (RAG) has emerged as a promising solution by incorporating knowledge from external databases. This enhances the accuracy and credibility of the generation, particularly for knowledge-inten… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Ongoing Work

  15. arXiv:2310.15598  [pdf, other

    cs.IT

    Coded Computing for Half-Duplex Wireless Distributed Computing Systems via Interference Alignment

    Authors: Youlong Wu, Zhenhao Huang, Kai Yuan, Shuai Ma, Yue Bi

    Abstract: Distributed computing frameworks such as MapReduce and Spark are often used to process large-scale data computing jobs. In wireless scenarios, exchanging data among distributed nodes would seriously suffer from the communication bottleneck due to limited communication resources such as bandwidth and power. To address this problem, we propose a coded parallel computing (CPC) scheme for distributed… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 17 pages, 6 figures

  16. arXiv:2309.08160  [pdf, other

    eess.IV cs.CV

    Cross-Modal Synthesis of Structural MRI and Functional Connectivity Networks via Conditional ViT-GANs

    Authors: Yuda Bi, Anees Abrol, Jing Sui, Vince Calhoun

    Abstract: The cross-modal synthesis between structural magnetic resonance imaging (sMRI) and functional network connectivity (FNC) is a relatively unexplored area in medical imaging, especially with respect to schizophrenia. This study employs conditional Vision Transformer Generative Adversarial Networks (cViT-GANs) to generate FNC data based on sMRI inputs. After training on a comprehensive dataset that i… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  17. arXiv:2307.05609  [pdf, other

    cs.NI

    Virtual Network Embedding without Explicit Virtual Network Specification

    Authors: Jiangnan Cheng, Yingjie Bi, Ao Tang

    Abstract: Network virtualization enables Internet service providers to run multiple heterogeneous and dedicated network architectures for different customers on a shared substrate. In existing works on virtual network embedding (VNE), each customer formulates a virtual network request (VNR) where a virtual network (VN) is required. Motivated by a concrete example where VN is not a proper VNR formulation to… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  18. arXiv:2307.03705  [pdf, other

    cs.RO cs.AI

    Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

    Authors: Zhongliang Jiang, Yuan Bi, Mingchuan Zhou, Ying Hu, Michael Burke, Nassir Navab

    Abstract: Ultrasound (US) imaging is widely used for biometric measurement and diagnosis of internal organs due to the advantages of being real-time and radiation-free. However, due to inter-operator variations, resulting images highly depend on the experience of sonographers. This work proposes an intelligent robotic sonographer to autonomously "explore" target anatomies and navigate a US probe to a releva… ▽ More

    Submitted 29 November, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

  19. arXiv:2307.03698  [pdf, other

    eess.IV cs.CV cs.RO

    Motion Magnification in Robotic Sonography: Enabling Pulsation-Aware Artery Segmentation

    Authors: Dianye Huang, Yuan Bi, Nassir Navab, Zhongliang Jiang

    Abstract: Ultrasound (US) imaging is widely used for diagnosing and monitoring arterial diseases, mainly due to the advantages of being non-invasive, radiation-free, and real-time. In order to provide additional information to assist clinicians in diagnosis, the tubular structures are often segmented from US images. To improve the artery segmentation accuracy and stability during scans, this work presents a… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: Accepted Paper IROS 2023

  20. arXiv:2307.01383  [pdf, other

    cs.CV cs.AI q-bio.QM

    Depth video data-enabled predictions of longitudinal dairy cow body weight using thresholding and Mask R-CNN algorithms

    Authors: Ye Bi, Leticia M. Campos, Jin Wang, Haipeng Yu, Mark D. Hanigan, Gota Morota

    Abstract: Monitoring cow body weight is crucial to support farm management decisions due to its direct relationship with the growth, nutritional status, and health of dairy cows. Cow body weight is a repeated trait, however, the majority of previous body weight prediction research only used data collected at a single point in time. Furthermore, the utility of deep learning-based segmentation for body weight… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

  21. arXiv:2305.11338  [pdf, other

    cs.CV

    Coordinated Transformer with Position \& Sample-aware Central Loss for Anatomical Landmark Detection

    Authors: Qikui Zhu, Yihui Bi, Danxin Wang, Xiangpeng Chu, Jie Chen, Yanqing Wang

    Abstract: Heatmap-based anatomical landmark detection is still facing two unresolved challenges: 1) inability to accurately evaluate the distribution of heatmap; 2) inability to effectively exploit global spatial structure information. To address the computational inability challenge, we propose a novel position-aware and sample-aware central loss. Specifically, our central loss can absorb position informat… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  22. arXiv:2305.08228  [pdf, other

    eess.IV cs.CV cs.RO

    Skeleton Graph-based Ultrasound-CT Non-rigid Registration

    Authors: Zhongliang Jiang, Xuesong Li, Chenyu Zhang, Yuan Bi, Walter Stechele, Nassir Navab

    Abstract: Autonomous ultrasound (US) scanning has attracted increased attention, and it has been seen as a potential solution to overcome the limitations of conventional US examinations, such as inter-operator variations. However, it is still challenging to autonomously and accurately transfer a planned scan trajectory on a generic atlas to the current setup for different patients, particularly for thorax a… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

    Comments: online video: https://www.youtube.com/watch?v=LkSHL7FJ8eU

  23. arXiv:2305.02086  [pdf, other

    cs.CV

    Revisiting the Encoding of Satellite Image Time Series

    Authors: Xin Cai, Yaxin Bi, Peter Nicholl, Roy Sterritt

    Abstract: Satellite Image Time Series (SITS) representation learning is complex due to high spatiotemporal resolutions, irregular acquisition times, and intricate spatiotemporal interactions. These challenges result in specialized neural network architectures tailored for SITS analysis. The field has witnessed promising results achieved by pioneering researchers, but transferring the latest advances or esta… ▽ More

    Submitted 8 September, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

  24. arXiv:2303.16362  [pdf, ps, other

    cs.SE

    Benchmarking Software Vulnerability Detection Techniques: A Survey

    Authors: Yingzhou Bi, Jiangtao Huang, Penghui Liu, Lianmei Wang

    Abstract: Software vulnerabilities can have serious consequences, which is why many techniques have been proposed to defend against them. Among these, vulnerability detection techniques are a major area of focus. However, there is a lack of a comprehensive approach for benchmarking these proposed techniques. In this paper, we present the first survey that comprehensively investigates and summarizes the curr… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  25. arXiv:2303.12649  [pdf, other

    eess.IV cs.CV

    MI-SegNet: Mutual Information-Based US Segmentation for Unseen Domain Generalization

    Authors: Yuan Bi, Zhongliang Jiang, Ricarda Clarenbach, Reza Ghotbi, Angelos Karlas, Nassir Navab

    Abstract: Generalization capabilities of learning-based medical image segmentation across domains are currently limited by the performance degradation caused by the domain shift, particularly for ultrasound (US) imaging. The quality of US images heavily relies on carefully tuned acoustic parameters, which vary across sonographers, machines, and settings. To improve the generalizability on US images across d… ▽ More

    Submitted 6 February, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  26. arXiv:2303.09012  [pdf, other

    eess.IV cs.CV

    Exploring the Power of Generative Deep Learning for Image-to-Image Translation and MRI Reconstruction: A Cross-Domain Review

    Authors: Yuda Bi

    Abstract: Deep learning has become a prominent computational modeling tool in the areas of computer vision and image processing in recent years. This research comprehensively analyzes the different deep-learning methods used for image-to-image translation and reconstruction in the natural and medical imaging domains. We examine the famous deep learning frameworks, such as convolutional neural networks and g… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  27. BDTS: Blockchain-based Data Trading System

    Authors: Erya Jiang, Bo Qin, Qin Wang, Qianhong Wu, Sanxi Li, Wenchang Shi, Yingxin Bi, Wenyi Tang

    Abstract: Trading data through blockchain platforms is hard to achieve \textit{fair exchange}. Reasons come from two folds: Firstly, guaranteeing fairness between sellers and consumers is a challenging task as the deception of any participating parties is risk-free. This leads to the second issue where judging the behavior of data executors (such as cloud service providers) among distrustful parties is impr… ▽ More

    Submitted 31 October, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: ICICS 2023 (Best Paper Award)

    Journal ref: International Conference on Information and Communications Security, pp. 645-664. Singapore: Springer Nature Singapore, 2023

  28. arXiv:2211.06726   

    cs.CV

    MultiCrossViT: Multimodal Vision Transformer for Schizophrenia Prediction using Structural MRI and Functional Network Connectivity Data

    Authors: Yuda Bi, Anees Abrol, Zening Fu, Vince Calhoun

    Abstract: Vision Transformer (ViT) is a pioneering deep learning framework that can address real-world computer vision issues, such as image classification and object recognition. Importantly, ViTs are proven to outperform traditional deep learning models, such as convolutional neural networks (CNNs). Relatively recently, a number of ViT mutations have been transplanted into the field of medical imaging, th… ▽ More

    Submitted 5 March, 2023; v1 submitted 12 November, 2022; originally announced November 2022.

    Comments: I submitted the wrong paper

  29. A New Interference-Alignment Scheme for Wireless MapReduce

    Authors: Yue Bi, Michèle Wigger, Yue Wu

    Abstract: We consider a full-duplex wireless Distributed Computing (DC) system under the MapReduce framework. New upper and lower bounds on the optimal tradeoff between Normalized Delivery Time (NDT) and computation load are presented. The upper bound strictly improves over the previous reported upper bounds and is based on a novel interference alignment (IA) scheme tailored to the interference cancellation… ▽ More

    Submitted 6 May, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

  30. arXiv:2209.13233  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    Genetic Programming-Based Evolutionary Deep Learning for Data-Efficient Image Classification

    Authors: Ying Bi, Bing Xue, Mengjie Zhang

    Abstract: Data-efficient image classification is a challenging task that aims to solve image classification using small training data. Neural network-based deep learning methods are effective for image classification, but they typically require large-scale training data and have major limitations such as requiring expertise to design network architectures and having poor interpretability. Evolutionary deep… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted by IEEE Transactions on Evolutionary Computation

    Journal ref: IEEE Transactions on Evolutionary Computation, 2022, https://ieeexplore.ieee.org/document/9919314

  31. arXiv:2209.07590   

    eess.IV cs.CV cs.LG q-bio.NC

    Prediction of Gender from Longitudinal MRI data via Deep Learning on Adolescent Data Reveals Unique Patterns Associated with Brain Structure and Change over a Two-year Period

    Authors: Yuda Bi, Anees Abrol, Zening Fu, Jiayu Chen, Jingyu Liu, Vince Calhoun

    Abstract: Deep learning algorithms for predicting neuroimaging data have shown considerable promise in various applications. Prior work has demonstrated that deep learning models that take advantage of the data's 3D structure can outperform standard machine learning on several learning tasks. However, most prior research in this area has focused on neuroimaging data from adults. Within the Adolescent Brain… ▽ More

    Submitted 5 March, 2023; v1 submitted 15 September, 2022; originally announced September 2022.

    Comments: I submitted the wrong paper

  32. arXiv:2209.06399  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    A Survey on Evolutionary Computation for Computer Vision and Image Analysis: Past, Present, and Future Trends

    Authors: Ying Bi, Bing Xue, Pablo Mesejo, Stefano Cagnoni, Mengjie Zhang

    Abstract: Computer vision (CV) is a big and important field in artificial intelligence covering a wide range of applications. Image analysis is a major task in CV aiming to extract, analyse and understand the visual content of images. However, image-related tasks are very challenging due to many factors, e.g., high variations across images, high dimensionality, domain expertise requirement, and image distor… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Conditionally accepted by IEEE Transactions on Evolutionary Computation

    Journal ref: IEEE Transactions on Evolutionary Computationm, 2022, https://ieeexplore.ieee.org/document/9943992/

  33. arXiv:2208.07503  [pdf, other

    cs.CV

    Color Image Edge Detection using Multi-scale and Multi-directional Gabor filter

    Authors: Yunhong Li, Yuandong Bi, Weichuan Zhang, Jie Ren, Jinni Chen

    Abstract: In this paper, a color edge detection method is proposed where the multi-scale Gabor filter are used to obtain edges from input color images. The main advantage of the proposed method is that high edge detection accuracy is attained while maintaining good noise robustness. The proposed method consists of three aspects: First, the RGB color image is converted to CIE L*a*b* space because of its wide… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

  34. Precise Repositioning of Robotic Ultrasound: Improving Registration-based Motion Compensation using Ultrasound Confidence Optimization

    Authors: Zhongliang Jiang, Nehil Danis, Yuan Bi, Mingchuan Zhou, Markus Kroenke, Thomas Wendler, Nassir Navab

    Abstract: Robotic ultrasound (US) imaging has been seen as a promising solution to overcome the limitations of free-hand US examinations, i.e., inter-operator variability. However, the fact that robotic US systems cannot react to subject movements during scans limits their clinical acceptance. Regarding human sonographers, they often react to patient movements by repositioning the probe or even restarting t… ▽ More

    Submitted 5 September, 2022; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: The paper has been accepted by IEEE TIM. Video: https://www.youtube.com/watch?v=MUtgSXS7EZI

  35. arXiv:2208.03526  [pdf, other

    cs.CV

    Multiplex-detection Based Multiple Instance Learning Network for Whole Slide Image Classification

    Authors: Zhikang Wang, Yue Bi, Tong Pan, Xiaoyu Wang, Chris Bain, Richard Bassed, Seiya Imoto, Jianhua Yao, Jiangning Song

    Abstract: Multiple instance learning (MIL) is a powerful approach to classify whole slide images (WSIs) for diagnostic pathology. A fundamental challenge of MIL on WSI classification is to discover the \textit{critical instances} that trigger the bag label. However, previous methods are primarily designed under the independent and identical distribution hypothesis (\textit{i.i.d}), ignoring either the corre… ▽ More

    Submitted 31 August, 2022; v1 submitted 6 August, 2022; originally announced August 2022.

  36. arXiv:2205.06676  [pdf, other

    eess.IV cs.AI cs.CV cs.RO

    VesNet-RL: Simulation-based Reinforcement Learning for Real-World US Probe Navigation

    Authors: Yuan Bi, Zhongliang Jiang, Yuan Gao, Thomas Wendler, Angelos Karlas, Nassir Navab

    Abstract: Ultrasound (US) is one of the most common medical imaging modalities since it is radiation-free, low-cost, and real-time. In freehand US examinations, sonographers often navigate a US probe to visualize standard examination planes with rich diagnostic information. However, reproducibility and stability of the resulting images often suffer from intra- and inter-operator variation. Reinforcement lea… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Directly accepted by IEEE RAL after the first round of review. Video: https://www.youtube.com/watch?v=bzCO07Hquj8 Codes: https://github.com/yuan-12138/VesNet-RL

  37. arXiv:2203.16149  [pdf, other

    cs.CV cs.LG

    Tampered VAE for Improved Satellite Image Time Series Classification

    Authors: Xin Cai, Yaxin Bi, Peter Nicholl

    Abstract: The unprecedented availability of spatial and temporal high-resolution satellite image time series (SITS) for crop type mapping is believed to necessitate deep learning architectures to accommodate challenges arising from both dimensions. Recent state-of-the-art deep learning models have shown promising results by stacking spatial and temporal encoders. However, we present a Pyramid Time-Series Tr… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

  38. arXiv:2201.11149  [pdf, other

    cs.IT cs.DC

    DoF of a Cooperative X-Channel with an Application to Distributed Computing

    Authors: Yue Bi, Michèle Wigger, Philippe Ciblat, Yue Wu

    Abstract: We consider a cooperative X-channel with $\sf K$ transmitters (TXs) and $\sf K$ receivers (Rxs) where Txs and Rxs are gathered into groups of size $\sf r$ respectively. Txs belonging to the same group cooperate to jointly transmit a message to each of the $\sf K- \sf r$ Rxs in all other groups, and each Rx individually decodes all its intended messages. By introducing a new interference alignment… ▽ More

    Submitted 9 March, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  39. Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction

    Authors: Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning

    Abstract: Diabetes prediction is an important data science application in the social healthcare domain. There exist two main challenges in the diabetes prediction task: data heterogeneity since demographic and metabolic data are of different types, data insufficiency since the number of diabetes cases in a single medical center is usually limited. To tackle the above challenges, we employ gradient boosting… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 11 pages (2 pages of supplementary), 10 figures, 7 tables. Accepted by ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021)

  40. Deformation-Aware Robotic 3D Ultrasound

    Authors: Zhongliang Jiang, Yue Zhou, Yuan Bi, Mingchuan Zhou, Thomas Wendler, Nassir Navab

    Abstract: Tissue deformation in ultrasound (US) imaging leads to geometrical errors when measuring tissues due to the pressure exerted by probes. Such deformation has an even larger effect on 3D US volumes as the correct compounding is limited by the inconsistent location and geometry. This work proposes a patient-specified stiffness-based method to correct the tissue deformations in robotic 3D US acquisiti… ▽ More

    Submitted 18 July, 2021; originally announced July 2021.

    Comments: Accepted for publication in IEEE Robotics and Automation Letters; Video: https://www.youtube.com/watch?v=MlZtugQ2cvQ

    Journal ref: IEEE Robotics and Automation Letters 2021

  41. arXiv:2106.09556  [pdf, other

    stat.ML cs.LG

    A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents

    Authors: Yifei Bi, Xinyi Chen, Caihui Xiao

    Abstract: Adapting the idea of training CartPole with Deep Q-learning agent, we are able to find a promising result that prevent the pole from falling down. The capacity of reinforcement learning (RL) to learn from the interaction between the environment and agent provides an optimal control strategy. In this paper, we aim to solve the classic pendulum swing-up problem that making the learned pendulum to be… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

  42. arXiv:2105.08232  [pdf, other

    math.OC cs.LG stat.ML

    Sharp Restricted Isometry Property Bounds for Low-rank Matrix Recovery Problems with Corrupted Measurements

    Authors: Ziye Ma, Yingjie Bi, Javad Lavaei, Somayeh Sojoudi

    Abstract: In this paper, we study a general low-rank matrix recovery problem with linear measurements corrupted by some noise. The objective is to understand under what conditions on the restricted isometry property (RIP) of the problem local search methods can find the ground truth with a small error. By analyzing the landscape of the non-convex problem, we first propose a global guarantee on the maximum d… ▽ More

    Submitted 25 July, 2023; v1 submitted 17 May, 2021; originally announced May 2021.

  43. arXiv:2105.06709  [pdf, other

    cs.LG cs.CE

    Learning Unknown from Correlations: Graph Neural Network for Inter-novel-protein Interaction Prediction

    Authors: Guofeng Lv, Zhiqiang Hu, Yanguang Bi, Shaoting Zhang

    Abstract: The study of multi-type Protein-Protein Interaction (PPI) is fundamental for understanding biological processes from a systematic perspective and revealing disease mechanisms. Existing methods suffer from significant performance degradation when tested in unseen dataset. In this paper, we investigate the problem and find that it is mainly attributed to the poor performance for inter-novel-protein… ▽ More

    Submitted 1 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: 10 pages(3 pages appendix), 2 figures, Accepted by Conference IJCAI2021, which is its extended version

  44. arXiv:2103.04533  [pdf, other

    cs.CR

    Volcano: Stateless Cache Side-channel Attack by Exploiting Mesh Interconnect

    Authors: Junpeng Wan, Yanxiang Bi, Zhe Zhou, Zhou Li

    Abstract: Cache side-channel attacks lead to severe security threats to the settings that a CPU is shared across users, e.g., in the cloud. The existing attacks rely on sensing the micro-architectural state changes made by victims, and this assumption can be invalidated by combining spatial (\eg, Intel CAT) and temporal isolation (\eg, time protection). In this work, we advance the state of cache side-chann… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

  45. arXiv:2102.02063  [pdf

    cs.SD cs.LG physics.app-ph

    Acoustic Structure Inverse Design and Optimization Using Deep Learning

    Authors: Xuecong Sun, Han Jia, Yuzhen Yang, Han Zhao, Yafeng Bi, Zhaoyong Sun, Jun Yang

    Abstract: From ancient to modern times, acoustic structures have been used to control the propagation of acoustic waves. However, the design of the acoustic structures has remained widely a time-consuming and computational resource-consuming iterative process. In recent years, Deep Learning has attracted unprecedented attention for its ability to tackle hard problems with huge datasets, which has achieved s… ▽ More

    Submitted 15 April, 2021; v1 submitted 29 January, 2021; originally announced February 2021.

  46. Learning and Sharing: A Multitask Genetic Programming Approach to Image Feature Learning

    Authors: Ying Bi, Bing Xue, Mengjie Zhang

    Abstract: Using evolutionary computation algorithms to solve multiple tasks with knowledge sharing is a promising approach. Image feature learning can be considered as a multitask problem because different tasks may have a similar feature space. Genetic programming (GP) has been successfully applied to image feature learning for classification. However, most of the existing GP methods solve one task, indepe… ▽ More

    Submitted 18 December, 2020; v1 submitted 17 December, 2020; originally announced December 2020.

    Comments: Submitted to IEEE Transactions on Evolutionary Computation

    Report number: 2012.09444

    Journal ref: IEEE Transactions on Evolutionary Computation, 2021

  47. arXiv:2012.07585  [pdf, other

    cs.LG

    Building Deep Learning Models to Predict Mortality in ICU Patients

    Authors: Huachuan Wang, Yuanfei Bi

    Abstract: Mortality prediction in intensive care units is considered one of the critical steps for efficiently treating patients in serious condition. As a result, various prediction models have been developed to address this problem based on modern electronic healthcare records. However, it becomes increasingly challenging to model such tasks as time series variables because some laboratory test results su… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

  48. arXiv:2008.06179  [pdf, other

    cs.CV cs.IR

    A Multimodal Late Fusion Model for E-Commerce Product Classification

    Authors: Ye Bi, Shuo Wang, Zhongrui Fan

    Abstract: The cataloging of product listings is a fundamental problem for most e-commerce platforms. Despite promising results obtained by unimodal-based methods, it can be expected that their performance can be further boosted by the consideration of multimodal product information. In this study, we investigated a multimodal late fusion approach based on text and image modalities to categorize e-commerce p… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 4 pages, SIGIR 2020 E-commerce Workshop Data Challenge Technical Report

  49. arXiv:2008.06176  [pdf, other

    cs.IR cs.CL

    A Hybrid BERT and LightGBM based Model for Predicting Emotion GIF Categories on Twitter

    Authors: Ye Bi, Shuo Wang, Zhongrui Fan

    Abstract: The animated Graphical Interchange Format (GIF) images have been widely used on social media as an intuitive way of expression emotion. Given their expressiveness, GIFs offer a more nuanced and precise way to convey emotions. In this paper, we present our solution for the EmotionGIF 2020 challenge, the shared task of SocialNLP 2020. To recommend GIF categories for unlabeled tweets, we regarded thi… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 4 pages, ACL 2020 EmotionGIF Challenge Technical Report

  50. arXiv:2008.04579  [pdf, other

    cs.IR cs.SI

    DREAM: A Dynamic Relational-Aware Model for Social Recommendation

    Authors: Liqiang Song, Ye Bi, Mengqiu Yao, Zhenyu Wu, Jianming Wang, Jing Xiao

    Abstract: Social connections play a vital role in improving the performance of recommendation systems (RS). However, incorporating social information into RS is challenging. Most existing models usually consider social influences in a given session, ignoring that both users preferences and their friends influences are evolving. Moreover, in real world, social relations are sparse. Modeling dynamic influence… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Comments: 5 pages, accepted by CIKM 2020 Short Paper Session