Skip to main content

Showing 1–50 of 114 results for author: Shen, B

  1. arXiv:2407.00814  [pdf, other

    cs.NI cs.AI

    Privacy-Aware Spectrum Pricing and Power Control Optimization for LEO Satellite Internet-of-Things

    Authors: Bowen Shen, Kwok-Yan Lam, Feng Li

    Abstract: Low earth orbit (LEO) satellite systems play an important role in next generation communication networks due to their ability to provide extensive global coverage with guaranteed communications in remote areas and isolated areas where base stations cannot be cost-efficiently deployed. With the pervasive adoption of LEO satellite systems, especially in the LEO Internet-of-Things (IoT) scenarios, th… ▽ More

    Submitted 1 April, 2024; originally announced July 2024.

  2. arXiv:2406.17349  [pdf, other

    cs.CR cs.CV

    Semantic Deep Hiding for Robust Unlearnable Examples

    Authors: Ruohan Meng, Chenyu Yi, Yi Yu, Siyuan Yang, Bingquan Shen, Alex C. Kot

    Abstract: Ensuring data privacy and protection has become paramount in the era of deep learning. Unlearnable examples are proposed to mislead the deep learning models and prevent data from unauthorized exploration by adding small perturbations to data. However, such perturbations (e.g., noise, texture, color change) predominantly impact low-level features, making them vulnerable to common countermeasures. I… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by TIFS 2024

  3. arXiv:2406.14758  [pdf, ps, other

    cs.AI

    Compliance Cards: Computational Artifacts for Automated AI Regulation Compliance

    Authors: Bill Marino, Preslav Aleksandrov, Carwyn Rahman, Yulu Pi, Bill Shen, Rui-jie Yew, Nicholas D. Lane

    Abstract: As the artificial intelligence (AI) supply chain grows more complex, AI systems and models are increasingly likely to incorporate externally-sourced ingredients such as datasets and other models. In such cases, determining whether or not an AI system or model complies with the EU AI Act will require gathering compliance-related metadata about both the AI system or model at-large as well as those e… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.07003  [pdf, other

    cs.SE

    GraphCoder: Enhancing Repository-Level Code Completion via Code Context Graph-based Retrieval and Language Model

    Authors: Wei Liu, Ailun Yu, Daoguang Zan, Bo Shen, Wei Zhang, Haiyan Zhao, Zhi Jin, Qianxiang Wang

    Abstract: The performance of repository-level code completion depends upon the effective leverage of both general and repository-specific knowledge. Despite the impressive capability of code LLMs in general code completion tasks, they often exhibit less satisfactory performance on repository-level completion due to the lack of repository-specific knowledge in these LLMs. To address this problem, we propose… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.04902  [pdf, other

    cs.ET

    Beyond Data, Towards Sustainability: A Sydney Case Study on Urban Digital Twins

    Authors: Ammar Sohail, Bojie Shen, Muhammad Aamir Cheema, Mohammed Eunus Ali, Anwaar Ulhaq, Muhammad Ali Babar, Asama Qureshi

    Abstract: As urban areas grapple with unprecedented challenges stemming from population growth and climate change, the emergence of urban digital twins offers a promising solution. This paper presents a case study focusing on Sydney's urban digital twin, a virtual replica integrating diverse real-time and historical data, including weather, crime, emissions, and traffic. Through advanced visualization and d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  6. arXiv:2406.03792  [pdf, other

    cs.CL

    Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning

    Authors: Naibin Gu, Peng Fu, Xiyu Liu, Bowen Shen, Zheng Lin, Weiping Wang

    Abstract: Parameter-efficient fine-tuning (PEFT) has emerged as the predominant technique for fine-tuning in the era of large language models. However, existing PEFT methods still have inadequate training efficiency. Firstly, the utilization of large-scale foundation models during the training process is excessively redundant for certain fine-tuning tasks. Secondly, as the model size increases, the growth i… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  7. arXiv:2405.12754  [pdf, other

    astro-ph.SR cs.AI cs.LG physics.space-ph

    Neural Operator for Accelerating Coronal Magnetic Field Model

    Authors: Yutao Du, Qin Li, Raghav Gnanasambandam, Mengnan Du, Haimin Wang, Bo Shen

    Abstract: Studying the sun's outer atmosphere is challenging due to its complex magnetic fields impacting solar activities. Magnetohydrodynamics (MHD) simulations help model these interactions but are extremely time-consuming (usually on a scale of days). Our research applies the Fourier Neural Operator (FNO) to accelerate the coronal magnetic field modeling, specifically, the Bifrost MHD model. We apply Te… ▽ More

    Submitted 26 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  8. arXiv:2405.10216  [pdf, other

    cs.LG cs.AI eess.SP

    Low-Rank Adaptation of Time Series Foundational Models for Out-of-Domain Modality Forecasting

    Authors: Divij Gupta, Anubhav Bhatti, Suraj Parmar, Chen Dan, Yuwei Liu, Bingjie Shen, San Lee

    Abstract: Low-Rank Adaptation (LoRA) is a widely used technique for fine-tuning large pre-trained or foundational models across different modalities and tasks. However, its application to time series data, particularly within foundational models, remains underexplored. This paper examines the impact of LoRA on contemporary time series foundational models: Lag-Llama, MOIRAI, and Chronos. We demonstrate LoRA'… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures. This work has been submitted to the ACM for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  9. arXiv:2405.06995  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Benchmarking Cross-Domain Audio-Visual Deception Detection

    Authors: Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, Alex C. Kot

    Abstract: Automated deception detection is crucial for assisting humans in accurately assessing truthfulness and identifying deceptive behavior. Conventional contact-based techniques, like polygraph devices, rely on physiological signals to determine the authenticity of an individual's statements. Nevertheless, recent developments in automated deception detection have demonstrated that multimodal features d… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

    Comments: 10 pages

  10. arXiv:2405.01825  [pdf, other

    cs.CV

    Improving Concept Alignment in Vision-Language Concept Bottleneck Models

    Authors: Nithish Muthuchamy Selvaraj, Xiaobao Guo, Bingquan Shen, Adams Wai-Kin Kong, Alex Kot

    Abstract: Concept Bottleneck Models (CBM) map the input image to a high-level human-understandable concept space and then make class predictions based on these concepts. Recent approaches automate the construction of CBM by prompting Large Language Models (LLM) to generate text concepts and then use Vision Language Models (VLM) to obtain concept scores to train a CBM. However, it is desired to build CBMs wi… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  11. arXiv:2405.01714  [pdf, other

    cs.LG cs.AI

    Interpretable Vital Sign Forecasting with Model Agnostic Attention Maps

    Authors: Yuwei Liu, Chen Dan, Anubhav Bhatti, Bingjie Shen, Divij Gupta, Suraj Parmar, San Lee

    Abstract: Sepsis is a leading cause of mortality in intensive care units (ICUs), representing a substantial medical challenge. The complexity of analyzing diverse vital signs to predict sepsis further aggravates this issue. While deep learning techniques have been advanced for early sepsis prediction, their 'black-box' nature obscures the internal logic, impairing interpretability in critical settings like… ▽ More

    Submitted 21 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  12. arXiv:2404.18439  [pdf, other

    cs.CV cs.RO

    $ν$-DBA: Neural Implicit Dense Bundle Adjustment Enables Image-Only Driving Scene Reconstruction

    Authors: Yunxuan Mao, Bingqi Shen, Yifei Yang, Kai Wang, Rong Xiong, Yiyi Liao, Yue Wang

    Abstract: The joint optimization of the sensor trajectory and 3D map is a crucial characteristic of bundle adjustment (BA), essential for autonomous driving. This paper presents $ν$-DBA, a novel framework implementing geometric dense bundle adjustment (DBA) using 3D neural implicit surfaces for map parametrization, which optimizes both the map surface and trajectory poses using geometric error guided by den… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  13. arXiv:2404.11987  [pdf, other

    cs.CV

    MultiPhys: Multi-Person Physics-aware 3D Motion Estimation

    Authors: Nicolas Ugrinovic, Boxiao Pan, Georgios Pavlakos, Despoina Paschalidou, Bokui Shen, Jordi Sanchez-Riera, Francesc Moreno-Noguer, Leonidas Guibas

    Abstract: We introduce MultiPhys, a method designed for recovering multi-person motion from monocular videos. Our focus lies in capturing coherent spatial placement between pairs of individuals across varying degrees of engagement. MultiPhys, being physically aware, exhibits robustness to jittering and occlusions, and effectively eliminates penetration issues between the two individuals. We devise a pipelin… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  14. arXiv:2404.08947  [pdf, other

    cs.SE

    Zero-Shot Code Representation Learning via Prompt Tuning

    Authors: Nan Cui, Xiaodong Gu, Beijun Shen

    Abstract: Learning code representations has been the core prerequisite of many software engineering tasks such as code clone detection and code generation. State-of-the-art program representation techniques mainly utilize pre-trained language models (PLMs) such as CodeBERT. A Transformer encoder is firstly pre-trained on a large-scale code corpus to acquire general knowledge about source code. The pre-train… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.08360

  15. arXiv:2404.06819  [pdf, other

    cs.CR cs.DB

    Enc2DB: A Hybrid and Adaptive Encrypted Query Processing Framework

    Authors: Hui Li, Jingwen Shi, Qi Tian, Zheng Li, Yan Fu, Bingqing Shen, Yaofeng Tu

    Abstract: As cloud computing gains traction, data owners are outsourcing their data to cloud service providers (CSPs) for Database Service (DBaaS), bringing in a deviation of data ownership and usage, and intensifying privacy concerns, especially with potential breaches by hackers or CSP insiders. To address that, encrypted database services propose encrypting every tuple and query statement before submitti… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 33 pages,33 figures, DASAFAA24

  16. arXiv:2403.16443  [pdf, other

    cs.CL cs.AI cs.SE

    CodeS: Natural Language to Code Repository via Multi-Layer Sketch

    Authors: Daoguang Zan, Ailun Yu, Wei Liu, Dong Chen, Bo Shen, Wei Li, Yafen Yao, Yongshun Gong, Xiaolin Chen, Bei Guan, Zhiguang Yang, Yongji Wang, Qianxiang Wang, Lizhen Cui

    Abstract: The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Repository (NL2Repo). This task aims to generate an entire code repository from its natural language requirements. To address this task, we propose a simple y… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: https://github.com/NL2Code/CodeS

  17. arXiv:2403.12032  [pdf, other

    cs.CV cs.GR

    Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

    Authors: Hansheng Chen, Ruoxi Shi, Yulin Liu, Bokui Shen, Jiayuan Gu, Gordon Wetzstein, Hao Su, Leonidas Guibas

    Abstract: Open-domain 3D object synthesis has been lagging behind image synthesis due to limited data and higher computational complexity. To bridge this gap, recent works have investigated multi-view diffusion but often fall short in either 3D consistency, visual quality, or efficiency. This paper proposes MVEdit, which functions as a 3D counterpart of SDEdit, employing ancestral sampling to jointly denois… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: V2 note: Fix missing acknowledgements. Project page: https://lakonik.github.io/mvedit

  18. arXiv:2403.10717  [pdf, other

    cs.LG cs.AI cs.CR

    Backdoor Secrets Unveiled: Identifying Backdoor Data with Optimized Scaled Prediction Consistency

    Authors: Soumyadeep Pal, Yuguang Yao, Ren Wang, Bingquan Shen, Sijia Liu

    Abstract: Modern machine learning (ML) systems demand substantial training data, often resorting to external sources. Nevertheless, this practice renders them vulnerable to backdoor poisoning attacks. Prior backdoor defense strategies have primarily focused on the identification of backdoored models or poisoned data characteristics, typically operating under the assumption of access to clean data. In this w… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: The Twelfth International Conference on Learning Representations (ICLR 2024)

  19. arXiv:2403.04261  [pdf

    cs.AI cs.CL cs.LG

    Advancing Biomedical Text Mining with Community Challenges

    Authors: Hui Zong, Rongrong Wu, Jiaxue Cha, Erman Wu, Jiakun Li, Liang Tao, Zuofeng Li, Buzhou Tang, Bairong Shen

    Abstract: The field of biomedical research has witnessed a significant increase in the accumulation of vast amounts of textual data from various sources such as scientific literatures, electronic health records, clinical trial reports, and social media. However, manually processing and analyzing these extensive and complex resources is time-consuming and inefficient. To address this challenge, biomedical te… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  20. arXiv:2402.10688  [pdf, other

    cs.CL

    Towards Uncovering How Large Language Model Works: An Explainability Perspective

    Authors: Haiyan Zhao, Fan Yang, Bo Shen, Himabindu Lakkaraju, Mengnan Du

    Abstract: Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as hallucinations, toxicity, and misalignment with human values, hindering the safe and beneficial deployment of LLMs. This paper aims to uncover the mechanisms und… ▽ More

    Submitted 15 April, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: 8 pages, 2 figures

  21. arXiv:2312.06663  [pdf, other

    cs.CV cs.GR

    CAD: Photorealistic 3D Generation via Adversarial Distillation

    Authors: Ziyu Wan, Despoina Paschalidou, Ian Huang, Hongyu Liu, Bokui Shen, Xiaoyu Xiang, Jing Liao, Leonidas Guibas

    Abstract: The increased demand for 3D data in AR/VR, robotics and gaming applications, gave rise to powerful generative pipelines capable of synthesizing high-quality 3D objects. Most of these models rely on the Score Distillation Sampling (SDS) algorithm to optimize a 3D representation such that the rendered image maintains a high likelihood as evaluated by a pre-trained diffusion model. However, finding a… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Project page: http://raywzy.com/CAD/

  22. arXiv:2312.01307  [pdf, other

    cs.RO cs.CV

    SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects

    Authors: Haoran Geng, Songlin Wei, Congyue Deng, Bokui Shen, He Wang, Leonidas Guibas

    Abstract: To interact with daily-life articulated objects of diverse structures and functionalities, understanding the object parts plays a central role in both user instruction comprehension and task execution. However, the possible discordance between the semantic meaning and physics functionalities of the parts poses a challenge for designing a general system. To address this problem, we propose SAGE, a… ▽ More

    Submitted 30 March, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  23. arXiv:2311.04770  [pdf, other

    cs.LG cs.AI

    Vital Sign Forecasting for Sepsis Patients in ICUs

    Authors: Anubhav Bhatti, Yuwei Liu, Chen Dan, Bingjie Shen, San Lee, Yonghwan Kim, Jang Yong Kim

    Abstract: Sepsis and septic shock are a critical medical condition affecting millions globally, with a substantial mortality rate. This paper uses state-of-the-art deep learning (DL) architectures to introduce a multi-step forecasting system to predict vital signs indicative of septic shock progression in Intensive Care Units (ICUs). Our approach utilizes a short window of historical vital sign data to fore… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 4 pages, 3 Figures

  24. arXiv:2311.02787  [pdf, other

    cs.RO cs.AI

    Make a Donut: Hierarchical EMD-Space Planning for Zero-Shot Deformable Manipulation with Tools

    Authors: Yang You, Bokui Shen, Congyue Deng, Haoran Geng, Songlin Wei, He Wang, Leonidas Guibas

    Abstract: Deformable object manipulation stands as one of the most captivating yet formidable challenges in robotics. While previous techniques have predominantly relied on learning latent dynamics through demonstrations, typically represented as either particles or images, there exists a pertinent limitation: acquiring suitable demonstrations, especially for long-horizon tasks, can be elusive. Moreover, ba… ▽ More

    Submitted 24 March, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: 8 pages

  25. arXiv:2311.02373  [pdf, other

    cs.LG

    From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models

    Authors: Zhuoshi Pan, Yuguang Yao, Gaowen Liu, Bingquan Shen, H. Vicky Zhao, Ramana Rao Kompella, Sijia Liu

    Abstract: While state-of-the-art diffusion models (DMs) excel in image generation, concerns regarding their security persist. Earlier research highlighted DMs' vulnerability to data poisoning attacks, but these studies placed stricter requirements than conventional methods like `BadNets' in image classification. This is because the art necessitates modifications to the diffusion training and sampling proced… ▽ More

    Submitted 15 June, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures, 4 tables

  26. arXiv:2311.00735  [pdf

    cs.LG cs.CV

    PET Tracer Conversion among Brain PET via Variable Augmented Invertible Network

    Authors: Bohui Shen, Wei Zhang, Xubiao Liu, Pengfei Yu, Shirui Jiang, Xinchong Shi, Xiangsong Zhang, Xiaoyu Zhou, Weirui Zhang, Bingxuan Li, Qiegen Liu

    Abstract: Positron emission tomography (PET) serves as an essential tool for diagnosis of encephalopathy and brain science research. However, it suffers from the limited choice of tracers. Nowadays, with the wide application of PET imaging in neuropsychiatric treatment, 6-18F-fluoro-3, 4-dihydroxy-L-phenylalanine (DOPA) has been found to be more effective than 18F-labeled fluorine-2-deoxyglucose (FDG) in th… ▽ More

    Submitted 15 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    MSC Class: 68T01

  27. arXiv:2310.01885  [pdf

    eess.IV cs.LG q-bio.NC

    Synthetic CT Generation via Variant Invertible Network for All-digital Brain PET Attenuation Correction

    Authors: Yu Guan, Bohui Shen, Xinchong Shi, Xiangsong Zhang, Bingxuan Li, Qiegen Liu

    Abstract: Attenuation correction (AC) is essential for the generation of artifact-free and quantitatively accurate positron emission tomography (PET) images. However, AC of PET faces challenges including inter-scan motion and erroneous transformation of structural voxel-intensities to PET attenuation-correction factors. Nowadays, the problem of AC for quantitative PET have been solved to a large extent afte… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  28. arXiv:2309.15487  [pdf, other

    cs.CV

    Tackling VQA with Pretrained Foundation Models without Further Training

    Authors: Alvin De Jun Tan, Bingquan Shen

    Abstract: Large language models (LLMs) have achieved state-of-the-art results in many natural language processing tasks. They have also demonstrated ability to adapt well to different tasks through zero-shot or few-shot settings. With the capability of these LLMs, researchers have looked into how to adopt them for use with Visual Question Answering (VQA). Many methods require further training to align the i… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  29. arXiv:2309.05361  [pdf

    physics.plasm-ph cs.AI cs.LG

    Cross-tokamak Disruption Prediction based on Physics-Guided Feature Extraction and domain adaptation

    Authors: Chengshuo Shen, Wei Zheng, Bihao Guo, Yonghua Ding, Dalong Chen, Xinkun Ai, Fengming Xue, Yu Zhong, Nengchao Wang, Biao Shen, Binjia Xiao, Zhongyong Chen, Yuan Pan, J-TEXT team

    Abstract: The high acquisition cost and the significant demand for disruptive discharges for data-driven disruption prediction models in future tokamaks pose an inherent contradiction in disruption prediction research. In this paper, we demonstrated a novel approach to predict disruption in a future tokamak using only a few discharges. The first step is to use the existing understanding of physics to extrac… ▽ More

    Submitted 1 November, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 17 pages, 9 figures

  30. arXiv:2308.16824  [pdf, other

    cs.CL cs.AI cs.PL cs.SE

    Can Programming Languages Boost Each Other via Instruction Tuning?

    Authors: Daoguang Zan, Ailun Yu, Bo Shen, Jiaxin Zhang, Taihong Chen, Bing Geng, Bei Chen, Jichuan Ji, Yafen Yao, Yongji Wang, Qianxiang Wang

    Abstract: When human programmers have mastered a programming language, it would be easier when they learn a new programming language. In this report, we focus on exploring whether programming languages can boost each other during the instruction fine-tuning phase of code large language models. We conduct extensive experiments of 8 popular programming languages (Python, JavaScript, TypeScript, C, C++, Java,… ▽ More

    Submitted 3 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Work in progress

  31. arXiv:2308.08961  [pdf

    cs.SE

    On the Evaluation of Neural Code Translation: Taxonomy and Benchmark

    Authors: Mingsheng Jiao, Tingrui Yu, Xuan Li, Guanjie Qiu, Xiaodong Gu, Beijun Shen

    Abstract: In recent years, neural code translation has gained increasing attention. While most of the research focuses on improving model architectures and training processes, we notice that the evaluation process and benchmark for code translation models are severely limited: they primarily treat source code as natural languages and provide a holistic accuracy score while disregarding the full spectrum of… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: accepted by ASE2023

  32. arXiv:2308.04041  [pdf, other

    cs.AI cs.CL

    InfeRE: Step-by-Step Regex Generation via Chain of Inference

    Authors: Shuai Zhang, Xiaodong Gu, Yuting Chen, Beijun Shen

    Abstract: Automatically generating regular expressions (abbrev. regexes) from natural language description (NL2RE) has been an emerging research area. Prior studies treat regex as a linear sequence of tokens and generate the final expressions autoregressively in a single pass. They did not take into account the step-by-step internal text-matching processes behind the final results. This significantly hinder… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted by ASE'23

  33. arXiv:2308.03556  [pdf, other

    cs.IT eess.SP

    Joint Device Identification, Channel Estimation, and Signal Detection for LEO Satellite-Enabled Random Access

    Authors: Boxiao Shen, Yongpeng Wu, Wenjun Zhang, Symeon Chatzinotas, Björn Ottersten

    Abstract: This paper investigates joint device identification, channel estimation, and signal detection for LEO satellite-enabled grant-free random access, where a multiple-input multipleoutput (MIMO) system with orthogonal time-frequency space modulation (OTFS) is utilized to combat the dynamics of the terrestrial-satellite link (TSL). We divide the receiver structure into three modules: first, a linear mo… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

    Comments: This paper has been accepted for presentation at the IEEE GLOBECOM 2023

  34. arXiv:2307.16342  [pdf, other

    cs.LG cs.AI cs.CR

    Proof-of-Federated-Learning-Subchain: Free Partner Selection Subchain Based on Federated Learning

    Authors: Boyang Li, Bingyu Shen, Qing Lu, Taeho Jung, Yiyu Shi

    Abstract: The continuous thriving of the Blockchain society motivates research in novel designs of schemes supporting cryptocurrencies. Previously multiple Proof-of-Deep-Learning(PoDL) consensuses have been proposed to replace hashing with useful work such as deep learning model training tasks. The energy will be more efficiently used while maintaining the ledger. However deep learning models are problem-sp… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: 7 pages, 7 figures

  35. arXiv:2307.15055  [pdf, other

    cs.CV

    PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking

    Authors: Yang Zheng, Adam W. Harley, Bokui Shen, Gordon Wetzstein, Leonidas J. Guibas

    Abstract: We introduce PointOdyssey, a large-scale synthetic dataset, and data generation framework, for the training and evaluation of long-term fine-grained tracking algorithms. Our goal is to advance the state-of-the-art by placing emphasis on long videos with naturalistic motion. Toward the goal of naturalism, we animate deformable characters using real-world motion capture data, we build 3D scenes to m… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

  36. arXiv:2307.14936  [pdf, other

    cs.CL cs.AI cs.LG cs.PL cs.SE

    PanGu-Coder2: Boosting Large Language Models for Code with Ranking Feedback

    Authors: Bo Shen, Jiaxin Zhang, Taihong Chen, Daoguang Zan, Bing Geng, An Fu, Muhan Zeng, Ailun Yu, Jichuan Ji, Jingyang Zhao, Yuenan Guo, Qianxiang Wang

    Abstract: Large Language Models for Code (Code LLM) are flourishing. New and powerful models are released on a weekly basis, demonstrating remarkable performance on the code generation task. Various approaches have been proposed to boost the code generation performance of pre-trained Code LLMs, such as supervised fine-tuning, instruction tuning, reinforcement learning, etc. In this paper, we propose a novel… ▽ More

    Submitted 27 July, 2023; originally announced July 2023.

    Comments: Preprint

  37. arXiv:2306.01683  [pdf, other

    cs.LG cs.AI q-bio.BM

    Balancing Exploration and Exploitation: Disentangled $β$-CVAE in De Novo Drug Design

    Authors: Guang Jun Nicholas Ang, De Tao Irwin Chin, Bingquan Shen

    Abstract: Deep generative models have recently emerged as a promising de novo drug design method. In this respect, deep generative conditional variational autoencoder (CVAE) models are a powerful approach for generating novel molecules with desired drug-like properties. However, molecular graph-based models with disentanglement and multivariate explicit latent conditioning have not been fully elucidated. To… ▽ More

    Submitted 17 August, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  38. arXiv:2305.16727  [pdf, other

    cs.CV cs.AI

    A Novel real-time arrhythmia detection model using YOLOv8

    Authors: Guang Jun Nicholas Ang, Aritejh Kr Goil, Henryk Chan, Jieyi Jeric Lew, Xin Chun Lee, Raihan Bin Ahmad Mustaffa, Timotius Jason, Ze Ting Woon, Bingquan Shen

    Abstract: In a landscape characterized by heightened connectivity and mobility, coupled with a surge in cardiovascular ailments, the imperative to curtail healthcare expenses through remote monitoring of cardiovascular health has become more pronounced. The accurate detection and classification of cardiac arrhythmias are pivotal for diagnosing individuals with heart irregularities. This study underscores th… ▽ More

    Submitted 7 January, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  39. arXiv:2305.16315  [pdf, other

    cs.CV

    NAP: Neural 3D Articulation Prior

    Authors: Jiahui Lei, Congyue Deng, Bokui Shen, Leonidas Guibas, Kostas Daniilidis

    Abstract: We propose Neural 3D Articulation Prior (NAP), the first 3D deep generative model to synthesize 3D articulated object models. Despite the extensive research on generating 3D objects, compositions, or scenes, there remains a lack of focus on capturing the distribution of articulated objects, a common object category for human and robot interaction. To generate articulated objects, we first design a… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: project page: https://www.cis.upenn.edu/~leijh/projects/nap

  40. arXiv:2305.16314  [pdf, other

    cs.CV

    Banana: Banach Fixed-Point Network for Pointcloud Segmentation with Inter-Part Equivariance

    Authors: Congyue Deng, Jiahui Lei, Bokui Shen, Kostas Daniilidis, Leonidas Guibas

    Abstract: Equivariance has gained strong interest as a desirable network property that inherently ensures robust generalization. However, when dealing with complex systems such as articulated objects or multi-object scenes, effectively capturing inter-part transformations poses a challenge, as it becomes entangled with the overall structure and local transformations. The interdependence of part assignment a… ▽ More

    Submitted 26 May, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  41. arXiv:2305.08446  [pdf, other

    cs.AI cs.RO

    Tracking Progress in Multi-Agent Path Finding

    Authors: Bojie Shen, Zhe Chen, Muhammad Aamir Cheema, Daniel D. Harabor, Peter J. Stuckey

    Abstract: Multi-Agent Path Finding (MAPF) is an important core problem for many new and emerging industrial applications. Many works appear on this topic each year, and a large number of substantial advancements and performance improvements have been reported. Yet measuring overall progress in MAPF is difficult: there are many potential competitors, and the computational burden for comprehensive experimenta… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  42. arXiv:2304.14356  [pdf, other

    cs.RO

    S$^2$MAT: Simultaneous and Self-Reinforced Mapping and Tracking in Dynamic Urban Scenariosorcing Framework for Simultaneous Mapping and Tracking in Unbounded Urban Environments

    Authors: Tingxiang Fan, Bowen Shen, Yinqiang Zhang, Chuye Zhang, Lei Yang, Hua Chen, Wei Zhang, Jia Pan

    Abstract: Despite the increasing prevalence of robots in daily life, their navigation capabilities are still limited to environments with prior knowledge, such as a global map. To fully unlock the potential of robots, it is crucial to enable them to navigate in large-scale unknown and changing unstructured scenarios. This requires the robot to construct an accurate static map in real-time as it explores, wh… ▽ More

    Submitted 20 November, 2023; v1 submitted 27 April, 2023; originally announced April 2023.

    Comments: homepage: https://sites.google.com/view/smat-nav

  43. arXiv:2304.07704  [pdf, other

    cs.CR

    A Survey of Access Control Misconfiguration Detection Techniques

    Authors: Bingyu Shen

    Abstract: Access control mechanisms have been adopted in many real-world systems to control resource sharing for the principals in the system. An error in the access control policy (misconfiguration) can easily cause severe data leakage and system exploitation. Researchers have developed several methodologies to detect the access control misconfigurations through data mining, testing, and verification for v… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

    Comments: 12 pages

  44. arXiv:2304.02163  [pdf, other

    cs.CV cs.AI cs.GR cs.RO

    GINA-3D: Learning to Generate Implicit Neural Assets in the Wild

    Authors: Bokui Shen, Xinchen Yan, Charles R. Qi, Mahyar Najibi, Boyang Deng, Leonidas Guibas, Yin Zhou, Dragomir Anguelov

    Abstract: Modeling the 3D world from sensor data for simulation is a scalable way of developing testing and validation environments for robotic learning problems such as autonomous driving. However, manually creating or re-creating real-world-like environments is difficult, expensive, and not scalable. Recent generative model techniques have shown promising progress to address such challenges by learning 3D… ▽ More

    Submitted 28 August, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted by CVPR 2023; Our WOD-ObjectAsset can be accessed through waymo.com/open

  45. arXiv:2303.12745  [pdf, other

    cs.CV cs.AI

    Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning

    Authors: Xiaobao Guo, Nithish Muthuchamy Selvaraj, Zitong Yu, Adams Wai-Kin Kong, Bingquan Shen, Alex Kot

    Abstract: Deception detection in conversations is a challenging yet important task, having pivotal applications in many fields such as credibility assessment in business, multimedia anti-frauds, and custom security. Despite this, deception detection research is hindered by the lack of high-quality deception datasets, as well as the difficulties of learning multimodal features effectively. To address this is… ▽ More

    Submitted 3 August, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    Comments: 11 pages, 6 figures

  46. arXiv:2303.09800  [pdf, other

    cs.CV cs.AI cs.RO

    GOOD: General Optimization-based Fusion for 3D Object Detection via LiDAR-Camera Object Candidates

    Authors: Bingqi Shen, Shuwei Dai, Yuyin Chen, Rong Xiong, Yue Wang, Yanmei Jiao

    Abstract: 3D object detection serves as the core basis of the perception tasks in autonomous driving. Recent years have seen the rapid progress of multi-modal fusion strategies for more robust and accurate 3D object detection. However, current researches for robust fusion are all learning-based frameworks, which demand a large amount of training data and are inconvenient to implement in new scenes. In this… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  47. arXiv:2303.06815  [pdf, other

    cs.LG stat.ML

    On Model Compression for Neural Networks: Framework, Algorithm, and Convergence Guarantee

    Authors: Chenyang Li, Jihoon Chung, Biao Cai, Haimin Wang, Xianlian Zhou, Bo Shen

    Abstract: Model compression is a crucial part of deploying neural networks (NNs), especially when the memory and storage of computing devices are limited in many applications. This paper focuses on two model compression techniques: low-rank approximation and weight pruning in neural networks, which are very popular nowadays. However, training NN with low-rank approximation and weight pruning always suffers… ▽ More

    Submitted 4 January, 2024; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: 43 pages

  48. arXiv:2302.14314  [pdf, other

    cs.SD eess.AS

    Adapter Incremental Continual Learning of Efficient Audio Spectrogram Transformers

    Authors: Nithish Muthuchamy Selvaraj, Xiaobao Guo, Adams Kong, Bingquan Shen, Alex Kot

    Abstract: Continual learning involves training neural networks incrementally for new tasks while retaining the knowledge of previous tasks. However, efficiently fine-tuning the model for sequential tasks with minimal computational resources remains a challenge. In this paper, we propose Task Incremental Continual Learning (TI-CL) of audio classifiers with both parameter-efficient and compute-efficient Audio… ▽ More

    Submitted 2 January, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

  49. arXiv:2302.05727  [pdf, other

    cs.CV

    Flexible-modal Deception Detection with Audio-Visual Adapter

    Authors: Zhaoxu Li, Zitong Yu, Nithish Muthuchamy Selvaraj, Xiaobao Guo, Bingquan Shen, Adams Wai-Kin Kong, Alex Kot

    Abstract: Detecting deception by human behaviors is vital in many fields such as custom security and multimedia anti-fraud. Recently, audio-visual deception detection attracts more attention due to its better performance than using only a single modality. However, in real-world multi-modal settings, the integrity of data can be an issue (e.g., sometimes only partial modalities are available). The missing mo… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

  50. CoderEval: A Benchmark of Pragmatic Code Generation with Generative Pre-trained Models

    Authors: Hao Yu, Bo Shen, Dezhi Ran, Jiaxin Zhang, Qi Zhang, Yuchi Ma, Guangtai Liang, Ying Li, Qianxiang Wang, Tao Xie

    Abstract: Code generation models based on the pre-training and fine-tuning paradigm have been increasingly attempted by both academia and industry, resulting in well-known industrial models such as Codex, CodeGen, and PanGu-Coder. To evaluate the effectiveness of these models, multiple existing benchmarks are proposed, including only cases of generating a standalone function, i.e., a function that may invok… ▽ More

    Submitted 23 February, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Journal ref: ICSE (2024)