Skip to main content

Showing 1–50 of 170 results for author: Pang, C

  1. arXiv:2406.15774  [pdf, other

    cs.RO

    Observation Time Difference: an Online Dynamic Objects Removal Method for Ground Vehicles

    Authors: Rongguang Wu, Chenglin Pang, Xuankang Wu, Zheng Fang

    Abstract: In the process of urban environment mapping, the sequential accumulations of dynamic objects will leave a large number of traces in the map. These traces will usually have bad influences on the localization accuracy and navigation performance of the robot. Therefore, dynamic objects removal plays an important role for creating clean map. However, conventional dynamic objects removal methods usuall… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  2. arXiv:2406.13243  [pdf, ps, other

    cs.IT

    Abelian Group Codes for Classical and Classical-Quantum Channels: One-shot and Asymptotic Rate Bounds

    Authors: James Chin-Jen Pang, Sandeep Pradhan, Hessam Mahdavifar

    Abstract: We study the problem of transmission of information over classical and classical-quantum channels in the one-shot regime where the underlying codes are constrained to be group codes. In the achievability part, we introduce a new input probability distribution that incorporates the encoding homomorphism and the underlying channel law. Using a random coding argument, we characterize the performance… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 41 pages

  3. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.04113  [pdf, other

    cs.CL

    Uncovering Limitations of Large Language Models in Information Seeking from Tables

    Authors: Chaoxu Pang, Yixuan Cao, Chunhao Yang, Ping Luo

    Abstract: Tables are recognized for their high information density and widespread usage, serving as essential sources of information. Seeking information from tables (TIS) is a crucial capability for Large Language Models (LLMs), serving as the foundation of knowledge-based Q&A systems. However, this field presently suffers from an absence of thorough and reliable evaluation. This paper introduces a more re… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Findings of ACL 2024

  5. arXiv:2405.12376  [pdf

    physics.optics physics.app-ph

    A silicon photonics waveguide-coupled colloidal quantum dot photodiode sensitive beyond 1.6 um

    Authors: Chao Pang, Yu-Hao Deng, Ezat Kheradmand, Luis Moreno Hagelsieb, Yujie Guo, David Cheyns, Pieter Geiregat, Zeger Hens, Dries Van Thourhout

    Abstract: Silicon photonics faces a persistent challenge in extending photodetection capabilities beyond the 1.6 um wavelength range, primarily due to the lack of appropriate epitaxial materials. Colloidal quantum dots (QDs) present a promising solution here, offering distinct advantages such as infrared wavelength tunability, cost-effectiveness, and facile deposition. Their unique properties position them… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2405.07765  [pdf, other

    cs.CL

    TANQ: An open domain dataset of table answered questions

    Authors: Mubashara Akhtar, Chenxi Pang, Andreea Marzoca, Yasemin Altun, Julian Martin Eisenschlos

    Abstract: Language models, potentially augmented with tool usage such as retrieval are becoming the go-to means of answering questions. Understanding and answering questions in real-world settings often requires retrieving information from different sources, processing and aggregating data to extract insights, and presenting complex findings in form of structured artifacts such as novel tables, charts, or i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages

  7. arXiv:2404.16157  [pdf, ps, other

    math.PR math.AP

    Convergence of stochastic integrals with applications to transport equations and conservation laws with noise

    Authors: Kenneth H. Karlsen, Peter H. C. Pang

    Abstract: Convergence of stochastic integrals driven by Wiener processes $W_n$, with $W_n \to W$ almost surely in $C_t$, is crucial in analyzing SPDEs. Our focus is on the convergence of the form $\int_0^T V_n\, \mathrm{d} W_n \to \int_0^T V\, \mathrm{d} W$, where $\{V_n\}$ is bounded in $L^p(Ω\times [0,T];X)$ for a Banach space $X$ and some finite $p > 2$. This is challenging when $V_n$ converges to $V$ we… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 31 pages

    MSC Class: Primary: 60H15; 60G46; Secondary: 60F25

  8. arXiv:2403.20213  [pdf, other

    cs.CV

    H2RSVLM: Towards Helpful and Honest Remote Sensing Large Vision Language Model

    Authors: Chao Pang, Jiang Wu, Jiayu Li, Yi Liu, Jiaxing Sun, Weijia Li, Xingxing Weng, Shuai Wang, Litong Feng, Gui-Song Xia, Conghui He

    Abstract: The generic large Vision-Language Models (VLMs) is rapidly developing, but still perform poorly in Remote Sensing (RS) domain, which is due to the unique and specialized nature of RS imagery and the comparatively limited spatial perception of current VLMs. Existing Remote Sensing specific Vision Language Models (RSVLMs) still have considerable potential for improvement, primarily owing to the lack… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Equal contribution: Chao Pang, Jiang Wu; Corresponding author: Gui-Song Xia, Conghui He

  9. arXiv:2403.19427  [pdf

    physics.optics

    Dynamic Phase Enabled Topological Mode Steering in Composite Su-Schrieffer-Heeger Waveguide Arrays

    Authors: Min Tang, Chi Pang, Christian N. Saggau, Haiyun Dong, Ching Hua Lee, Ronny Thomale, Sebastian Klembt, Ion Cosma Fulga, Jeroen Van Den Brink, Yana Vaynzof, Oliver G. Schmidt, Jiawei Wang, Libo Ma

    Abstract: Topological boundary states localize at interfaces whenever the interface implies a change of the associated topological invariant encoded in the geometric phase. The generically present dynamic phase, however, which is energy and time dependent, has been known to be non-universal, and hence not to intertwine with any topological geometric phase. Using the example of topological zero modes in comp… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  10. arXiv:2402.19386  [pdf, ps, other

    math.AP math.PR

    The viscous variational wave equation with transport noise

    Authors: Peter H. C. Pang

    Abstract: This article considers the variational wave equation with viscosity and transport noise as a system of three coupled nonlinear stochastic partial differential equations. We prove pathwise global existence, uniqueness, and temporal continuity of solutions to this system in $L^2_x$. Martingale solutions are extracted from a two-level Galerkin approximation via the Skorokhod--Jakubowski theorem. We u… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 40 pages

    MSC Class: Primary: 35R60; 35F55; Secondary: 35D30

  11. arXiv:2402.18132  [pdf, other

    cs.CV cs.NE

    Understanding the Role of Pathways in a Deep Neural Network

    Authors: Lei Lyu, Chen Pang, Jihua Wang

    Abstract: Deep neural networks have demonstrated superior performance in artificial intelligence applications, but the opaqueness of their inner working mechanism is one major drawback in their application. The prevailing unit-based interpretation is a statistical observation of stimulus-response data, which fails to show a detailed internal process of inherent mechanisms of neural networks. In this work, w… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  12. RadarMOSEVE: A Spatial-Temporal Transformer Network for Radar-Only Moving Object Segmentation and Ego-Velocity Estimation

    Authors: Changsong Pang, Xieyuanli Chen, Yimin Liu, Huimin Lu, Yuwei Cheng

    Abstract: Moving object segmentation (MOS) and Ego velocity estimation (EVE) are vital capabilities for mobile systems to achieve full autonomy. Several approaches have attempted to achieve MOSEVE using a LiDAR sensor. However, LiDAR sensors are typically expensive and susceptible to adverse weather conditions. Instead, millimeter-wave radar (MWR) has gained popularity in robotics and autonomous driving for… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Accepted at AAAI-24

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence.38(2024)4424-4432

  13. arXiv:2402.10629  [pdf, ps, other

    hep-ph

    M1 Radiative and spin-nonflip $ππ$ transitions of $B_c$ states in the Cornell potential model

    Authors: Zhi-bin Gao, Yan-yue Fan, Hao Chen, Cheng-qun Pang

    Abstract: In this paper, we mainly predict the rates of M1 radiative and spin-nonflip $ππ$ transitions of $B_{c}$-meson under the non-relativistic Cornell potential model with a screening potential effect. We employ the numerical wave function to determine the M1 radiative transition widths of $B_c$ excited states and utilize the Kuang-Yan proposed method for the spin-nonflip $ππ$ transitions among $B_c$ st… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 4 figures. arXiv admin note: text overlap with arXiv:2205.05950 by other authors

  14. arXiv:2402.04400  [pdf, other

    cs.LG cs.AI cs.CY

    CEHR-GPT: Generating Electronic Health Records with Chronological Patient Timelines

    Authors: Chao Pang, Xinzhuo Jiang, Nishanth Parameshwar Pavinkurve, Krishna S. Kalluri, Elise L. Minto, Jason Patterson, Linying Zhang, George Hripcsak, Gamze Gürsoy, Noémie Elhadad, Karthik Natarajan

    Abstract: Synthetic Electronic Health Records (EHR) have emerged as a pivotal tool in advancing healthcare applications and machine learning models, particularly for researchers without direct access to healthcare data. Although existing methods, like rule-based approaches and generative adversarial networks (GANs), generate synthetic data that resembles real-world EHR data, these methods often use a tabula… ▽ More

    Submitted 5 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  15. arXiv:2401.10752  [pdf, other

    cs.CV

    HiCD: Change Detection in Quality-Varied Images via Hierarchical Correlation Distillation

    Authors: Chao Pang, Xingxing Weng, Jiang Wu, Qiang Wang, Gui-Song Xia

    Abstract: Advanced change detection techniques primarily target image pairs of equal and high quality. However, variations in imaging conditions and platforms frequently lead to image pairs with distinct qualities: one image being high-quality, while the other being low-quality. These disparities in image quality present significant challenges for understanding image pairs semantically and extracting change… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

    Comments: accepted by TGRS

  16. arXiv:2401.09085  [pdf

    physics.optics

    3D orientation super-resolution spatial-frequency-shift microscopy

    Authors: Xiaowei Liu, Mingwei Tang, Ning Zhou, Chenlei Pang, Zhong Wen, Xu Liu, Qing Yang

    Abstract: Super-resolution mapping of the 3D orientation of fluorophores reveals the alignment of biological structures where the fluorophores are tightly attached, and thus plays a vital role in studying the organization and dynamics of bio-complexes. However, current super-resolution imaging techniques are either limited to 2D orientation mapping or suffer from slow speed and the requirement of special la… ▽ More

    Submitted 22 January, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 22 pages, 5 figures

  17. arXiv:2401.07607  [pdf

    cond-mat.mtrl-sci

    SnS2 thin film with in-situ and controllable Sb doping via atomic layer deposition for optoelectronic applications

    Authors: Dong-Ho Shin, Jun Yang, Samik Mukherjee, Amin Bahrami, Sebastian Lehmann, Noushin Nasiri, Fabian Krahl, Chi Pang, Angelika Wrzesińska-Lashkova, Yana Vaynzof, Steve Wohlrab, Alexey Popov, Kornelius Nielsch

    Abstract: SnS2 stands out as a highly promising two-dimensional material with significant potential for applications in the field of electronics. Numerous attempts have been undertaken to modulate the physical properties of SnS2 by doping with various metal ions. Here, we deposited a series of Sb-doped SnS2 via atomic layer deposition (ALD) super-cycle process and compared its crystallinity, composition, an… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 18 pages, 5 Figures, Journal

  18. arXiv:2312.17077  [pdf, ps, other

    math.NA math.PR

    Projected Langevin Monte Carlo algorithms in non-convex and super-linear setting

    Authors: Chenxu Pang, Xiaojie Wang, Yue Wu

    Abstract: It is of significant interest in many applications to sample from a high-dimensional target distribution $π$ with the density $π(\text{d} x) \propto e^{-U(x)} (\text{d} x) $, based on the temporal discretization of the Langevin stochastic differential equations (SDEs). In this paper, we propose an explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential $U$ and super-line… ▽ More

    Submitted 1 January, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: 31 pages, 6 figures

    MSC Class: 60H35; 65C05; 65C30

  19. arXiv:2312.14557  [pdf, other

    cs.CL

    Aurora:Activating Chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning

    Authors: Rongsheng Wang, Haoming Chen, Ruizhe Zhou, Yaofei Duan, Kunyan Cai, Han Ma, Jiaxi Cui, Jian Li, Patrick Cheong-Iao Pang, Yapeng Wang, Tao Tan

    Abstract: Existing research has demonstrated that refining large language models (LLMs) through the utilization of machine-generated instruction-following data empowers these models to exhibit impressive zero-shot capabilities for novel tasks, without requiring human-authored instructions. In this paper, we systematically investigate, preprocess, and integrate three Chinese instruction-following datasets wi… ▽ More

    Submitted 1 January, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: 10 pages, 2 figures

  20. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  21. arXiv:2312.06682  [pdf, other

    cs.AI cs.LG

    Learning to Denoise Unreliable Interactions for Link Prediction on Biomedical Knowledge Graph

    Authors: Tengfei Ma, Yujie Chen, Wen Tao, Dashun Zheng, Xuan Lin, Patrick Cheong-lao Pang, Yiping Liu, Yijun Wang, Bosheng Song, Xiangxiang Zeng

    Abstract: Link prediction in biomedical knowledge graphs (KGs) aims at predicting unknown interactions between entities, including drug-target interaction (DTI) and drug-drug interaction (DDI), which is critical for drug discovery and therapeutics. Previous methods prefer to utilize the rich semantic relations and topological structure of the KG to predict missing links, yielding promising outcomes. However… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

  22. arXiv:2311.09258  [pdf, other

    physics.optics physics.app-ph

    Single-Chip Silicon Photonic Processor for Analog Optical and Microwave Signals

    Authors: Hong Deng, Jing Zhang, Emadreza Soltanian, Xiangfeng Chen, Chao Pang, Nicolas Vaissiere, Delphine Neel, Joan Ramirez, Jean Decobert, Nishant Singh, Guy Torfs, Gunther Roelkens, Wim Bogaerts

    Abstract: The explosion of data volume in communications, AI training, and cloud computing requires efficient data handling, which is typically stored as digital electrical information and transmitted as wireless radio frequency (RF) signals or light waves in optical fibres. Today's communications systems mostly treat the RF and optical signals separately, which results in unnecessary conversion losses and… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  23. arXiv:2310.17901  [pdf, other

    cs.LG stat.ML

    Improving the Knowledge Gradient Algorithm

    Authors: Yang Le, Gao Siyang, Ho Chin Pang

    Abstract: The knowledge gradient (KG) algorithm is a popular policy for the best arm identification (BAI) problem. It is built on the simple idea of always choosing the measurement that yields the greatest expected one-step improvement in the estimate of the best mean of the arms. In this research, we show that this policy has limitations, causing the algorithm not asymptotically optimal. We next provide a… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 32 pages, 42 figures

  24. arXiv:2310.05066  [pdf, other

    cs.CL cs.LG

    Guideline Learning for In-context Information Extraction

    Authors: Chaoxu Pang, Yixuan Cao, Qiang Ding, Ping Luo

    Abstract: Large language models (LLMs) can perform a new task by merely conditioning on task instructions and a few input-output examples, without optimizing any parameters. This is called In-Context Learning (ICL). In-context Information Extraction (IE) has recently garnered attention in the research community. However, the performance of In-context IE generally lags behind the state-of-the-art supervised… ▽ More

    Submitted 21 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main conference

  25. arXiv:2310.02815  [pdf, other

    cs.CV cs.RO eess.IV

    CoBEV: Elevating Roadside 3D Object Detection with Depth and Height Complementarity

    Authors: Hao Shi, Chengshan Pang, Jiaming Zhang, Kailun Yang, Yuhao Wu, Huajian Ni, Yining Lin, Rainer Stiefelhagen, Kaiwei Wang

    Abstract: Roadside camera-driven 3D object detection is a crucial task in intelligent transportation systems, which extends the perception range beyond the limitations of vision-centric vehicles and enhances road safety. While previous studies have limitations in using only depth or height information, we find both depth and height matter and they are in fact complementary. The depth feature encompasses pre… ▽ More

    Submitted 17 October, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: The source code will be made publicly available at https://github.com/MasterHow/CoBEV

  26. arXiv:2309.02208  [pdf, ps, other

    math.NA math.AP

    Convergent finite difference schemes for stochastic transport equations

    Authors: Ulrik S. Fjordholm, Kenneth H. Karlsen, Peter H. C. Pang

    Abstract: We present difference schemes for stochastic transport equations with low-regularity velocity fields. We establish $L^2$ stability and convergence of the difference approximations under conditions that are less strict than those required for deterministic transport equations. The $L^2$ estimate, crucial for the analysis, is obtained through a discrete duality argument and a comprehensive examinati… ▽ More

    Submitted 3 July, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: 42 pages; adjustments in Section 2.2, other typos amended

    MSC Class: 60H15; 65M12; 60H50; 65M80

  27. Linear implicit approximations of invariant measures of semi-linear SDEs with non-globally Lipschitz coefficients

    Authors: Chenxu Pang, Xiaojie Wang, Yue Wu

    Abstract: This article investigates the weak approximation towards the invariant measure of semi-linear stochastic differential equations (SDEs) under non-globally Lipschitz coefficients. For this purpose, we propose a linear-theta-projected Euler (LTPE) scheme, which also admits an invariant measure, to handle the potential influence of the linear stiffness. Under certain assumptions, both the SDE and the… ▽ More

    Submitted 17 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 37 pages, 7 figures

    MSC Class: 60H35; 37M25; 65C30

    Journal ref: Journal of Complexity, Volume 83, August 2024, 101842

  28. arXiv:2307.10512  [pdf, other

    cs.CL cs.AI

    IvyGPT: InteractiVe Chinese pathwaY language model in medical domain

    Authors: Rongsheng Wang, Yaofei Duan, ChanTong Lam, Jiexi Chen, Jiangsheng Xu, Haoming Chen, Xiaohong Liu, Patrick Cheong-Iao Pang, Tao Tan

    Abstract: General large language models (LLMs) such as ChatGPT have shown remarkable success. However, such LLMs have not been widely adopted for medical purposes, due to poor accuracy and inability to provide medical advice. We propose IvyGPT, an LLM based on LLaMA that is trained and fine-tuned with high-quality medical question-answer (QA) instances and Reinforcement Learning from Human Feedback (RLHF).… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: 5 pages, 3 figures

  29. arXiv:2305.12992  [pdf, ps, other

    math.NA math.PR

    Antithetic multilevel Monte Carlo method for approximations of SDEs with non-globally Lipschitz continuous coefficients

    Authors: Chenxu Pang, Xiaojie Wang

    Abstract: In the field of computational finance, it is common for the quantity of interest to be expected values of functions of random variables via stochastic differential equations (SDEs). For SDEs with globally Lipschitz coefficients and commutative diffusion coefficients, the explicit Milstein scheme, relying on only Brownian increments and thus easily implementable, can be combined with the multilevel… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 39 pages, 4 figures

    MSC Class: 65C05; 60H15; 65C30

  30. arXiv:2305.07328  [pdf, other

    cs.CV

    Configurable Spatial-Temporal Hierarchical Analysis for Flexible Video Anomaly Detection

    Authors: Kai Cheng, Xinhua Zeng, Yang Liu, Tian Wang, Chengxin Pang, Jing Teng, Zhaoyang Xia, Jing Liu

    Abstract: Video anomaly detection (VAD) is a vital task with great practical applications in industrial surveillance, security system, and traffic control. Unlike previous unsupervised VAD methods that adopt a fixed structure to learn normality without considering different detection demands, we design a spatial-temporal hierarchical architecture (STHA) as a configurable architecture to flexibly detect diff… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: submitted to IEEE TCSVT, under peer review

  31. arXiv:2304.03981  [pdf, other

    cs.LG cs.CV

    Uncertainty-inspired Open Set Learning for Retinal Anomaly Identification

    Authors: Meng Wang, Tian Lin, Lianyu Wang, Aidi Lin, Ke Zou, Xinxing Xu, Yi Zhou, Yuanyuan Peng, Qingquan Meng, Yiming Qian, Guoyao Deng, Zhiqun Wu, Junhong Chen, Jianhong Lin, Mingzhi Zhang, Weifang Zhu, Changqing Zhang, Daoqiang Zhang, Rick Siow Mong Goh, Yong Liu, Chi Pui Pang, Xinjian Chen, Haoyu Chen, Huazhu Fu

    Abstract: Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies. We established an uncertainty-inspired open-set (UIOS) model, which was trained with fundus images of 9 retinal conditions. Besides assessing the probability of each category, UIOS also calcul… ▽ More

    Submitted 29 August, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  32. StoryChat: Designing a Narrative-Based Viewer Participation Tool for Live Streaming Chatrooms

    Authors: Ryan Yen, Li Feng, Brinda Mehra, Ching Christie Pang, Siying Hu, Zhicong Lu

    Abstract: Live streaming platforms and existing viewer participation tools enable users to interact and engage with an online community, but the anonymity and scale of chat usually result in the spread of negative comments. However, only a few existing moderation tools investigate the influence of proactive moderation on viewers' engagement and prosocial behavior. To address this, we developed StoryChat, a… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

  33. arXiv:2303.09511  [pdf, other

    cs.IT eess.SP

    Capacity-achieving Polar-based Codes with Sparsity Constraints on the Generator Matrices

    Authors: James Chin-Jen Pang, Hessam Mahdavifar, S. Sandeep Pradhan

    Abstract: In this paper, we leverage polar codes and the well-established channel polarization to design capacity-achieving codes with a certain constraint on the weights of all the columns in the generator matrix (GM) while having a low-complexity decoding algorithm. We first show that given a binary-input memoryless symmetric (BMS) channel $W$ and a constant $s \in (0, 1]$, there exists a polarization ker… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 31 pages, single column. arXiv admin note: substantial text overlap with arXiv:2012.13977

  34. Harms from Increasingly Agentic Algorithmic Systems

    Authors: Alan Chan, Rebecca Salganik, Alva Markelius, Chris Pang, Nitarshan Rajkumar, Dmitrii Krasheninnikov, Lauro Langosco, Zhonghao He, Yawen Duan, Micah Carroll, Michelle Lin, Alex Mayhew, Katherine Collins, Maryam Molamohammadi, John Burden, Wanru Zhao, Shalaleh Rismani, Konstantinos Voudouris, Umang Bhatt, Adrian Weller, David Krueger, Tegan Maharaj

    Abstract: Research in Fairness, Accountability, Transparency, and Ethics (FATE) has established many sources and forms of algorithmic harm, in domains as diverse as health care, finance, policing, and recommendations. Much work remains to be done to mitigate the serious harms of these systems, particularly those disproportionately affecting marginalized communities. Despite these ongoing harms, new systems… ▽ More

    Submitted 11 May, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: Accepted at FAccT 2023

  35. arXiv:2302.05582  [pdf, other

    eess.AS cs.CL cs.SD cs.SE

    ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

    Authors: Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong Chong, Mei Kuan Lim, David Lo

    Abstract: Recent years have witnessed wider adoption of Automated Speech Recognition (ASR) techniques in various domains. Consequently, evaluating and enhancing the quality of ASR systems is of great importance. This paper proposes ASDF, an Automated Speech Recognition Differential Testing Framework for testing ASR systems. ASDF extends an existing ASR testing tool, the CrossASR++, which synthesizes test ca… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Accpeted by ICST 2023 Tool Demo Track

  36. arXiv:2302.04456  [pdf, other

    cs.SD cs.AI cs.CL cs.MM eess.AS

    ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

    Authors: Pengfei Zhu, Chao Pang, Yekun Chai, Lei Li, Shuohuan Wang, Yu Sun, Hao Tian, Hua Wu

    Abstract: In recent years, the burgeoning interest in diffusion models has led to significant advances in image and speech generation. Nevertheless, the direct synthesis of music waveforms from unrestricted textual prompts remains a relatively underexplored domain. In response to this lacuna, this paper introduces a pioneering contribution in the form of a text-to-waveform music generation model, underpinne… ▽ More

    Submitted 21 September, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted by AACL demo 2023

  37. arXiv:2301.11495  [pdf, other

    cs.CV

    Skeleton-based Action Recognition through Contrasting Two-Stream Spatial-Temporal Networks

    Authors: Chen Pang, Xuequan Lu, Lei Lyu

    Abstract: For pursuing accurate skeleton-based action recognition, most prior methods use the strategy of combining Graph Convolution Networks (GCNs) with attention-based methods in a serial way. However, they regard the human skeleton as a complete graph, resulting in less variations between different actions (e.g., the connection between the elbow and head in action ``clapping hands''). For this, we propo… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 14 pages, 9 figures

  38. arXiv:2301.10922  [pdf, other

    cs.CV

    Detecting Building Changes with Off-Nadir Aerial Images

    Authors: Chao Pang, Jiang Wu, Jian Ding, Can Song, Gui-Song Xia

    Abstract: The tilted viewing nature of the off-nadir aerial images brings severe challenges to the building change detection (BCD) problem: the mismatch of the nearby buildings and the semantic ambiguity of the building facades. To tackle these challenges, we present a multi-task guided change detection network model, named as MTGCD-Net. The proposed model approaches the specific BCD problem by designing th… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Journal ref: SCIENCE CHINA Information Sciences (SCIS) 2023

  39. arXiv:2301.06349  [pdf, ps, other

    math.AP math.PR

    Second order commutator estimates in renormalisation theory for SPDEs with gradient-type noise

    Authors: Peter H. C. Pang

    Abstract: An important step in standard renormalisation arguments involve convolution against a standard mollifier. As pointed out in (Punshon-Smith--Smith 2018), this generates second order commutator terms in equations with gradient-type noise. These are commutators similar to commutators in the well-known ``folklore lemma" of Di Perna--Lions (Di Perna--Lions 1989, Lemma II.1), but not covered by standard… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 10 pages, proceedings of HYP2022

    MSC Class: 35-06; 35A25; 35R60; 60H15

  40. arXiv:2301.06096   

    math.PR math.AP

    Weak convergence of stochastic integrals

    Authors: Kenneth H. Karlsen, Peter H. C. Pang

    Abstract: The convergence of stochastic integrals driven by a sequence of Wiener processes $W_n\to W$ (with convergence in $C_t$) is crucial in the analysis of stochastic partial differential equations (SPDEs). The convergence we focus on in this paper is of the form $\int_0^T V_n\, {\rm d} W_n \to \int_0^T V\,{\rm d} W$, where $V_n$ takes values in $L^p([0,T];X)$ for some finite $p\ge 2$ and a Banach space… ▽ More

    Submitted 23 August, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: This paper was withdrawn due to an error in the proof of the main theorem

    MSC Class: Primary: 60H15; 60G46; Secondary: 60F25

  41. arXiv:2212.10505  [pdf, other

    cs.CL cs.AI cs.CV

    DePlot: One-shot visual language reasoning by plot-to-table translation

    Authors: Fangyu Liu, Julian Martin Eisenschlos, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Wenhu Chen, Nigel Collier, Yasemin Altun

    Abstract: Visual language such as charts and plots is ubiquitous in the human world. Comprehending plots and charts requires strong reasoning skills. Prior state-of-the-art (SOTA) models require at least tens of thousands of training examples and their reasoning capabilities are still much limited, especially on complex human-written queries. This paper presents the first one-shot solution to visual languag… ▽ More

    Submitted 23 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023 (Findings)

  42. arXiv:2212.09662  [pdf, other

    cs.CL cs.AI cs.CV

    MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering

    Authors: Fangyu Liu, Francesco Piccinno, Syrine Krichene, Chenxi Pang, Kenton Lee, Mandar Joshi, Yasemin Altun, Nigel Collier, Julian Martin Eisenschlos

    Abstract: Visual language data such as plots, charts, and infographics are ubiquitous in the human world. However, state-of-the-art vision-language models do not perform well on these data. We propose MatCha (Math reasoning and Chart derendering pretraining) to enhance visual language models' capabilities in jointly modeling charts/plots and language data. Specifically, we propose several pretraining tasks… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  43. arXiv:2212.06742  [pdf, other

    cs.CL cs.LG cs.PL cs.SE

    ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages

    Authors: Yekun Chai, Shuohuan Wang, Chao Pang, Yu Sun, Hao Tian, Hua Wu

    Abstract: Software engineers working with the same programming language (PL) may speak different natural languages (NLs) and vice versa, erecting huge barriers to communication and working efficiency. Recent studies have demonstrated the effectiveness of generative pre-training in computer programs, yet they are always English-centric. In this work, we step towards bridging the gap between multilingual NLs… ▽ More

    Submitted 19 May, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted at ACL 2023 (Findings)

  44. arXiv:2212.01575  [pdf

    cs.LG q-bio.BM

    Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery

    Authors: Chao Pang, Yu Wang, Yi Jiang, Ruheng Wang, Ran Su, Leyi Wei

    Abstract: In this work, we propose MEDICO, a Multi-viEw Deep generative model for molecule generation, structural optimization, and the SARS-CoV-2 Inhibitor disCOvery. To the best of our knowledge, MEDICO is the first-of-this-kind graph generative model that can generate molecular graphs similar to the structure of targeted molecules, with a multi-view representation learning framework to sufficiently and a… ▽ More

    Submitted 3 December, 2022; originally announced December 2022.

  45. arXiv:2211.14921  [pdf

    physics.med-ph

    Padded Helmet Shell Covers in American Football: A Comprehensive Laboratory Evaluation with Preliminary On-Field Findings

    Authors: Nicholas J. Cecchi, Ashlyn A. Callan, Landon P. Watson, Yuzhe Liu, Xianghao Zhan, Ramanand V. Vegesna, Collin Pang, Enora Le Flao, Gerald A. Grant, Michael M. Zeineh, David B. Camarillo

    Abstract: Protective headgear effects measured in the laboratory may not always translate to the field. In this study, we evaluated the impact attenuation capabilities of a commercially available padded helmet shell cover in the laboratory and field. In the laboratory, we evaluated the efficacy of the padded helmet shell cover in attenuating impact magnitude across six impact locations and three impact velo… ▽ More

    Submitted 27 November, 2022; originally announced November 2022.

    Comments: 49 references, 8 figures

  46. arXiv:2211.09023  [pdf, ps, other

    hep-ph

    Can the three new states around 2.2 GeV assign to $ω(3D)$

    Authors: Ya-rong Wang, Yang Ma, Cheng-qun Pang

    Abstract: Recently, the BESIII Collaboration reported three resonances: $X(2232)$ with $M = 2232 \pm 19 \pm 27$ MeV and $Γ= 93 \pm 53 \pm 20$ MeV, $X(2200)$ whose mass $M = 2200 \pm 11 \pm 17$ MeV and width $Γ= 74 \pm 20 \pm 24$ MeV as well as $X(2222)$ which has mass of $2222 \pm 7 \pm 2$ MeV and the width of $59 \pm 30 \pm 6$ MeV. The mass spectrum of $ω$ meson family is studied utilizing the modified God… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 5 pages,2 figures

  47. arXiv:2211.07454  [pdf, other

    cs.CV

    LGN-Net: Local-Global Normality Network for Video Anomaly Detection

    Authors: Mengyang Zhao, Xinhua Zeng, Yang Liu, Jing Liu, Di Li, Xing Hu, Chengxin Pang

    Abstract: Video anomaly detection (VAD) has been intensively studied for years because of its potential applications in intelligent video systems. Existing unsupervised VAD methods tend to learn normality from training sets consisting of only normal videos and regard instances deviating from such normality as anomalies. However, they often consider only local or global normality in the temporal dimension. S… ▽ More

    Submitted 8 January, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  48. Global existence of dissipative solutions to the Camassa--Holm equation with transport noise

    Authors: Luca Galimberti, Helge Holden, Kenneth H. Karlsen, Peter H. C. Pang

    Abstract: We consider a nonlinear stochastic partial differential equation (SPDE) that takes the form of the Camassa--Holm equation perturbed by a convective, position-dependent, noise term. We establish the first global-in-time existence result for dissipative weak martingale solutions to this SPDE, with general finite-energy initial data. The solution is obtained as the limit of classical solutions to par… ▽ More

    Submitted 1 January, 2024; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 86 pages

    MSC Class: Primary: 35R60; 35G25; Secondary: 35A01; 35D30

  49. arXiv:2211.03885  [pdf, other

    cs.CV eess.IV

    Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report

    Authors: Andrey Ignatov, Radu Timofte, Shuai Liu, Chaoyu Feng, Furui Bai, Xiaotao Wang, Lei Lei, Ziyao Yi, Yan Xiang, Zibin Liu, Shaoqing Li, Keming Shi, Dehui Kong, Ke Xu, Minsu Kwon, Yaqi Wu, Jiesi Zheng, Zhihao Fan, Xun Wu, Feng Zhang, Albert No, Minhyeok Cho, Zewen Chen, Xiaze Zhang, Ran Li , et al. (13 additional authors not shown)

    Abstract: The role of mobile cameras increased dramatically over the past few years, leading to more and more research in automatic image quality enhancement and RAW photo processing. In this Mobile AI challenge, the target was to develop an efficient end-to-end AI-based image signal processing (ISP) pipeline replacing the standard mobile ISPs that can run on modern smartphone GPUs using TensorFlow Lite. Th… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  50. arXiv:2211.03545  [pdf, other

    eess.AS cs.CL cs.SD

    ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech

    Authors: Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Junkun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu

    Abstract: Speech representation learning has improved both speech understanding and speech synthesis tasks for single language. However, its ability in cross-lingual scenarios has not been explored. In this paper, we extend the pretraining method for cross-lingual multi-speaker speech synthesis tasks, including cross-lingual multi-speaker voice cloning and cross-lingual multi-speaker speech editing. We prop… ▽ More

    Submitted 4 December, 2022; v1 submitted 7 November, 2022; originally announced November 2022.