Skip to main content

Showing 151–200 of 1,383 results for author: Pan, J

  1. arXiv:2312.13680  [pdf, other

    cs.AI

    HGE: Embedding Temporal Knowledge Graphs in a Product Space of Heterogeneous Geometric Subspaces

    Authors: Jiaxin Pan, Mojtaba Nayyeri, Yinan Li, Steffen Staab

    Abstract: Temporal knowledge graphs represent temporal facts $(s,p,o,τ)$ relating a subject $s$ and an object $o$ via a relation label $p$ at time $τ$, where $τ$ could be a time point or time interval. Temporal knowledge graphs may exhibit static temporal patterns at distinct points in time and dynamic temporal patterns between different timestamps. In order to learn a rich set of static and dynamic tempora… ▽ More

    Submitted 25 December, 2023; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: The 38th Annual AAAI Conference on Artificial Intelligence (AAAI'24)

  2. arXiv:2312.12030  [pdf, other

    cs.CV cs.AI

    Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method

    Authors: Jiachun Pan, Hanshu Yan, Jun Hao Liew, Jiashi Feng, Vincent Y. F. Tan

    Abstract: Training-free guided sampling in diffusion models leverages off-the-shelf pre-trained networks, such as an aesthetic evaluation model, to guide the generation process. Current training-free guided sampling algorithms obtain the guidance energy function based on a one-step estimate of the clean image. However, since the off-the-shelf pre-trained networks are trained on clean images, the one-step es… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2312.10997  [pdf, other

    cs.CL cs.AI

    Retrieval-Augmented Generation for Large Language Models: A Survey

    Authors: Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang

    Abstract: Large Language Models (LLMs) showcase impressive capabilities but encounter challenges like hallucination, outdated knowledge, and non-transparent, untraceable reasoning processes. Retrieval-Augmented Generation (RAG) has emerged as a promising solution by incorporating knowledge from external databases. This enhances the accuracy and credibility of the generation, particularly for knowledge-inten… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Ongoing Work

  4. arXiv:2312.08994  [pdf, other

    cs.LG cs.AR

    PANDA: Architecture-Level Power Evaluation by Unifying Analytical and Machine Learning Solutions

    Authors: Qijun Zhang, Shiyu Li, Guanglei Zhou, Jingyu Pan, Chen-Chia Chang, Yiran Chen, Zhiyao Xie

    Abstract: Power efficiency is a critical design objective in modern microprocessor design. To evaluate the impact of architectural-level design decisions, an accurate yet efficient architecture-level power model is desired. However, widely adopted data-independent analytical power models like McPAT and Wattch have been criticized for their unreliable accuracy. While some machine learning (ML) methods have b… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: IEEE/ACM International Conference on Computer-Aided Design (ICCAD) 2023

  5. arXiv:2312.07839  [pdf, ps, other

    math.ST cs.LG math.PR stat.ML

    Minimax-optimal estimation for sparse multi-reference alignment with collision-free signals

    Authors: Subhro Ghosh, Soumendu Sundar Mukherjee, Jing Bin Pan

    Abstract: The Multi-Reference Alignment (MRA) problem aims at the recovery of an unknown signal from repeated observations under the latent action of a group of cyclic isometries, in the presence of additive noise of high intensity $σ$. It is a more tractable version of the celebrated cryo EM model. In the crucial high noise regime, it is known that its sample complexity scales as $σ^6$. Recent investigatio… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  6. Discovery and Timing of Millisecond Pulsars in the Globular Cluster M5 (NGC 5904) with FAST and Arecibo

    Authors: Lei Zhang, Paulo C. C. Freire, Alessandro Ridolfi, Zhichen Pan, Jiaqi Zhao, Craig O. Heinke, Jianxing Chen, Mario Cadelano, Cristina Pallanca, Xian Hou, Xiaoting Fu, Shi Dai, Erbil Gugercinoglu, Meng Guo, Jason Hessels, Jiale Hu, Guodong Li, Mengmeng Ni, Jingshan Pan, Scott M. Ransom, Qitong Ruan, Ingrid Stairs, Chao-Wei Tsai, Pei Wang, Long Wang , et al. (7 additional authors not shown)

    Abstract: We report on a comprehensive multi-wavelength study of the pulsars in the globular cluster (GC) M5, including the discovery of M5G, a new compact non-eclipsing "black widow" pulsar. Thanks to the analysis of 34 years of radio data taken with the FAST and Arecibo telescopes, we obtained new phase-connected timing solutions for four pulsars in the clusters and improved those of the other three known… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Journal ref: ApJS, 2013, 269:56

  7. arXiv:2312.05177  [pdf, other

    astro-ph.CO astro-ph.GA gr-qc

    Compressed baryon acoustic oscillation analysis is robust to modified-gravity models

    Authors: Jiaming Pan, Dragan Huterer, Felipe Andrade-Oliveira, Camille Avestruz

    Abstract: We study the robustness of the baryon acoustic oscillation (BAO) analysis to the underlying cosmological model. We focus on testing the standard BAO analysis that relies on the use of a template. These templates are constructed assuming a fixed fiducial cosmological model and used to extract the location of the acoustic peaks. Such "compressed analysis" had been shown to be unbiased when applied t… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: 30 pages, 8 Figures, published version

    Journal ref: Journal of Cosmology and Astroparticle Physics 06 (2024) 051

  8. arXiv:2312.04965  [pdf, other

    cs.CV cs.AI cs.CL

    Inversion-Free Image Editing with Natural Language

    Authors: Sihan Xu, Yidong Huang, Jiayi Pan, Ziqiao Ma, Joyce Chai

    Abstract: Despite recent advances in inversion-based editing, text-guided image manipulation remains challenging for diffusion models. The primary bottlenecks include 1) the time-consuming nature of the inversion process; 2) the struggle to balance consistency with accuracy; 3) the lack of compatibility with efficient consistency sampling methods used in consistency models. To address the above issues, we s… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Project Page: https://sled-group.github.io/InfEdit/

  9. arXiv:2312.03788  [pdf, other

    cs.LG cs.CL

    SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM

    Authors: Jiayi Pan, Chengcan Wang, Kaifu Zheng, Yangguang Li, Zhenyu Wang, Bin Feng

    Abstract: Large language models (LLMs) have shown remarkable capabilities in various tasks. However their huge model size and the consequent demand for computational and memory resources also pose challenges to model deployment. Currently, 4-bit post-training quantization (PTQ) has achieved some success in LLMs, reducing the memory footprint by approximately 75% compared to FP16 models, albeit with some acc… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  10. arXiv:2312.01837  [pdf, other

    cs.CL

    Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model

    Authors: Yuxia Geng, Jiaoyan Chen, Yuhang Zeng, Zhuo Chen, Wen Zhang, Jeff Z. Pan, Yuxiang Wang, Xiaoliang Xu

    Abstract: Both graph structures and textual information play a critical role in Knowledge Graph Completion (KGC). With the success of Pre-trained Language Models (PLMs) such as BERT, they have been applied for text encoding for KGC. However, the current methods mostly prefer to fine-tune PLMs, leading to huge training costs and limited scalability to larger PLMs. In contrast, we propose to utilize prompts a… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: under review

  11. arXiv:2312.01677   

    cs.CV

    Multi-task Image Restoration Guided By Robust DINO Features

    Authors: Xin Lin, Chao Ren, Kelvin C. K. Chan, Lu Qi, Jinshan Pan, Ming-Hsuan Yang

    Abstract: Multi-task image restoration has gained significant interest due to its inherent versatility and efficiency compared to its single-task counterpart. Despite its potential, performance degradation is observed with an increase in the number of tasks, primarily attributed to the distinct nature of each restoration task. Addressing this challenge, we introduce \mbox{\textbf{DINO-IR}}, a novel multi-ta… ▽ More

    Submitted 5 December, 2023; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Some important information need to add

  12. arXiv:2312.01674  [pdf, other

    cs.LG

    EDALearn: A Comprehensive RTL-to-Signoff EDA Benchmark for Democratized and Reproducible ML for EDA Research

    Authors: Jingyu Pan, Chen-Chia Chang, Zhiyao Xie, Yiran Chen

    Abstract: The application of Machine Learning (ML) in Electronic Design Automation (EDA) for Very Large-Scale Integration (VLSI) design has garnered significant research attention. Despite the requirement for extensive datasets to build effective ML models, most studies are limited to smaller, internally generated datasets due to the lack of comprehensive public resources. In response, we introduce EDALearn… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 8 pages

  13. arXiv:2311.17532  [pdf, other

    cs.CV

    Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation

    Authors: Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo

    Abstract: Generating vivid and emotional 3D co-speech gestures is crucial for virtual avatar animation in human-machine interaction applications. While the existing methods enable generating the gestures to follow a single emotion label, they overlook that long gesture sequence modeling with emotion transition is more practical in real scenes. In addition, the lack of large-scale available datasets with emo… ▽ More

    Submitted 27 March, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted by CVPR 2024

  14. arXiv:2311.17455  [pdf, other

    quant-ph physics.atom-ph physics.optics

    Experimental Generation of Spin-Photon Entanglement in Silicon Carbide

    Authors: Ren-Zhou Fang, Xiao-Yi Lai, Tao Li, Ren-Zhu Su, Bo-Wei Lu, Chao-Wei Yang, Run-Ze Liu, Yu-Kun Qiao, Cheng Li, Zhi-Gang He, Jia Huang, Hao Li, Li-Xing You, Yong-Heng Huo, Xiao-Hui Bao, Jian-Wei Pan

    Abstract: A solid-state approach for quantum networks is advantages, as it allows the integration of nanophotonics to enhance the photon emission and the utilization of weakly coupled nuclear spins for long-lived storage. Silicon carbide, specifically point defects within it, shows great promise in this regard due to the easy of availability and well-established nanofabrication techniques. Despite of remark… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 8 pages in total, 4 figures in the main text, 1 figure in the supplemental material

  15. arXiv:2311.17366  [pdf, other

    cs.CV

    Generative Hierarchical Temporal Transformer for Hand Action Recognition and Motion Prediction

    Authors: Yilin Wen, Hao Pan, Takehiko Ohkawa, Lei Yang, Jia Pan, Yoichi Sato, Taku Komura, Wenping Wang

    Abstract: We present a novel framework that concurrently tackles hand action recognition and 3D future hand motion prediction. While previous works focus on either recognition or prediction, we propose a generative Transformer VAE architecture to jointly capture both aspects, facilitating realistic motion prediction by leveraging the short-term hand motion and long-term action consistency observed across ti… ▽ More

    Submitted 24 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  16. arXiv:2311.15309  [pdf, other

    eess.IV

    Deep Refinement-Based Joint Source Channel Coding over Time-Varying Channels

    Authors: Junyu Pan, Hanlei Li, Guangyi Zhang, Yunlong Cai, Guanding Yu

    Abstract: In recent developments, deep learning (DL)-based joint source-channel coding (JSCC) for wireless image transmission has made significant strides in performance enhancement. Nonetheless, the majority of existing DL-based JSCC methods are tailored for scenarios featuring stable channel conditions, notably a fixed signal-to-noise ratio (SNR). This specialization poses a limitation, as their performan… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  17. arXiv:2311.11866  [pdf, other

    cs.CY cs.AI

    Analyzing Emissions and Energy Efficiency at Unsignalized Real-world Intersections Under Mixed Traffic Control

    Authors: Michael Villarreal, Dawei Wang, Jia Pan, Weizi Li

    Abstract: Greenhouse gas emissions have dramatically risen since the early 1900s with U.S. transportation generating 28% of U.S. emissions. As such, there is interest in reducing transportation-related emissions. Specifically, sustainability research has sprouted around signalized intersections as intersections allow different streams of traffic to cross and change directions. Recent research has developed… ▽ More

    Submitted 17 January, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to 4th IEEE Forum for Innovative Sustainable Transportation Systems

  18. arXiv:2311.11551  [pdf, other

    cs.CL

    Adapt in Contexts: Retrieval-Augmented Domain Adaptation via In-Context Learning

    Authors: Quanyu Long, Wenya Wang, Sinno Jialin Pan

    Abstract: Large language models (LLMs) have showcased their capability with few-shot inference known as in-context learning. However, in-domain demonstrations are not always readily available in real scenarios, leading to cross-domain in-context learning. Besides, LLMs are still facing challenges in long-tail knowledge in unseen and unfamiliar domains. The above limitations demonstrate the necessity of Unsu… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP 2023

  19. arXiv:2311.11347  [pdf, other

    cs.RO

    Large-scale Mixed Traffic Control Using Dynamic Vehicle Routing and Privacy-Preserving Crowdsourcing

    Authors: Dawei Wang, Weizi Li, Jia Pan

    Abstract: Controlling and coordinating urban traffic flow through robot vehicles is emerging as a novel transportation paradigm for the future. While this approach garners growing attention from researchers and practitioners, effectively managing and coordinating large-scale mixed traffic remains a challenge. We introduce an effective framework for large-scale mixed traffic control via privacy-preserving cr… ▽ More

    Submitted 19 November, 2023; originally announced November 2023.

    Comments: Accepted to IEEE Internet of Things Journal

  20. arXiv:2311.10776  [pdf, other

    cs.IR cs.AI

    Chemist-X: Large Language Model-empowered Agent for Reaction Condition Recommendation in Chemical Synthesis

    Authors: Kexin Chen, Junyou Li, Kunyi Wang, Yuyang Du, Jiahui Yu, Jiamin Lu, Lanqing Li, Jiezhong Qiu, Jianzhang Pan, Yi Huang, Qun Fang, Pheng Ann Heng, Guangyong Chen

    Abstract: Recent AI research plots a promising future of automatic chemical reactions within the chemistry society. This study proposes Chemist-X, a transformative AI agent that automates the reaction condition recommendation (RCR) task in chemical synthesis with retrieval-augmented generation (RAG) technology. To emulate expert chemists' strategies when solving RCR tasks, Chemist-X utilizes advanced RAG sc… ▽ More

    Submitted 4 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  21. Shifting to Machine Supervision: Annotation-Efficient Semi and Self-Supervised Learning for Automatic Medical Image Segmentation and Classification

    Authors: Pranav Singh, Raviteja Chukkapalli, Shravan Chaudhari, Luoyao Chen, Mei Chen, Jinqian Pan, Craig Smuda, Jacopo Cirrone

    Abstract: Advancements in clinical treatment are increasingly constrained by the limitations of supervised learning techniques, which depend heavily on large volumes of annotated data. The annotation process is not only costly but also demands substantial time from clinical specialists. Addressing this issue, we introduce the S4MI (Self-Supervision and Semi-Supervision for Medical Imaging) pipeline, a novel… ▽ More

    Submitted 17 May, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Seventeen pages (incl. references), five figures, and one table. Accepted and published in Scientific Reports 14.1 (2024): 10820

    Journal ref: Singh, P., Chukkapalli, R., Chaudhari, S. et al. Shifting to machine supervision: annotation-efficient semi and self-supervised learning for automatic medical image segmentation and classification. Sci Rep 14, 10820 (2024)

  22. arXiv:2311.08347  [pdf, other

    quant-ph

    High-efficiency single-photon source above the loss-tolerant threshold for efficient linear optical quantum computing

    Authors: Xing Ding, Yong-Peng Guo, Mo-Chi Xu, Run-Ze Liu, Geng-Yan Zou, Jun-Yi Zhao, Zhen-Xuan Ge, Qi-Hang Zhang, Hua-Liang Liu, Lin-Jun Wang, Ming-Cheng Chen, Hui Wang, Yu-Ming He, Yong-Heng Huo, Chao-Yang Lu, Jian-Wei Pan

    Abstract: Photon loss is the biggest enemy for scalable photonic quantum information processing. This problem can be tackled by using quantum error correction, provided that the overall photon loss is below a threshold of 1/3. However, all reported on-demand and indistinguishable single-photon sources still fall short of this threshold. Here, by using tailor shaped laser pulse excitation on a high-quantum e… ▽ More

    Submitted 28 November, 2023; v1 submitted 14 November, 2023; originally announced November 2023.

  23. arXiv:2311.07547  [pdf, other

    cs.CV cs.AI cs.CL cs.MM

    GPT-4V(ision) as A Social Media Analysis Engine

    Authors: Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

    Abstract: Recent research has offered insights into the extraordinary capabilities of Large Multimodal Models (LMMs) in various general vision and language tasks. There is growing interest in how LMMs perform in more specialized domains. Social media content, inherently multimodal, blends text, images, videos, and sometimes audio. Understanding social multimedia content remains a challenging problem for con… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  24. arXiv:2311.05261  [pdf

    cs.CR

    RAGLog: Log Anomaly Detection using Retrieval Augmented Generation

    Authors: Jonathan Pan, Swee Liang Wong, Yidi Yuan

    Abstract: The ability to detect log anomalies from system logs is a vital activity needed to ensure cyber resiliency of systems. It is applied for fault identification or facilitate cyber investigation and digital forensics. However, as logs belonging to different systems and components differ significantly, the challenge to perform such analysis is humanly challenging from the volume, variety and velocity… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2203.10960

  25. arXiv:2311.02248  [pdf, other

    cs.CL cs.AI eess.AS

    COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

    Authors: Jing Pan, Jian Wu, Yashesh Gaur, Sunit Sivasankaran, Zhuo Chen, Shujie Liu, Jinyu Li

    Abstract: We present a cost-effective method to integrate speech into a large language model (LLM), resulting in a Contextual Speech Model with Instruction-following/in-context-learning Capabilities (COSMIC) multi-modal LLM. Using GPT-3.5, we generate Speech Comprehension Test Question-Answer (SQA) pairs from speech transcriptions for supervised instruction tuning. With under 30 million trainable parameters… ▽ More

    Submitted 14 June, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  26. arXiv:2311.00047  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?

    Authors: Yichi Zhang, Jiayi Pan, Yuchen Zhou, Rui Pan, Joyce Chai

    Abstract: Vision-Language Models (VLMs) are trained on vast amounts of data captured by humans emulating our understanding of the world. However, known as visual illusions, human's perception of reality isn't always faithful to the physical world. This raises a key question: do VLMs have the similar kind of illusions as humans do, or do they faithfully learn to represent reality? To investigate this questio… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 main conference

  27. arXiv:2310.19294  [pdf, other

    physics.optics quant-ph

    Dual-comb spectroscopy over 100km open-air path

    Authors: Jin-Jian Han, Wei Zhong, Ruo-Can Zhao, Ting Zeng, Min Li, Jian Lu, Xin-Xin Peng, Xi-Ping Shi, Qin Yin, Yong Wang, Ali Esamdin, Qi Shen, Jian-Yu Guan, Lei Hou, Ji-Gang Ren, Jian-Jun Jia, Yu Wang, Hai-Feng Jiang, XiangHui Xue, Qiang Zhang, Xian-Kang Dou, Jian-Wei Pan

    Abstract: Satellite-based greenhouse gases (GHG) sensing technologies play a critical role in the study of global carbon emissions and climate change. However, none of the existing satellite-based GHG sensing technologies can achieve the measurement of broad bandwidth, high temporal-spatial resolution, and high sensitivity at the same time. Recently, dual-comb spectroscopy (DCS) has been proposed as a super… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 24 pages, 6 figures

  28. arXiv:2310.19019  [pdf, other

    cs.CL cs.AI

    TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

    Authors: Nan He, Hanyu Lai, Chenyang Zhao, Zirui Cheng, Junting Pan, Ruoyu Qin, Ruofan Lu, Rui Lu, Yunchen Zhang, Gangming Zhao, Zhaohui Hou, Zhiyuan Huang, Shaoqing Lu, Ding Liang, Mingjie Zhan

    Abstract: Large Language Models (LLMs) exhibit impressive reasoning and data augmentation capabilities in various NLP tasks. However, what about small models? In this work, we propose TeacherLM-7.1B, capable of annotating relevant fundamentals, chain of thought, and common mistakes for most NLP samples, which makes annotation more than just an answer, thus allowing other models to learn "why" instead of jus… ▽ More

    Submitted 15 July, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: 5 figures, 15 pages

  29. arXiv:2310.18292  [pdf, other

    quant-ph

    Twin-field quantum key distribution with local frequency reference

    Authors: Jiu-Peng Chen, Fei Zhou, Chi Zhang, Cong Jiang, Fa-Xi Chen, Jia Huang, Hao Li, Li-Xing You, Xiang-Bin Wang, Yang Liu, Qiang Zhang, Jian-Wei Pan

    Abstract: Twin-field quantum key distribution (TF-QKD) overcomes the linear rate-loss limit, which promises a boost of secure key rate over long distance. However, the complexity of eliminating the frequency differences between the independent laser sources hinders its practical application. Here, taking the saturated absorption spectroscopy of acetylene as an absolute reference, we propose and demonstrate… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages, 5 figures, 7 tables

    Journal ref: Phys. Rev. Lett. 132, 260802 (2024)

  30. arXiv:2310.17924  [pdf, other

    cs.CL

    SOUL: Towards Sentiment and Opinion Understanding of Language

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: Sentiment analysis is a well-established natural language processing task, with sentiment polarity classification being one of its most popular and representative tasks. However, despite the success of pre-trained language models in this area, they often fall short of capturing the broader complexities of sentiment analysis. To address this issue, we propose a new task called Sentiment and Opinion… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main Conference, Short Paper

  31. arXiv:2310.17113  [pdf, other

    physics.ins-det physics.optics quant-ph

    Compact free-running InGaAs/InP single-photon detector with 40% detection efficiency and 2.3 kcps dark count rate

    Authors: Qi Xu, Chao Yu, Wei Chen, Jianglin Zhao, Dajian Cui, Jun Zhang, Jian-Wei Pan

    Abstract: Free-running InGaAs/InP single-photon detectors (SPDs) based on negative-feedback avalanche diodes (NFADs) are the key components for applications requiring asynchronous single-photon detection in the near-infrared region. From the perspective of practical applications, the features of SPDs in terms of high photon detection efficiency (PDE), low noise, large sensitive area, and compactness are hig… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: 7 pages, 7 figures. Accepted for publication in the IEEE Journal of Selected Topics in Quantum Electronics

    Journal ref: IEEE Journal of Selected Topics in Quantum Electronics 30, 6400107 (2024)

  32. arXiv:2310.16713  [pdf, other

    cs.CL cs.AI

    SkyMath: Technical Report

    Authors: Liu Yang, Haihua Yang, Wenjun Cheng, Lei Lin, Chenxia Li, Yifu Chen, Lunan Liu, Jianfei Pan, Tianwen Wei, Biye Li, Liang Zhao, Lijie Wang, Bo Zhu, Guoliang Li, Xuejie Wu, Xilin Luo, Rui Hu

    Abstract: Large language models (LLMs) have shown great potential to solve varieties of natural language processing (NLP) tasks, including mathematical reasoning. In this work, we present SkyMath, a large language model for mathematics with 13 billion parameters. By applying self-compare fine-tuning, we have enhanced mathematical reasoning abilities of Skywork-13B-Base remarkably. On GSM8K, SkyMath outperfo… ▽ More

    Submitted 26 October, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  33. arXiv:2310.15590  [pdf, other

    cs.CR cs.CV

    Facial Data Minimization: Shallow Model as Your Privacy Filter

    Authors: Yuwen Pu, Jiahao Chen, Jiayu Pan, Hao li, Diqun Yan, Xuhong Zhang, Shouling Ji

    Abstract: Face recognition service has been used in many fields and brings much convenience to people. However, once the user's facial data is transmitted to a service provider, the user will lose control of his/her private data. In recent years, there exist various security and privacy issues due to the leakage of facial data. Although many privacy-preserving methods have been proposed, they usually fail w… ▽ More

    Submitted 12 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 14 pages, 11 figures

  34. arXiv:2310.15539  [pdf, other

    cs.CL cs.AI

    SteloCoder: a Decoder-Only LLM for Multi-Language to Python Code Translation

    Authors: Jialing Pan, Adrien Sadé, Jin Kim, Eric Soriano, Guillem Sole, Sylvain Flamant

    Abstract: With the recent focus on Large Language Models (LLMs), both StarCoder (Li et al., 2023) and Code Llama (Rozière et al., 2023) have demonstrated remarkable performance in code generation. However, there is still a need for improvement in code translation functionality with efficient training techniques. In response to this, we introduce SteloCoder, a decoder-only StarCoder-based LLM designed specif… ▽ More

    Submitted 15 December, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  35. arXiv:2310.14050  [pdf, other

    cs.CL

    Code-Switching with Word Senses for Pretraining in Neural Machine Translation

    Authors: Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

    Abstract: Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP (Findings) 2023 Long Paper

  36. arXiv:2310.14024  [pdf, other

    cond-mat.quant-gas cond-mat.supr-con quant-ph

    Observation and quantification of pseudogap in unitary Fermi gases

    Authors: Xi Li, Shuai Wang, Xiang Luo, Yu-Yang Zhou, Ke Xie, Hong-Chi Shen, Yu-Zhao Nie, Qijin Chen, Hui Hu, Yu-Ao Chen, Xing-Can Yao, Jian-Wei Pan

    Abstract: The nature of pseudogap lies at the heart of strongly-interacting superconductivity and superfluidity. With known pairing interactions, unitary Fermi gases provide an ideal testbed to verify whether a pseudogap can arise from many-body pairing. Here we report the observation of the long-sought pair-fluctuation-driven pseudogap in homogeneous unitary Fermi gases of lithium-6 atoms, by precisely mea… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

  37. arXiv:2310.14021  [pdf, other

    cs.DB

    Survey of Vector Database Management Systems

    Authors: James Jie Pan, Jianguo Wang, Guoliang Li

    Abstract: There are now over 20 commercial vector database management systems (VDBMSs), all produced within the past five years. But embedding-based retrieval has been studied for over ten years, and similarity search a staggering half century and more. Driving this shift from algorithms to systems are new data intensive applications, notably large language models, that demand vast stores of unstructured da… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 25 pages

  38. arXiv:2310.12008  [pdf, other

    cs.CL cs.AI

    Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Knowledge graph entity typing (KGET) aims at inferring plausible types of entities in knowledge graphs. Existing approaches to KGET focus on how to better encode the knowledge provided by the neighbors and types of an entity into its representation. However, they ignore the semantic knowledge provided by the way in which types can be clustered together. In this paper, we propose a novel method cal… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Main

  39. arXiv:2310.11676  [pdf, other

    cs.LG cs.AI

    PREM: A Simple Yet Effective Approach for Node-Level Graph Anomaly Detection

    Authors: Junjun Pan, Yixin Liu, Yizhen Zheng, Shirui Pan

    Abstract: Node-level graph anomaly detection (GAD) plays a critical role in identifying anomalous nodes from graph-structured data in various domains such as medicine, social networks, and e-commerce. However, challenges have arisen due to the diversity of anomalies and the dearth of labeled data. Existing methodologies - reconstruction-based and contrastive learning - while effective, often suffer from eff… ▽ More

    Submitted 27 November, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE International Conference of Data Mining 2023 (ICDM 2023)

  40. arXiv:2310.11037  [pdf, ps, other

    cs.IT

    Sampling for Remote Estimation of the Wiener Process over an Unreliable Channel

    Authors: Jiayu Pan, Yin Sun, Ness B. Shroff

    Abstract: In this paper, we study a sampling problem where a source takes samples from a Wiener process and transmits them through a wireless channel to a remote estimator. Due to channel fading, interference, and potential collisions, the packet transmissions are unreliable and could take random time durations. Our objective is to devise an optimal causal sampling policy that minimizes the long-term averag… ▽ More

    Submitted 18 October, 2023; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Accepted by ACM Sigmetrics, will appear in ACM POMACS journal

  41. arXiv:2310.10365  [pdf, other

    quant-ph cond-mat.mes-hall

    Berry Curvature and Bulk-Boundary Correspondence from Transport Measurement for Photonic Chern Bands

    Authors: Chao Chen, Run-Ze Liu, Jizhou Wu, Zu-En Su, Xing Ding, Jian Qin, Lin Wang, Wei-Wei Zhang, Yu He, Xi-Lin Wang, Chao-Yang Lu, Li Li, Barry C. Sanders, Xiong-Jun Liu, Jian-Wei Pan

    Abstract: Berry curvature is a fundamental element to characterize topological quantum physics, while a full measurement of Berry curvature in momentum space was not reported for topological states. Here we achieve two-dimensional Berry curvature reconstruction in a photonic quantum anomalous Hall system via Hall transport measurement of a momentum-resolved wave packet. Integrating measured Berry curvature… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Phys. Rev. Lett. 131, 133601 (25 September 2023)

  42. arXiv:2310.09824  [pdf, other

    cs.RO

    Overconstrained Robotic Limb with Energy-Efficient, Omni-directional Locomotion

    Authors: Ronghan Xu, Jiayi Yin, Shihao Feng, Bangchao Huang, Haoran Sun, Jia Pan, Fang Wan, Chaoyang Song

    Abstract: This paper studies the design, modeling, and control of a novel quadruped, featuring overconstrained robotic limbs employing the Bennett linkage for motion and power transmission. The modular limb design allows the robot to morph into reptile- or mammal-inspired forms. In contrast to the prevailing focus on planar limbs, this research delves into the classical overconstrained linkages, which have… ▽ More

    Submitted 3 February, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: 19 pages, 13 figures, 2 tables

  43. arXiv:2310.08276  [pdf, other

    cs.CV cs.AI

    Direction-Oriented Visual-semantic Embedding Model for Remote Sensing Image-text Retrieval

    Authors: Qing Ma, Jiancheng Pan, Cong Bai

    Abstract: Image-text retrieval has developed rapidly in recent years. However, it is still a challenge in remote sensing due to visual-semantic imbalance, which leads to incorrect matching of non-semantic visual and textual features. To solve this problem, we propose a novel Direction-Oriented Visual-semantic Embedding Model (DOVE) to mine the relationship between vision and language. Our highlight is to co… ▽ More

    Submitted 23 January, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 14 pages, 11 figures

  44. arXiv:2310.06654  [pdf, other

    cs.RO cs.CV

    Evaluating Explanation Methods for Vision-and-Language Navigation

    Authors: Guanqi Chen, Lei Yang, Guanhua Chen, Jia Pan

    Abstract: The ability to navigate robots with natural language instructions in an unknown environment is a crucial step for achieving embodied artificial intelligence (AI). With the improving performance of deep neural models proposed in the field of vision-and-language navigation (VLN), it is equally interesting to know what information the models utilize for their decision-making in the navigation tasks.… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted by ECAI 2023

  45. arXiv:2310.06474  [pdf, other

    cs.CL

    Multilingual Jailbreak Challenges in Large Language Models

    Authors: Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

    Abstract: While large language models (LLMs) exhibit remarkable capabilities across a wide range of tasks, they pose potential safety concerns, such as the ``jailbreak'' problem, wherein malicious instructions can manipulate LLMs to exhibit undesirable behavior. Although several preventive measures have been developed to mitigate the potential risks associated with LLMs, they have primarily focused on Engli… ▽ More

    Submitted 3 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: ICLR 2024

  46. arXiv:2310.05128  [pdf, other

    cs.CL cs.AI cs.LG

    Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification

    Authors: Simon Yu, Jie He, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Hierarchical multi-label text classification (HMTC) aims at utilizing a label hierarchy in multi-label classification. Recent approaches to HMTC deal with the problem of imposing an over-constrained premise on the output space by using contrastive learning on generated samples in a semi-supervised manner to bring text and label embeddings closer. However, the generation of samples tends to introdu… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 18 pages; 10 figures. Published as a conference paper at EMNLP 2023 Findings (Long Paper). Code and data available at https://github.com/simonucl/HJCL

  47. arXiv:2310.04747  [pdf, other

    cs.CV cs.AI

    Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation

    Authors: Jingyi Pan, Sihang Li, Yucheng Chen, Jinjing Zhu, Lin Wang

    Abstract: Nighttime semantic segmentation plays a crucial role in practical applications, such as autonomous driving, where it frequently encounters difficulties caused by inadequate illumination conditions and the absence of well-annotated datasets. Moreover, semantic segmentation models trained on daytime datasets often face difficulties in generalizing effectively to nighttime conditions. Unsupervised do… ▽ More

    Submitted 14 March, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

  48. arXiv:2310.04721  [pdf, other

    cs.CV

    Memory-Constrained Semantic Segmentation for Ultra-High Resolution UAV Imagery

    Authors: Qi Li, Jiaxin Cai, Yuanlong Yu, Jason Gu, Jia Pan, Wenxi Liu

    Abstract: Amidst the swift advancements in photography and sensor technologies, high-definition cameras have become commonplace in the deployment of Unmanned Aerial Vehicles (UAVs) for diverse operational purposes. Within the domain of UAV imagery analysis, the segmentation of ultra-high resolution images emerges as a substantial and intricate challenge, especially when grappling with the constraints impose… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

  49. arXiv:2310.04400  [pdf, other

    cs.LG cs.IR

    On the Embedding Collapse when Scaling up Recommendation Models

    Authors: Xingzhuo Guo, Junwei Pan, Ximei Wang, Baixu Chen, Jie Jiang, Mingsheng Long

    Abstract: Recent advances in foundation models have led to a promising trend of developing large recommendation models to leverage vast amounts of available data. Still, mainstream models remain embarrassingly small in size and naïve enlarging does not lead to sufficient performance gain, suggesting a deficiency in the model scalability. In this paper, we identify the embedding collapse phenomenon as the in… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: ICML 2024 Accepted

  50. arXiv:2310.04399  [pdf, other

    cs.CL

    Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

    Authors: Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li

    Abstract: Simultaneous Speech-to-Text translation serves a critical role in real-time crosslingual communication. Despite the advancements in recent years, challenges remain in achieving stability in the translation process, a concern primarily manifested in the flickering of partial results. In this paper, we propose a novel revision-controllable method designed to address this issue. Our method introduces… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

    Comments: accepted by ASRU 2023