Skip to main content

Showing 1–50 of 340 results for author: Lu, K

  1. arXiv:2407.10671  [pdf, other

    cs.CL cs.AI

    Qwen2 Technical Report

    Authors: An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Chengpeng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Ma, Jin Xu, Jingren Zhou, Jinze Bai, Jinzheng He, Junyang Lin, Kai Dang , et al. (34 additional authors not shown)

    Abstract: This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, a… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 25 pages, 1 figure

  2. arXiv:2407.09932  [pdf, other

    quant-ph

    Quantum Clock Synchronization Network with Silicon-chip Dual-Pumped Entangled Photon Source

    Authors: J. A. Li, H. Han, X. P. Huang, B. Y. Tang, K. Guo, J. Q. Huang, S. Y. Xiong, W. R. Yu, Z. J. Zhang, J. B. Yang, B. Liu, H. Chen, Z. K. Lu

    Abstract: In this paper, we propose a quantum clock synchronization (QCS) network scheme with silicon-chip dual-pumped entangled photon source. This scheme couples two pump beams into the silicon-based waveguide, where degenerate and non-degenerate spontaneous four-wave mixing (SFWM) occurs, generating entanglement between one signal channel and three idler channels. The entangled photons are distributed to… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  3. arXiv:2407.09886  [pdf, other

    eess.AS cs.CL cs.SD

    Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation

    Authors: Chun-Yi Kuan, Chih-Kai Yang, Wei-Ping Huang, Ke-Han Lu, Hung-yi Lee

    Abstract: In this work, we introduce Speech-Copilot, a modular framework for instruction-oriented speech-processing tasks that minimizes human effort in toolset construction. Unlike end-to-end methods using large audio-language models, Speech-Copilot builds speech processing-specific toolsets by analyzing pre-collected task instructions and breaking tasks into manageable sub-tasks. It features a flexible ag… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 8 pages, 2 figures

  4. arXiv:2407.06957  [pdf, other

    eess.AS cs.CL cs.CY

    Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

    Authors: Yi-Cheng Lin, Tzu-Quan Lin, Chih-Kai Yang, Ke-Han Lu, Wei-Chih Chen, Chun-Yi Kuan, Hung-yi Lee

    Abstract: Speech Integrated Large Language Models (SILLMs) combine large language models with speech perception to perform diverse tasks, such as emotion recognition to speaker verification, demonstrating universal audio understanding capability. However, these models may amplify biases present in training data, potentially leading to biased access to information for marginalized groups. This work introduce… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  5. arXiv:2407.05414  [pdf, other

    astro-ph.GA astro-ph.HE

    Velocity-Resolved Ionization Mapping of Broad Line Region. I. Insights into Diverse Geometry and Kinematics

    Authors: Sha-Sha Li, Hai-Cheng Feng, H. T. Liu, J. M. Bai, Xiang Ji, Cheng Cheng, Kai-Xing Lu, Jian-Guo Wang, Rui Li

    Abstract: Broad emission lines of active galactic nuclei (AGNs) originate from the broad-line region (BLR), consisting of dense gas clouds in orbit around an accreting supermassive black hole. Understanding the geometry and kinematics of the region is crucial for gaining insights into the physics and evolution of AGNs. Conventional velocity-resolved reverberation mapping may face challenges in disentangling… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 20 pages, 10 figures, Accepted by ApJ

  6. arXiv:2407.04294  [pdf, other

    cs.CR

    SQLaser: Detecting DBMS Logic Bugs with Clause-Guided Fuzzing

    Authors: Jin Wei, Ping Chen, Kangjie Lu, Jun Dai, Xiaoyan Sun

    Abstract: Database Management Systems (DBMSs) are vital components in modern data-driven systems. Their complexity often leads to logic bugs, which are implementation errors within the DBMSs that can lead to incorrect query results, data exposure, unauthorized access, etc., without necessarily causing visible system failures. Existing detection employs two strategies: rule-based bug detection and coverage-g… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  7. arXiv:2406.18871  [pdf, other

    eess.AS cs.CL

    DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

    Authors: Ke-Han Lu, Zhehuai Chen, Szu-Wei Fu, He Huang, Boris Ginsburg, Yu-Chiang Frank Wang, Hung-yi Lee

    Abstract: Recent speech language models (SLMs) typically incorporate pre-trained speech models to extend the capabilities from large language models (LLMs). In this paper, we propose a Descriptive Speech-Text Alignment approach that leverages speech captioning to bridge the gap between speech and text modalities, enabling SLMs to interpret and generate comprehensive natural language descriptions, thereby fa… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to Interspeech 2024

  8. arXiv:2406.14024  [pdf, other

    cs.CL

    LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

    Authors: Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Dayiheng Liu, Chang Zhou, Wen Xiao, Junjie Hu, Tianyu Liu, Baobao Chang

    Abstract: Mathematical verfier achieves success in mathematical reasoning tasks by validating the correctness of solutions. However, existing verifiers are trained with binary classification labels, which are not informative enough for the model to accurately assess the solutions. To mitigate the aforementioned insufficiency of binary labels, we introduce step-wise natural language feedbacks as rationale la… ▽ More

    Submitted 8 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 9 pages

  9. arXiv:2406.13542  [pdf, other

    cs.CL cs.AI cs.LG

    Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

    Authors: Guanting Dong, Keming Lu, Chengpeng Li, Tingyu Xia, Bowen Yu, Chang Zhou, Jingren Zhou

    Abstract: One core capability of large language models (LLMs) is to follow natural language instructions. However, the issue of automatically constructing high-quality training data to enhance the complex instruction-following abilities of LLMs without manual annotation remains unresolved. In this paper, we introduce AutoIF, the first scalable and reliable method for automatically generating instruction-fol… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  10. arXiv:2406.05067  [pdf, ps, other

    math.QA math-ph math.RT

    Affine $\imath$quantum groups and twisted Yangians in Drinfeld presentations

    Authors: Kang Lu, Weiqiang Wang, Weinan Zhang

    Abstract: We formulate a family of algebras, twisted Yangians (of split type) in current generators and relations, via a degeneration of the Drinfeld presentation of affine $\imath$quantum groups (associated with split Satake diagrams). These new algebras admit PBW type bases and are shown to be a deformation of twisted current algebras; presentations for twisted current algebras are also provided. For type… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 33 pages

    MSC Class: 17B37

  11. arXiv:2406.03797  [pdf, other

    astro-ph.GA

    Morpho-Photometric Classification of KiDS DR5 Sources Based on Neural Networks: A Comprehensive Star-Quasar-Galaxy Catalog

    Authors: Hai-Cheng Feng, Rui Li, Nicola R. Napolitano, Sha-Sha Li, J. M. Bai, Ran Li, H. T. Liu, Kai-Xing Lu, Mario Radovich, Huan-Yuan Shan, Jian-Guo Wang, Wen-Zhe Xi, Ling-Hua Xie, Yang-Wei Zhang

    Abstract: We present a novel multimodal neural network for classifying astronomical sources in multiband ground-based observations, from optical to near infrared, to separate sources in stars, galaxies and quasars. Our approach combines a convolutional neural network branch for learning morphological features from $r$-band images with an artificial neural network branch for extracting spectral energy distri… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 18 pages, 12 figures, 2 tables, Submitted to ApJS

  12. arXiv:2406.02069  [pdf, other

    cs.CL cs.AI

    PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

    Authors: Zefan Cai., Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao

    Abstract: In this study, we investigate whether attention-based information flow inside large language models (LLMs) is aggregated through noticeable patterns for long context processing. Our observations reveal that LLMs aggregate information through Pyramidal Information Funneling where attention is scattering widely in lower layers, progressively consolidating within specific contexts, and ultimately foc… ▽ More

    Submitted 16 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  13. arXiv:2406.01252  [pdf, other

    cs.CL cs.AI stat.ML

    Towards Scalable Automated Alignment of LLMs: A Survey

    Authors: Boxi Cao, Keming Lu, Xinyu Lu, Jiawei Chen, Mengjie Ren, Hao Xiang, Peilin Liu, Yaojie Lu, Ben He, Xianpei Han, Le Sun, Hongyu Lin, Bowen Yu

    Abstract: Alignment is the most critical step in building large language models (LLMs) that meet human needs. With the rapid development of LLMs gradually surpassing human capabilities, traditional alignment methods based on human-annotation are increasingly unable to meet the scalability demands. Therefore, there is an urgent need to explore new sources of automated alignment signals and technical approach… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  14. arXiv:2405.17931  [pdf, other

    cs.CL cs.LG

    Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

    Authors: Keming Lu, Bowen Yu, Fei Huang, Yang Fan, Runji Lin, Chang Zhou

    Abstract: Effectively aligning Large Language Models (LLMs) with human-centric values while preventing the degradation of abilities acquired through Pre-training and Supervised Fine-tuning (SFT) poses a central challenge in Reinforcement Learning from Human Feedback (RLHF). In this paper, we first discover that interpolating RLHF and SFT model parameters can adjust the trade-off between human preference and… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  15. arXiv:2405.10639  [pdf, other

    math.CO

    Note on the union-closed sets conjecture and Reimer's average set size theorem

    Authors: Kengbo Lu, Abigail Raz

    Abstract: The Union-Closed Sets Conjecture, often attributed to Péter Frankl in 1979, remains an open problem in discrete mathematics. It posits that for any finite family of sets $S\neq\{\emptyset\}$, if the union of any two sets in the family is also in the family, then $\underline{\text{there must exist an element that belongs to at least half of the member sets}}$. We will refer to the underlined text a… ▽ More

    Submitted 29 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    MSC Class: 05D05

  16. arXiv:2405.05481  [pdf, other

    quant-ph

    Achieving millisecond coherence fluxonium through overlap Josephson junctions

    Authors: Fei Wang, Kannan Lu, Huijuan Zhan, Lu Ma, Feng Wu, Hantao Sun, Hao Deng, Yang Bai, Feng Bao, Xu Chang, Ran Gao, Xun Gao, Guicheng Gong, Lijuan Hu, Ruizi Hu, Honghong Ji, Xizheng Ma, Liyong Mao, Zhijun Song, Chengchun Tang, Hongcheng Wang, Tenghui Wang, Ziang Wang, Tian Xia, Hongxin Xu , et al. (10 additional authors not shown)

    Abstract: Fluxonium qubits are recognized for their high coherence times and high operation fidelities, attributed to their unique design incorporating over 100 Josephson junctions per superconducting loop. However, this complexity poses significant fabrication challenges, particularly in achieving high yield and junction uniformity with traditional methods. Here, we introduce an overlap process for Josephs… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  17. arXiv:2405.01561  [pdf

    cs.SE cs.AI cs.CY

    Rapid Mobile App Development for Generative AI Agents on MIT App Inventor

    Authors: Jaida Gao, Calab Su, Etai Miller, Kevin Lu, Yu Meng

    Abstract: The evolution of Artificial Intelligence (AI) stands as a pivotal force shaping our society, finding applications across diverse domains such as education, sustainability, and safety. Leveraging AI within mobile applications makes it easily accessible to the public, catalyzing its transformative potential. In this paper, we present a methodology for the rapid development of AI agent applications u… ▽ More

    Submitted 31 March, 2024; originally announced May 2024.

    Journal ref: Journal of advances in information science and technology 2(3) 1-8, March 2024

  18. arXiv:2405.00626  [pdf, other

    stat.ME

    SARMA: Scalable Low-Rank High-Dimensional Autoregressive Moving Averages via Tensor Decomposition

    Authors: Feiqing Huang, Kexin Lu, Yao Zheng

    Abstract: Existing models for high-dimensional time series are overwhelmingly developed within the finite-order vector autoregressive (VAR) framework, whereas the more flexible vector autoregressive moving averages (VARMA) have been much less considered. This paper introduces a high-dimensional model for capturing VARMA dynamics, namely the Scalable ARMA (SARMA) model, by combining novel reparameterization… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  19. arXiv:2404.13492  [pdf, other

    math.NA math-ph nlin.SI

    Discrete non-commutative hungry Toda lattice and its application in matrix computation

    Authors: Zheng Wang, Shi-Hao Li, Kang-Ya Lu, Jian-Qing Sun

    Abstract: In this paper, we plan to show an eigenvalue algorithm for block Hessenberg matrices by using the idea of non-commutative integrable systems and matrix-valued orthogonal polynomials. We introduce adjacent families of matrix-valued $θ$-deformed bi-orthogonal polynomials, and derive corresponding discrete non-commutative hungry Toda lattice from discrete spectral transformations for polynomials. It… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 24 pages, 2 figures. Comments are welcome

  20. arXiv:2403.13438  [pdf, other

    cs.CV

    SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors

    Authors: Chenyang Ma, Kai Lu, Ta-Ying Cheng, Niki Trigoni, Andrew Markham

    Abstract: Current state-of-the-art spatial reasoning-enhanced VLMs are trained to excel at spatial visual question answering (VQA). However, we believe that higher-level 3D-aware tasks, such as articulating dynamic scene changes and motion planning, require a fundamental and explicit 3D understanding beyond current spatial VQA datasets. In this work, we present SpatialPIN, a framework designed to enhance th… ▽ More

    Submitted 6 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Project Page: https://dannymcy.github.io/zeroshot_task_hallucination/

  21. arXiv:2403.09747  [pdf, other

    cs.CL cs.AI

    Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors

    Authors: Guanghua Li, Wensheng Lu, Wei Zhang, Defu Lian, Kezhong Lu, Rui Mao, Kai Shu, Hao Liao

    Abstract: The proliferation of fake news has had far-reaching implications on politics, the economy, and society at large. While Fake news detection methods have been employed to mitigate this issue, they primarily depend on two essential elements: the quality and relevance of the evidence, and the effectiveness of the verdict prediction mechanism. Traditional methods, which often source information from st… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  22. arXiv:2403.08164  [pdf, other

    cs.SD cs.LG eess.AS

    EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech

    Authors: Ziqi Liang, Haoxiang Shi, Jiawei Wang, Keda Lu

    Abstract: Recently, deep learning-based Text-to-Speech (TTS) systems have achieved high-quality speech synthesis results. Recurrent neural networks have become a standard modeling technique for sequential data in TTS systems and are widely used. However, training a TTS model which includes RNN components requires powerful GPU performance and takes a long time. In contrast, CNN-based sequence synthesis techn… ▽ More

    Submitted 17 March, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted by the 27th IEEE International Conference on Computer Supported Cooperative Work in Design (IEEE CSCWD 2024). arXiv admin note: substantial text overlap with arXiv:2211.01948

  23. arXiv:2403.06946  [pdf, other

    cs.CV

    Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation

    Authors: Xinyao Li, Yuke Li, Zhekai Du, Fengling Li, Ke Lu, Jingjing Li

    Abstract: Large vision-language models (VLMs) like CLIP have demonstrated good zero-shot learning performance in the unsupervised domain adaptation task. Yet, most transfer approaches for VLMs focus on either the language or visual branches, overlooking the nuanced interplay between both modalities. In this work, we introduce a Unified Modality Separation (UniMoS) framework for unsupervised domain adaptatio… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: CVPR 2024 camera ready

  24. arXiv:2403.05062  [pdf, other

    cs.CV

    Agile Multi-Source-Free Domain Adaptation

    Authors: Xinyao Li, Jingjing Li, Fengling Li, Lei Zhu, Ke Lu

    Abstract: Efficiently utilizing rich knowledge in pretrained models has become a critical topic in the era of large models. This work focuses on adaptively utilizing knowledge from multiple source-pretrained models to an unlabeled target domain without accessing the source data. Despite being a practically useful setting, existing methods require extensive parameter tuning over each source model, which is c… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted to AAAI2024

  25. arXiv:2403.02899  [pdf, other

    cs.AI

    Domain-Agnostic Mutual Prompting for Unsupervised Domain Adaptation

    Authors: Zhekai Du, Xinyao Li, Fengling Li, Ke Lu, Lei Zhu, Jingjing Li

    Abstract: Conventional Unsupervised Domain Adaptation (UDA) strives to minimize distribution discrepancy between domains, which neglects to harness rich semantics from data and struggles to handle complex domain shifts. A promising technique is to leverage the knowledge of large-scale pre-trained vision-language models for more guided adaptation. Despite some endeavors, current methods often learn textual p… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  26. arXiv:2402.11354  [pdf, other

    cs.LG cs.AI cs.CV cs.DB cs.DS

    Probabilistic Routing for Graph-Based Approximate Nearest Neighbor Search

    Authors: Kejing Lu, Chuan Xiao, Yoshiharu Ishikawa

    Abstract: Approximate nearest neighbor search (ANNS) in high-dimensional spaces is a pivotal challenge in the field of machine learning. In recent years, graph-based methods have emerged as the superior approach to ANNS, establishing a new state of the art. Although various optimizations for graph-based ANNS have been introduced, they predominantly rely on heuristic methods that lack formal theoretical back… ▽ More

    Submitted 10 July, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Source code is available at https://github.com/ICML2024-code/PEOs

  27. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  28. arXiv:2401.17837  [pdf, ps, other

    eess.SY

    Safe Reinforcement Learning-Based Eco-Driving Control for Mixed Traffic Flows With Disturbances

    Authors: Ke Lu, Dongjun Li, Qun Wang, Kaidi Yang, Lin Zhao, Ziyou Song

    Abstract: This paper presents a safe learning-based eco-driving framework tailored for mixed traffic flows, which aims to optimize energy efficiency while guaranteeing safety during real-system operations. Even though reinforcement learning (RL) is capable of optimizing energy efficiency in intricate environments, it is challenged by safety requirements during the training process. The lack of safety guaran… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  29. arXiv:2401.15967  [pdf, other

    cs.CR cs.SE

    INSTILLER: Towards Efficient and Realistic RTL Fuzzing

    Authors: Gen Zhang, Pengfei Wang, Tai Yue, Danjun Liu, Yubei Guo, Kai Lu

    Abstract: Bugs exist in hardware, such as CPU. Unlike software bugs, these hardware bugs need to be detected before deployment. Previous fuzzing work in CPU bug detection has several disadvantages, e.g., the length of RTL input instructions keeps growing, and longer inputs are ineffective for fuzzing. In this paper, we propose INSTILLER (Instruction Distiller), an RTL fuzzer based on ant colony optimization… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Journal ref: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2024

  30. MobFuzz: Adaptive Multi-objective Optimization in Gray-box Fuzzing

    Authors: Gen Zhang, Pengfei Wang, Tai Yue, Xiangdong Kong, Shan Huang, Xu Zhou, Kai Lu

    Abstract: Coverage-guided gray-box fuzzing (CGF) is an efficient software testing technique. There are usually multiple objectives to optimize in CGF. However, existing CGF methods cannot successfully find the optimal values for multiple objectives simultaneously. In this paper, we propose a gray-box fuzzer for multi-objective optimization (MOO) called MobFuzz. We model the multi-objective optimization proc… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Journal ref: Network and Distributed Systems Security (NDSS) Symposium 2022

  31. arXiv:2401.15603  [pdf, other

    cs.LG cs.SI

    Improving Expressive Power of Spectral Graph Neural Networks with Eigenvalue Correction

    Authors: Kangkang Lu, Yanhua Yu, Hao Fei, Xuan Li, Zixuan Yang, Zirui Guo, Meiyu Liang, Mengran Yin, Tat-Seng Chua

    Abstract: In recent years, spectral graph neural networks, characterized by polynomial filters, have garnered increasing attention and have achieved remarkable performance in tasks such as node classification. These models typically assume that eigenvalues for the normalized Laplacian matrix are distinct from each other, thus expecting a polynomial filter to have a high fitting ability. However, this paper… ▽ More

    Submitted 18 March, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI-24

  32. Color Maker: a Mixed-Initiative Approach to Creating Accessible Color Maps

    Authors: Amey Salvi, Kecheng Lu, Michael E. Papka, Yunhai Wang, Khairi Reda

    Abstract: Quantitative data is frequently represented using color, yet designing effective color mappings is a challenging task, requiring one to balance perceptual standards with personal color preference. Current design tools either overwhelm novices with complexity or offer limited customization options. We present ColorMaker, a mixed-initiative approach for creating colormaps. ColorMaker combines fluid… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: To appear at the ACM CHI '24 Conference on Human Factors in Computing Systems

  33. arXiv:2401.14776  [pdf, other

    math.OC

    Online Distributed Optimization with Clipped Stochastic Gradients: High Probability Bound of Regrets

    Authors: Yuchen Yang, Kaihong Lu, Long Wang

    Abstract: In this paper, the problem of distributed optimization is studied via a network of agents. Each agent only has access to a stochastic gradient of its own objective function in the previous time, and can communicate with its neighbors via a network. To handle this problem, an online distributed clipped stochastic gradient descent algorithm is proposed. Dynamic regrets are used to capture the perfor… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  34. arXiv:2401.13714  [pdf, other

    cs.CV cs.LG

    Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers

    Authors: Wei Tao, Shenglin He, Kai Lu, Xiaoyang Qu, Guokuan Li, Jiguang Wan, Jianzong Wang, Jing Xiao

    Abstract: Deploying neural networks on microcontroller units (MCUs) presents substantial challenges due to their constrained computation and memory resources. Previous researches have explored patch-based inference as a strategy to conserve memory without sacrificing model accuracy. However, this technique suffers from severe redundant computation overhead, leading to a substantial increase in execution lat… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted by the 27th Design, Automation and Test in Europe Conference (DATE 2024)

  35. arXiv:2401.12474  [pdf, other

    cs.CL cs.LG

    Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

    Authors: Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou

    Abstract: Considerable efforts have been invested in augmenting the role-playing proficiency of open-source large language models (LLMs) by emulating proprietary counterparts. Nevertheless, we posit that LLMs inherently harbor role-play capabilities, owing to the extensive knowledge of characters and potential dialogues ingrained in their vast training corpora. Thus, in this study, we introduce Ditto, a sel… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  36. Correcting the Contamination of Second-order Spectra: Improving Hα Measurements in Reverberation Mapping Campaigns

    Authors: Wen-Zhe Xi, Kai-Xing Lu, Hai-Cheng Feng, Sha-Sha Li, Jin-Ming Bai, Rui-Lei Zhou, Hong-Tao Liu, Jian-Guo Wang

    Abstract: Long-term spectroscopic monitoring campaigns on active galactic nuclei (AGNs) provide a wealth of information about its interior structure and kinematics. However, a number of the observations suffer from the contamination of second-order spectra (SOS) which will introduce some undesirable uncertainties at the red side of the spectra. In this paper, we test the effect of SOS and propose a method t… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Journal ref: Res.Astron.Astrophys.23(2023)125021

  37. arXiv:2401.11476  [pdf, ps, other

    math.GR

    Finite solvable tidy Groups whose orders are divisible by two primes

    Authors: Nicolas F. Beike, Rachel Carleton, David G. Costanzo, Colin Heath, Mark L. Lewis, Kaiwen Lu, Jamie D. Pearce

    Abstract: In this paper, we investigate finite solvable tidy groups. We classify the tidy $\{ p, q \}$-groups. Combining this with a previous result, we are able to characterize the finite tidy solvable groups. Using this characterization, we bound the Fitting height of finite tidy solvable groups and we prove that the quotients of finite tidy solvable groups are tidy.

    Submitted 21 January, 2024; originally announced January 2024.

    MSC Class: Primary: 20D10 Secondary: 20D20

  38. arXiv:2401.00273  [pdf, ps, other

    eess.AS cs.CL

    Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

    Authors: Chih-Kai Yang, Kuan-Po Huang, Ke-Han Lu, Chun-Yi Kuan, Chi-Yuan Hsiao, Hung-yi Lee

    Abstract: This work evaluated several cutting-edge large-scale foundation models based on self-supervision or weak supervision, including SeamlessM4T, SeamlessM4T v2, and Whisper-large-v3, on three code-switched corpora. We found that self-supervised models can achieve performances close to the supervised model, indicating the effectiveness of multilingual self-supervised pre-training. We also observed that… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Submitted to ICASSP 2024 Self-supervision in Audio, Speech and Beyond workshop

  39. arXiv:2312.08610  [pdf, other

    eess.AS cs.SD

    A computationally efficient semi-blind source separation based approach for nonlinear echo cancellation based on an element-wise iterative source steering

    Authors: Kunxing Lu, Xianrui Wang, Tetsuya Ueda, Shoji Makino, Jingdong Chen

    Abstract: While the semi-blind source separation-based acoustic echo cancellation (SBSS-AEC) has received much research attention due to its promising performance during double-talk compared to the traditional adaptive algorithms, it suffers from system latency and nonlinear distortions. To circumvent these drawbacks, the recently developed ideas on convolutive transfer function (CTF) approximation and nonl… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  40. arXiv:2312.00346  [pdf, other

    stat.ME

    Supervised Factor Modeling for High-Dimensional Linear Time Series

    Authors: Feiqing Huang, Kexin Lu, Guodong Li

    Abstract: Motivated by Tucker tensor decomposition, this paper imposes low-rank structures to the column and row spaces of coefficient matrices in a multivariate infinite-order vector autoregression (VAR), which leads to a supervised factor model with two factor modelings being conducted to responses and predictors simultaneously. Interestingly, the stationarity condition implies an intrinsic weak group spa… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  41. arXiv:2311.16373  [pdf, ps, other

    math.RT math-ph math.QA

    Twisted super Yangians of type AIII and their representations

    Authors: Kang Lu

    Abstract: We study the super analogue of the Molev-Ragoucy reflection algebras, which we call twisted super Yangians of type AIII, and classify their finite-dimensional irreducible representations. These superalgebras are coideal subalgebras of the super Yangian $\mathscr{Y}(\mathfrak{gl}_{m|n})$ and are associated with symmetric pairs of type AIII in Cartan's classification. We establish the Schur-Weyl typ… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 42 pages. This is a preliminary version

  42. arXiv:2311.12058  [pdf, other

    cs.CV

    FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin

    Authors: Zichen Yu, Changyong Shu, Jiajun Deng, Kangjie Lu, Zongdai Liu, Jiangyong Yu, Dawei Yang, Hui Li, Yan Chen

    Abstract: Given the capability of mitigating the long-tail deficiencies and intricate-shaped absence prevalent in 3D object detection, occupancy prediction has become a pivotal component in autonomous driving systems. However, the procession of three-dimensional voxel-level representations inevitably introduces large overhead in both memory and computation, obstructing the deployment of to-date occupancy pr… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: 10 pages, 4 figures

  43. arXiv:2311.08981  [pdf, other

    cs.CL

    Speculative Contrastive Decoding

    Authors: Hongyi Yuan, Keming Lu, Fei Huang, Zheng Yuan, Chang Zhou

    Abstract: Large language models~(LLMs) exhibit exceptional performance in language tasks, yet their auto-regressive inference is limited due to high computational requirements and is sub-optimal due to the exposure bias. Inspired by speculative decoding and contrastive decoding, we introduce Speculative Contrastive Decoding~(SCD), a straightforward yet powerful decoding approach that leverages predictions f… ▽ More

    Submitted 13 March, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

    Comments: Revised version

  44. arXiv:2311.08692  [pdf, other

    cs.CL cs.LG

    Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

    Authors: Keming Lu, Hongyi Yuan, Runji Lin, Junyang Lin, Zheng Yuan, Chang Zhou, Jingren Zhou

    Abstract: The complementary potential of Large Language Models (LLM) assumes off-the-shelf LLMs have heterogeneous expertise in a wide range of domains and tasks so that an ensemble of LLMs can achieve consistently better performance. Existing ensemble methods for LLMs mainly focus on reward model ranking of outputs, leading to significant computation overhead. To combat this issue, we revisit the complemen… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  45. arXiv:2311.08182  [pdf, other

    cs.CL cs.LG

    Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

    Authors: Shengguang Wu, Keming Lu, Benfeng Xu, Junyang Lin, Qi Su, Chang Zhou

    Abstract: Enhancing the instruction-following ability of Large Language Models (LLMs) primarily demands substantial instruction-tuning datasets. However, the sheer volume of these imposes a considerable computational burden and annotation cost. To investigate a label-efficient instruction tuning method that allows the model itself to actively sample subsets that are equally or even more effective, we introd… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  46. arXiv:2311.06530  [pdf, other

    cs.SE cs.AI cs.CL cs.CR

    Exploring ChatGPT's Capabilities on Vulnerability Management

    Authors: Peiyu Liu, Junming Liu, Lirong Fu, Kangjie Lu, Yifan Xia, Xuhong Zhang, Wenzhi Chen, Haiqin Weng, Shouling Ji, Wenhai Wang

    Abstract: Recently, ChatGPT has attracted great attention from the code analysis domain. Prior works show that ChatGPT has the capabilities of processing foundational code analysis tasks, such as abstract syntax tree generation, which indicates the potential of using ChatGPT to comprehend code syntax and static behaviors. However, it is unclear whether ChatGPT can complete more complicated real-world vulner… ▽ More

    Submitted 20 June, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Comments: Accepted by USENIX Security 2024

  47. arXiv:2311.05282  [pdf, other

    physics.optics eess.SP

    Empowering high-dimensional optical fiber communications with integrated photonic processors

    Authors: Kaihang Lu, Zengqi Chen, Hao Chen, Wu Zhou, Zunyue Zhang, Hon Ki Tsang, Yeyu Tong

    Abstract: Mode division multiplexing (MDM) in optical fibers enables multichannel capabilities for various applications, including data transmission, quantum networks, imaging, and sensing. However, MDM optical fiber systems, usually necessities bulk-optics approaches for launching different orthogonal fiber modes into the multimode optical fiber, and multiple-input multiple-output digital electronic signal… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  48. arXiv:2310.18999  [pdf, other

    cs.CV

    DynPoint: Dynamic Neural Point For View Synthesis

    Authors: Kaichen Zhou, Jia-Xing Zhong, Sangyun Shin, Kai Lu, Yiyuan Yang, Andrew Markham, Niki Trigoni

    Abstract: The introduction of neural radiance fields has greatly improved the effectiveness of view synthesis for monocular videos. However, existing algorithms face difficulties when dealing with uncontrolled or lengthy scenarios, and require extensive training time specific to each new scenario. To tackle these limitations, we propose DynPoint, an algorithm designed to facilitate the rapid synthesis of no… ▽ More

    Submitted 18 January, 2024; v1 submitted 29 October, 2023; originally announced October 2023.

  49. arXiv:2310.14310  [pdf, other

    astro-ph.HE

    Multi-band analyses of the bright GRB 230812B and the associated SN2023pel

    Authors: T. Hussenot-Desenonges, T. Wouters, N. Guessoum, I. Abdi, A. Abulwfa, C. Adami, J. F. Agüí Fernández, T. Ahumada, V. Aivazyan, D. Akl, S. Anand, C. M. Andrade, S. Antier, S. A. Ata, P. D'Avanzo, Y. A. Azzam, A. Baransky, S. Basa, M. Blazek, P. Bendjoya, S. Beradze, P. Boumis, M. Bremer, R. Brivio, V. Buat , et al. (87 additional authors not shown)

    Abstract: GRB~230812B is a bright and relatively nearby ($z =0.36$) long gamma-ray burst (GRB) that has generated significant interest in the community and has thus been observed over the entire electromagnetic spectrum. We report over 80 observations in X-ray, ultraviolet, optical, infrared, and sub-millimeter bands from the GRANDMA (Global Rapid Advanced Network for Multi-messenger Addicts) network of obs… ▽ More

    Submitted 17 February, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  50. arXiv:2310.12501  [pdf, other

    astro-ph.HE hep-ph

    Inelastic Scattering of Dark Matter with Heavy Cosmic Rays

    Authors: Keyu Lu, Yue-Lin Sming Tsai, Qiang Yuan, Le Zhang

    Abstract: We investigate the impact of inelastic collisions between dark matter (DM) and heavy cosmic ray (CR) nuclei on CR propagation. We approximate the fragmentation cross-sections for DM-CR collisions using collider-measured proton-nuclei scattering cross-sections, allowing us to assess how these collisions affect the spectra of CR Boron and Carbon. We derive new CR spectra from DM-CR collisions by inc… ▽ More

    Submitted 7 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: 19 pages, 8 figures

    Journal ref: Research in Astronomy and Astrophysics 24, 065007 (2024)