Skip to main content

Showing 1–50 of 165 results for author: Song, Q

  1. arXiv:2407.11626  [pdf

    cs.LG cs.NE

    Dynamic Dimension Wrapping (DDW) Algorithm: A Novel Approach for Efficient Cross-Dimensional Search in Dynamic Multidimensional Spaces

    Authors: Dongnan Jin, Yali Liu, Qiuzhi Song, Xunju Ma, Yue Liu, Dehao Wu

    Abstract: In the real world, as the complexity of optimization problems continues to increase, there is an urgent need to research more efficient optimization methods. Current optimization algorithms excel in solving problems with a fixed number of dimensions. However, their efficiency in searching dynamic multi-dimensional spaces is unsatisfactory. In response to the challenge of cross-dimensional search i… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  2. arXiv:2407.07735  [pdf, other

    cs.CV

    Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

    Authors: Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation. With the rising prominence and influence of NeRF, safeguarding its intellectual property has become increasingly important. In this paper, we propose \textbf{NeRFProtector}, which adopts a plug-and-play strategy to protect NeRF's copyright during its creation. NeRFProtector utilizes a pre-trained watermarking base… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  3. arXiv:2407.06935  [pdf, other

    cs.LG stat.CO stat.ML

    Bayesian Federated Learning with Hamiltonian Monte Carlo: Algorithm and Theory

    Authors: Jiajun Liang, Qian Zhang, Wei Deng, Qifan Song, Guang Lin

    Abstract: This work introduces a novel and efficient Bayesian federated learning algorithm, namely, the Federated Averaging stochastic Hamiltonian Monte Carlo (FA-HMC), for parameter estimation and uncertainty quantification. We establish rigorous convergence guarantees of FA-HMC on non-iid distributed data sets, under the strong convexity and Hessian smoothness assumptions. Our analysis investigates the ef… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  4. arXiv:2406.11707  [pdf, other

    cs.CR cs.CV cs.LG

    A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

    Authors: Yang Lou, Yi Zhu, Qun Song, Rui Tan, Chunming Qiao, Wei-Bin Lee, Jianping Wang

    Abstract: Trajectory prediction forecasts nearby agents' moves based on their historical trajectories. Accurate trajectory prediction is crucial for autonomous vehicles. Existing attacks compromise the prediction model of a victim AV by directly manipulating the historical trajectory of an attacker AV, which has limited real-world applicability. This paper, for the first time, explores an indirect attack ap… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 33rd USENIX Security Symposium 2024

  5. arXiv:2406.06559  [pdf, other

    cs.CL cs.AI cs.LG

    Harnessing Business and Media Insights with Large Language Models

    Authors: Yujia Bao, Ankit Parag Shah, Neeru Narang, Jonathan Rivers, Rajeev Maksey, Lan Guan, Louise N. Barrere, Shelley Evenson, Rahul Basole, Connie Miao, Ankit Mehta, Fabien Boulay, Su Min Park, Natalie E. Pearson, Eldhose Joy, Tiger He, Sumiran Thakur, Koustav Ghosal, Josh On, Phoebe Morrison, Tim Major, Eva Siqi Wang, Gina Escobar, Jiaheng Wei, Tharindu Cyril Weerasooriya , et al. (8 additional authors not shown)

    Abstract: This paper introduces Fortune Analytics Language Model (FALM). FALM empowers users with direct access to comprehensive business analysis, including market trends, company performance metrics, and expert insights. Unlike generic LLMs, FALM leverages a curated knowledge base built from professional journalism, enabling it to deliver precise and in-depth answers to intricate business questions. Users… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  6. arXiv:2404.11155  [pdf, other

    cs.CV

    HybriMap: Hybrid Clues Utilization for Effective Vectorized HD Map Construction

    Authors: Chi Zhang, Qi Song, Feifei Li, Yongquan Chen, Rui Huang

    Abstract: Constructing vectorized high-definition maps from surround-view cameras has garnered significant attention in recent years. However, the commonly employed multi-stage sequential workflow in prevailing approaches often leads to the loss of early-stage information, particularly in perspective-view features. Usually, such loss is observed as an instance missing or shape mismatching in the final birds… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  7. arXiv:2403.19436  [pdf, other

    cs.HC

    "At the end of the day, I am accountable": Gig Workers' Self-Tracking for Multi-Dimensional Accountability Management

    Authors: Rie Helene Hernandez, Qiurong Song, Yubo Kou, Xinning Gui

    Abstract: Tracking is inherent in and central to the gig economy. Platforms track gig workers' performance through metrics such as acceptance rate and punctuality, while gig workers themselves engage in self-tracking. Although prior research has extensively examined how gig platforms track workers through metrics -- with some studies briefly acknowledging the phenomenon of self-tracking among workers -- the… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted to CHI 2024

  8. arXiv:2403.10133  [pdf, other

    cs.CV

    E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance

    Authors: Tianrui Huang, Pu Cao, Lu Yang, Chun Liu, Mengjie Hu, Zhiwei Liu, Qing Song

    Abstract: Diffusion-based image editing is a composite process of preserving the source image content and generating new content or applying modifications. While current editing approaches have made improvements under text guidance, most of them have only focused on preserving the information of the input image, disregarding the importance of editability and alignment to the target prompt. In this paper, we… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  9. arXiv:2403.04279  [pdf, other

    cs.CV

    Controllable Generation with Text-to-Image Diffusion Models: A Survey

    Authors: Pu Cao, Feng Zhou, Qing Song, Lu Yang

    Abstract: In the rapidly advancing realm of visual generation, diffusion models have revolutionized the landscape, marking a significant shift in capabilities with their impressive text-guided generative functions. However, relying solely on text for conditioning these models does not fully cater to the varied and complex requirements of different applications and scenarios. Acknowledging this shortfall, a… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: A collection of resources on controllable generation with text-to-image diffusion models: https://github.com/PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models

  10. arXiv:2403.03967  [pdf, other

    cs.LG cs.CR stat.ML

    Effect of Ambient-Intrinsic Dimension Gap on Adversarial Vulnerability

    Authors: Rajdeep Haldar, Yue Xing, Qifan Song

    Abstract: The existence of adversarial attacks on machine learning models imperceptible to a human is still quite a mystery from a theoretical perspective. In this work, we introduce two notions of adversarial attacks: natural or on-manifold attacks, which are perceptible by a human/oracle, and unnatural or off-manifold attacks, which are not. We argue that the existence of the off-manifold attacks is a nat… ▽ More

    Submitted 23 March, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: AISTATS 2024

  11. arXiv:2402.18811  [pdf, other

    cs.CV

    BFRFormer: Transformer-based generator for Real-World Blind Face Restoration

    Authors: Guojing Ge, Qi Song, Guibo Zhu, Yuting Zhang, Jinglu Chen, Miao Xin, Ming Tang, Jinqiao Wang

    Abstract: Blind face restoration is a challenging task due to the unknown and complex degradation. Although face prior-based methods and reference-based methods have recently demonstrated high-quality results, the restored images tend to contain over-smoothed results and lose identity-preserved details when the degradation is severe. It is observed that this is attributed to short-range dependencies, the in… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted by ICASSP 2024

  12. arXiv:2402.14891  [pdf, other

    cs.CL cs.AI

    LLMBind: A Unified Modality-Task Integration Framework

    Authors: Bin Zhu, Munan Ning, Peng Jin, Bin Lin, Jinfa Huang, Qi Song, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan

    Abstract: In the multi-modal domain, the dependence of various models on specific input formats leads to user confusion and hinders progress. To address this challenge, we introduce \textbf{LLMBind}, a novel framework designed to unify a diverse array of multi-modal tasks. By harnessing a Mixture-of-Experts (MoE) Large Language Model (LLM), LLMBind processes multi-modal inputs and generates task-specific to… ▽ More

    Submitted 18 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  13. arXiv:2402.13435  [pdf, other

    cs.IR cs.LG

    Learning to Retrieve for Job Matching

    Authors: Jianqiang Shen, Yuchin Juan, Shaobo Zhang, Ping Liu, Wen Pu, Sriram Vasudevan, Qingquan Song, Fedor Borisyuk, Kay Qianqi Shen, Haichao Wei, Yunxiang Ren, Yeou S. Chiou, Sicong Kuang, Yuan Yin, Ben Zheng, Muchen Wu, Shaghayegh Gharghabi, Xiaoqing Wang, Huichao Xue, Qi Guo, Daniel Hewlett, Luke Simon, Liangjie Hong, Wenjing Zhang

    Abstract: Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we d… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  14. arXiv:2402.06859  [pdf, other

    cs.LG cs.AI cs.IR

    LiRank: Industrial Large Scale Ranking Models at LinkedIn

    Authors: Fedor Borisyuk, Mingzhou Zhou, Qingquan Song, Siyu Zhu, Birjodh Tiwana, Ganesh Parameswaran, Siddharth Dangi, Lars Hertel, Qiang Xiao, Xiaochen Hou, Yunbo Ouyang, Aman Gupta, Sheallika Singh, Dan Liu, Hailing Cheng, Lei Le, Jonathan Hung, Sathiya Keerthi, Ruoyan Wang, Fengyu Zhang, Mohit Kothari, Chen Zhu, Daqi Sun, Yun Dai, Xun Luan , et al. (9 additional authors not shown)

    Abstract: We present LiRank, a large-scale ranking framework at LinkedIn that brings to production state-of-the-art modeling architectures and optimization methods. We unveil several modeling improvements, including Residual DCN, which adds attention and residual connections to the famous DCNv2 architecture. We share insights into combining and tuning SOTA architectures to create a unified model, including… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    ACM Class: H.3.3

  15. arXiv:2402.00743  [pdf, other

    cs.LG cs.CL stat.ML

    Theoretical Understanding of In-Context Learning in Shallow Transformers with Unstructured Data

    Authors: Yue Xing, Xiaofeng Lin, Chenheng Xu, Namjoon Suh, Qifan Song, Guang Cheng

    Abstract: Large language models (LLMs) are powerful models that can learn concepts at the inference stage via in-context learning (ICL). While theoretical studies, e.g., \cite{zhang2023trained}, attempt to explain the mechanism of ICL, they assume the input $x_i$ and the output $y_i$ of each demonstration example are in the same token (i.e., structured data). However, in real practice, the examples are usua… ▽ More

    Submitted 18 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  16. arXiv:2401.15248  [pdf, other

    cs.LG stat.ML

    Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective

    Authors: Yue Xing, Xiaofeng Lin, Qifan Song, Yi Xu, Belinda Zeng, Guang Cheng

    Abstract: Pre-training is known to generate universal representations for downstream tasks in large-scale deep learning such as large language models. Existing literature, e.g., \cite{kim2020adversarial}, empirically observe that the downstream tasks can inherit the adversarial robustness of the pre-trained model. We provide theoretical justifications for this robustness inheritance phenomenon. Our theoreti… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: To appear in AISTATS2024

  17. arXiv:2401.09785  [pdf, other

    cs.CL

    Instant Answering in E-Commerce Buyer-Seller Messaging using Message-to-Question Reformulation

    Authors: Besnik Fetahu, Tejas Mehta, Qun Song, Nikhita Vedula, Oleg Rokhlenko, Shervin Malmasi

    Abstract: E-commerce customers frequently seek detailed product information for purchase decisions, commonly contacting sellers directly with extended queries. This manual response requirement imposes additional costs and disrupts buyer's shopping experience with response time fluctuations ranging from hours to days. We seek to automate buyer inquiries to sellers in a leading e-commerce store using a domain… ▽ More

    Submitted 30 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at ECIR 2024

  18. arXiv:2401.09686  [pdf, other

    eess.AS cs.SD

    An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement

    Authors: Qiquan Zhang, Meng Ge, Hongxu Zhu, Eliathamby Ambikairajah, Qi Song, Zhaoheng Ni, Haizhou Li

    Abstract: Transformer architecture has enabled recent progress in speech enhancement. Since Transformers are position-agostic, positional encoding is the de facto standard component used to enable Transformers to distinguish the order of elements in a sequence. However, it remains unclear how positional encoding exactly impacts speech enhancement based on Transformer architectures. In this paper, we perform… ▽ More

    Submitted 13 February, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP 2024

  19. arXiv:2401.08067  [pdf

    cs.HC

    TrajVis: a visual clinical decision support system to translate artificial intelligence trajectory models in the precision management of chronic kidney disease

    Authors: Zuotian Li, Xiang Liu, Ziyang Tang, Pengyue Zhang, Nanxin Jin, Michael Eadon, Qianqian Song, Yingjie Chen, Jing Su

    Abstract: Objective: Our objective is to develop and validate TrajVis, an interactive tool that assists clinicians in using artificial intelligence (AI) models to leverage patients' longitudinal electronic medical records (EMR) for personalized precision management of chronic disease progression. Methods: We first perform requirement analysis with clinicians and data scientists to determine the visual analy… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  20. arXiv:2401.06431  [pdf, other

    cs.CL cs.AI

    Human-AI Collaborative Essay Scoring: A Dual-Process Framework with LLMs

    Authors: Changrong Xiao, Wenxing Ma, Qingping Song, Sean Xin Xu, Kunpeng Zhang, Yufang Wang, Qi Fu

    Abstract: Receiving timely and personalized feedback is essential for second-language learners, especially when human instructors are unavailable. This study explores the effectiveness of Large Language Models (LLMs), including both proprietary and open-source models, for Automated Essay Scoring (AES). Through extensive experiments with public and private datasets, we find that while LLMs do not surpass con… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  21. arXiv:2401.04956  [pdf, other

    cs.CV cs.CR

    EmMixformer: Mix transformer for eye movement recognition

    Authors: Huafeng Qin, Hongyu Zhu, Xin Jin, Qun Song, Mounim A. El-Yacoubi, Xinbo Gao

    Abstract: Eye movement (EM) is a new highly secure biometric behavioral modality that has received increasing attention in recent years. Although deep neural networks, such as convolutional neural network (CNN), have recently achieved promising performance, current solutions fail to capture local and global temporal dependencies within eye movement data. To overcome this problem, we propose in this paper a… ▽ More

    Submitted 9 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

  22. arXiv:2401.04429  [pdf, other

    cs.AI cs.MA

    i-Rebalance: Personalized Vehicle Repositioning for Supply Demand Balance

    Authors: Haoyang Chen, Peiyan Sun, Qiyuan Song, Wanyuan Wang, Weiwei Wu, Wencan Zhang, Guanyu Gao, Yan Lyu

    Abstract: Ride-hailing platforms have been facing the challenge of balancing demand and supply. Existing vehicle reposition techniques often treat drivers as homogeneous agents and relocate them deterministically, assuming compliance with the reposition. In this paper, we consider a more realistic and driver-centric scenario where drivers have unique cruising preferences and can decide whether to take the r… ▽ More

    Submitted 2 April, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  23. arXiv:2401.04044  [pdf, other

    cs.CL

    FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference

    Authors: Zirui Liu, Qingquan Song, Qiang Charles Xiao, Sathiya Keerthi Selvaraj, Rahul Mazumder, Aman Gupta, Xia Hu

    Abstract: The large number of parameters in Pretrained Language Models enhance their performance, but also make them resource-intensive, making it challenging to deploy them on commodity hardware like a single GPU. Due to the memory and power limitations of these devices, model compression techniques are often used to decrease both the model's size and its inference latency. This usually results in a trade-… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  24. arXiv:2312.08195  [pdf, other

    cs.CV cs.AI cs.MM

    Concept-centric Personalization with Large-scale Diffusion Priors

    Authors: Pu Cao, Lu Yang, Feng Zhou, Tianrui Huang, Qing Song

    Abstract: Despite large-scale diffusion models being highly capable of generating diverse open-world content, they still struggle to match the photorealism and fidelity of concept-specific generators. In this work, we present the task of customizing large-scale diffusion priors for specific concepts as concept-centric personalization. Our goal is to generate high-quality concept-centric images while maintai… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  25. arXiv:2312.02944  [pdf, other

    cs.RO eess.SY

    An alternating peak-optimization method for optimal trajectory generation of quadrotor drones

    Authors: Wytze A. B. de Vries, Ming Li, Qirui Song, Zhiyong Sun

    Abstract: In this paper, we propose an alternating optimization method to address a time-optimal trajectory generation problem. Different from the existing solutions, our approach introduces a new formulation that minimizes the overall trajectory running time while maintaining the polynomial smoothness constraints and incorporating hard limits on motion derivatives to ensure feasibility. To address this pro… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  26. arXiv:2311.16576  [pdf, other

    cs.DC

    Wireless Powered Metaverse: Joint Task Scheduling and Trajectory Design for Multi-Devices and Multi-UAVs

    Authors: Xiaojie Wang, Jiameng Li, Zhaolong Ning, Qingyang Song, Lei Guo, Abbas Jamalipour

    Abstract: To support the running of human-centric metaverse applications on mobile devices, Unmanned Aerial Vehicle (UAV)-assisted Wireless Powered Mobile Edge Computing (WPMEC) is promising to compensate for limited computational capabilities and energy supplies of mobile devices. The high-speed computational processing demands and significant energy consumption of metaverse applications require joint reso… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  27. arXiv:2311.05866  [pdf, other

    stat.ML cs.LG

    Fair Supervised Learning with A Simple Random Sampler of Sensitive Attributes

    Authors: Jinwon Sohn, Qifan Song, Guang Lin

    Abstract: As the data-driven decision process becomes dominating for industrial applications, fairness-aware machine learning arouses great attention in various areas. This work proposes fairness penalties learned by neural networks with a simple random sampler of sensitive attributes for non-discriminatory supervised learning. In contrast to many existing works that critically rely on the discreteness of s… ▽ More

    Submitted 9 March, 2024; v1 submitted 9 November, 2023; originally announced November 2023.

  28. arXiv:2310.16323  [pdf, other

    stat.ML cs.LG

    Personalized Federated X -armed Bandit

    Authors: Wenjie Li, Qifan Song, Jean Honorio

    Abstract: In this work, we study the personalized federated $\mathcal{X}$-armed bandit problem, where the heterogeneous local objectives of the clients are optimized simultaneously in the federated learning paradigm. We propose the \texttt{PF-PNE} algorithm with a unique double elimination strategy, which safely eliminates the non-optimal regions while encouraging federated collaboration through biased but… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  29. arXiv:2310.16320  [pdf, other

    stat.ML cs.LG

    Enhancing Low-Precision Sampling via Stochastic Gradient Hamiltonian Monte Carlo

    Authors: Ziyi Wang, Yujie Chen, Qifan Song, Ruqi Zhang

    Abstract: Low-precision training has emerged as a promising low-cost technique to enhance the training efficiency of deep neural networks without sacrificing much accuracy. Its Bayesian counterpart can further provide uncertainty quantification and improved generalization accuracy. This paper investigates low-precision sampling via Stochastic Gradient Hamiltonian Monte Carlo (SGHMC) with low-precision and f… ▽ More

    Submitted 14 July, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

  30. arXiv:2310.09603  [pdf, other

    eess.IV cs.CV

    B-Spine: Learning B-Spline Curve Representation for Robust and Interpretable Spinal Curvature Estimation

    Authors: Hao Wang, Qiang Song, Ruofeng Yin, Rui Ma, Yizhou Yu, Yi Chang

    Abstract: Spinal curvature estimation is important to the diagnosis and treatment of the scoliosis. Existing methods face several issues such as the need of expensive annotations on the vertebral landmarks and being sensitive to the image quality. It is challenging to achieve robust estimation and obtain interpretable results, especially for low-quality images which are blurry and hazy. In this paper, we pr… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  31. arXiv:2309.15035  [pdf, ps, other

    math.AC cs.SC math.CO

    On the Reduced Gröbner Bases of Blockwise Determinantal Ideals

    Authors: Chenqi Mou, Qiuye Song

    Abstract: Blockwise determinantal ideals are those generated by the union of all the minors of specified sizes in certain blocks of a generic matrix, and they are the natural generalization of many existing determinantal ideals like the Schubert and ladder ones. In this paper we establish several criteria to verify whether the Gröbner bases of blockwise determinantal ideals with respect to (anti-)diagonal t… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    MSC Class: 13P10 (Primary) 13C40; 05E14 (Secondary)

  32. arXiv:2309.04909  [pdf, ps, other

    cs.CR

    Bicoptor 2.0: Addressing Challenges in Probabilistic Truncation for Enhanced Privacy-Preserving Machine Learning

    Authors: Lijing Zhou, Qingrui Song, Su Zhang, Ziyu Wang, Xianggui Wang, Yong Li

    Abstract: This paper primarily focuses on analyzing the problems and proposing solutions for the probabilistic truncation protocol in existing PPML works from the perspectives of accuracy and efficiency. In terms of accuracy, we reveal that precision selections recommended in some of the existing works are incorrect. We conduct a thorough analysis of their open-source code and find that their errors were ma… ▽ More

    Submitted 6 March, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

    Comments: 17 pages, 5 figures

  33. arXiv:2309.01885  [pdf, other

    stat.ML cs.CL cs.LG

    QuantEase: Optimization-based Quantization for Language Models

    Authors: Kayhan Behdin, Ayan Acharya, Aman Gupta, Qingquan Song, Siyu Zhu, Sathiya Keerthi, Rahul Mazumder

    Abstract: With the rising popularity of Large Language Models (LLMs), there has been an increasing interest in compression techniques that enable their efficient deployment. This study focuses on the Post-Training Quantization (PTQ) of LLMs. Drawing from recent advances, our work introduces QuantEase, a layer-wise quantization framework where individual layers undergo separate quantization. The problem is f… ▽ More

    Submitted 1 December, 2023; v1 submitted 4 September, 2023; originally announced September 2023.

  34. arXiv:2309.00154  [pdf, other

    cs.CY

    Learning From Peers: A Survey of Perception and Utilization of Online Peer Support Among Informal Dementia Caregivers

    Authors: Zhijun Yin, Lauren Stratton, Qingyuan Song, Congning Ni, Lijun Song, Patricia A. Commiskey, Qingxia Chen, Monica Moreno, Sam Fazio, Bradley A. Malin

    Abstract: Informal dementia caregivers are those who care for a person living with dementia (PLWD) without receiving payment (e.g., family members, friends, or other unpaid caregivers). These informal caregivers are subject to substantial mental, physical, and financial burdens. Online communities enable these caregivers to exchange caregiving strategies and communicate experiences with other caregivers who… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  35. arXiv:2308.16453  [pdf, ps, other

    cs.CR cs.LG

    Listen to Minority: Encrypted Traffic Classification for Class Imbalance with Contrastive Pre-Training

    Authors: Xiang Li, Juncheng Guo, Qige Song, Jiang Xie, Yafei Sang, Shuyuan Zhao, Yongzheng Zhang

    Abstract: Mobile Internet has profoundly reshaped modern lifestyles in various aspects. Encrypted Traffic Classification (ETC) naturally plays a crucial role in managing mobile Internet, especially with the explosive growth of mobile apps using encrypted communication. Despite some existing learning-based ETC methods showing promising results, three-fold limitations still remain in real-world network enviro… ▽ More

    Submitted 6 September, 2023; v1 submitted 31 August, 2023; originally announced August 2023.

    Comments: Accepted by 2023 20th Annual IEEE International Conference on Sensing, Communication, and Networking, 9 pages, 6 figures

  36. Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving

    Authors: Yang Lou, Qun Song, Qian Xu, Rui Tan, Jianping Wang

    Abstract: Multi-modal fusion has shown initial promising results for object detection of autonomous driving perception. However, many existing fusion schemes do not consider the quality of each fusion input and may suffer from adverse conditions on one or more sensors. While predictive uncertainty has been applied to characterize single-modal object detection performance at run time, incorporating uncertain… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: In proceedings of the 26th European Conference on Artificial Intelligence ECAI 2023. 8 pages + 2 appendix pages

  37. arXiv:2307.16099  [pdf, other

    cs.LG stat.ML

    On Neural Network approximation of ideal adversarial attack and convergence of adversarial training

    Authors: Rajdeep Haldar, Qifan Song

    Abstract: Adversarial attacks are usually expressed in terms of a gradient-based operation on the input data and model, this results in heavy computations every time an attack is generated. In this work, we solidify the idea of representing adversarial attacks as a trainable function, without further gradient computation. We first motivate that the theoretical best attacks, under proper conditions, can be r… ▽ More

    Submitted 29 July, 2023; originally announced July 2023.

    MSC Class: 68T99; 62G20; 49K35; 34A34

  38. arXiv:2307.08252  [pdf, other

    cs.CV

    Large-Scale Person Detection and Localization using Overhead Fisheye Cameras

    Authors: Lu Yang, Liulei Li, Xueshi Xin, Yifan Sun, Qing Song, Wenguan Wang

    Abstract: Location determination finds wide applications in daily life. Instead of existing efforts devoted to localizing tourist photos captured by perspective cameras, in this article, we focus on devising person positioning solutions using overhead fisheye cameras. Such solutions are advantageous in large field of view (FOV), low cost, anti-occlusion, and unaggressive work mode (without the necessity of… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: ICCV 2023. Project page: https://LOAFisheye.github.io

  39. arXiv:2306.13641  [pdf, other

    stat.ML cs.LG

    A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules

    Authors: Sehwan Kim, Qifan Song, Faming Liang

    Abstract: The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity amon… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  40. arXiv:2306.02283  [pdf, other

    stat.ML cs.LG

    Matrix Completion from General Deterministic Sampling Patterns

    Authors: Hanbyul Lee, Rahul Mazumder, Qifan Song, Jean Honorio

    Abstract: Most of the existing works on provable guarantees for low-rank matrix completion algorithms rely on some unrealistic assumptions such that matrix entries are sampled randomly or the sampling pattern has a specific structure. In this work, we establish theoretical guarantee for the exact and approximate low-rank matrix completion problems which can be applied to any deterministic sampling schemes.… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

  41. arXiv:2305.14146  [pdf, other

    cs.SE

    Industry Practices for Challenging Autonomous Driving Systems with Critical Scenarios

    Authors: Qunying Song, Emelie Engström, Per Runeson

    Abstract: Testing autonomous driving systems for safety and reliability is extremely complex. A primary challenge is identifying the relevant test scenarios, especially the critical ones that may expose hazards or risks of harm to autonomous vehicles and other road users. There are several proposed methods and tools for critical scenario identification, while the industry practices, such as the selection, i… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 29 pages, 3 figures, submitted to a journal

  42. arXiv:2305.08721  [pdf, other

    cs.CV

    Learning More Discriminative Local Descriptors for Few-shot Learning

    Authors: Qijun Song, Siyun Zhou, Liwei Xu

    Abstract: Few-shot learning for image classification comes up as a hot topic in computer vision, which aims at fast learning from a limited number of labeled images and generalize over the new tasks. In this paper, motivated by the idea of Fisher Score, we propose a Discriminative Local Descriptors Attention (DLDA) model that adaptively selects the representative local descriptors and does not introduce any… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  43. arXiv:2305.08541  [pdf, other

    cs.SD eess.AS

    Ripple sparse self-attention for monaural speech enhancement

    Authors: Qiquan Zhang, Hongxu Zhu, Qi Song, Xinyuan Qian, Zhaoheng Ni, Haizhou Li

    Abstract: The use of Transformer represents a recent success in speech enhancement. However, as its core component, self-attention suffers from quadratic complexity, which is computationally prohibited for long speech recordings. Moreover, it allows each time frame to attend to all time frames, neglecting the strong local correlations of speech signals. This study presents a simple yet effective sparse self… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: 5 pages, ICASSP 2023 published

  44. arXiv:2305.05838  [pdf, other

    cs.CV cs.MM

    Generative Steganographic Flow

    Authors: Ping Wei, Ge Luo, Qi Song, Xinpeng Zhang, Zhenxing Qian, Sheng Li

    Abstract: Generative steganography (GS) is a new data hiding manner, featuring direct generation of stego media from secret data. Existing GS methods are generally criticized for their poor performances. In this paper, we propose a novel flow based GS approach -- Generative Steganographic Flow (GSF), which provides direct generation of stego images without cover image. We take the stego image generation and… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: The accepted paper in ICME 2022

  45. arXiv:2303.15878  [pdf, ps, other

    cs.NI

    Multidimensional Resource Fragmentation-Aware Virtual Network Embedding in MEC Systems Interconnected by Metro Optical Networks

    Authors: Yingying Guan, Qingyang Song, Weijing Qi, Ke Li, Lei Guo, Abbas Jamalipour

    Abstract: The increasing demand for diverse emerging applications has resulted in the interconnection of multi-access edge computing (MEC) systems via metro optical networks. To cater to these diverse applications, network slicing has become a popular tool for creating specialized virtual networks. However, resource fragmentation caused by uneven utilization of multidimensional resources can lead to reduced… ▽ More

    Submitted 28 March, 2023; originally announced March 2023.

  46. arXiv:2303.06548  [pdf, other

    cs.CV eess.IV

    CoT-MISR:Marrying Convolution and Transformer for Multi-Image Super-Resolution

    Authors: Mingming Xiu, Yang Nie, Qing Song, Chun Liu

    Abstract: As a method of image restoration, image super-resolution has been extensively studied at first. How to transform a low-resolution image to restore its high-resolution image information is a problem that researchers have been exploring. In the early physical transformation methods, the high-resolution pictures generated by these methods always have a serious problem of missing information, and the… ▽ More

    Submitted 11 March, 2023; originally announced March 2023.

  47. arXiv:2303.04030  [pdf, other

    stat.ML cs.AI cs.LG cs.SE

    PyXAB -- A Python Library for $\mathcal{X}$-Armed Bandit and Online Blackbox Optimization Algorithms

    Authors: Wenjie Li, Haoze Li, Jean Honorio, Qifan Song

    Abstract: We introduce a Python open-source library for $\mathcal{X}$-armed bandit and online blackbox optimization named PyXAB. PyXAB contains the implementations for more than 10 $\mathcal{X}$-armed bandit algorithms, such as HOO, StoSOO, HCT, and the most recent works GPO and VHCT. PyXAB also provides the most commonly-used synthetic objectives to evaluate the performance of different algorithms and the… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  48. arXiv:2303.03166  [pdf, other

    cs.CV cs.AI

    Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator

    Authors: Qing Song, Yang Zhou, Mengjie Hu, Chun Liu

    Abstract: Temporal action localization in videos presents significant challenges in the field of computer vision. While the boundary-sensitive method has been widely adopted, its limitations include incomplete use of intermediate and global information, as well as an inefficient proposal feature generator. To address these challenges, we propose a novel framework, Sparse Multilevel Boundary Generator (SMBG)… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: 18 pages, 5 figures

  49. AutoML in The Wild: Obstacles, Workarounds, and Expectations

    Authors: Yuan Sun, Qiurong Song, Xinning Gui, Fenglong Ma, Ting Wang

    Abstract: Automated machine learning (AutoML) is envisioned to make ML techniques accessible to ordinary users. Recent work has investigated the role of humans in enhancing AutoML functionality throughout a standard ML workflow. However, it is also critical to understand how users adopt existing AutoML solutions in complex, real-world settings from a holistic perspective. To fill this gap, this study conduc… ▽ More

    Submitted 3 April, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI'23), April 23-28, 2023, Hamburg, Germany

  50. arXiv:2302.09693  [pdf, other

    stat.ML cs.LG

    mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

    Authors: Kayhan Behdin, Qingquan Song, Aman Gupta, Sathiya Keerthi, Ayan Acharya, Borja Ocejo, Gregory Dexter, Rajiv Khanna, David Durfee, Rahul Mazumder

    Abstract: Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient descent methods toward flatter minima, which are believed to exhibit enhanced generalization prowess. Our study delves into a specific variant of SAM known as… ▽ More

    Submitted 30 September, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2212.04343