Skip to main content

Showing 1–22 of 22 results for author: Ge, Q

  1. arXiv:2406.01195  [pdf, other

    cs.RO

    C$^3$P-VoxelMap: Compact, Cumulative and Coalescible Probabilistic Voxel Mapping

    Authors: Xu Yang, Wenhao Li, Qijie Ge, Lulu Suo, Weijie Tang, Zhengyu Wei, Longxiang Huang, Bo Wang

    Abstract: This work presents a compact, cumulative and coalescible probabilistic voxel mapping method to enhance performance, accuracy and memory efficiency in LiDAR odometry. Probabilistic voxel mapping requires storing past point clouds and re-iterating on them to update the uncertainty every iteration, which consumes large memory space and CPU cycles. To solve this problem, we propose a two-folded strate… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  2. arXiv:2401.17633  [pdf, other

    cs.CL cs.AI

    Navigating the OverKill in Large Language Models

    Authors: Chenyu Shi, Xiao Wang, Qiming Ge, Songyang Gao, Xianjun Yang, Tao Gui, Qi Zhang, Xuanjing Huang, Xun Zhao, Dahua Lin

    Abstract: Large language models are meticulously aligned to be both helpful and harmless. However, recent research points to a potential overkill which means models may refuse to answer benign queries. In this paper, we investigate the factors for overkill by exploring how models handle and determine the safety of queries. Our findings reveal the presence of shortcuts within models, leading to an over-atten… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  3. arXiv:2401.11458  [pdf, other

    cs.CL

    Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

    Authors: Songyang Gao, Qiming Ge, Wei Shen, Shihan Dou, Junjie Ye, Xiao Wang, Rui Zheng, Yicheng Zou, Zhi Chen, Hang Yan, Qi Zhang, Dahua Lin

    Abstract: The success of AI assistants based on Language Models (LLMs) hinges on Reinforcement Learning from Human Feedback (RLHF) to comprehend and align with user intentions. However, traditional alignment algorithms, such as PPO, are hampered by complex annotation and training requirements. This reliance limits the applicability of RLHF and hinders the development of professional assistants tailored to d… ▽ More

    Submitted 1 July, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted by ICML2024, I'm still preparing a better vision

  4. arXiv:2311.03275  [pdf, other

    cs.LG cs.SI

    HetCAN: A Heterogeneous Graph Cascade Attention Network with Dual-Level Awareness

    Authors: Zeyuan Zhao, Qingqing Ge, Anfeng Cheng, Yiding Liu, Xiang Li, Shuaiqiang Wang

    Abstract: Heterogeneous graph neural networks(HGNNs) have recently shown impressive capability in modeling heterogeneous graphs that are ubiquitous in real-world applications. Most existing methods for heterogeneous graphs mainly learn node embeddings by stacking multiple convolutional or attentional layers, which can be considered as capturing the high-order information from node-level aspect. However, dif… ▽ More

    Submitted 29 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted by ECML-PKDD 2024

  5. arXiv:2311.02116  [pdf, other

    cs.LG

    Resist Label Noise with PGM for Graph Neural Networks

    Authors: Qingqing Ge, Jianxiang Yu, Zeyuan Zhao, Xiang Li

    Abstract: While robust graph neural networks (GNNs) have been widely studied for graph perturbation and attack, those for label noise have received significantly less attention. Most existing methods heavily rely on the label smoothness assumption to correct noisy labels, which adversely affects their performance on heterophilous graphs. Further, they generally perform poorly in high noise-rate scenarios. T… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  6. arXiv:2310.17394  [pdf, other

    cs.LG

    PSP: Pre-Training and Structure Prompt Tuning for Graph Neural Networks

    Authors: Qingqing Ge, Zeyuan Zhao, Yiding Liu, Anfeng Cheng, Xiang Li, Shuaiqiang Wang, Dawei Yin

    Abstract: Graph Neural Networks (GNNs) are powerful in learning semantics of graph data. Recently, a new paradigm "pre-train and prompt" has shown promising results in adapting GNNs to various tasks with less supervised data. The success of such paradigm can be attributed to the more consistent objectives of pre-training and task-oriented prompt tuning, where the pre-trained knowledge can be effectively tra… ▽ More

    Submitted 1 June, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  7. arXiv:2310.14152  [pdf, other

    cs.CL cs.LG

    Orthogonal Subspace Learning for Language Model Continual Learning

    Authors: Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

    Abstract: Benefiting from massive corpora and advanced hardware, large language models (LLMs) exhibit remarkable capabilities in language understanding and generation. However, their performance degrades in scenarios where multiple tasks are encountered sequentially, also known as catastrophic forgetting. In this paper, we propose orthogonal low-rank adaptation (O-LoRA), a simple and efficient approach for… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 findings

  8. arXiv:2302.03498  [pdf, other

    cs.CL cs.SD eess.AS

    MAC: A unified framework boosting low resource automatic speech recognition

    Authors: Zeping Min, Qian Ge, Zhong Li, Weinan E

    Abstract: We propose a unified framework for low resource automatic speech recognition tasks named meta audio concatenation (MAC). It is easy to implement and can be carried out in extremely low resource environments. Mathematically, we give a clear description of MAC framework from the perspective of bayesian sampling. In this framework, we leverage a novel concatenative synthesis text-to-speech system to… ▽ More

    Submitted 15 February, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

  9. Heterogeneous Graph Contrastive Learning with Meta-path Contexts and Adaptively Weighted Negative Samples

    Authors: Jianxiang Yu, Qingqing Ge, Xiang Li, Aoying Zhou

    Abstract: Heterogeneous graph contrastive learning has received wide attention recently. Some existing methods use meta-paths, which are sequences of object types that capture semantic relationships between objects, to construct contrastive views. However, most of them ignore the rich meta-path context information that describes how two objects are connected by meta-paths. Further, they fail to distinguish… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: This paper has been accepted by TKDE as a regular paper

  10. arXiv:2211.10039  [pdf, other

    cs.LG cs.AI

    Why the pseudo label based semi-supervised learning algorithm is effective?

    Authors: Zeping Min, Qian Ge, Cheng Tai

    Abstract: Recently, pseudo label based semi-supervised learning has achieved great success in many fields. The core idea of the pseudo label based semi-supervised learning algorithm is to use the model trained on the labeled data to generate pseudo labels on the unlabeled data, and then train a model to fit the previously generated pseudo labels. We give a theory analysis for why pseudo label based semi-sup… ▽ More

    Submitted 24 January, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  11. arXiv:2210.15285  [pdf, other

    cs.SD cs.CL eess.AS

    SAN: a robust end-to-end ASR model architecture

    Authors: Zeping Min, Qian Ge, Guanhua Huang

    Abstract: In this paper, we propose a novel Siamese Adversarial Network (SAN) architecture for automatic speech recognition, which aims at solving the difficulty of fuzzy audio recognition. Specifically, SAN constructs two sub-networks to differentiate the audio feature input and then introduces a loss to unify the output distribution of these sub-networks. Adversarial learning enables the network to captur… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  12. arXiv:2210.13067  [pdf, other

    cs.SD eess.AS

    10 hours data is all you need

    Authors: Zeping Min, Qian Ge, Zhong Li

    Abstract: We propose a novel procedure to generate pseudo mandarin speech data named as CAMP (character audio mix up), which aims at generating audio from a character scale. We also raise a method for building a mandarin character scale audio database adaptive to CAMP named as META-AUDIO, which makes full use of audio data and can greatly increase the data diversity of the database. Experiments show that ou… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

  13. arXiv:2002.11847  [pdf, other

    cs.CL cs.LG cs.NE

    Echo State Neural Machine Translation

    Authors: Ankush Garg, Yuan Cao, Qi Ge

    Abstract: We present neural machine translation (NMT) models inspired by echo state network (ESN), named Echo State NMT (ESNMT), in which the encoder and decoder layer weights are randomly generated then fixed throughout training. We show that even with this extremely simple model construction and training procedure, ESNMT can already reach 70-80% quality of fully trainable baselines. We examine how spectra… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  14. Relaxed Actor-Critic with Convergence Guarantees for Continuous-Time Optimal Control of Nonlinear Systems

    Authors: Jingliang Duan, Jie Li, Qiang Ge, Shengbo Eben Li, Monimoy Bujarbaruah, Fei Ma, Dezhao Zhang

    Abstract: This paper presents the Relaxed Continuous-Time Actor-critic (RCTAC) algorithm, a method for finding the nearly optimal policy for nonlinear continuous-time (CT) systems with known dynamics and infinite horizon, such as the path-tracking control of vehicles. RCTAC has several advantages over existing adaptive dynamic programming algorithms for CT systems. It does not require the ``admissibility" o… ▽ More

    Submitted 30 March, 2023; v1 submitted 11 September, 2019; originally announced September 2019.

    Journal ref: IEEE Transactions on Intelligent Vehicles, 2023 (Early Access)

  15. arXiv:1908.11200  [pdf, other

    cs.CY cs.LG

    A Concert-planning Tool for Independent Musicians by Machine Learning Models

    Authors: Xiaohan Yang, Qingyin Ge

    Abstract: Our project aims at helping independent musicians to plan their concerts based on the economies of agglomeration in the music industry. Initially, we planned to design an advisory tool for both concert pricing and location selection. Nonetheless, after implementing SGD linear regression and support vector regression models, we realized that concert price does not vary significantly according to di… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

  16. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  17. arXiv:1810.05345  [pdf, other

    cs.OS

    Time Protection: the Missing OS Abstraction

    Authors: Qian Ge, Yuval Yarom, Tom Chothia, Gernot Heiser

    Abstract: Timing channels enable data leakage that threatens the security of computer systems, from cloud platforms to smartphones and browsers executing untrusted third-party code. Preventing unauthorised information flow is a core duty of the operating system, however, present OSes are unable to prevent timing channels. We argue that OSes must provide time protection in addition to the established memory… ▽ More

    Submitted 15 October, 2018; v1 submitted 11 October, 2018; originally announced October 2018.

  18. arXiv:1612.04474  [pdf, other

    cs.CR

    Your Processor Leaks Information - and There's Nothing You Can Do About It

    Authors: Qian Ge, Yuval Yarom, Frank Li, Gernot Heiser

    Abstract: Timing channels are information flows, encoded in the relative timing of events, that bypass the system's protection mechanisms. Any microarchitectural state that depends on execution history and affects the rate of progress of later executions potentially establishes a timing channel, unless explicit steps are taken to close it. Such state includes CPU caches, TLBs, branch predictors and prefetch… ▽ More

    Submitted 14 September, 2017; v1 submitted 13 December, 2016; originally announced December 2016.

  19. arXiv:1312.3005  [pdf, ps, other

    cs.CL

    One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

    Authors: Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson

    Abstract: We propose a new benchmark corpus to be used for measuring progress in statistical language modeling. With almost one billion words of training data, we hope this benchmark will be useful to quickly evaluate novel language modeling techniques, and to compare their contribution when combined with other advanced techniques. We show performance of several well-known types of language models, with the… ▽ More

    Submitted 4 March, 2014; v1 submitted 10 December, 2013; originally announced December 2013.

    Comments: Accompanied by a code.google.com project allowing anyone to generate the benchmark data, and use it to compare their language model against the ones described in the paper

  20. arXiv:1105.5131  [pdf, ps, other

    cs.CC cs.DM math.CO

    Improved Inapproximability Results for Counting Independent Sets in the Hard-Core Model

    Authors: Andreas Galanis, Qi Ge, Daniel Stefankovic, Eric Vigoda, Linji Yang

    Abstract: We study the computational complexity of approximately counting the number of independent sets of a graph with maximum degree Delta. More generally, for an input graph G=(V,E) and an activity lambda>0, we are interested in the quantity Z_G(lambda) defined as the sum over independent sets I weighted as w(I) = lambda^|I|. In statistical physics, Z_G(lambda) is the partition function for the hard-c… ▽ More

    Submitted 11 December, 2012; v1 submitted 25 May, 2011; originally announced May 2011.

    Comments: to appear in Random Structures and Algorithms

    ACM Class: F.2.2; G.3

  21. arXiv:1009.5019  [pdf, ps, other

    cs.CC

    The Complexity of Counting Eulerian Tours in 4-Regular Graphs

    Authors: Qi Ge, Daniel Stefankovic

    Abstract: We investigate the complexity of counting Eulerian tours ({\sc #ET}) and its variations from two perspectives---the complexity of exact counting and the complexity w.r.t. approximation-preserving reductions (AP-reductions \cite{MR2044886}). We prove that {\sc #ET} is #P-complete even for planar 4-regular graphs. A closely related problem is that of counting A-trails ({\sc #A-trails}) in graphs w… ▽ More

    Submitted 25 September, 2010; originally announced September 2010.

  22. arXiv:0911.4732  [pdf, ps, other

    cs.DM cs.DS

    A graph polynomial for independent sets of bipartite graphs

    Authors: Qi Ge, Daniel Stefankovic

    Abstract: We introduce a new graph polynomial that encodes interesting properties of graphs, for example, the number of matchings and the number of perfect matchings. Most importantly, for bipartite graphs the polynomial encodes the number of independent sets (#BIS). We analyze the complexity of exact evaluation of the polynomial at rational points and show that for most points exact evaluation is #P-ha… ▽ More

    Submitted 10 February, 2010; v1 submitted 24 November, 2009; originally announced November 2009.