Skip to main content

Showing 1–50 of 52 results for author: Meng, R

  1. arXiv:2406.17349  [pdf, other

    cs.CR cs.CV

    Semantic Deep Hiding for Robust Unlearnable Examples

    Authors: Ruohan Meng, Chenyu Yi, Yi Yu, Siyuan Yang, Bingquan Shen, Alex C. Kot

    Abstract: Ensuring data privacy and protection has become paramount in the era of deep learning. Unlearnable examples are proposed to mislead the deep learning models and prevent data from unauthorized exploration by adding small perturbations to data. However, such perturbations (e.g., noise, texture, color change) predominantly impact low-level features, making them vulnerable to common countermeasures. I… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted by TIFS 2024

  2. arXiv:2406.06149  [pdf, other

    cs.LG stat.ML

    Decoupled Marked Temporal Point Process using Neural Ordinary Differential Equations

    Authors: Yujee Song, Donghyun Lee, Rui Meng, Won Hwa Kim

    Abstract: A Marked Temporal Point Process (MTPP) is a stochastic process whose realization is a set of event-time data. MTPP is often used to understand complex dynamics of asynchronous temporal events such as money transaction, social media, healthcare, etc. Recent studies have utilized deep neural networks to capture complex temporal dependencies of events and generate embedding that aptly represent the o… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 18 pages, 8 figures, The Twelfth International Conference on Learning Representations (ICLR 2024)

  3. arXiv:2405.19509  [pdf, other

    cs.IT

    Leveraging partial stragglers within gradient coding

    Authors: Aditya Ramamoorthy, Ruoyu Meng, Vrinda S. Girimaji

    Abstract: Within distributed learning, workers typically compute gradients on their assigned dataset chunks and send them to the parameter server (PS), which aggregates them to compute either an exact or approximate version of $\nabla L$ (gradient of the loss function $L$). However, in large-scale clusters, many workers are slower than their promised speed or even failure-prone. A gradient coding solution i… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 12 pages, 7 figures

  4. arXiv:2405.14597  [pdf, other

    cs.LG cs.AI

    Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs

    Authors: Qingyuan Li, Ran Meng, Yiduo Li, Bo Zhang, Yifan Lu, Yerui Sun, Lin Ma, Yuchen Xie

    Abstract: We introduce Integer Scale, a novel post-training quantization scheme for large language models that effectively resolves the inference bottleneck in current fine-grained quantization approaches while maintaining similar accuracies. Integer Scale is a free lunch as it requires no extra calibration or fine-tuning which will otherwise incur additional costs. It can be used plug-and-play for most fin… ▽ More

    Submitted 28 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  5. arXiv:2405.13179  [pdf, other

    cs.CL

    RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

    Authors: Yuelyu Ji, Zhuochun Li, Rui Meng, Sonish Sivarajkumar, Yanshan Wang, Zeshui Yu, Hui Ji, Yushui Han, Hanyu Zeng, Daqing He

    Abstract: This paper introduces the RAG-RLRC-LaySum framework, designed to make complex biomedical research understandable to laymen through advanced Natural Language Processing (NLP) techniques. Our Retrieval Augmented Generation (RAG) solution, enhanced by a reranking method, utilizes multiple knowledge sources to ensure the precision and pertinence of lay summaries. Additionally, our Reinforcement Learni… ▽ More

    Submitted 24 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

  6. arXiv:2405.10960  [pdf, other

    physics.med-ph cs.GR

    Optimizing Surgical Plans for Parenchyma-Sparing Liver Resections through Contour-Guided Resection and Surface Approximation

    Authors: Gabriella d'Albenzio, Ruoyan Meng, Davit Aghayan, Egidijus Pelanis, Rebecca Hisey, Sarkis Drejian, Åsmund Avdem Fretland, Ole Jakob Elle, Bjørn Edwin, Rafael Palomar

    Abstract: Objective: This study introduces a novel method for defining virtual resections in liver cancer surgery, aimed at enhancing the adaptability of parenchyma-sparing resection (PSR) plans. By comparing these with traditional anatomical resection (AR) plans, we explore the potential for optimization in surgical planning. Methods: Leveraging contours and spline surface approximations directly from the… ▽ More

    Submitted 8 April, 2024; originally announced May 2024.

  7. arXiv:2404.13951  [pdf, other

    cs.SE

    Program Environment Fuzzing

    Authors: Ruijie Meng, Gregory J. Duck, Abhik Roychoudhury

    Abstract: Computer programs are not executed in isolation, but rather interact with the execution environment which drives the program behaviours. Software validation and verification methods, such as greybox fuzzing, thus need to capture the effect of possibly complex environmental interactions, including files, databases, configurations, network sockets, human-user interactions, and more. Conventional app… ▽ More

    Submitted 20 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 13 pages, 5 figures, 5 tables

  8. arXiv:2404.02889  [pdf, other

    cs.CR cs.CV

    Steganographic Passport: An Owner and User Verifiable Credential for Deep Model IP Protection Without Retraining

    Authors: Qi Cui, Ruohan Meng, Chaohui Xu, Chip-Hong Chang

    Abstract: Ensuring the legal usage of deep models is crucial to promoting trustable, accountable, and responsible artificial intelligence innovation. Current passport-based methods that obfuscate model functionality for license-to-use and ownership verifications suffer from capacity and quality constraints, as they require retraining the owner model for new users. They are also vulnerable to advanced Expand… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  9. arXiv:2402.01549   

    cs.IT math.CO quant-ph

    Quantum advantage in zero-error function computation with side information

    Authors: Ruoyu Meng, Aditya Ramamoorthy

    Abstract: We consider the problem of zero-error function computation with side information. Alice has a source $X$ and Bob has correlated source $Y$ and they can communicate via either classical or a quantum channel. Bob wants to calculate $f(X,Y)$ with zero error. We aim to characterize the minimum amount of information that Alice needs to send to Bob for this to happen with zero-error. In the classical se… ▽ More

    Submitted 4 March, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: We have realized an error in Claim 3

    MSC Class: 05

  10. arXiv:2312.06149  [pdf, other

    cs.CL cs.AI

    Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding

    Authors: Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou

    Abstract: Large Language Models (LLMs) have demonstrated a powerful ability for text generation. However, achieving optimal results with a given prompt or instruction can be challenging, especially for billion-sized models. Additionally, undesired behaviors such as toxicity or hallucinations can manifest. While much larger models (e.g., ChatGPT) may demonstrate strength in mitigating these issues, there is… ▽ More

    Submitted 25 June, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  11. arXiv:2311.09550  [pdf, other

    cs.LG cs.CL

    A Speed Odyssey for Deployable Quantization of LLMs

    Authors: Qingyuan Li, Ran Meng, Yiduo Li, Bo Zhang, Liang Li, Yifan Lu, Xiangxiang Chu, Yerui Sun, Yuchen Xie

    Abstract: The large language model era urges faster and less costly inference. Prior model compression works on LLMs tend to undertake a software-centric approach primarily focused on the simulated quantization performance. By neglecting the feasibility of deployment, these approaches are typically disabled in real practice. They used to drastically push down the quantization bit range for a reduced computa… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  12. arXiv:2309.08210  [pdf, other

    cs.CL

    Investigating Answerability of LLMs for Long-Form Question Answering

    Authors: Meghana Moorthy Bhat, Rui Meng, Ye Liu, Yingbo Zhou, Semih Yavuz

    Abstract: As we embark on a new era of LLMs, it becomes increasingly crucial to understand their capabilities, limitations, and differences. Toward making further progress in this direction, we strive to build a deeper understanding of the gaps between massive LLMs (e.g., ChatGPT) and smaller yet effective open-source LLMs and their distilled counterparts. To this end, we specifically focus on long-form que… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  13. arXiv:2309.03450  [pdf, other

    cs.CL cs.AI cs.LG

    XGen-7B Technical Report

    Authors: Erik Nijkamp, Tian Xie, Hiroaki Hayashi, Bo Pang, Congying Xia, Chen Xing, Jesse Vig, Semih Yavuz, Philippe Laban, Ben Krause, Senthil Purushwalkam, Tong Niu, Wojciech Kryściński, Lidiya Murakhovs'ka, Prafulla Kumar Choubey, Alex Fabbri, Ye Liu, Rui Meng, Lifu Tu, Meghana Bhat, Chien-Sheng Wu, Silvio Savarese, Yingbo Zhou, Shafiq Joty, Caiming Xiong

    Abstract: Large Language Models (LLMs) have become ubiquitous across various domains, transforming the way we interact with information and conduct research. However, most high-performing LLMs remain confined behind proprietary walls, hindering scientific progress. Most open-source LLMs, on the other hand, are limited in their ability to support longer sequence lengths, which is a key requirement for many t… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

  14. arXiv:2308.12574  [pdf, other

    cs.IR cs.AI

    Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs

    Authors: Ye Liu, Semih Yavuz, Rui Meng, Meghana Moorthy, Shafiq Joty, Caiming Xiong, Yingbo Zhou

    Abstract: The integration of retrieved passages and large language models (LLMs), such as ChatGPTs, has significantly contributed to improving open-domain question answering. However, there is still a lack of exploration regarding the optimal approach for incorporating retrieved passages into the answer generation process. This paper aims to fill this gap by investigating different methods of combining retr… ▽ More

    Submitted 7 April, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

  15. arXiv:2308.08169  [pdf, other

    cs.CL cs.AI

    Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System

    Authors: Jianguo Zhang, Stephen Roller, Kun Qian, Zhiwei Liu, Rui Meng, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong

    Abstract: End-to-end task-oriented dialogue (TOD) systems have achieved promising performance by leveraging sophisticated natural language understanding and natural language generation capabilities of pre-trained models. This work enables the TOD systems with more flexibility through a simple cache. The cache provides the flexibility to dynamically update the TOD systems and handle both existing and unseen… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: Accepted by SIGDIAL 2023 as a long paper

  16. arXiv:2307.10172  [pdf, other

    cs.CL cs.AI

    DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI

    Authors: Jianguo Zhang, Kun Qian, Zhiwei Liu, Shelby Heinecke, Rui Meng, Ye Liu, Zhou Yu, Huan Wang, Silvio Savarese, Caiming Xiong

    Abstract: Despite advancements in conversational AI, language models encounter challenges to handle diverse conversational tasks, and existing dialogue dataset collections often lack diversity and comprehensiveness. To tackle these issues, we introduce DialogStudio: the largest and most diverse collection of dialogue datasets, unified under a consistent format while preserving their original information. Ou… ▽ More

    Submitted 5 February, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: 17 pages, accepted by EACL 2024 Findings as a long paper. All datasets, licenses, codes, and models are available at at https://github.com/salesforce/DialogStudio

  17. arXiv:2305.07789  [pdf, other

    cs.CL cs.AI

    HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution

    Authors: Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Yingbo Zhou

    Abstract: The dominant paradigm of textual question answering systems is based on end-to-end neural networks, which excels at answering natural language questions but falls short on complex ones. This stands in contrast to the broad adaptation of semantic parsing approaches over structured data sources (e.g., relational database, knowledge graphs), that convert natural language questions to logical forms an… ▽ More

    Submitted 5 January, 2024; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Findings

  18. arXiv:2305.04435  [pdf, ps, other

    quant-ph cs.IT

    Communication complexity of entanglement assisted multi-party computation

    Authors: Ruoyu Meng, Aditya Ramamoorthy

    Abstract: We consider a quantum and classical version multi-party function computation problem with $n$ players, where players $2, \dots, n$ need to communicate appropriate information to player 1, so that a "generalized" inner product function with an appropriate promise can be calculated. The communication complexity of a protocol is the total number of bits that need to be communicated. When $n$ is prime… ▽ More

    Submitted 5 February, 2024; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Modified layout so that the reference is shown correctly

  19. arXiv:2305.02601  [pdf, other

    cs.SE

    Greybox Fuzzing of Distributed Systems

    Authors: Ruijie Meng, George Pîrlea, Abhik Roychoudhury, Ilya Sergey

    Abstract: Grey-box fuzzing is the lightweight approach of choice for finding bugs in sequential programs. It provides a balance between efficiency and effectiveness by conducting a biased random search over the domain of program inputs using a feedback function from observed test executions. For distributed system testing, however, the state-of-practice is represented today by only black-box tools that do n… ▽ More

    Submitted 12 August, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  20. arXiv:2304.13140  [pdf

    cs.LG cs.CL

    ESimCSE Unsupervised Contrastive Learning Jointly with UDA Semi-Supervised Learning for Large Label System Text Classification Mode

    Authors: Ruan Lu, Zhou HangCheng, Ran Meng, Zhao Jin, Qin JiaoYu, Wei Feng, Wang ChenZi

    Abstract: The challenges faced by text classification with large tag systems in natural language processing tasks include multiple tag systems, uneven data distribution, and high noise. To address these problems, the ESimCSE unsupervised comparative learning and UDA semi-supervised comparative learning models are combined through the use of joint training techniques in the models.The ESimCSE model efficient… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: This paper contains 14 pages,4 figures,4 tables

  21. arXiv:2212.09877  [pdf, other

    cs.CV

    LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

    Authors: Ning Yu, Chia-Chih Chen, Zeyuan Chen, Rui Meng, Gang Wu, Paul Josel, Juan Carlos Niebles, Caiming Xiong, Ran Xu

    Abstract: Graphic layout designs play an essential role in visual communication. Yet handcrafting layout designs is skill-demanding, time-consuming, and non-scalable to batch production. Generative models emerge to make design automation scalable but it remains non-trivial to produce designs that comply with designers' multimodal desires, i.e., constrained by background images and driven by foreground conte… ▽ More

    Submitted 24 March, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

  22. arXiv:2212.08841  [pdf, other

    cs.CL cs.IR

    AugTriever: Unsupervised Dense Retrieval by Scalable Data Augmentation

    Authors: Rui Meng, Ye Liu, Semih Yavuz, Divyansh Agarwal, Lifu Tu, Ning Yu, Jianguo Zhang, Meghana Bhat, Yingbo Zhou

    Abstract: Dense retrievers have made significant strides in text retrieval and open-domain question answering, even though most achievements were made possible only with large amounts of human supervision. In this work, we aim to develop unsupervised methods by proposing two methods that create pseudo query-document pairs and train dense retrieval models in an annotation-free and scalable manner: query extr… ▽ More

    Submitted 7 March, 2023; v1 submitted 17 December, 2022; originally announced December 2022.

  23. arXiv:2211.10923  [pdf, other

    cs.CV

    Traceable and Authenticable Image Tagging for Fake News Detection

    Authors: Ruohan Meng, Zhili Zhou, Qi Cui, Kwok-Yan Lam, Alex Kot

    Abstract: To prevent fake news images from misleading the public, it is desirable not only to verify the authenticity of news images but also to trace the source of fake news, so as to provide a complete forensic chain for reliable fake news detection. To simultaneously achieve the goals of authenticity verification and source tracing, we propose a traceable and authenticable image tagging approach that is… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  24. arXiv:2211.05165  [pdf, other

    cs.CL cs.AI cs.PL

    Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database

    Authors: Ye Liu, Semih Yavuz, Rui Meng, Dragomir Radev, Caiming Xiong, Yingbo Zhou

    Abstract: Parsing natural language questions into executable logical forms is a useful and interpretable way to perform question answering on structured data such as knowledge bases (KB) or databases (DB). However, existing approaches on semantic parsing cannot adapt to both modalities, as they suffer from the exponential growth of the logical form candidates and can hardly generalize to unseen data. In thi… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

    Comments: EMNLP 2022

  25. arXiv:2210.04206  [pdf, other

    cs.CV

    Attention Diversification for Domain Generalization

    Authors: Rang Meng, Xianfeng Li, Weijie Chen, Shicai Yang, Jie Song, Xinchao Wang, Lei Zhang, Mingli Song, Di Xie, Shiliang Pu

    Abstract: Convolutional neural networks (CNNs) have demonstrated gratifying results at learning discriminative features. However, when applied to unseen domains, state-of-the-art models are usually prone to errors due to domain shift. After investigating this issue from the perspective of shortcut learning, we find the devils lie in the fact that models trained on different domains merely bias to different… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: ECCV 2022. Code available at https://github.com/hikvision-research/DomainGeneralization

    Journal ref: European Conference on Computer Vision (ECCV 2022)

  26. arXiv:2209.01326  [pdf

    cs.CV cs.AI

    Continual Learning for Steganalysis

    Authors: Zihao Yin, Ruohan Meng, Zhili Zhou

    Abstract: To detect the existing steganographic algorithms, recent steganalysis methods usually train a Convolutional Neural Network (CNN) model on the dataset consisting of corresponding paired cover/stego-images. However, it is inefficient and impractical for those steganalysis tools to completely retrain the CNN model to make it effective against both the existing steganographic algorithms and a new emer… ▽ More

    Submitted 3 September, 2022; originally announced September 2022.

  27. arXiv:2209.00871  [pdf

    cs.RO

    3D Path Planning and Obstacle Avoidance Algorithms for Obstacle-Overcoming Robots

    Authors: Yuanhao huang, Shi Huang, Hao Wang, Ruifeng Meng

    Abstract: This article introduces a multimodal motion planning (MMP) algorithm that combines three-dimensional (3-D) path planning and a DWA obstacle avoidance algorithm. The algorithms aim to plan the path and motion of obstacle-overcoming robots in complex unstructured scenes. A novel A-star algorithm is proposed to combine the characteristics of unstructured scenes and a strategy to switch it into a gree… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: 2nd IEEE International Conference on Electronic Communications, Internet of Things and Big Data Conference 2022 (IEEE ICEIB 2022)

  28. arXiv:2208.09606  [pdf, other

    cs.CL

    General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation

    Authors: Rui Meng, Tong Wang, Xingdi Yuan, Yingbo Zhou, Daqing He

    Abstract: Training keyphrase generation (KPG) models require a large amount of annotated data, which can be prohibitively expensive and often limited to specific domains. In this study, we first demonstrate that large distribution shifts among different domains severely hinder the transferability of KPG models. We then propose a three-stage pipeline, which gradually guides KPG models' learning focus from ge… ▽ More

    Submitted 7 May, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

    Comments: The submission has been accepted to the Findings of ACL 2023

  29. arXiv:2206.06620  [pdf, other

    cs.CV

    Slimmable Domain Adaptation

    Authors: Rang Meng, Weijie Chen, Shicai Yang, Jie Song, Luojun Lin, Di Xie, Shiliang Pu, Xinchao Wang, Mingli Song, Yueting Zhuang

    Abstract: Vanilla unsupervised domain adaptation methods tend to optimize the model with fixed neural architecture, which is not very practical in real-world scenarios since the target data is usually processed by different resource-limited devices. It is therefore of great necessity to facilitate architecture adaptation across various devices. In this paper, we introduce a simple framework, Slimmable Domai… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

    Comments: To appear in CVPR 2022. Code is coming soon: https://github.com/hikvision-research/SlimDA

    Journal ref: IEEE/CVF Computer Vision and Pattern Recognition Conference (CVPR), 2022

  30. arXiv:2205.10471  [pdf, other

    cs.CL cs.AI cs.LG

    Retrieval-Augmented Multilingual Keyphrase Generation with Retriever-Generator Iterative Training

    Authors: Yifan Gao, Qingyu Yin, Zheng Li, Rui Meng, Tong Zhao, Bing Yin, Irwin King, Michael R. Lyu

    Abstract: Keyphrase generation is the task of automatically predicting keyphrases given a piece of long text. Despite its recent flourishing, keyphrase generation on non-English languages haven't been vastly investigated. In this paper, we call attention to a new setting named multilingual keyphrase generation and we contribute two new datasets, EcommerceMKP and AcademicMKP, covering six languages. Technica… ▽ More

    Submitted 1 June, 2022; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 (Findings)

  31. arXiv:2203.14474  [pdf, other

    cs.CL

    Interpretable Research Replication Prediction via Variational Contextual Consistency Sentence Masking

    Authors: Tianyi Luo, Rui Meng, Xin Eric Wang, Yang Liu

    Abstract: Research Replication Prediction (RRP) is the task of predicting whether a published research result can be replicated or not. Building an interpretable neural text classifier for RRP promotes the understanding of why a research paper is predicted as replicable or non-replicable and therefore makes its real-world application more reliable and trustworthy. However, the prior works on model interpret… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  32. arXiv:2203.02051  [pdf, other

    cs.LG cs.IT

    Compressed Predictive Information Coding

    Authors: Rui Meng, Tianyi Luo, Kristofer Bouchard

    Abstract: Unsupervised learning plays an important role in many fields, such as artificial intelligence, machine learning, and neuroscience. Compared to static data, methods for extracting low-dimensional structure for dynamic data are lagging. We developed a novel information-theoretic framework, Compressed Predictive Information Coding (CPIC), to extract useful representations from dynamic data. CPIC sele… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

  33. arXiv:2112.08561  [pdf, other

    cs.SD eess.AS

    EmotionBox: a music-element-driven emotional music generation system using Recurrent Neural Network

    Authors: Kaitong Zheng, Ruijie Meng, Chengshi Zheng, Xiaodong Li, Jinqiu Sang, Juanjuan Cai, Jie Wang

    Abstract: With the development of deep neural networks, automatic music composition has made great progress. Although emotional music can evoke listeners' different emotions and it is important for artistic expression, only few researches have focused on generating emotional music. This paper presents EmotionBox -an music-element-driven emotional music generator that is capable of composing music given a sp… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  34. arXiv:2109.02312  [pdf, other

    cs.SE

    Linear-time Temporal Logic guided Greybox Fuzzing

    Authors: Ruijie Meng, Zhen Dong, Jialin Li, Ivan Beschastnikh, Abhik Roychoudhury

    Abstract: Software model checking is a verification technique which is widely used for checking temporal properties of software systems. Even though it is a property verification technique, its common usage in practice is in "bug finding", that is, finding violations of temporal properties. Motivated by this observation and leveraging the recent progress in fuzzing, we build a greybox fuzzing framework to f… ▽ More

    Submitted 19 April, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: To appear in International Conference on Software Engineering (ICSE) 2022

  35. arXiv:2106.13379  [pdf, other

    cs.LG stat.ME stat.ML

    Bayesian Inference in High-Dimensional Time-Serieswith the Orthogonal Stochastic Linear Mixing Model

    Authors: Rui Meng, Kristofer Bouchard

    Abstract: Many modern time-series datasets contain large numbers of output response variables sampled for prolonged periods of time. For example, in neuroscience, the activities of 100s-1000's of neurons are recorded during behaviors and in response to sensory stimuli. Multi-output Gaussian process models leverage the nonparametric nature of Gaussian processes to capture structure across multiple outputs. H… ▽ More

    Submitted 12 March, 2022; v1 submitted 24 June, 2021; originally announced June 2021.

  36. arXiv:2106.00719  [pdf, other

    cs.LG stat.ML

    Stochastic Collapsed Variational Inference for Structured Gaussian Process Regression Network

    Authors: Rui Meng, Herbie Lee, Kristofer Bouchard

    Abstract: This paper presents an efficient variational inference framework for deriving a family of structured gaussian process regression network (SGPRN) models. The key idea is to incorporate auxiliary inducing variables in latent functions and jointly treats both the distributions of the inducing variables and hyper-parameters as variational parameters. Then we propose structured variable distributions a… ▽ More

    Submitted 17 November, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

  37. arXiv:2106.00130  [pdf, other

    cs.CL

    Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents

    Authors: Rui Meng, Khushboo Thaker, Lei Zhang, Yue Dong, Xingdi Yuan, Tong Wang, Daqing He

    Abstract: Faceted summarization provides briefings of a document from different perspectives. Readers can quickly comprehend the main points of a long document with the help of a structured outline. However, little research has been conducted on this subject, partially due to the lack of large-scale faceted summarization datasets. In this study, we present FacetSum, a faceted summarization benchmark built o… ▽ More

    Submitted 22 June, 2021; v1 submitted 31 May, 2021; originally announced June 2021.

    Comments: Accepted at ACL2021

  38. arXiv:2104.08729  [pdf, other

    cs.CL cs.LG

    Unsupervised Deep Keyphrase Generation

    Authors: Xianjie Shen, Yinghan Wang, Rui Meng, Jingbo Shang

    Abstract: Keyphrase generation aims to summarize long documents with a collection of salient phrases. Deep neural models have demonstrated a remarkable success in this task, capable of predicting keyphrases that are even absent from a document. However, such abstractiveness is acquired at the expense of a substantial amount of annotated data. In this paper, we present a novel method for keyphrase generation… ▽ More

    Submitted 18 April, 2021; originally announced April 2021.

  39. arXiv:2012.06711  [pdf, other

    cs.CV

    Teacher-Student Asynchronous Learning with Multi-Source Consistency for Facial Landmark Detection

    Authors: Rongye Meng, Sanping Zhou, Xingyu Wan, Mengliu Li, Jinjun Wang

    Abstract: Due to the high annotation cost of large-scale facial landmark detection tasks in videos, a semi-supervised paradigm that uses self-training for mining high-quality pseudo-labels to participate in training has been proposed by researchers. However, self-training based methods often train with a gradually increasing number of samples, whose performances vary a lot depending on the number of pseudo-… ▽ More

    Submitted 11 December, 2020; originally announced December 2020.

    Comments: second version

  40. arXiv:2010.00656  [pdf, other

    cs.CL cs.HC

    Predicting User Engagement Status for Online Evaluation of Intelligent Assistants

    Authors: Rui Meng, Zhen Yue, Alyssa Glass

    Abstract: Evaluation of intelligent assistants in large-scale and online settings remains an open challenge. User behavior-based online evaluation metrics have demonstrated great effectiveness for monitoring large-scale web search and recommender systems. Therefore, we consider predicting user engagement status as the very first and critical step to online evaluation for intelligent assistants. In this work… ▽ More

    Submitted 31 May, 2021; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: Paper has been accepted by ECIR 2021 (43rd edition of the annual European Conference on Information Retrieval)

  41. arXiv:2009.10229  [pdf, other

    cs.CL

    An Empirical Study on Neural Keyphrase Generation

    Authors: Rui Meng, Xingdi Yuan, Tong Wang, Sanqiang Zhao, Adam Trischler, Daqing He

    Abstract: Recent years have seen a flourishing of neural keyphrase generation (KPG) works, including the release of several large-scale datasets and a host of new models to tackle them. Model performance on KPG tasks has increased significantly with evolving deep learning research. However, there lacks a comprehensive comparison among different model designs, and a thorough investigation on related factors… ▽ More

    Submitted 15 April, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

    Comments: NAACL 2021, added more results

  42. arXiv:2008.04882  [pdf, other

    cs.LG stat.ML

    Spatiotemporal Attention for Multivariate Time Series Prediction and Interpretation

    Authors: Tryambak Gangopadhyay, Sin Yong Tan, Zhanhong Jiang, Rui Meng, Soumik Sarkar

    Abstract: Multivariate time series modeling and prediction problems are abundant in many machine learning application domains. Accurate interpretation of such prediction outcomes from a machine learning model that explicitly captures temporal correlations can significantly benefit the domain experts. In this context, temporal attention has been successfully applied to isolate the important time steps for th… ▽ More

    Submitted 26 October, 2020; v1 submitted 11 August, 2020; originally announced August 2020.

  43. arXiv:2002.12580  [pdf, other

    cs.CV

    Neural Inheritance Relation Guided One-Shot Layer Assignment Search

    Authors: Rang Meng, Weijie Chen, Di Xie, Yuan Zhang, Shiliang Pu

    Abstract: Layer assignment is seldom picked out as an independent research topic in neural architecture search. In this paper, for the first time, we systematically investigate the impact of different layer assignments to the network performance by building an architecture dataset of layer assignment on CIFAR-100. Through analyzing this dataset, we discover a neural inheritance relation among the networks w… ▽ More

    Submitted 28 February, 2020; originally announced February 2020.

    Comments: AAAI2020

  44. arXiv:1910.05843  [pdf, other

    stat.ML cs.LG

    Regularized Sparse Gaussian Processes

    Authors: Rui Meng, Herbert Lee, Soper Braden, Priyadip Ray

    Abstract: Gaussian processes are a flexible Bayesian nonparametric modelling approach that has been widely applied but poses computational challenges. To address the poor scaling of exact inference methods, approximation methods based on sparse Gaussian processes (SGP) are attractive. An issue faced by SGP, especially in latent variable models, is the inefficient learning of the inducing inputs, which leads… ▽ More

    Submitted 30 May, 2021; v1 submitted 13 October, 2019; originally announced October 2019.

  45. arXiv:1909.03590  [pdf, ps, other

    cs.CL

    Does Order Matter? An Empirical Study on Generating Multiple Keyphrases as a Sequence

    Authors: Rui Meng, Xingdi Yuan, Tong Wang, Peter Brusilovsky, Adam Trischler, Daqing He

    Abstract: Recently, concatenating multiple keyphrases as a target sequence has been proposed as a new learning paradigm for keyphrase generation. Existing studies concatenate target keyphrases in different orders but no study has examined the effects of ordering on models' behavior. In this paper, we propose several orderings for concatenation and inspect the important factors for training a successful keyp… ▽ More

    Submitted 28 February, 2022; v1 submitted 8 September, 2019; originally announced September 2019.

  46. arXiv:1909.00045  [pdf

    cs.HC cs.CY

    An active smartphone authentication method based on daily cyclical activity

    Authors: Chunmin Mi, Runjie Xu, Ching-Torng Lin, Run Yu Meng

    Abstract: Smartphones have become an important tool for people's daily lives, which brings higher security requirements in high-risk application areas, for example, mobile payment. Although the combination of physical password, fingerprint and facial recognition have improved the security to a certain extent, there still exists a high risk of being decrepted. This paper attempts an algorithm which is more s… ▽ More

    Submitted 8 March, 2020; v1 submitted 30 August, 2019; originally announced September 2019.

  47. Continuous Regular Functions

    Authors: Alexi Block Gorman, Philipp Hieronymi, Elliot Kaplan, Ruoyu Meng, Erik Walsberg, Zihe Wang, Ziqin Xiong, Hongru Yang

    Abstract: Following Chaudhuri, Sankaranarayanan, and Vardi, we say that a function $f:[0,1] \to [0,1]$ is $r$-regular if there is a Büchi automaton that accepts precisely the set of base $r \in \mathbb{N}$ representations of elements of the graph of $f$. We show that a continuous $r$-regular function $f$ is locally affine away from a nowhere dense, Lebesgue null, subset of $[0,1]$. As a corollary we establi… ▽ More

    Submitted 13 February, 2020; v1 submitted 10 January, 2019; originally announced January 2019.

    Journal ref: Logical Methods in Computer Science, Volume 16, Issue 1 (February 14, 2020) lmcs:5301

  48. arXiv:1810.11193  [pdf, other

    cs.CL cs.AI

    Integrating Transformer and Paraphrase Rules for Sentence Simplification

    Authors: Sanqiang Zhao, Rui Meng, Daqing He, Saptono Andi, Parmanto Bambang

    Abstract: Sentence simplification aims to reduce the complexity of a sentence while retaining its original meaning. Current models for sentence simplification adopted ideas from ma- chine translation studies and implicitly learned simplification mapping rules from normal- simple sentence pairs. In this paper, we explore a novel model based on a multi-layer and multi-head attention architecture and we pro- p… ▽ More

    Submitted 26 October, 2018; originally announced October 2018.

  49. arXiv:1810.05241  [pdf, other

    cs.CL cs.LG

    One Size Does Not Fit All: Generating and Evaluating Variable Number of Keyphrases

    Authors: Xingdi Yuan, Tong Wang, Rui Meng, Khushboo Thaker, Peter Brusilovsky, Daqing He, Adam Trischler

    Abstract: Different texts shall by nature correspond to different number of keyphrases. This desideratum is largely missing from existing neural keyphrase generation models. In this study, we address this problem from both modeling and evaluation perspectives. We first propose a recurrent generative model that generates multiple keyphrases as delimiter-separated sequences. Generation diversity is further… ▽ More

    Submitted 12 May, 2020; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: ACL 2020

  50. arXiv:1810.01641  [pdf, other

    cs.CV

    PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report

    Authors: Andrey Ignatov, Radu Timofte, Thang Van Vu, Tung Minh Luu, Trung X Pham, Cao Van Nguyen, Yongwoo Kim, Jae-Seok Choi, Munchurl Kim, Jie Huang, Jiewen Ran, Chen Xing, Xingguang Zhou, Pengfei Zhu, Mingrui Geng, Yawei Li, Eirikur Agustsson, Shuhang Gu, Luc Van Gool, Etienne de Stoutz, Nikolay Kobyshev, Kehui Nie, Yan Zhao, Gen Li, Tong Tong , et al. (23 additional authors not shown)

    Abstract: This paper reviews the first challenge on efficient perceptual image enhancement with the focus on deploying deep learning models on smartphones. The challenge consisted of two tracks. In the first one, participants were solving the classical image super-resolution problem with a bicubic downscaling factor of 4. The second track was aimed at real-world photo enhancement, and the goal was to map lo… ▽ More

    Submitted 3 October, 2018; originally announced October 2018.