Skip to main content

Showing 1–46 of 46 results for author: Qi, P

  1. arXiv:2405.15362  [pdf, other

    cs.LG cs.CL cs.DC

    Pipeline Parallelism with Controllable Memory

    Authors: Penghui Qi, Xinyi Wan, Nyamdavaa Amar, Min Lin

    Abstract: Pipeline parallelism has been widely explored, but most existing schedules lack a systematic methodology. In this paper, we propose a framework to decompose pipeline schedules as repeating a building block and we show that the lifespan of the building block decides the peak activation memory of the pipeline schedule. Guided by the observations, we find that almost all existing pipeline schedules,… ▽ More

    Submitted 10 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2403.03170  [pdf, other

    cs.MM cs.AI cs.CL cs.CV cs.CY

    SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection

    Authors: Peng Qi, Zehong Yan, Wynne Hsu, Mong Li Lee

    Abstract: Misinformation is a prevalent societal issue due to its potential high risks. Out-of-context (OOC) misinformation, where authentic images are repurposed with false text, is one of the easiest and most effective ways to mislead audiences. Current methods focus on assessing image-text consistency but lack convincing explanations for their judgments, which is essential for debunking misinformation. W… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: To appear in CVPR 2024

  3. arXiv:2403.01203  [pdf, other

    cs.LG cs.CL cs.DB

    Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment

    Authors: Luyao Wang, Pengnian Qi, Xigang Bao, Chunlai Zhou, Biao Qin

    Abstract: Multi-modal entity alignment (MMEA) aims to identify equivalent entities between two multi-modal knowledge graphs for integration. Unfortunately, prior arts have attempted to improve the interaction and fusion of multi-modal information, which have overlooked the influence of modal-specific noise and the usage of labeled and unlabeled data in semi-supervised settings. In this work, we introduce a… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Comments: accepted by AAAI2024

  4. arXiv:2401.10241  [pdf, other

    cs.DC cs.AI cs.LG

    Zero Bubble Pipeline Parallelism

    Authors: Penghui Qi, Xinyi Wan, Guangxing Huang, Min Lin

    Abstract: Pipeline parallelism is one of the key components for large-scale distributed training, yet its efficiency suffers from pipeline bubbles which were deemed inevitable. In this work, we introduce a scheduling strategy that, to our knowledge, is the first to successfully achieve zero pipeline bubbles under synchronous training semantics. The key idea behind this improvement is to split the backward c… ▽ More

    Submitted 30 November, 2023; originally announced January 2024.

  5. Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection

    Authors: Beizhe Hu, Qiang Sheng, Juan Cao, Yuhui Shi, Yang Li, Danding Wang, Peng Qi

    Abstract: Detecting fake news requires both a delicate sense of diverse clues and a profound understanding of the real-world background, which remains challenging for detectors based on small language models (SLMs) due to their knowledge and capability limitations. Recent advances in large language models (LLMs) have shown remarkable performance in various tasks, but whether and how LLMs could help with fak… ▽ More

    Submitted 22 January, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: 16 pages, 5 figures, and 9 tables. To appear at AAAI 2024

    Journal ref: AAAI 2024

  6. arXiv:2306.05241  [pdf, other

    cs.MM

    Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors

    Authors: Peng Qi, Yuyang Zhao, Yufeng Shen, Wei Ji, Juan Cao, Tat-Seng Chua

    Abstract: The prevalence of short video platforms has spawned a lot of fake news videos, which have stronger propagation ability than textual fake news. Thus, automatically detecting fake news videos has been an important countermeasure in practice. Previous works commonly verify each news video individually with multimodal information. Nevertheless, news videos from different perspectives regarding the sam… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: To appear in ACL 2023 Findings

  7. arXiv:2302.03242  [pdf, other

    cs.CV cs.MM cs.SI

    Combating Online Misinformation Videos: Characterization, Detection, and Future Directions

    Authors: Yuyan Bu, Qiang Sheng, Juan Cao, Peng Qi, Danding Wang, Jintao Li

    Abstract: With information consumption via online video streaming becoming increasingly popular, misinformation video poses a new threat to the health of the online information ecosystem. Though previous studies have made much progress in detecting misinformation in text and image formats, video-based misinformation brings new and unique challenges to automatic detection systems: 1) high information heterog… ▽ More

    Submitted 6 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at ACM Multimedia 2023 (MM 2023). 11 pages, 4 figures, and 89 references

  8. arXiv:2212.11352  [pdf

    physics.med-ph cs.LG

    Sensitivity analysis of biological washout and depth selection for a machine learning based dose verification framework in proton therapy

    Authors: Shixiong Yu, Yuxiang Liu, Zongsheng Hu, Haozhao Zhang, Pengyu Qi, Hao Peng

    Abstract: Dose verification based on proton-induced positron emitters is a promising quality assurance tool and may leverage the strength of artificial intelligence. To move a step closer towards practical application, the sensitivity analysis of two factors needs to be performed: biological washout and depth selection. selection. A bi-directional recurrent neural network (RNN) model was developed. The trai… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

  9. arXiv:2212.09912  [pdf, other

    cs.CL

    Tokenization Consistency Matters for Generative Models on Extractive NLP Tasks

    Authors: Kaiser Sun, Peng Qi, Yuhao Zhang, Lan Liu, William Yang Wang, Zhiheng Huang

    Abstract: Generative models have been widely applied to solve extractive tasks, where parts of the input is extracted to form the desired output, and achieved significant success. For example, in extractive question answering (QA), generative models have constantly yielded state-of-the-art results. In this work, we identify the issue of tokenization inconsistency that is commonly neglected in training these… ▽ More

    Submitted 24 October, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Findings of EMNLP2023

  10. arXiv:2211.10973  [pdf, other

    cs.MM

    FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms

    Authors: Peng Qi, Yuyan Bu, Juan Cao, Wei Ji, Ruihao Shui, Junbin Xiao, Danding Wang, Tat-Seng Chua

    Abstract: Short video platforms have become an important channel for news sharing, but also a new breeding ground for fake news. To mitigate this problem, research of fake news video detection has recently received a lot of attention. Existing works face two roadblocks: the scarcity of comprehensive and largescale datasets and insufficient utilization of multimodal information. Therefore, in this paper, we… ▽ More

    Submitted 2 December, 2022; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: To appear in AAAI 2023 AISI track. This version contains appendix with additional details

  11. arXiv:2210.07126  [pdf, other

    cs.CL cs.AI cs.HC

    Challenges in Explanation Quality Evaluation

    Authors: Hendrik Schuff, Heike Adel, Peng Qi, Ngoc Thang Vu

    Abstract: While much research focused on producing explanations, it is still unclear how the produced explanations' quality can be evaluated in a meaningful way. Today's predominant approach is to quantify explanations using proxy scores which compare explanations to (human-annotated) gold explanations. This approach assumes that explanations which reach higher proxy scores will also provide a greater benef… ▽ More

    Submitted 9 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: 41 pages, 11 figures

  12. arXiv:2210.06633  [pdf, other

    cs.IR cs.CL

    Language Agnostic Multilingual Information Retrieval with Contrastive Learning

    Authors: Xiyang Hu, Xinchi Chen, Peng Qi, Deguang Kong, Kunlun Liu, William Yang Wang, Zhiheng Huang

    Abstract: Multilingual information retrieval (IR) is challenging since annotated training data is costly to obtain in many languages. We present an effective method to train multilingual IR systems when only English IR training data and some parallel corpora between English and other languages are available. We leverage parallel and non-parallel corpora to improve the pretrained multilingual language models… ▽ More

    Submitted 25 May, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: ACL Findings 2023

  13. arXiv:2208.02169  [pdf, other

    cs.LG cs.CL

    SpanDrop: Simple and Effective Counterfactual Learning for Long Sequences

    Authors: Peng Qi, Guangtao Wang, Jing Huang

    Abstract: Distilling supervision signal from a long sequence to make predictions is a challenging task in machine learning, especially when not all elements in the input sequence contribute equally to the desired output. In this paper, we propose SpanDrop, a simple and effective data augmentation technique that helps models identify the true supervision signal in a long sequence with very few examples. By d… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: Peng Qi and Guangtao Wang contributed equally

  14. arXiv:2207.12021  [pdf, other

    cs.CL

    Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue Agent

    Authors: Ethan A. Chi, Ashwin Paranjape, Abigail See, Caleb Chiam, Trenton Chang, Kathleen Kenealy, Swee Kiat Lim, Amelia Hardy, Chetanya Rastogi, Haojun Li, Alexander Iyabor, Yutong He, Hari Sowrirajan, Peng Qi, Kaushik Ram Sadagopan, Nguyet Minh Phu, Dilara Soylu, Jillian Tang, Avanika Narayan, Giovanni Campagna, Christopher D. Manning

    Abstract: We present Chirpy Cardinal, an open-domain social chatbot. Aiming to be both informative and conversational, our bot chats with users in an authentic, emotionally intelligent way. By integrating controlled neural generation with scaffolded, hand-written dialogue, we let both the user and bot take turns driving the conversation, producing an engaging and socially fluent experience. Deployed in the… ▽ More

    Submitted 16 January, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: SIGDIAL '22

  15. arXiv:2203.09121  [pdf, other

    cs.CV

    DRAG: Dynamic Region-Aware GCN for Privacy-Leaking Image Detection

    Authors: Guang Yang, Juan Cao, Qiang Sheng, Peng Qi, Xirong Li, Jintao Li

    Abstract: The daily practice of sharing images on social media raises a severe issue about privacy leakage. To address the issue, privacy-leaking image detection is studied recently, with the goal to automatically identify images that may leak privacy. Recent advance on this task benefits from focusing on crucial objects via pretrained object detectors and modeling their correlation. However, these methods… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to AAAI-22, 9 pages

  16. arXiv:2203.00255  [pdf, other

    cs.CL cs.LG

    Improving Time Sensitivity for Question Answering over Temporal Knowledge Graphs

    Authors: Chao Shang, Guangtao Wang, Peng Qi, Jing Huang

    Abstract: Question answering over temporal knowledge graphs (KGs) efficiently uses facts contained in a temporal KG, which records entity relations and when they occur in time, to answer natural language questions (e.g., "Who was the president of the US before Obama?"). These questions often involve three time-related challenges that previous work fail to adequately address: 1) questions often do not specif… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: 10 pages, 2 figures

    Journal ref: ACL 2022

  17. arXiv:2201.03014  [pdf, other

    cs.CV cs.AI cs.LG

    Glance and Focus Networks for Dynamic Visual Recognition

    Authors: Gao Huang, Yulin Wang, Kangchen Lv, Haojun Jiang, Wenhui Huang, Pengfei Qi, Shiji Song

    Abstract: Spatial redundancy widely exists in visual recognition tasks, i.e., discriminative features in an image or video frame usually correspond to only a subset of pixels, while the remaining regions are irrelevant to the task at hand. Therefore, static models which process all the pixels with an equal amount of computation result in considerable redundancy in terms of time and space consumption. In thi… ▽ More

    Submitted 4 August, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI). Journal version of arXiv:2010.05300 (NeurIPS 2020). The first two authors contributed equally

  18. arXiv:2110.10030  [pdf, other

    cs.LG

    Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization

    Authors: Panjie Qi, Edwin Hsing-Mean Sha, Qingfeng Zhuge, Hongwu Peng, Shaoyi Huang, Zhenglun Kong, Yuhong Song, Bingbing Li

    Abstract: State-of-the-art Transformer-based models, with gigantic parameters, are difficult to be accommodated on resource constrained embedded devices. Moreover, with the development of technology, more and more embedded devices are available to run a Transformer model. For a Transformer model with different constraints (tight or loose), it can be deployed onto devices with different computing power. Howe… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    ACM Class: C.3; I.2

  19. arXiv:2110.01167  [pdf, other

    cs.AI cs.LG

    Trustworthy AI: From Principles to Practices

    Authors: Bo Li, Peng Qi, Bo Liu, Shuai Di, Jingen Liu, Jiquan Pei, Jinfeng Yi, Bowen Zhou

    Abstract: The rapid development of Artificial Intelligence (AI) technology has enabled the deployment of various systems based on it. However, many current AI systems are found vulnerable to imperceptible attacks, biased against underrepresented groups, lacking in user privacy protection. These shortcomings degrade user experience and erode people's trust in all AI systems. In this review, we provide AI pra… ▽ More

    Submitted 26 May, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

  20. Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues

    Authors: Peng Qi, Juan Cao, Xirong Li, Huan Liu, Qiang Sheng, Xiaoyue Mi, Qin He, Yongbiao Lv, Chenyang Guo, Yingchao Yu

    Abstract: Recently, fake news with text and images have achieved more effective diffusion than text-only fake news, raising a severe issue of multimodal fake news detection. Current studies on this issue have made significant contributions to developing multimodal models, but they are defective in modeling the multimodal content sufficiently. Most of them only preliminarily model the basic semantics of the… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: To appear in MM 2021 industrial track (long paper)

  21. arXiv:2108.02317  [pdf

    eess.IV cs.CV physics.optics

    Efficient Fourier single-pixel imaging with Gaussian random sampling

    Authors: Ziheng Qiu, Xinyi Guo, Tianao Lu, Pan Qi, Zibang Zhang, Jingang Zhong

    Abstract: Fourier single-pixel imaging (FSI) is a branch of single-pixel imaging techniques. It uses Fourier basis patterns as structured patterns for spatial information acquisition in the Fourier domain. However, the spatial resolution of the image reconstructed by FSI mainly depends on the number of Fourier coefficients sampled. The reconstruction of a high-resolution image typically requires a number of… ▽ More

    Submitted 28 June, 2021; originally announced August 2021.

  22. arXiv:2106.10401  [pdf

    eess.SP cs.LG

    Parallel frequency function-deep neural network for efficient complex broadband signal approximation

    Authors: Zhi Zeng, Pengpeng Shi, Fulei Ma, Peihan Qi

    Abstract: A neural network is essentially a high-dimensional complex mapping model by adjusting network weights for feature fitting. However, the spectral bias in network training leads to unbearable training epochs for fitting the high-frequency components in broadband signals. To improve the fitting efficiency of high-frequency components, the PhaseDNN was proposed recently by combining complex frequency… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  23. arXiv:2105.06457  [pdf, ps, other

    cs.CL cs.CY

    Conversational AI Systems for Social Good: Opportunities and Challenges

    Authors: Peng Qi, Jing Huang, Youzheng Wu, Xiaodong He, Bowen Zhou

    Abstract: Conversational artificial intelligence (ConvAI) systems have attracted much academic and commercial attention recently, making significant progress on both fronts. However, little existing work discusses how these systems can be developed and deployed for social good in real-world applications, with comprehensive case studies and analyses of pros and cons. In this paper, we briefly review the prog… ▽ More

    Submitted 7 January, 2022; v1 submitted 13 May, 2021; originally announced May 2021.

  24. arXiv:2103.11794  [pdf, other

    cs.CL cs.LG

    Graph Ensemble Learning over Multiple Dependency Trees for Aspect-level Sentiment Classification

    Authors: Xiaochen Hou, Peng Qi, Guangtao Wang, Rex Ying, Jing Huang, Xiaodong He, Bowen Zhou

    Abstract: Recent work on aspect-level sentiment classification has demonstrated the efficacy of incorporating syntactic structures such as dependency trees with graph neural networks(GNN), but these approaches are usually vulnerable to parsing errors. To better leverage syntactic information in the face of unavoidable errors, we propose a simple yet effective graph ensemble technique, GraphMerge, to make us… ▽ More

    Submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted by NAACL 2021

  25. arXiv:2102.06336  [pdf, ps, other

    cs.LG

    Dancing along Battery: Enabling Transformer with Run-time Reconfigurability on Mobile Devices

    Authors: Yuhong Song, Weiwen Jiang, Bingbing Li, Panjie Qi, Qingfeng Zhuge, Edwin Hsing-Mean Sha, Sakyasingha Dasgupta, Yiyu Shi, Caiwen Ding

    Abstract: A pruning-based AutoML framework for run-time reconfigurability, namely RT3, is proposed in this work. This enables Transformer-based large Natural Language Processing (NLP) models to be efficiently executed on resource-constrained mobile devices and reconfigured (i.e., switching models for dynamic hardware conditions) at run-time. Such reconfigurability is the key to save energy for battery-power… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

    Comments: 7 pages, 5 figures

  26. arXiv:2012.13169  [pdf, other

    cs.LG

    SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II

    Authors: Xiangjun Wang, Junxiao Song, Penghui Qi, Peng Peng, Zhenkun Tang, Wei Zhang, Weimin Li, Xiongjun Pi, Jujie He, Chao Gao, Haitao Long, Quan Yuan

    Abstract: AlphaStar, the AI that reaches GrandMaster level in StarCraft II, is a remarkable milestone demonstrating what deep reinforcement learning can achieve in complex Real-Time Strategy (RTS) games. However, the complexities of the game, algorithms and systems, and especially the tremendous amount of computation needed are big obstacles for the community to conduct further research in this direction. W… ▽ More

    Submitted 9 June, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: ICML 2021 camera ready

  27. arXiv:2010.12527  [pdf, other

    cs.CL

    Answering Open-Domain Questions of Varying Reasoning Steps from Text

    Authors: Peng Qi, Haejun Lee, Oghenetegiri "TG" Sido, Christopher D. Manning

    Abstract: We develop a unified system to answer directly from text open-domain questions that may require a varying number of retrieval steps. We employ a single multi-task transformer model to perform all the necessary subtasks -- retrieving supporting facts, reranking them, and predicting the answer from all retrieved documents -- in an iterative fashion. We avoid crucial assumptions of previous work that… ▽ More

    Submitted 29 October, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: EMNLP 2021. Peng Qi, Haejun Lee, and TG Sido contributed equally

  28. arXiv:2008.12348  [pdf, other

    cs.CL cs.AI

    Neural Generation Meets Real People: Towards Emotionally Engaging Mixed-Initiative Conversations

    Authors: Ashwin Paranjape, Abigail See, Kathleen Kenealy, Haojun Li, Amelia Hardy, Peng Qi, Kaushik Ram Sadagopan, Nguyet Minh Phu, Dilara Soylu, Christopher D. Manning

    Abstract: We present Chirpy Cardinal, an open-domain dialogue agent, as a research platform for the 2019 Alexa Prize competition. Building an open-domain socialbot that talks to real people is challenging - such a system must meet multiple user expectations such as broad world knowledge, conversational style, and emotional connection. Our socialbot engages users on their terms - prioritizing their interests… ▽ More

    Submitted 5 September, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Published in 3rd Proceedings of Alexa Prize (Alexa Prize 2019)

  29. arXiv:2008.09084  [pdf, other

    cs.CL

    Do Syntax Trees Help Pre-trained Transformers Extract Information?

    Authors: Devendra Singh Sachan, Yuhao Zhang, Peng Qi, William Hamilton

    Abstract: Much recent work suggests that incorporating syntax information from dependency trees can improve task-specific transformer models. However, the effect of incorporating dependency tree information into pre-trained transformer models (e.g., BERT) remains unclear, especially given recent studies highlighting how these models implicitly encode syntax. In this work, we systematically study the utility… ▽ More

    Submitted 26 January, 2021; v1 submitted 20 August, 2020; originally announced August 2020.

    Comments: EACL 2021. Code available at: https://github.com/DevSinghSachan/syntax-augmented-bert

  30. arXiv:2007.14640  [pdf, other

    cs.CL

    Biomedical and Clinical English Model Packages in the Stanza Python NLP Library

    Authors: Yuhao Zhang, Yuhui Zhang, Peng Qi, Christopher D. Manning, Curtis P. Langlotz

    Abstract: We introduce biomedical and clinical English model packages for the Stanza Python NLP library. These packages offer accurate syntactic analysis and named entity recognition capabilities for biomedical and clinical text, by combining Stanza's fully neural architecture with a wide variety of open datasets as well as large-scale unsupervised biomedical and clinical text data. We show via extensive ex… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

    Comments: Website: https://stanfordnlp.github.io/stanza/; demo page: http://stanza.run/bio

  31. arXiv:2006.05639  [pdf, other

    cs.IR stat.ML

    Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction

    Authors: Pi Qi, Xiaoqiang Zhu, Guorui Zhou, Yujing Zhang, Zhe Wang, Lejian Ren, Ying Fan, Kun Gai

    Abstract: Rich user behavior data has been proven to be of great value for click-through rate prediction tasks, especially in industrial applications such as recommender systems and online advertising. Both industry and academy have paid much attention to this topic and propose different approaches to modeling with long sequential user behavior data. Among them, memory network based model MIMN proposed by A… ▽ More

    Submitted 28 June, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

    MSC Class: Machine Learning (stat.ML); Information Retrieval (cs.IR); Machine Learning (cs.LG) ACM Class: I.2.6

  32. arXiv:2004.14530  [pdf, other

    cs.CL

    Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations

    Authors: Peng Qi, Yuhao Zhang, Christopher D. Manning

    Abstract: We investigate the problem of generating informative questions in information-asymmetric conversations. Unlike previous work on question generation which largely assumes knowledge of what the answer might be, we are interested in the scenario where the questioner is not given the context from which answers are drawn, but must reason pragmatically about how to acquire new information, given the sha… ▽ More

    Submitted 20 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: Findings of ACL: EMNLP 2020. Code available at: https://github.com/qipeng/stay-hungry-stay-focused

  33. arXiv:2003.07082  [pdf, other

    cs.CL

    Stanza: A Python Natural Language Processing Toolkit for Many Human Languages

    Authors: Peng Qi, Yuhao Zhang, Yuhui Zhang, Jason Bolton, Christopher D. Manning

    Abstract: We introduce Stanza, an open-source Python natural language processing toolkit supporting 66 human languages. Compared to existing widely used toolkits, Stanza features a language-agnostic fully neural pipeline for text analysis, including tokenization, multi-word token expansion, lemmatization, part-of-speech and morphological feature tagging, dependency parsing, and named entity recognition. We… ▽ More

    Submitted 23 April, 2020; v1 submitted 16 March, 2020; originally announced March 2020.

    Comments: ACL2020 System Demonstration. First two authors contribute equally. Website: https://stanfordnlp.github.io/stanza

  34. Exploring the Role of Visual Content in Fake News Detection

    Authors: Juan Cao, Peng Qi, Qiang Sheng, Tianyun Yang, Junbo Guo, Jintao Li

    Abstract: The increasing popularity of social media promotes the proliferation of fake news, which has caused significant negative societal effects. Therefore, fake news detection on social media has recently become an emerging research area of great concern. With the development of multimedia technology, fake news attempts to utilize multimedia content with images or videos to attract and mislead consumers… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

    Comments: This is a preprint of a chapter published in Disinformation, Misinformation, and Fake News in Social Media: Emerging Research Challenges and Opportunities, edited by Kai, S., Suhang, W., Dongwon, L., Huan, L, 2020, Springer reproduced with permission of Springer Nature Switzerland AG. The final authenticated version is available online at: https://www.springer.com/gp/book/9783030426989. arXiv admin note: text overlap with arXiv:2001.00623, arXiv:1808.06686, arXiv:1903.00788 by other authors

    Journal ref: Disinformation, Misinformation, and Fake News in Social Media. 2020

  35. arXiv:1910.07000  [pdf, other

    cs.CL

    Answering Complex Open-domain Questions Through Iterative Query Generation

    Authors: Peng Qi, Xiaowen Lin, Leo Mehr, Zijian Wang, Christopher D. Manning

    Abstract: It is challenging for current one-step retrieve-and-read question answering (QA) systems to answer questions like "Which novel by the author of 'Armada' will be adapted as a feature film by Steven Spielberg?" because the question seldom contains retrievable clues about the missing entity (here, the author). Answering such a question requires multi-hop reasoning where one must gather information ab… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: EMNLP-IJCNLP 2019. Xiaowen Lin, Leo Mehr, and Zijian Wang contributed equally. GitHub: https://github.com/qipeng/golden-retriever

  36. arXiv:1909.06020  [pdf

    eess.SP cs.LG

    Spectrum Sensing Based on Deep Learning Classification for Cognitive Radios

    Authors: Shilian Zheng, Shichuan Chen, Peihan Qi, Huaji Zhou, Xiaoniu Yang

    Abstract: Spectrum sensing is a key technology for cognitive radios. We present spectrum sensing as a classification problem and propose a sensing method based on deep learning classification. We normalize the received signal power to overcome the effects of noise power uncertainty. We train the model with as many types of signals as possible as well as noise data to enable the trained network model to adap… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

    Comments: Submitted to China Communications

  37. arXiv:1908.10818  [pdf, ps, other

    cs.MM cs.SI

    False News Detection on Social Media

    Authors: Juan Cao, Qiang Sheng, Peng Qi, Lei Zhong, Yanyan Wang, Xueyao Zhang

    Abstract: Social media has become a major information platform where people consume and share news. However, it has also enabled the wide dissemination of false news, i.e., news posts published on social media that are verifiably false, causing significant negative effects on society. In order to help prevent further propagation of false news on social media, we set up this competition to motivate the devel… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

    Comments: 4 pages

  38. arXiv:1908.04472  [pdf, other

    cs.MM cs.IR cs.SI

    Exploiting Multi-domain Visual Information for Fake News Detection

    Authors: Peng Qi, Juan Cao, Tianyun Yang, Junbo Guo, Jintao Li

    Abstract: The increasing popularity of social media promotes the proliferation of fake news. With the development of multimedia technology, fake news attempts to utilize multimedia contents with images or videos to attract and mislead readers for rapid dissemination, which makes visual contents an important part of fake news. Fake-news images, images attached in fake news posts,include not only fake images… ▽ More

    Submitted 12 August, 2019; originally announced August 2019.

    Comments: 10 pages, 9 figures, conference

  39. arXiv:1907.07352  [pdf, other

    cs.CR cs.LG

    Dynamic Malware Analysis with Feature Engineering and Feature Learning

    Authors: Zhaoqi Zhang, Panpan Qi, Wei Wang

    Abstract: Dynamic malware analysis executes the program in an isolated environment and monitors its run-time behaviour (e.g. system API calls) for malware detection. This technique has been proven to be effective against various code obfuscation techniques and newly released ("zero-day") malware. However, existing works typically only consider the API name while ignoring the arguments, or require complex fe… ▽ More

    Submitted 23 January, 2020; v1 submitted 17 July, 2019; originally announced July 2019.

  40. arXiv:1902.02441  [pdf, other

    cs.LG cs.RO stat.ML

    Artificial Intelligence for Prosthetics - challenge solutions

    Authors: Łukasz Kidziński, Carmichael Ong, Sharada Prasanna Mohanty, Jennifer Hicks, Sean F. Carroll, Bo Zhou, Hongsheng Zeng, Fan Wang, Rongzhong Lian, Hao Tian, Wojciech Jaśkowski, Garrett Andersen, Odd Rune Lykkebø, Nihat Engin Toklu, Pranav Shyam, Rupesh Kumar Srivastava, Sergey Kolesnikov, Oleksii Hrinchuk, Anton Pechenko, Mattias Ljungström, Zhen Wang, Xu Hu, Zehong Hu, Minghui Qiu, Jun Huang , et al. (25 additional authors not shown)

    Abstract: In the NeurIPS 2018 Artificial Intelligence for Prosthetics challenge, participants were tasked with building a controller for a musculoskeletal model with a goal of matching a given time-varying velocity vector. Top participants were invited to describe their algorithms. In this work, we describe the challenge and present thirteen solutions that used deep reinforcement learning approaches. Many s… ▽ More

    Submitted 6 February, 2019; originally announced February 2019.

  41. arXiv:1901.10457  [pdf, other

    cs.CL

    Universal Dependency Parsing from Scratch

    Authors: Peng Qi, Timothy Dozat, Yuhao Zhang, Christopher D. Manning

    Abstract: This paper describes Stanford's system at the CoNLL 2018 UD Shared Task. We introduce a complete neural pipeline system that takes raw text as input, and performs all tasks required by the shared task, ranging from tokenization and sentence segmentation, to POS tagging and dependency parsing. Our single system submission achieved very competitive performance on big treebanks. Moreover, after fixin… ▽ More

    Submitted 29 January, 2019; originally announced January 2019.

    Comments: In Proceedings of the CoNLL 2018 UD Shared Task. First three authors contributed roughly equally. Github repo: https://github.com/stanfordnlp/stanfordnlp Website: https://stanfordnlp.github.io/stanfordnlp/

  42. arXiv:1809.10185  [pdf, other

    cs.CL

    Graph Convolution over Pruned Dependency Trees Improves Relation Extraction

    Authors: Yuhao Zhang, Peng Qi, Christopher D. Manning

    Abstract: Dependency trees help relation extraction models capture long-range relations between words. However, existing dependency-based models either neglect crucial information (e.g., negation) by pruning the dependency trees too aggressively, or are computationally inefficient because it is difficult to parallelize over different tree structures. We propose an extension of graph convolutional networks t… ▽ More

    Submitted 26 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018. Code available at: https://github.com/qipeng/gcn-over-pruned-trees

  43. arXiv:1809.09600  [pdf, other

    cs.CL

    HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering

    Authors: Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William W. Cohen, Ruslan Salakhutdinov, Christopher D. Manning

    Abstract: Existing question answering (QA) datasets fail to train QA systems to perform complex reasoning and provide explanations for answers. We introduce HotpotQA, a new dataset with 113k Wikipedia-based question-answer pairs with four key features: (1) the questions require finding and reasoning over multiple supporting documents to answer; (2) the questions are diverse and not constrained to any pre-ex… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

    Comments: EMNLP 2018 long paper. The first three authors contribute equally. Data, code, and blog posts available at https://hotpotqa.github.io/

  44. arXiv:1805.04623  [pdf, other

    cs.CL

    Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

    Authors: Urvashi Khandelwal, He He, Peng Qi, Dan Jurafsky

    Abstract: We know very little about how neural language models (LM) use prior linguistic context. In this paper, we investigate the role of context in an LSTM LM, through ablation studies. Specifically, we analyze the increase in perplexity when prior context words are shuffled, replaced, or dropped. On two standard datasets, Penn Treebank and WikiText-2, we find that the model is capable of using about 200… ▽ More

    Submitted 11 May, 2018; originally announced May 2018.

    Comments: ACL 2018

  45. arXiv:1705.04434  [pdf, other

    cs.CL

    Arc-swift: A Novel Transition System for Dependency Parsing

    Authors: Peng Qi, Christopher D. Manning

    Abstract: Transition-based dependency parsers often need sequences of local shift and reduce operations to produce certain attachments. Correct individual decisions hence require global information about the sentence context and mistakes cause error propagation. This paper proposes a novel transition system, arc-swift, that enables direct attachments between tokens farther apart with a single transition. Th… ▽ More

    Submitted 11 May, 2017; originally announced May 2017.

    Comments: Accepted at ACL 2017

  46. arXiv:1406.7806  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Building DNN Acoustic Models for Large Vocabulary Speech Recognition

    Authors: Andrew L. Maas, Peng Qi, Ziang Xie, Awni Y. Hannun, Christopher T. Lengerich, Daniel Jurafsky, Andrew Y. Ng

    Abstract: Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recognition systems. Building neural network acoustic models requires several design decisions including network architecture, size, and training loss function. This paper offers an empirical investigation on which aspects of DNN acoustic model design are most important for speech recognition system perfo… ▽ More

    Submitted 20 January, 2015; v1 submitted 30 June, 2014; originally announced June 2014.