Skip to main content

Showing 1–49 of 49 results for author: Du, P

  1. arXiv:2407.11335  [pdf, other

    cs.CV

    LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

    Authors: Penghui Du, Yu Wang, Yifan Sun, Luting Wang, Yue Liao, Gang Zhang, Errui Ding, Yan Wang, Jingdong Wang, Si Liu

    Abstract: Existing methods enhance open-vocabulary object detection by leveraging the robust open-vocabulary recognition capabilities of Vision-Language Models (VLMs), such as CLIP.However, two main challenges emerge:(1) A deficiency in concept representation, where the category names in CLIP's text space lack textual and visual knowledge.(2) An overfitting tendency towards base categories, with the open vo… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  2. arXiv:2406.04785  [pdf, other

    cs.DC

    Enabling Efficient Batch Serving for LMaaS via Generation Length Prediction

    Authors: Ke Cheng, Wen Hu, Zhi Wang, Peng Du, Jianguo Li, Sheng Zhang

    Abstract: Nowadays, large language models (LLMs) are published as a service and can be accessed by various applications via APIs, also known as language-model-as-a-service (LMaaS). Without knowing the generation length of requests, existing serving systems serve requests in a first-come, first-served (FCFS) manner with a fixed batch size, which leads to two problems that affect batch serving efficiency. Fir… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 12 pages, 14 figures

  3. arXiv:2406.02822  [pdf, other

    cs.RO

    W-RIZZ: A Weakly-Supervised Framework for Relative Traversability Estimation in Mobile Robotics

    Authors: Andre Schreiber, Arun N. Sivakumar, Peter Du, Mateus V. Gasparino, Girish Chowdhary, Katherine Driggs-Campbell

    Abstract: Successful deployment of mobile robots in unstructured domains requires an understanding of the environment and terrain to avoid hazardous areas, getting stuck, and colliding with obstacles. Traversability estimation--which predicts where in the environment a robot can travel--is one prominent approach that tackles this problem. Existing geometric methods may ignore important semantic consideratio… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted by RA-L. Code is available at https://github.com/andreschreiber/W-RIZZ

  4. LoS Sensing-based Channel Estimation in UAV-Assisted OFDM Systems

    Authors: Chaojin Qing, Zhiying Liu, Wenquan Hu, Yinjie Zhang, Xi Cai, Pengfei Du

    Abstract: In unmanned aerial vehicle (UAV)-assisted orthogonal frequency division multiplexing (OFDM) systems, the potential advantage of the line-of-sight (LoS) path, characterized by its high probability of existence, has not been fully harnessed, thereby impeding the improvement of channel estimation (CE) accuracy. Inspired by the ideas of integrated sensing and communication (ISAC), this letter develops… ▽ More

    Submitted 22 February, 2024; originally announced April 2024.

  5. arXiv:2403.01928  [pdf, other

    cs.RO

    ZSL-RPPO: Zero-Shot Learning for Quadrupedal Locomotion in Challenging Terrains using Recurrent Proximal Policy Optimization

    Authors: Yao Zhao, Tao Wu, Yijie Zhu, Xiang Lu, Jun Wang, Haitham Bou-Ammar, Xinyu Zhang, Peng Du

    Abstract: We present ZSL-RPPO, an improved zero-shot learning architecture that overcomes the limitations of teacher-student neural networks and enables generating robust, reliable, and versatile locomotion for quadrupedal robots in challenging terrains. We propose a new algorithm RPPO (Recurrent Proximal Policy Optimization) that directly trains recurrent neural network in partially observable environments… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  6. arXiv:2403.00561  [pdf, other

    cs.CV cs.AI

    Multi-Task Learning Using Uncertainty to Weigh Losses for Heterogeneous Face Attribute Estimation

    Authors: Huaqing Yuan, Yi He, Peng Du, Lu Song

    Abstract: Face images contain a wide variety of attribute information. In this paper, we propose a generalized framework for joint estimation of ordinal and nominal attributes based on information sharing. We tackle the correlation problem between heterogeneous attributes using hard parameter sharing of shallow features, and trade-off multiple loss functions by considering homoskedastic uncertainty for each… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  7. arXiv:2402.11139  [pdf, other

    cs.LG cs.AI

    LiGNN: Graph Neural Networks at LinkedIn

    Authors: Fedor Borisyuk, Shihai He, Yunbo Ouyang, Morteza Ramezani, Peng Du, Xiaochen Hou, Chengming Jiang, Nitin Pasumarthy, Priya Bannur, Birjodh Tiwana, Ping Liu, Siddharth Dangi, Daqi Sun, Zhoutao Pei, Xiao Shi, Sirou Zhu, Qianqi Shen, Kuang-Hsuan Lee, David Stein, Baolei Li, Haichao Wei, Amol Ghoting, Souvik Ghosh

    Abstract: In this paper, we present LiGNN, a deployed large-scale Graph Neural Networks (GNNs) Framework. We share our insight on developing and deployment of GNNs at large scale at LinkedIn. We present a set of algorithmic improvements to the quality of GNN representation learning including temporal graph architectures with long term losses, effective cold start solutions via graph densification, ID embedd… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  8. arXiv:2402.06329  [pdf

    cs.CV eess.IV

    A Network for structural dense displacement based on 3D deformable mesh model and optical flow

    Authors: Peimian Du, Qicheng Guo, Yanru Li

    Abstract: This study proposes a Network to recognize displacement of a RC frame structure from a video by a monocular camera. The proposed Network consists of two modules which is FlowNet2 and POFRN-Net. FlowNet2 is used to generate dense optical flow as well as POFRN-Net is to extract pose parameter H. FlowNet2 convert two video frames into dense optical flow. POFRN-Net is inputted dense optical flow from… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Paper for the 3rd International Competition for Structural Health Monitoring (IC-SHM 2022): 15 pages, 13 figures

  9. arXiv:2402.02547  [pdf

    cs.AI cs.CL

    Integration of cognitive tasks into artificial general intelligence test for large models

    Authors: Youzhi Qu, Chen Wei, Penghui Du, Wenxin Che, Chi Zhang, Wanli Ouyang, Yatao Bian, Feiyang Xu, Bin Hu, Kai Du, Haiyan Wu, Jia Liu, Quanying Liu

    Abstract: During the evolution of large models, performance evaluation is necessarily performed to assess their capabilities and ensure safety before practical application. However, current model evaluations mainly rely on specific tasks and datasets, lacking a united framework for assessing the multidimensional intelligence of large models. In this perspective, we advocate for a comprehensive framework of… ▽ More

    Submitted 5 March, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

  10. arXiv:2401.11679  [pdf, other

    physics.ao-ph cs.LG

    Simulating Nighttime Visible Satellite Imagery of Tropical Cyclones Using Conditional Generative Adversarial Networks

    Authors: Jinghuai Yao, Puyuan Du, Yucheng Zhao, Yubo Wang

    Abstract: Visible (VIS) imagery of satellites has various important applications in meteorology, including monitoring Tropical Cyclones (TCs). However, it is unavailable at night because of the lack of sunlight. This study presents a Conditional Generative Adversarial Networks (CGAN) model that generates highly accurate nighttime visible reflectance using infrared (IR) bands and sunlight direction parameter… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  11. arXiv:2312.10317  [pdf, other

    cs.LG cs.AI q-bio.NC

    Spatial-Temporal DAG Convolutional Networks for End-to-End Joint Effective Connectivity Learning and Resting-State fMRI Classification

    Authors: Rui Yang, Wenrui Dai, Huajun She, Yiping P. Du, Dapeng Wu, Hongkai Xiong

    Abstract: Building comprehensive brain connectomes has proved of fundamental importance in resting-state fMRI (rs-fMRI) analysis. Based on the foundation of brain network, spatial-temporal-based graph convolutional networks have dramatically improved the performance of deep learning methods in rs-fMRI time series classification. However, existing works either pre-define the brain network as the correlation… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted by NeurIPS 2023 Temporal Graph Learning Workshop

  12. Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images

    Authors: Zhiyun Song, Penghui Du, Junpeng Yan, Kailu Li, Jianzhong Shou, Maode Lai, Yubo Fan, Yan Xu

    Abstract: Self-supervised pretraining attempts to enhance model performance by obtaining effective features from unlabeled data, and has demonstrated its effectiveness in the field of histopathology images. Despite its success, few works concentrate on the extraction of nucleus-level information, which is essential for pathologic analysis. In this work, we propose a novel nucleus-aware self-supervised pretr… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  13. arXiv:2308.14353  [pdf, other

    cs.CL

    ZhuJiu: A Multi-dimensional, Multi-faceted Chinese Benchmark for Large Language Models

    Authors: Baoli Zhang, Haining Xie, Pengfan Du, Junhao Chen, Pengfei Cao, Yubo Chen, Shengping Liu, Kang Liu, Jun Zhao

    Abstract: The unprecedented performance of large language models (LLMs) requires comprehensive and accurate evaluation. We argue that for LLMs evaluation, benchmarks need to be comprehensive and systematic. To this end, we propose the ZhuJiu benchmark, which has the following strengths: (1) Multi-dimensional ability coverage: We comprehensively evaluate LLMs across 7 ability dimensions covering 51 tasks. Es… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  14. arXiv:2308.11874  [pdf, other

    cs.CV

    Semi-Supervised Learning via Weight-aware Distillation under Class Distribution Mismatch

    Authors: Pan Du, Suyun Zhao, Zisen Sheng, Cuiping Li, Hong Chen

    Abstract: Semi-Supervised Learning (SSL) under class distribution mismatch aims to tackle a challenging problem wherein unlabeled data contain lots of unknown categories unseen in the labeled ones. In such mismatch scenarios, traditional SSL suffers severe performance damage due to the harmful invasion of the instances with unknown categories into the target classifier. In this study, by strict mathematical… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: ICCV 2023

  15. arXiv:2307.10730  [pdf, other

    cs.IT eess.SP

    Joint Port Selection Based Channel Acquisition for FDD Cell-Free Massive MIMO

    Authors: Cheng Zhang, Pengguang Du, Minjie Ding, Yindi Jing, Yongming Huang

    Abstract: In frequency division duplexing (FDD) cell-free massive MIMO, the acquisition of the channel state information (CSI) is very challenging because of the large overhead required for the training and feedback of the downlink channels of multiple cooperating base stations (BSs). In this paper, for systems with partial uplink-downlink channel reciprocity, and a general spatial domain channel model with… ▽ More

    Submitted 12 January, 2024; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 15 pages, 11 figures. The paper has been accepted by IEEE TRANSACTIONS ON COMMUNICATIONS

  16. arXiv:2304.00367  [pdf, other

    cs.RO cs.AI cs.MA

    Conveying Autonomous Robot Capabilities through Contrasting Behaviour Summaries

    Authors: Peter Du, Surya Murthy, Katherine Driggs-Campbell

    Abstract: As advances in artificial intelligence enable increasingly capable learning-based autonomous agents, it becomes more challenging for human observers to efficiently construct a mental model of the agent's behaviour. In order to successfully deploy autonomous agents, humans should not only be able to understand the individual limitations of the agents but also have insight on how they compare agains… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

  17. arXiv:2304.00365  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Adaptive Failure Search Using Critical States from Domain Experts

    Authors: Peter Du, Katherine Driggs-Campbell

    Abstract: Uncovering potential failure cases is a crucial step in the validation of safety critical systems such as autonomous vehicles. Failure search may be done through logging substantial vehicle miles in either simulation or real world testing. Due to the sparsity of failure events, naive random search approaches require significant amounts of vehicle operation hours to find potential system weaknesses… ▽ More

    Submitted 1 April, 2023; originally announced April 2023.

    Comments: Appears in IEEE ICRA 2021

  18. arXiv:2303.05892  [pdf, other

    cs.CV

    Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection

    Authors: Luting Wang, Yi Liu, Penghui Du, Zihan Ding, Yue Liao, Qiaosong Qi, Biaolong Chen, Si Liu

    Abstract: Open-vocabulary object detection aims to provide object detectors trained on a fixed set of object categories with the generalizability to detect objects described by arbitrary text queries. Previous methods adopt knowledge distillation to extract knowledge from Pretrained Vision-and-Language Models (PVLMs) and transfer it to detectors. However, due to the non-adaptive proposal cropping and single… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted by CVPR 2023

  19. arXiv:2212.14189  [pdf, other

    cs.CY eess.SY

    High Resolution Modeling and Analysis of Cryptocurrency Mining's Impact on Power Grids: Carbon Footprint, Reliability, and Electricity Price

    Authors: Ali Menati, Xiangtian Zheng, Kiyeob Lee, Ranyu Shi, Pengwei Du, Chanan Singh, Le Xie

    Abstract: Blockchain technologies are considered one of the most disruptive innovations of the last decade, enabling secure decentralized trust-building. However, in recent years, with the rapid increase in the energy consumption of blockchain-based computations for cryptocurrency mining, there have been growing concerns about their sustainable operation in electric grids. This paper investigates the tri-fa… ▽ More

    Submitted 14 April, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: This paper has been accepted for publication in the journal of "Advances in Applied Energy"

  20. arXiv:2211.02864  [pdf

    cs.CL cs.SI

    BEKG: A Built Environment Knowledge Graph

    Authors: Xiaojun Yang, Haoyu Zhong, Penglin Du, Keyi Zhou, Xingjin Lai, Zhengdong Wang, Yik Lun Lau, Yangqiu Song, Liyaning Tang

    Abstract: Practices in the built environment have become more digitalized with the rapid development of modern design and construction technologies. However, the requirement of practitioners or scholars to gather complicated professional knowledge in the built environment has not been satisfied yet. In this paper, more than 80,000 paper abstracts in the built environment field were obtained to build a knowl… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

  21. arXiv:2207.04028  [pdf, other

    cs.CV cs.AI

    CoCAtt: A Cognitive-Conditioned Driver Attention Dataset (Supplementary Material)

    Authors: Yuan Shen, Niviru Wijayaratne, Pranav Sriram, Aamir Hasan, Peter Du, Katherine Driggs-Campbell

    Abstract: The task of driver attention prediction has drawn considerable interest among researchers in robotics and the autonomous vehicle industry. Driver attention prediction can play an instrumental role in mitigating and preventing high-risk events, like collisions and casualties. However, existing driver attention prediction models neglect the distraction state and intention of the driver, which can si… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: Supplementary Material for the main paper, "CoCAtt: A Cognitive-Conditioned Driver Attention Dataset". Accepted at ITSC2022

  22. arXiv:2207.01762  [pdf, other

    cs.CL cs.AI cs.IR

    PReGAN: Answer Oriented Passage Ranking with Weakly Supervised GAN

    Authors: Pan Du, Jian-Yun Nie, Yutao Zhu, Hao Jiang, Lixin Zou, Xiaohui Yan

    Abstract: Beyond topical relevance, passage ranking for open-domain factoid question answering also requires a passage to contain an answer (answerability). While a few recent studies have incorporated some reading capability into a ranker to account for answerability, the ranker is still hindered by the noisy nature of the training data typically available in this area, which considers any passage containi… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  23. arXiv:2206.03950  [pdf, other

    q-bio.NC cs.AI cs.LG

    Transfer learning to decode brain states reflecting the relationship between cognitive tasks

    Authors: Youzhi Qu, Xinyao Jian, Wenxin Che, Penghui Du, Kai Fu, Quanying Liu

    Abstract: Transfer learning improves the performance of the target task by leveraging the data of a specific source task: the closer the relationship between the source and the target tasks, the greater the performance improvement by transfer learning. In neuroscience, the relationship between cognitive tasks is usually represented by similarity of activated brain regions or neural representation. However,… ▽ More

    Submitted 30 August, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

  24. arXiv:2204.08939  [pdf, other

    physics.med-ph cs.CV eess.IV physics.flu-dyn

    Deep learning-based surrogate model for 3-D patient-specific computational fluid dynamics

    Authors: Pan Du, Xiaozhi Zhu, Jian-Xun Wang

    Abstract: Optimization and uncertainty quantification have been playing an increasingly important role in computational hemodynamics. However, existing methods based on principled modeling and classic numerical techniques have faced significant challenges, particularly when it comes to complex 3D patient-specific shapes in the real world. First, it is notoriously challenging to parameterize the input space… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: 8 figures, 2 tables

  25. arXiv:2204.00976  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    FedGBF: An efficient vertical federated learning framework via gradient boosting and bagging

    Authors: Yujin Han, Pan Du, Kai Yang

    Abstract: Federated learning, conducive to solving data privacy and security problems, has attracted increasing attention recently. However, the existing federated boosting model sequentially builds a decision tree model with the weak base learner, resulting in redundant boosting steps and high interactive communication costs. In contrast, the federated bagging model saves time by building multi-decision tr… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

  26. arXiv:2203.02104  [pdf, other

    cs.CV

    Interactive Image Synthesis with Panoptic Layout Generation

    Authors: Bo Wang, Tao Wu, Minfeng Zhu, Peng Du

    Abstract: Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are un… ▽ More

    Submitted 28 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Accepted by CVPR 2022

  27. arXiv:2112.01215  [pdf

    cs.NE stat.ML

    Adaptive Group Collaborative Artificial Bee Colony Algorithm

    Authors: Haiquan Wang, Hans-DietrichHaasis, Panpan Du, Xiaobin Xu, Menghao Su, Shengjun Wen, Wenxuan Yue, Shanshan Zhang

    Abstract: As an effective algorithm for solving complex optimization problems, artificial bee colony (ABC) algorithm has shown to be competitive, but the same as other population-based algorithms, it is poor at balancing the abilities of global searching in the whole solution space (named as exploration) and quick searching in local solution space which is defined as exploitation. For improving the performa… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

  28. arXiv:2111.10014  [pdf, other

    cs.CV

    CoCAtt: A Cognitive-Conditioned Driver Attention Dataset

    Authors: Yuan Shen, Niviru Wijayaratne, Pranav Sriram, Aamir Hasan, Peter Du, Katie Driggs-Campbell

    Abstract: The task of driver attention prediction has drawn considerable interest among researchers in robotics and the autonomous vehicle industry. Driver attention prediction can play an instrumental role in mitigating and preventing high-risk events, like collisions and casualties. However, existing driver attention prediction models neglect the distraction state and intention of the driver, which can si… ▽ More

    Submitted 23 November, 2021; v1 submitted 18 November, 2021; originally announced November 2021.

    Comments: 10 pages, 5 figures

  29. Contrastive Learning of User Behavior Sequence for Context-Aware Document Ranking

    Authors: Yutao Zhu, Jian-Yun Nie, Zhicheng Dou, Zhengyi Ma, Xinyu Zhang, Pan Du, Xiaochen Zuo, Hao Jiang

    Abstract: Context information in search sessions has proven to be useful for capturing user search intent. Existing studies explored user behavior sequences in sessions in different ways to enhance query suggestion or document ranking. However, a user behavior sequence has often been viewed as a definite and exact signal reflecting a user's behavior. In reality, it is highly variable: user's queries for the… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: Accepted by CIKM 2021

  30. arXiv:2108.07949  [pdf, other

    cs.CV cs.AI

    DeepFake MNIST+: A DeepFake Facial Animation Dataset

    Authors: Jiajun Huang, Xueyu Wang, Bo Du, Pei Du, Chang Xu

    Abstract: The DeepFakes, which are the facial manipulation techniques, is the emerging threat to digital society. Various DeepFake detection methods and datasets are proposed for detecting such data, especially for face-swapping. However, recent researches less consider facial animation, which is also important in the DeepFake attack side. It tries to animate a face image with actions provided by a driving… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: 14 pages

  31. Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals

    Authors: Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Hao Jiang, Zhicheng Dou

    Abstract: A proactive dialogue system has the ability to proactively lead the conversation. Different from the general chatbots which only react to the user, proactive dialogue systems can be used to achieve some goals, e.g., to recommend some items to the user. Background knowledge is essential to enable smooth and natural transitions in dialogue. In this paper, we propose a new multi-task learning framewo… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

    Comments: Accepted by SIGIR 2021

  32. arXiv:2105.08251  [pdf, other

    cs.CL

    Emotion Eliciting Machine: Emotion Eliciting Conversation Generation based on Dual Generator

    Authors: Hao Jiang, Yutao Zhu, Xinyu Zhang, Zhicheng Dou, Pan Du, Te Pi, Yantao Jia

    Abstract: Recent years have witnessed great progress on building emotional chatbots. Tremendous methods have been proposed for chatbots to generate responses with given emotions. However, the emotion changes of the user during the conversation has not been fully explored. In this work, we study the problem of positive emotion elicitation, which aims to generate responses that can elicit positive emotion of… ▽ More

    Submitted 17 May, 2021; originally announced May 2021.

  33. arXiv:2103.13584  [pdf, other

    cs.CL

    BERT4SO: Neural Sentence Ordering by Fine-tuning BERT

    Authors: Yutao Zhu, Jian-Yun Nie, Kun Zhou, Shengchao Liu, Yabo Ling, Pan Du

    Abstract: Sentence ordering aims to arrange the sentences of a given text in the correct order. Recent work frames it as a ranking problem and applies deep neural networks to it. In this work, we propose a new method, named BERT4SO, by fine-tuning BERT for sentence ordering. We concatenate all sentences and compute their representations by using multiple special tokens and carefully designed segment (interv… ▽ More

    Submitted 11 May, 2021; v1 submitted 24 March, 2021; originally announced March 2021.

  34. arXiv:2102.13034  [pdf, other

    cs.AI cs.HC cs.RO

    AutoPreview: A Framework for Autopilot Behavior Understanding

    Authors: Yuan Shen, Niviru Wijayaratne, Peter Du, Shanduojiao Jiang, Katherine Driggs Campbell

    Abstract: The behavior of self driving cars may differ from people expectations, (e.g. an autopilot may unexpectedly relinquish control). This expectation mismatch can cause potential and existing users to distrust self driving technology and can increase the likelihood of accidents. We propose a simple but effective framework, AutoPreview, to enable consumers to preview a target autopilot potential actions… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

    Comments: 7 pages, 5 figures, CHI 2021 Late breaking Work

    Journal ref: CHI Conference on Human Factors in Computing Systems Extended Abstracts (CHI '21 Extended Abstracts), May 8 to 13, 2021, Yokohama, Japan

  35. arXiv:2101.08426  [pdf, other

    cs.CL

    Content Selection Network for Document-grounded Retrieval-based Chatbots

    Authors: Yutao Zhu, Jian-Yun Nie, Kun Zhou, Pan Du, Zhicheng Dou

    Abstract: Grounding human-machine conversation in a document is an effective way to improve the performance of retrieval-based chatbots. However, only a part of the document content may be relevant to help select the appropriate response at a round. It is thus crucial to select the part of document content relevant to the current conversation context. In this paper, we propose a document content selection n… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

    Comments: ECIR 2021 Camera Ready

  36. Meta-Learning for Neural Relation Classification with Distant Supervision

    Authors: Zhenzhen Li, Jian-Yun Nie, Benyou Wang, Pan Du, Yuhan Zhang, Lixin Zou, Dongsheng Li

    Abstract: Distant supervision provides a means to create a large number of weakly labeled data at low cost for relation classification. However, the resulting labeled instances are very noisy, containing data with wrong labels. Many approaches have been proposed to select a subset of reliable instances for neural model training, but they still suffer from noisy labeling problem or underutilization of the we… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 10 pages, 7 figures; corrected one encoding error in CIKM pdf

    Journal ref: In Proceedings of CIKM, pp. 815-824. 2020

  37. arXiv:2006.05018  [pdf

    eess.IV cs.CV cs.LG

    Deep learning to estimate the physical proportion of infected region of lung for COVID-19 pneumonia with CT image set

    Authors: Wei Wu, Yu Shi, Xukun Li, Yukun Zhou, Peng Du, Shuangzhi Lv, Tingbo Liang, Jifang Sheng

    Abstract: Utilizing computed tomography (CT) images to quickly estimate the severity of cases with COVID-19 is one of the most straightforward and efficacious methods. Two tasks were studied in this present paper. One was to segment the mask of intact lung in case of pneumonia. Another was to generate the masks of regions infected by COVID-19. The masks of these two parts of images then were converted to co… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

  38. arXiv:2004.05707  [pdf, other

    cs.CL cs.LG stat.ML

    VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification

    Authors: Zhibin Lu, Pan Du, Jian-Yun Nie

    Abstract: Much progress has been made recently on text classification with methods based on neural networks. In particular, models using attention mechanism such as BERT have shown to have the capability of capturing the contextual information within a sentence or document. However, their ability of capturing the global information about the vocabulary of a language is more limited. This latter is the stren… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: 12 pages, 2 figures

    ACM Class: I.2.4; I.2.7

    Journal ref: in J. M. Jose et al. (Eds.): ECIR 2020, LNCS 12035, pp.369-382, 2020

  39. arXiv:2002.09334  [pdf

    physics.med-ph cs.LG eess.IV

    Deep Learning System to Screen Coronavirus Disease 2019 Pneumonia

    Authors: Xiaowei Xu, Xiangao Jiang, Chunlian Ma, Peng Du, Xukun Li, Shuangzhi Lv, Liang Yu, Yanfei Chen, Junwei Su, Guanjing Lang, Yongtao Li, Hong Zhao, Kaijin Xu, Lingxiang Ruan, Wei Wu

    Abstract: We found that the real time reverse transcription-polymerase chain reaction (RT-PCR) detection of viral RNA from sputum or nasopharyngeal swab has a relatively low positive rate in the early stage to determine COVID-19 (named by the World Health Organization). The manifestations of computed tomography (CT) imaging of COVID-19 had their own characteristics, which are different from other types of v… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Journal ref: Engineering, Volume 6, Issue 10, October 2020, Pages 1122-1129

  40. Discernible Image Compression

    Authors: Zhaohui Yang, Yunhe Wang, Chang Xu, Peng Du, Chao Xu, Chunjing Xu, Qi Tian

    Abstract: Image compression, as one of the fundamental low-level image processing tasks, is very essential for computer vision. Tremendous computing and storage resources can be preserved with a trivial amount of visual information. Conventional image compression methods tend to obtain compressed images by minimizing their appearance discrepancy with the corresponding original images, but pay little attenti… ▽ More

    Submitted 7 September, 2020; v1 submitted 17 February, 2020; originally announced February 2020.

    Comments: Accepted by ACMMM 2020

  41. arXiv:1910.05599  [pdf, other

    cs.RO cs.MA eess.SP

    Online monitoring for safe pedestrian-vehicle interactions

    Authors: Peter Du, Zhe Huang, Tianqi Liu, Ke Xu, Qichao Gao, Hussein Sibai, Katherine Driggs-Campbell, Sayan Mitra

    Abstract: As autonomous systems begin to operate amongst humans, methods for safe interaction must be investigated. We consider an example of a small autonomous vehicle in a pedestrian zone that must safely maneuver around people in a free-form fashion. We investigate two key questions: How can we effectively integrate pedestrian intent estimation into our autonomous stack. Can we develop an online monitori… ▽ More

    Submitted 17 July, 2020; v1 submitted 12 October, 2019; originally announced October 2019.

    Comments: 15 pages, 5 figures,

  42. arXiv:1910.02285  [pdf

    eess.IV cs.CV cs.LG

    A Deep Learning System That Generates Quantitative CT Reports for Diagnosing Pulmonary Tuberculosis

    Authors: Wei Wu, Xukun Li, Peng Du, Guanjing Lang, Min Xu, Kaijin Xu, Lanjuan Li

    Abstract: We developed a deep learning model-based system to automatically generate a quantitative Computed Tomography (CT) diagnostic report for Pulmonary Tuberculosis (PTB) cases.501 CT imaging datasets from 223 patients with active PTB were collected, and another 501 cases from a healthy population served as negative samples.2884 lesions of PTB were carefully labeled and classified manually by profession… ▽ More

    Submitted 5 October, 2019; originally announced October 2019.

  43. arXiv:1910.01557  [pdf, other

    cs.RO

    CyPhyHouse: A Programming, Simulation, and Deployment Toolchain for Heterogeneous Distributed Coordination

    Authors: Ritwika Ghosh, Joao P. Jansch-Porto, Chiao Hsieh, Amelia Gosse, Minghao Jiang, Hebron Taylor, Peter Du, Sayan Mitra, Geir Dullerud

    Abstract: Programming languages, libraries, and development tools have transformed the application development processes for mobile computing and machine learning. This paper introduces the CyPhyHouse - a toolchain that aims to provide similar programming, debugging, and deployment benefits for distributed mobile robotic applications. Users can develop hardware-agnostic, distributed applications using the h… ▽ More

    Submitted 10 October, 2019; v1 submitted 3 October, 2019; originally announced October 2019.

  44. arXiv:1908.01046  [pdf, other

    cs.RO cs.AI cs.LG eess.SY stat.ML

    Adaptive Stress Testing with Reward Augmentation for Autonomous Vehicle Validation

    Authors: Anthony Corso, Peter Du, Katherine Driggs-Campbell, Mykel J. Kochenderfer

    Abstract: Determining possible failure scenarios is a critical step in the evaluation of autonomous vehicle systems. Real-world vehicle testing is commonly employed for autonomous vehicle validation, but the costs and time requirements are high. Consequently, simulation-driven methods such as Adaptive Stress Testing (AST) have been proposed to aid in validation. AST formulates the problem of finding the mos… ▽ More

    Submitted 6 August, 2019; v1 submitted 2 August, 2019; originally announced August 2019.

    Comments: Appears in IEEE ITSC 2019

  45. arXiv:1905.13550  [pdf

    cs.LG eess.SP stat.AP stat.ML

    A novel hybrid model based on multi-objective Harris hawks optimization algorithm for daily PM2.5 and PM10 forecasting

    Authors: Pei Du, Jianzhou Wang, Yan Hao, Tong Niu, Wendong Yang

    Abstract: High levels of air pollution may seriously affect people's living environment and even endanger their lives. In order to reduce air pollution concentrations, and warn the public before the occurrence of hazardous air pollutants, it is urgent to design an accurate and reliable air pollutant forecasting model. However, most previous research have many deficiencies, such as ignoring the importance of… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 24 pages, 4 figures

    MSC Class: 68U20

  46. DivGraphPointer: A Graph Pointer Network for Extracting Diverse Keyphrases

    Authors: Zhiqing Sun, Jian Tang, Pan Du, Zhi-Hong Deng, Jian-Yun Nie

    Abstract: Keyphrase extraction from documents is useful to a variety of applications such as information retrieval and document summarization. This paper presents an end-to-end method called DivGraphPointer for extracting a set of diversified keyphrases from a document. DivGraphPointer combines the advantages of traditional graph-based ranking methods and recent neural network-based approaches. Specifically… ▽ More

    Submitted 19 May, 2019; originally announced May 2019.

    Comments: Accepted to SIGIR 2019

  47. arXiv:1903.00066  [pdf, other

    cs.IR cs.LG stat.ML

    A Long-Short Demands-Aware Model for Next-Item Recommendation

    Authors: Ting Bai, Pan Du, Wayne Xin Zhao, Ji-Rong Wen, Jian-Yun Nie

    Abstract: Recommending the right products is the central problem in recommender systems, but the right products should also be recommended at the right time to meet the demands of users, so as to maximize their values. Users' demands, implying strong purchase intents, can be the most useful way to promote products sales if well utilized. Previous recommendation models mainly focused on user's general intere… ▽ More

    Submitted 12 February, 2019; originally announced March 2019.

  48. arXiv:1810.07260  [pdf

    stat.AP cs.CR stat.ML

    Statistical Estimation of Malware Detection Metrics in the Absence of Ground Truth

    Authors: Pang Du, Zheyuan Sun, Huashan Chen, Jin-Hee Cho, Shouhuai Xu

    Abstract: The accurate measurement of security metrics is a critical research problem because an improper or inaccurate measurement process can ruin the usefulness of the metrics, no matter how well they are defined. This is a highly challenging problem particularly when the ground truth is unknown or noisy. In contrast to the well perceived importance of defining security metrics, the measurement of securi… ▽ More

    Submitted 23 September, 2018; originally announced October 2018.

    Journal ref: IEEE T-IFS (2018)

  49. arXiv:1806.08485  [pdf, other

    cs.GR cs.CV

    Shape-from-Mask: A Deep Learning Based Human Body Shape Reconstruction from Binary Mask Images

    Authors: Zhongping Ji, Xiao Qi, Yigang Wang, Gang Xu, Peng Du, Qing Wu

    Abstract: 3D content creation is referred to as one of the most fundamental tasks of computer graphics. And many 3D modeling algorithms from 2D images or curves have been developed over the past several decades. Designers are allowed to align some conceptual images or sketch some suggestive curves, from front, side, and top views, and then use them as references in constructing a 3D model automatically or m… ▽ More

    Submitted 22 June, 2018; originally announced June 2018.

    Comments: 11 pages