Skip to main content

Showing 1–50 of 54 results for author: Yao, R

  1. arXiv:2406.10057  [pdf, other

    cs.CV cs.AI

    First Multi-Dimensional Evaluation of Flowchart Comprehension for Multimodal Large Language Models

    Authors: Enming Zhang, Ruobing Yao, Huanyong Liu, Junhui Yu, Jiale Wang

    Abstract: With the development of Multimodal Large Language Models (MLLMs) technology, its general capabilities are increasingly powerful. To evaluate the various abilities of MLLMs, numerous evaluation systems have emerged. But now there is still a lack of a comprehensive method to evaluate MLLMs in the tasks related to flowcharts, which are very important in daily life and work. We propose the first compr… ▽ More

    Submitted 18 June, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2406.07952  [pdf, other

    eess.IV cs.CV

    Spatial-Frequency Dual Progressive Attention Network For Medical Image Segmentation

    Authors: Zhenhuan Zhou, Along He, Yanlin Wu, Rui Yao, Xueshuo Xie, Tao Li

    Abstract: In medical images, various types of lesions often manifest significant differences in their shape and texture. Accurate medical image segmentation demands deep learning models with robust capabilities in multi-scale and boundary feature learning. However, previous networks still have limitations in addressing the above issues. Firstly, previous networks simultaneously fuse multi-level features or… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 8 pages

  3. arXiv:2404.19401  [pdf, other

    cs.CV

    UniFS: Universal Few-shot Instance Perception with Point Representations

    Authors: Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo

    Abstract: Instance perception tasks (object detection, instance segmentation, pose estimation, counting) play a key role in industrial applications of visual models. As supervised learning methods suffer from high labeling cost, few-shot learning methods which effectively learn from a limited number of labeled examples are desired. Existing few-shot learning methods primarily focus on a restricted set of ta… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  4. arXiv:2404.14701  [pdf, other

    cs.LG

    Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization

    Authors: Siqi Feng, Rui Yao, Stephane Hess, Ricardo A. Daziano, Timothy Brathwaite, Joan Walker, Shenhao Wang

    Abstract: Deep neural networks (DNNs) frequently present behaviorally irregular patterns, significantly limiting their practical potentials and theoretical validity in travel behavior modeling. This study proposes strong and weak behavioral regularities as novel metrics to evaluate the monotonicity of individual demand functions (a.k.a. law of demand), and further designs a constrained optimization framewor… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  5. arXiv:2402.10834  [pdf, other

    stat.AP cs.CY

    Agent-based Simulation Evaluation of CBD Tolling: A Case Study from New York City

    Authors: Qingnan Liang, Ruili Yao, Ruixuan Zhang, Zhibin Chen, Guoyuan Wu

    Abstract: Congestion tollings have been widely developed and adopted as an effective tool to mitigate urban traffic congestion and enhance transportation system sustainability. Nevertheless, these tolling schemes are often tailored on a city-by-city or even area-by-area basis, and the cost of conducting field experiments often makes the design and evaluation process challenging. In this work, we leverage MA… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted by 2024 IEEE Forum on Integrated and Sustainable Transportation Systems

  6. arXiv:2311.17629  [pdf, other

    cs.CV

    Efficient Decoder for End-to-End Oriented Object Detection in Remote Sensing Images

    Authors: Jiaqi Zhao, Zeyu Ding, Yong Zhou, Hancheng Zhu, Wenliang Du, Rui Yao, Abdulmotaleb El Saddik

    Abstract: Object instances in remote sensing images often distribute with multi-orientations, varying scales, and dense distribution. These issues bring challenges to end-to-end oriented object detectors including multi-scale features alignment and a large number of queries. To address these limitations, we propose an end-to-end oriented detector equipped with an efficient decoder, which incorporates two te… ▽ More

    Submitted 1 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 11 pages, 7 figures, 13 tables

  7. arXiv:2310.19113  [pdf, other

    cs.CV cs.AI eess.SP

    Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision

    Authors: Jiayao Tan, Fan Lyu, Linyan Li, Fuyuan Hu, Tingliang Feng, Fenglei Xu, Rui Yao

    Abstract: Vehicle-to-everything (V2X) perception is an innovative technology that enhances vehicle perception accuracy, thereby elevating the security and reliability of autonomous systems. However, existing V2X perception methods focus on static scenes from mainly vehicle-based vision, which is constrained by sensor capabilities and communication loads. To adapt V2X perception models to dynamic scenes, we… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  8. arXiv:2310.16499  [pdf, other

    cs.LG

    Data Optimization in Deep Learning: A Survey

    Authors: Ou Wu, Rujing Yao

    Abstract: Large-scale, high-quality data are considered an essential factor for the successful application of many deep learning techniques. Meanwhile, numerous real-world deep learning tasks still have to contend with the lack of sufficient amounts of high-quality data. Additionally, issues such as model robustness, fairness, and trustworthiness are also closely related to training data. Consequently, a hu… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

  9. arXiv:2310.08285  [pdf, other

    econ.GN cs.GT

    How would mobility-as-a-service (MaaS) platform survive as an intermediary? From the viewpoint of stability in many-to-many matching

    Authors: Rui Yao, Kenan Zhang

    Abstract: Mobility-as-a-service (MaaS) provides seamless door-to-door trips by integrating different transport modes. Although many MaaS platforms have emerged in recent years, most of them remain at a limited integration level. This study investigates the assignment and pricing problem for a MaaS platform as an intermediary in a multi-modal transportation network, which purchases capacity from service oper… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

  10. arXiv:2308.14378  [pdf, other

    cs.CV

    GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition

    Authors: Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu

    Abstract: Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions. Although convolutional neural networks and vision transformers have succeeded in processing images as regular grids of pixels or patches, these representations are sub-optimal for capturing irregular and… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  11. CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer

    Authors: Zhiwen Shao, Yuchen Su, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

    Abstract: Contour based scene text detection methods have rapidly developed recently, but still suffer from inaccurate frontend contour initialization, multi-stage error accumulation, or deficient local information aggregation. To tackle these limitations, we propose a novel arbitrary-shaped scene text detection framework named CT-Net by progressive contour regression with contour transformers. Specifically… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted by IEEE Transactions on Circuits and Systems for Video Technology

  12. arXiv:2306.08854  [pdf, other

    cs.LG cs.AI stat.CO stat.ML

    A Gromov--Wasserstein Geometric View of Spectrum-Preserving Graph Coarsening

    Authors: Yifan Chen, Rentian Yao, Yun Yang, Jie Chen

    Abstract: Graph coarsening is a technique for solving large-scale graph problems by working on a smaller version of the original graph, and possibly interpolating the results back to the original graph. It has a long history in scientific computing and has recently gained popularity in machine learning, particularly in methods that preserve the graph spectrum. This work studies graph coarsening from a diffe… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: To appear at ICML 2023. Code is available at https://github.com/ychen-stat-ml/GW-Graph-Coarsening

  13. arXiv:2306.06624  [pdf, other

    cs.CL

    RestGPT: Connecting Large Language Models with Real-World RESTful APIs

    Authors: Yifan Song, Weimin Xiong, Dawei Zhu, Wenhao Wu, Han Qian, Mingbo Song, Hailiang Huang, Cheng Li, Ke Wang, Rong Yao, Ye Tian, Sujian Li

    Abstract: Tool-augmented large language models (LLMs) have achieved remarkable progress in tackling a broad range of tasks. However, existing methods are mainly restricted to specifically designed tools and fail to fulfill complex instructions, having great limitations when confronted with real-world scenarios. In this paper, we explore a more realistic scenario by connecting LLMs with RESTful APIs, which a… ▽ More

    Submitted 26 August, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Comments: Add RestBench to evaluate RestGPT

  14. arXiv:2306.00127  [pdf, other

    cs.LG cs.CR

    Surrogate Model Extension (SME): A Fast and Accurate Weight Update Attack on Federated Learning

    Authors: Junyi Zhu, Ruicong Yao, Matthew B. Blaschko

    Abstract: In Federated Learning (FL) and many other distributed training frameworks, collaborators can hold their private data locally and only share the network weights trained with the local data after multiple iterations. Gradient inversion is a family of privacy attacks that recovers data from its generated gradients. Seemingly, FL can provide a degree of protection against gradient inversion attacks on… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

    Comments: Accepted at ICML 2023

  15. arXiv:2305.15583  [pdf, other

    cs.CV

    Alleviating Exposure Bias in Diffusion Models through Sampling with Shifted Time Steps

    Authors: Mingxiao Li, Tingyu Qu, Ruicong Yao, Wei Sun, Marie-Francine Moens

    Abstract: Diffusion Probabilistic Models (DPM) have shown remarkable efficacy in the synthesis of high-quality images. However, their inference process characteristically requires numerous, potentially hundreds, of iterative steps, which could exaggerate the problem of exposure bias due to the training and inference discrepancy. Previous work has attempted to mitigate this issue by perturbing inputs during… ▽ More

    Submitted 16 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Accepted at International Conference on Learning Representations (ICLR2024); typo correction

  16. HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching

    Authors: Yiheng Li, Canhui Tang, Runzhao Yao, Aixue Ye, Feng Wen, Shaoyi Du

    Abstract: Patch-to-point matching has become a robust way of point cloud registration. However, previous patch-matching methods employ superpoints with poor localization precision as nodes, which may lead to ambiguous patch partitions. In this paper, we propose a HybridPoint-based network to find more robust and accurate correspondences. Firstly, we propose to use salient points with prominent local feature… ▽ More

    Submitted 23 April, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE International Conference on Multimedia and Expo (ICME), 2023

  17. arXiv:2302.03931  [pdf, other

    stat.ML cs.LG stat.ME

    Fast Linear Model Trees by PILOT

    Authors: Jakob Raymaekers, Peter J. Rousseeuw, Tim Verdonck, Ruicong Yao

    Abstract: Linear model trees are regression trees that incorporate linear models in the leaf nodes. This preserves the intuitive interpretation of decision trees and at the same time enables them to better capture linear relationships, which is hard for standard decision trees. But most existing methods for fitting linear model trees are time consuming and therefore not scalable to large data sets. In addit… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Journal ref: Machine Learning, 2024

  18. arXiv:2211.12794  [pdf, ps, other

    cs.IT eess.SP

    Zero Forcing Uplink Detection through Large-Scale RIS: System Performance and Phase Shift Design

    Authors: Nikolaos I. Miridakis, Theodoros A. Tsiftsis, Rugui Yao

    Abstract: A multiple-input multiple-output wireless communication system is analytically studied, which operates with the aid of a large-scale reconfigurable intelligent surface (LRIS). LRIS is equipped with multiple passive elements with discrete phase adjustment capabilities, and independent Rician fading conditions are assumed for both the transmitter-to-LRIS and LRIS-to-receiver links. A direct transcei… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted for publication to IEEE Transactions on Communications

  19. arXiv:2211.00168  [pdf, other

    cs.CV cs.LG

    Improving Fairness in Image Classification via Sketching

    Authors: Ruichen Yao, Ziteng Cui, Xiaoxiao Li, Lin Gu

    Abstract: Fairness is a fundamental requirement for trustworthy and human-centered Artificial Intelligence (AI) system. However, deep neural networks (DNNs) tend to make unfair predictions when the training data are collected from different sub-populations with different attributes (i.e. color, sex, age), leading to biased DNN predictions. We notice that such a troubling phenomenon is often caused by data i… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 8 pages, 2 figures. To appear in 2022 Trustworthy and Socially Responsible Machine Learning (TSRML 2022) co-located with NeurIPS 2022

  20. arXiv:2208.12042  [pdf, other

    stat.ME cs.LG

    Efficient Truncated Linear Regression with Unknown Noise Variance

    Authors: Constantinos Daskalakis, Patroklos Stefanou, Rui Yao, Manolis Zampetakis

    Abstract: Truncated linear regression is a classical challenge in Statistics, wherein a label, $y = w^T x + \varepsilon$, and its corresponding feature vector, $x \in \mathbb{R}^k$, are only observed if the label falls in some subset $S \subseteq \mathbb{R}$; otherwise the existence of the pair $(x, y)$ is hidden from observation. Linear regression with truncated observations has remained a challenge, in it… ▽ More

    Submitted 25 August, 2022; originally announced August 2022.

  21. TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask

    Authors: Yuchen Su, Zhiwen Shao, Yong Zhou, Fanrong Meng, Hancheng Zhu, Bing Liu, Rui Yao

    Abstract: Arbitrary-shaped scene text detection is a challenging task due to the variety of text changes in font, size, color, and orientation. Most existing regression based methods resort to regress the masks or contour points of text regions to model the text instances. However, regressing the complete masks requires high training complexity, and contour points are not sufficient to capture the details o… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: This paper has been accepted by IEEE Transactions on Multimedia

  22. arXiv:2203.15612  [pdf, other

    cs.RO

    Three-Dimensional Spectrum Occupancy Measurement using UAV: Performance Analysis and Algorithm Design

    Authors: Zhiqing Wei, Rubing Yao, Jie Kang, Xu Chen, Huici Wu

    Abstract: Spectrum sharing, as an approach to significantly improve spectrum efficiency in the era of 6th generation mobile networks (6G), has attracted extensive attention. Radio Environment Map (REM) based low-complexity spectrum sharing is widely studied where the spectrum occupancy measurement (SOM) is vital to construct REM. The SOM in three-dimensional (3D) space is becoming increasingly essential to… ▽ More

    Submitted 29 March, 2022; originally announced March 2022.

  23. arXiv:2112.14192   

    cs.IT cs.CL

    Robust Security Analysis Based on Random Geometry Theory for Satellite-Terrestrial-Vehicle Network

    Authors: Xudong Li, Ye Fan, Rugui Yao, Peng Wang, Nan Qi, Xiaoya Zuo

    Abstract: Driven by B5G and 6G technologies, multi-network fusion is an indispensable tendency for future communications. In this paper, we focus on and analyze the \emph{security performance} (SP) of the \emph{satellite-terrestrial downlink transmission} (STDT). Here, the STDT is composed of a satellite network and a vehicular network with a legitimate mobile receiver and an mobile eavesdropper distributin… ▽ More

    Submitted 14 July, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: The theoretical analysis in the original manuscript is insufficient, and the system model is not convincing. With the consideration of these flaws, we decide to withdraw our work for further improvement

  24. arXiv:2109.09467  [pdf

    cs.IT cs.CE

    Cooperative Anti-Jamming for UAV Networks: A Local Altruistic Game Approach

    Authors: Yueyue Su, Nan Qi, Zanqi Huang, Rugui Yao, Luliang Jia

    Abstract: To improve the anti-jamming ability of the UAV-aided communication systems, this paper investigates the channel selection optimization problem in face of both internal mutual interference and external malicious jamming. A cooperative anti-jamming method based on local altruistic is proposed to optimize UAVs' channel selection. Specifically, a Stackelberg game is modeled to formulate the confrontat… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Comments: 14 pages, 8 figures

    MSC Class: 91A28

  25. arXiv:2107.11921  [pdf, other

    cs.LG

    Compensation Learning

    Authors: Rujing Yao, Ou Wu

    Abstract: Weighting strategy prevails in machine learning. For example, a common approach in robust machine learning is to exert lower weights on samples which are likely to be noisy or quite hard. This study reveals another undiscovered strategy, namely, compensating. Various incarnations of compensating have been utilized but it has not been explicitly revealed. Learning with compensating is called compen… ▽ More

    Submitted 4 January, 2022; v1 submitted 25 July, 2021; originally announced July 2021.

  26. arXiv:2106.13319  [pdf

    cs.AI cs.LG physics.soc-ph stat.ME

    A variational autoencoder approach for choice set generation and implicit perception of alternatives in choice modeling

    Authors: Rui Yao, Shlomo Bekhor

    Abstract: This paper derives the generalized extreme value (GEV) model with implicit availability/perception (IAP) of alternatives and proposes a variational autoencoder (VAE) approach for choice set generation and implicit perception of alternatives. Specifically, the cross-nested logit (CNL) model with IAP is derived as an example of IAP-GEV models. The VAE approach is adapted to model the choice set gene… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

  27. arXiv:2105.13078  [pdf

    cs.DS

    A Dynamic Tree Algorithm for Peer-to-Peer Ride-sharing Matching

    Authors: Rui Yao, Shlomo Bekhor

    Abstract: On-demand peer-to-peer ride-sharing services provide flexible mobility options, and are expected to alleviate congestion by sharing empty car seats. An efficient matching algorithm is essential to the success of a ride-sharing system. The matching problem is related to the well-known dial-a-ride problem, which also tries to find the optimal pickup and delivery sequence for a given set of passenger… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Comments: Accepted for publication on Networks and Spatial Economics

  28. arXiv:2104.13463  [pdf

    cs.MA eess.SY

    A ridesharing simulation platform that considers dynamic supply-demand interactions

    Authors: Rui Yao, Shlomo Bekhor

    Abstract: This paper presents a new ridesharing simulation platform that accounts for dynamic driver supply and passenger demand, and complex interactions between drivers and passengers. The proposed simulation platform explicitly considers driver and passenger acceptance/rejection on the matching options, and cancellation before/after being matched. New simulation events, procedures and modules have been d… ▽ More

    Submitted 15 May, 2022; v1 submitted 27 April, 2021; originally announced April 2021.

  29. arXiv:2104.02880  [pdf, ps, other

    cs.DC

    Contingency Analysis Based on Partitioned and Parallel Holomorphic Embedding

    Authors: Rui Yao, Feng Qiu, Kai Sun

    Abstract: In the steady-state contingency analysis, the traditional Newton-Raphson method suffers from non-convergence issues when solving post-outage power flow problems, which hinders the integrity and accuracy of security assessment. In this paper, we propose a novel robust contingency analysis approach based on holomorphic embedding (HE). The HE-based simulator guarantees convergence if the true power f… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  30. arXiv:2104.02877  [pdf, ps, other

    cs.CE

    Hybrid QSS and Dynamic Extended-Term Simulation Based on Holomorphic Embedding

    Authors: Rui Yao, Feng Qiu

    Abstract: Power system simulations that extend over a time period of minutes, hours, or even longer are called extended-term simulations. As power systems evolve into complex systems with increasing interdependencies and richer dynamic behaviors across a wide range of timescales, extended-term simulation is needed for many power system analysis tasks (e.g., resilience analysis, renewable energy integration,… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  31. Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

    Authors: Yichen Zhang, Hantao Cui, Jianzhe Liu, Feng Qiu, Tianqi Hong, Rui Yao, Fangxing Li

    Abstract: With the increasing penetration of renewable energy, frequency response and its security are of significant concerns for reliable power system operations. Frequency-constrained unit commitment (FCUC) is proposed to address this challenge. Despite existing efforts in modeling frequency characteristics in unit commitment (UC), current strategies can only handle oversimplified low-order frequency res… ▽ More

    Submitted 12 October, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

  32. arXiv:2101.07897  [pdf, other

    cs.CR cs.CY

    Safer Illinois and RokWall: Privacy Preserving University Health Apps for COVID-19

    Authors: Vikram Sharma Mailthody, James Wei, Nicholas Chen, Mohammad Behnia, Ruihao Yao, Qihao Wang, Vedant Agrawal, Churan He, Lijian Wang, Leihao Chen, Amit Agarwal, Edward Richter, Wen-Mei Hwu, Christopher W. Fletcher, Jinjun Xiong, Andrew Miller, Sanjay Patel

    Abstract: COVID-19 has fundamentally disrupted the way we live. Government bodies, universities, and companies worldwide are rapidly developing technologies to combat the COVID-19 pandemic and safely reopen society. Essential analytics tools such as contact tracing, super-spreader event detection, and exposure mapping require collecting and analyzing sensitive user information. The increasing use of such po… ▽ More

    Submitted 17 March, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: Appears in the Workshop on Secure IT Technologies against COVID-19(CoronaDef) 2021

  33. CycleSegNet: Object Co-segmentation with Cycle Refinement and Region Correspondence

    Authors: Chi Zhang, Guankai Li, Guosheng Lin, Qingyao Wu, Rui Yao

    Abstract: Image co-segmentation is an active computer vision task that aims to segment the common objects from a set of images. Recently, researchers design various learning-based algorithms to undertake the co-segmentation task. The main difficulty in this task is how to effectively transfer information between images to make conditional predictions. In this paper, we present CycleSegNet, a novel framework… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 January, 2021; originally announced January 2021.

    Comments: Accept to TIP

  34. arXiv:2011.12354  [pdf, ps, other

    eess.SY cs.MA

    PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control

    Authors: Dong Chen, Kaian Chen. Zhaojian Li, Tianshu Chu, Rui Yao, Feng Qiu, Kaixiang Lin

    Abstract: This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNe… ▽ More

    Submitted 31 July, 2021; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: 11 pages

  35. arXiv:2011.00518  [pdf, other

    cs.IR cs.AI

    AI Marker-based Large-scale AI Literature Mining

    Authors: Rujing Yao, Yingchun Ye, Ji Zhang, Shuxiao Li, Ou Wu

    Abstract: The knowledge contained in academic literature is interesting to mine. Inspired by the idea of molecular markers tracing in the field of biochemistry, three named entities, namely, methods, datasets and metrics are used as AI markers for AI literature. These entities can be used to trace the research process described in the bodies of papers, which opens up new perspectives for seeking and mining… ▽ More

    Submitted 2 November, 2020; v1 submitted 1 November, 2020; originally announced November 2020.

  36. arXiv:2010.13583  [pdf, other

    cs.AI

    Method and Dataset Entity Mining in Scientific Literature: A CNN + Bi-LSTM Model with Self-attention

    Authors: Linlin Hou, Ji Zhang, Ou Wu, Ting Yu, Zhen Wang, Zhao Li, Jianliang Gao, Yingchun Ye, Rujing Yao

    Abstract: Literature analysis facilitates researchers to acquire a good understanding of the development of science and technology. The traditional literature analysis focuses largely on the literature metadata such as topics, authors, abstracts, keywords, references, etc., and little attention was paid to the main content of papers. In many scientific domains such as science, computing, engineering, etc.,… ▽ More

    Submitted 27 January, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

  37. arXiv:2007.13250  [pdf, other

    cs.LG eess.SY stat.ML

    Deep Active Learning for Solvability Prediction in Power Systems

    Authors: Yichen Zhang, Jianzhe Liu, Feng Qiu, Tianqi Hong, Rui Yao

    Abstract: Traditional methods for solvability region analysis can only have inner approximations with inconclusive conservatism. Machine learning methods have been proposed to approach the real region. In this letter, we propose a deep active learning framework for power system solvability prediction. Compared with the passive learning methods where the training is performed after all instances are labeled,… ▽ More

    Submitted 22 December, 2020; v1 submitted 26 July, 2020; originally announced July 2020.

  38. arXiv:2005.11195  [pdf

    cs.DS

    A Dynamic Tree Algorithm for On-demand Peer-to-peer Ride-sharing Matching

    Authors: Rui Yao, Shlomo Bekhor

    Abstract: Innovative shared mobility services provide on-demand flexible mobility options and have the potential to alleviate traffic congestion. These attractive services are challenging from different perspectives. One major challenge in such systems is to find suitable ride-sharing matchings between drivers and passengers with respect to the system objective and constraints, and to provide optimal pickup… ▽ More

    Submitted 22 May, 2020; originally announced May 2020.

    Comments: hEART 2020 : 9th Symposium of the European Association for Research in Transportation

  39. arXiv:2005.04463  [pdf, ps, other

    cs.CV

    Vehicle Re-Identification Based on Complementary Features

    Authors: Cunyuan Gao, Yi Hu, Yi Zhang, Rui Yao, Yong Zhou, Jiaqi Zhao

    Abstract: In this work, we present our solution to the vehicle re-identification (vehicle Re-ID) track in AI City Challenge 2020 (AIC2020). The purpose of vehicle Re-ID is to retrieve the same vehicle appeared across multiple cameras, and it could make a great contribution to the Intelligent Traffic System(ITS) and smart city. Due to the vehicle's orientation, lighting and inter-class similarity, it is diff… ▽ More

    Submitted 9 May, 2020; originally announced May 2020.

  40. Facial Action Unit Detection via Adaptive Attention and Relation

    Authors: Zhiwen Shao, Yong Zhou, Jianfei Cai, Hancheng Zhu, Rui Yao

    Abstract: Facial action unit (AU) detection is challenging due to the difficulty in capturing correlated information from subtle and dynamic AUs. Existing methods often resort to the localization of correlated regions of AUs, in which predefining local AU attentions by correlated facial landmarks often discards essential parts, or learning global attention maps often contains irrelevant areas. Furthermore,… ▽ More

    Submitted 16 May, 2023; v1 submitted 5 January, 2020; originally announced January 2020.

    Comments: This paper has been accepted by IEEE Transactions on Image Processing (TIP)

  41. arXiv:1912.12395  [pdf, ps, other

    eess.SP cs.MM

    OpenRadar: A Toolkit for Prototyping mmWave Radar Applications

    Authors: Arjun Gupta, Dashiell Kosaka, Edwin Pan, Jingning Tang, Ruihao Yao, Sanjay Patel

    Abstract: Millimeter-Wave (mmWave) radar sensors are gaining popularity for their robust sensing and increasing imaging capabilities. However, current radar signal processing is hardware specific, which makes it impossible to build sensor agnostic solutions. OpenRadar serves as an interface to prototype, research, and benchmark solutions in a modular manner. This enables creating software processing stacks… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    MSC Class: I.2.0; I.5.4; J.7 ACM Class: I.2.0; I.5.4; J.7

  42. arXiv:1912.00398  [pdf, other

    cs.CL cs.AI cs.LG

    Deep Human Answer Understanding for Natural Reverse QA

    Authors: Rujing Yao, Linlin Hou, Lei Yang, Jie Gui, Qing Yin, Ou Wu

    Abstract: This study focuses on a reverse question answering (QA) procedure, in which machines proactively raise questions and humans supply the answers. This procedure exists in many real human-machine interaction applications. However, a crucial problem in human-machine interaction is answer understanding. The existing solutions have relied on mandatory option term selection to avoid automatic answer unde… ▽ More

    Submitted 28 November, 2020; v1 submitted 1 December, 2019; originally announced December 2019.

  43. arXiv:1911.13096  [pdf

    cs.LG cs.CL cs.IR stat.ML

    Method and Dataset Mining in Scientific Papers

    Authors: Rujing Yao, Linlin Hou, Yingchun Ye, Ou Wu, Ji Zhang, Jian Wu

    Abstract: Literature analysis facilitates researchers better understanding the development of science and technology. The conventional literature analysis focuses on the topics, authors, abstracts, keywords, references, etc., and rarely pays attention to the content of papers. In the field of machine learning, the involved methods (M) and datasets (D) are key information in papers. The extraction and mining… ▽ More

    Submitted 29 November, 2019; originally announced November 2019.

  44. arXiv:1910.13174  [pdf, other

    cs.RO

    Autonomous UAV Landing System Based on Visual Navigation

    Authors: Zhixin Wu, Peng Han, Ruiwen Yao, Lei Qiao, Weidong Zhang, Tielong Shen, Min Sun, Yilong Zhu, Ming Liu, Rui Fan

    Abstract: In this paper, we present an autonomous unmanned aerial vehicle (UAV) landing system based on visual navigation. We design the landmark as a topological pattern in order to enable the UAV to distinguish the landmark from the environment easily. In addition, a dynamic thresholding method is developed for image binarization to improve detection efficiency. The relative distance in the horizontal pla… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 6 pages, 13 figures, 2019 IEEE International Conference on Imaging Systems and Techniques (IST)

  45. arXiv:1910.13055  [pdf, other

    cs.CV cs.LG cs.RO eess.IV

    PT-ResNet: Perspective Transformation-Based Residual Network for Semantic Road Image Segmentation

    Authors: Rui Fan, Yuan Wang, Lei Qiao, Ruiwen Yao, Peng Han, Weidong Zhang, Ioannis Pitas, Ming Liu

    Abstract: Semantic road region segmentation is a high-level task, which paves the way towards road scene understanding. This paper presents a residual network trained for semantic road segmentation. Firstly, we represent the projections of road disparities in the v-disparity map as a linear model, which can be estimated by optimizing the v-disparity map using dynamic programming. This linear model is then u… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 5 pages, 5 figures, accepted by 2019 IEEE International Conference on Imaging Systems and Techniques (IST)

  46. arXiv:1909.05891  [pdf, other

    cs.NI

    Traffic-aware Two-stage Queueing Communication Networks: Queue Analysis and Energy Saving

    Authors: Nan Qi, Nikolaos I. Miridakis, Ming Xiao, Theodoros A. Tsiftsis, Rugui Yao, Shi Jin

    Abstract: To boost energy saving for the general delay-tolerant IoT networks, a two-stage and single-relay queueing communication scheme is investigated. Concretely, a traffic-aware $N$-threshold and gated-service policy are applied at the relay. As two fundamental and significant performance metrics, the mean waiting time and long-term expected power consumption are explicitly derived and related with the… ▽ More

    Submitted 14 February, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

  47. arXiv:1904.09172  [pdf, other

    cs.CV

    Video Object Segmentation and Tracking: A Survey

    Authors: Rui Yao, Guosheng Lin, Shixiong Xia, Jiaqi Zhao, Yong Zhou

    Abstract: Object segmentation and object tracking are fundamental research area in the computer vision community. These two topics are diffcult to handle some common challenges, such as occlusion, deformation, motion blur, and scale variation. The former contains heterogeneous object, interacting object, edge ambiguity, and shape complexity. And the latter suffers from difficulties in handling fast motion,… ▽ More

    Submitted 26 April, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

  48. arXiv:1903.02351  [pdf, other

    cs.CV

    CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning

    Authors: Chi Zhang, Guosheng Lin, Fayao Liu, Rui Yao, Chunhua Shen

    Abstract: Recent progress in semantic segmentation is driven by deep Convolutional Neural Networks and large-scale labeled image datasets. However, data labeling for pixel-wise segmentation is tedious and costly. Moreover, a trained model can only make predictions within a set of pre-defined classes. In this paper, we present CANet, a class-agnostic segmentation network that performs few-shot segmentation o… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

    Comments: Accepted to CVPR 2019

  49. arXiv:1707.00548  [pdf, other

    cs.CV

    Efficient Eye Typing with 9-direction Gaze Estimation

    Authors: Chi Zhang, Rui Yao, Jinpeng Cai

    Abstract: Vision based text entry systems aim to help disabled people achieve text communication using eye movement. Most previous methods have employed an existing eye tracker to predict gaze direction and design an input method based upon that. However, these methods can result in eye tracking quality becoming easily affected by various factors and lengthy amounts of time for calibration. Our paper presen… ▽ More

    Submitted 3 July, 2017; originally announced July 2017.

  50. arXiv:1705.01671  [pdf, other

    cs.CE

    Towards Simulation and Risk Assessment of Weather-Related Cascading Outages

    Authors: Rui Yao, Kai Sun

    Abstract: Weather and environmental factors are verified to have played significant roles in historical major cascading outages and blackouts. Therefore, in the simulation and risk assessment of cascading outages in power systems, it is necessary to consider the weather and environmental effects. This paper proposes a method for the risk assessment of weather-related cascading outages. Based on the analysis… ▽ More

    Submitted 3 May, 2017; originally announced May 2017.