Skip to main content

Showing 1–47 of 47 results for author: Bao, R

  1. arXiv:2406.11190  [pdf, other

    cs.CL cs.AI

    Aligning Large Language Models from Self-Reference AI Feedback with one General Principle

    Authors: Rong Bao, Rui Zheng, Shihan Dou, Xiao Wang, Enyu Zhou, Bo Wang, Qi Zhang, Liang Ding, Dacheng Tao

    Abstract: In aligning large language models (LLMs), utilizing feedback from existing advanced AI rather than humans is an important method to scale supervisory signals. However, it is highly challenging for AI to understand human intentions and societal values, and provide accurate preference feedback based on these. Current AI feedback methods rely on powerful LLMs, carefully designed specific principles t… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 19 pages, 3 figures

  2. arXiv:2405.06275  [pdf, other

    cs.CL

    Pruning as a Domain-specific LLM Extractor

    Authors: Nan Zhang, Yanchi Liu, Xujiang Zhao, Wei Cheng, Runxue Bao, Rui Zhang, Prasenjit Mitra, Haifeng Chen

    Abstract: Large Language Models (LLMs) have exhibited remarkable proficiency across a wide array of NLP tasks. However, the escalation in model size also engenders substantial deployment costs. While few efforts have explored model pruning techniques to reduce the size of LLMs, they mainly center on general or task-specific weights. This leads to suboptimal performance due to lacking specificity on the targ… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: NAACL 2024 Findings

  3. arXiv:2403.16055  [pdf, other

    cs.CE

    Modal-adaptive Knowledge-enhanced Graph-based Financial Prediction from Monetary Policy Conference Calls with LLM

    Authors: Kun Ouyang, Yi Liu, Shicheng Li, Ruihan Bao, Keiko Harimoto, Xu Sun

    Abstract: Financial prediction from Monetary Policy Conference (MPC) calls is a new yet challenging task, which targets at predicting the price movement and volatility for specific financial assets by analyzing multimodal information including text, video, and audio. Although the existing work has achieved great success using cross-modal transformer blocks, it overlooks the potential external financial know… ▽ More

    Submitted 21 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC Coling 2024 -FinNLP (oral)

  4. arXiv:2403.14729  [pdf, other

    cs.CV cs.LG

    Auto-Train-Once: Controller Network Guided Automatic Network Pruning from Scratch

    Authors: Xidong Wu, Shangqian Gao, Zeyu Zhang, Zhenzhen Li, Runxue Bao, Yanfu Zhang, Xiaoqian Wang, Heng Huang

    Abstract: Current techniques for deep neural network (DNN) pruning often involve intricate multi-step processes that require domain-specific expertise, making their widespread adoption challenging. To address the limitation, the Only-Train-Once (OTO) and OTOv2 are proposed to eliminate the need for additional fine-tuning steps by directly training and compressing a general DNN from scratch. Nevertheless, th… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  5. arXiv:2402.16513  [pdf

    physics.optics cs.ET physics.app-ph

    Photonic Neural Network Fabricated on Thin Film Lithium Niobate for High-Fidelity and Power-Efficient Matrix Computation

    Authors: Yong Zheng, Rongbo Wu, Yuan Ren, Rui Bao, Jian Liu, Yu Ma, Min Wang, Ya Cheng

    Abstract: Photonic neural networks (PNNs) have emerged as a promising platform to address the energy consumption issue that comes with the advancement of artificial intelligence technology, and thin film lithium niobate (TFLN) offers an attractive solution as a material platform mainly for its combined characteristics of low optical loss and large electro-optic (EO) coefficients. Here, we present the first… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 27 pages,10 figures

  6. arXiv:2402.11441  [pdf, other

    cs.CL cs.AI cs.LG

    InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration

    Authors: Fali Wang, Runxue Bao, Suhang Wang, Wenchao Yu, Yanchi Liu, Wei Cheng, Haifeng Chen

    Abstract: Though Large Language Models (LLMs) have shown remarkable open-generation capabilities across diverse domains, they struggle with knowledge-intensive tasks. To alleviate this issue, knowledge integration methods have been proposed to enhance LLMs with domain-specific knowledge graphs using external modules. However, they suffer from data inefficiency as they require both known and unknown knowledg… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 12 pages, 5 figures

  7. arXiv:2402.09345  [pdf, other

    cs.LG cs.AI

    InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling

    Authors: Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, Dacheng Tao

    Abstract: Despite the success of reinforcement learning from human feedback (RLHF) in aligning language models with human values, reward hacking, also termed reward overoptimization, remains a critical challenge. This issue primarily arises from reward misgeneralization, where reward models (RMs) compute reward using spurious features that are irrelevant to human preferences. In this work, we tackle this pr… ▽ More

    Submitted 23 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: 35 pages, 28 figures

  8. arXiv:2402.01987  [pdf, other

    cs.LG cs.AI

    Online Transfer Learning for RSV Case Detection

    Authors: Yiming Sun, Yuhe Gao, Runxue Bao, Gregory F. Cooper, Jessi Espino, Harry Hochheiser, Marian G. Michaels, John M. Aronis, Chenxi Song, Ye Ye

    Abstract: Transfer learning has become a pivotal technique in machine learning and has proven to be effective in various real-world applications. However, utilizing this technique for classification tasks with sequential data often faces challenges, primarily attributed to the scarcity of class labels. To address this challenge, we introduce Multi-Source Adaptive Weighting (MSAW), an online multi-source tra… ▽ More

    Submitted 7 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 10 pages, 2 figures

  9. arXiv:2311.06276  [pdf, other

    eess.IV cs.CV

    Enhancing the machine vision performance with multi-spectral light sources

    Authors: Feng Zhang, Rui Bao, Congqi Dai, Wanlu Zhang, Shu Liu, Ruiqian Guo

    Abstract: This study mainly focuses on the performance of different multi-spectral light sources on different object colors in machine vision and tries to enhance machine vision with multi-spectral light sources. Using different color pencils as samples, by recognizing the collected images with two classical neural networks, AlexNet and VGG19, the performance was investigated under 35 different multi-spectr… ▽ More

    Submitted 20 October, 2023; originally announced November 2023.

    Comments: 12 pages, 7 figures

  10. arXiv:2310.14152  [pdf, other

    cs.CL cs.LG

    Orthogonal Subspace Learning for Language Model Continual Learning

    Authors: Xiao Wang, Tianze Chen, Qiming Ge, Han Xia, Rong Bao, Rui Zheng, Qi Zhang, Tao Gui, Xuanjing Huang

    Abstract: Benefiting from massive corpora and advanced hardware, large language models (LLMs) exhibit remarkable capabilities in language understanding and generation. However, their performance degrades in scenarios where multiple tasks are encountered sequentially, also known as catastrophic forgetting. In this paper, we propose orthogonal low-rank adaptation (O-LoRA), a simple and efficient approach for… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 findings

  11. arXiv:2310.08459  [pdf, other

    cs.LG cs.AI

    A Survey of Heterogeneous Transfer Learning

    Authors: Runxue Bao, Yiming Sun, Yuhe Gao, Jindong Wang, Qiang Yang, Haifeng Chen, Zhi-Hong Mao, Ye Ye

    Abstract: The application of transfer learning, an approach utilizing knowledge from a source domain to enhance model performance in a target domain, has seen a tremendous rise in recent years, underpinning many real-world scenarios. The key to its success lies in the shared common knowledge between the domains, a prerequisite in most transfer learning methodologies. These methods typically presuppose ident… ▽ More

    Submitted 15 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  12. arXiv:2309.05608  [pdf, other

    cs.CL cs.CE

    Incorporating Pre-trained Model Prompting in Multimodal Stock Volume Movement Prediction

    Authors: Ruibo Chen, Zhiyuan Zhang, Yi Liu, Ruihan Bao, Keiko Harimoto, Xu Sun

    Abstract: Multimodal stock trading volume movement prediction with stock-related news is one of the fundamental problems in the financial area. Existing multimodal works that train models from scratch face the problem of lacking universal knowledge when modeling financial news. In addition, the models ability may be limited by the lack of domain-related knowledge due to insufficient data in the datasets. To… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 9 pages, 3 figures, 7 tables. Accepted by 2023 KDD Workshop on Machine Learning in Finance

  13. A Framework for Migrating to Post-Quantum Cryptography: Security Dependency Analysis and Case Studies

    Authors: Khondokar Fida Hasan, Leonie Simpson, Mir Ali Rezazadeh Baee, Chadni Islam, Ziaur Rahman, Warren Armstrong, Praveen Gauravaram, Matthew McKague

    Abstract: Quantum computing is emerging as a significant threat to information protected by widely used cryptographic systems. Cryptographic methods, once deemed secure for decades, are now at risk of being compromised, posing a massive threat to the security of sensitive data and communications across enterprises worldwide. As a result, there is an urgent need to migrate to quantum-resistant cryptographic… ▽ More

    Submitted 21 February, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: 24 Pages

  14. arXiv:2306.17257  [pdf, other

    cs.LG cs.AI cs.CL

    Prediction of COVID-19 Patients' Emergency Room Revisit using Multi-Source Transfer Learning

    Authors: Yuelyu Ji, Yuhe Gao, Runxue Bao, Qi Li, Disheng Liu, Yiming Sun, Ye Ye

    Abstract: The coronavirus disease 2019 (COVID-19) has led to a global pandemic of significant severity. In addition to its high level of contagiousness, COVID-19 can have a heterogeneous clinical course, ranging from asymptomatic carriers to severe and potentially life-threatening health complications. Many patients have to revisit the emergency room (ER) within a short time after discharge, which significa… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: to appear at ICHI 2023

  15. arXiv:2304.09324  [pdf, other

    eess.IV cs.CV

    Computer-Vision Benchmark Segment-Anything Model (SAM) in Medical Images: Accuracy in 12 Datasets

    Authors: Sheng He, Rina Bao, Jingpeng Li, Jeffrey Stout, Atle Bjornerud, P. Ellen Grant, Yangming Ou

    Abstract: Background: The segment-anything model (SAM), introduced in April 2023, shows promise as a benchmark model and a universal solution to segment various natural images. It comes without previously-required re-training or fine-tuning specific to each new dataset. Purpose: To test SAM's accuracy in various medical image segmentation tasks and investigate potential factors that may affect its accurac… ▽ More

    Submitted 5 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

    Comments: Technical Report

  16. arXiv:2304.01401  [pdf, other

    eess.IV cs.CV

    U-Netmer: U-Net meets Transformer for medical image segmentation

    Authors: Sheng He, Rina Bao, P. Ellen Grant, Yangming Ou

    Abstract: The combination of the U-Net based deep learning models and Transformer is a new trend for medical image segmentation. U-Net can extract the detailed local semantic and texture information and Transformer can learn the long-rang dependencies among pixels in the input image. However, directly adapting the Transformer for segmentation has ``token-flatten" problem (flattens the local patches into 1D… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures, under review

  17. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  18. arXiv:2211.11843  [pdf, other

    cs.RO

    SLUGBOT, an Aplysia-inspired Robotic Grasper for Studying Control

    Authors: Kevin Dai, Ravesh Sukhnandan, Michael Bennington, Karen Whirley, Ryan Bao, Lu Li, Jeffrey P. Gill, Hillel J. Chiel, Victoria A. Webster-Wood

    Abstract: Living systems can use a single periphery to perform a variety of tasks and adapt to a dynamic environment. This multifunctionality is achieved through the use of neural circuitry that adaptively controls the reconfigurable musculature. Current robotic systems struggle to flexibly adapt to unstructured environments. Through mimicry of the neuromechanical coupling seen in living organisms, robotic… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Submitted and accepted to Living Machines 2022 conference

  19. Robust Lottery Tickets for Pre-trained Language Models

    Authors: Rui Zheng, Rong Bao, Yuhao Zhou, Di Liang, Sirui Wang, Wei Wu, Tao Gui, Qi Zhang, Xuanjing Huang

    Abstract: Recent works on Lottery Ticket Hypothesis have shown that pre-trained language models (PLMs) contain smaller matching subnetworks(winning tickets) which are capable of reaching accuracy comparable to the original models. However, these tickets are proved to be notrobust to adversarial examples, and even worse than their PLM counterparts. To address this problem, we propose a novel method based on… ▽ More

    Submitted 5 November, 2022; originally announced November 2022.

    Comments: ACL 2022. https://aclanthology.org/2022.acl-long.157

    MSC Class: 68-06

  20. arXiv:2211.01762  [pdf, ps, other

    q-fin.TR cs.LG

    Stock Trading Volume Prediction with Dual-Process Meta-Learning

    Authors: Ruibo Chen, Wei Li, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu Sun

    Abstract: Volume prediction is one of the fundamental objectives in the Fintech area, which is helpful for many downstream tasks, e.g., algorithmic trading. Previous methods mostly learn a universal model for different stocks. However, this kind of practice omits the specific characteristics of individual stocks by applying the same set of parameters for different stocks. On the other hand, learning differe… ▽ More

    Submitted 11 October, 2022; originally announced November 2022.

    Comments: 16 pages, 3 figures, 5 tables. Published in ECML-PKDD 2022

  21. arXiv:2210.12689  [pdf, other

    cs.CV

    Face Emotion Recognization Using Dataset Augmentation Based on Neural Network

    Authors: Mengyu Rao, Ruyi Bao, Liangshun Dong

    Abstract: Facial expression is one of the most external indications of a person's feelings and emotions. In daily conversation, according to the psychologist, only 7% and 38% of information is communicated through words and sounds respective, while up to 55% is through facial expression. It plays an important role in coordinating interpersonal relationships. Ekman and Friesen recognized six essential emotio… ▽ More

    Submitted 21 November, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 5 pages, 8 figures, 3 tables

  22. arXiv:2208.10251  [pdf, other

    cs.CL cs.AI

    Rethinking Textual Adversarial Defense for Pre-trained Language Models

    Authors: Jiayi Wang, Rongzhou Bao, Zhuosheng Zhang, Hai Zhao

    Abstract: Although pre-trained language models (PrLMs) have achieved significant success, recent studies demonstrate that PrLMs are vulnerable to adversarial attacks. By generating adversarial examples with slight perturbations on different levels (sentence / word / character), adversarial attacks can fool PrLMs to generate incorrect predictions, which questions the robustness of PrLMs. However, we find tha… ▽ More

    Submitted 21 July, 2022; originally announced August 2022.

  23. Sampling Through the Lens of Sequential Decision Making

    Authors: Jason Xiaotian Dou, Alvin Qingkai Pan, Runxue Bao, Haiyi Harry Mao, Lei Luo, Zhi-Hong Mao

    Abstract: Sampling is ubiquitous in machine learning methodologies. Due to the growth of large datasets and model complexity, we want to learn and adapt the sampling process while training a representation. Towards achieving this grand goal, a variety of sampling techniques have been proposed. However, most of them either use a fixed sampling scheme or adjust the sampling scheme based on simple heuristics.… ▽ More

    Submitted 13 December, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  24. arXiv:2208.07232  [pdf, other

    q-fin.TR cs.LG

    Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction

    Authors: Lei Li, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu Sun

    Abstract: Traditional knowledge distillation in classification problems transfers the knowledge via class correlations in the soft label produced by teacher models, which are not available in regression problems like stock trading volume prediction. To remedy this, we present a novel distillation framework for training a light-weight student model to perform trading volume prediction given historical transa… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: ECML-PKDD 2022, our code and data will be available at https://github.com/lancopku/DCKD

  25. arXiv:2208.06058  [pdf, other

    cs.LG stat.ML

    An Accelerated Doubly Stochastic Gradient Method with Faster Explicit Model Identification

    Authors: Runxue Bao, Bin Gu, Heng Huang

    Abstract: Sparsity regularized loss minimization problems play an important role in various fields including machine learning, data mining, and modern statistics. Proximal gradient descent method and coordinate descent method are the most popular approaches to solving the minimization problem. Although existing methods can achieve implicit model identification, aka support set identification, in a finite nu… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

  26. arXiv:2206.09576  [pdf, other

    cs.LG cs.AI math.OC

    FedSSO: A Federated Server-Side Second-Order Optimization Algorithm

    Authors: Xin Ma, Renyi Bao, Jinpeng Jiang, Yang Liu, Arthur Jiang, Jun Yan, Xin Liu, Zhisong Pan

    Abstract: In this work, we propose FedSSO, a server-side second-order optimization method for federated learning (FL). In contrast to previous works in this direction, we employ a server-side approximation for the Quasi-Newton method without requiring any training data from the clients. In this way, we not only shift the computation burden from clients to server, but also eliminate the additional communicat… ▽ More

    Submitted 22 August, 2022; v1 submitted 20 June, 2022; originally announced June 2022.

  27. arXiv:2204.10981  [pdf, other

    cs.LG cs.DC stat.ML

    Distributed Dynamic Safe Screening Algorithms for Sparse Regularization

    Authors: Runxue Bao, Xidong Wu, Wenhan Xian, Heng Huang

    Abstract: Distributed optimization has been widely used as one of the most efficient approaches for model training with massive samples. However, large-scale learning problems with both massive samples and high-dimensional features widely exist in the era of big data. Safe screening is a popular technique to speed up high-dimensional models by discarding the inactive features with zero coefficients. Neverth… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  28. arXiv:2203.11199  [pdf, other

    cs.LG cs.CL cs.CR

    Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model

    Authors: Jiayi Wang, Rongzhou Bao, Zhuosheng Zhang, Hai Zhao

    Abstract: Recently, the problem of robustness of pre-trained language models (PrLMs) has received increasing research interest. Latest studies on adversarial attacks achieve high attack success rates against PrLMs, claiming that PrLMs are not robust. However, we find that the adversarial samples that PrLMs fail are mostly non-natural and do not appear in reality. We question the validity of current evaluati… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted by findings of ACL 2022

  29. arXiv:2108.12848  [pdf, other

    cs.CL

    Span Fine-tuning for Pre-trained Language Models

    Authors: Rongzhou Bao, Zhuosheng Zhang, Hai Zhao

    Abstract: Pre-trained language models (PrLM) have to carefully manage input units when training on a very large text with a vocabulary consisting of millions of words. Previous works have shown that incorporating span-level information over consecutive words in pre-training could further improve the performance of PrLMs. However, given that span-level clues are introduced and fixed in pre-training, previous… ▽ More

    Submitted 15 September, 2021; v1 submitted 29 August, 2021; originally announced August 2021.

    Comments: Accepted by EMNLP 2021 Finding(early version)

  30. arXiv:2108.11318  [pdf, other

    q-fin.ST cs.AI cs.LG

    Long-term, Short-term and Sudden Event: Trading Volume Movement Prediction with Graph-based Multi-view Modeling

    Authors: Liang Zhao, Wei Li, Ruihan Bao, Keiko Harimoto, YunfangWu, Xu Sun

    Abstract: Trading volume movement prediction is the key in a variety of financial applications. Despite its importance, there is few research on this topic because of its requirement for comprehensive understanding of information from different sources. For instance, the relation between multiple stocks, recent transaction data and suddenly released events are all essential for understanding trading market.… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: Accepted as a main track paper by IJCAI 21

  31. ASAT: Adaptively Scaled Adversarial Training in Time Series

    Authors: Zhiyuan Zhang, Wei Li, Ruihan Bao, Keiko Harimoto, Yunfang Wu, Xu Sun

    Abstract: Adversarial training is a method for enhancing neural networks to improve the robustness against adversarial examples. Besides the security concerns of potential adversarial examples, adversarial training can also improve the generalization ability of neural networks, train robust neural networks, and provide interpretability for neural networks. In this work, we introduce adversarial training in… ▽ More

    Submitted 19 December, 2022; v1 submitted 19 August, 2021; originally announced August 2021.

    Comments: Accepted to Neurocomputing

    Journal ref: Neurocomputing 522 (2023) pp. 11-23

  32. arXiv:2105.14553  [pdf, other

    cs.CL

    Defending Pre-trained Language Models from Adversarial Word Substitutions Without Performance Sacrifice

    Authors: Rongzhou Bao, Jiayi Wang, Hai Zhao

    Abstract: Pre-trained contextualized language models (PrLMs) have led to strong performance gains in downstream natural language understanding tasks. However, PrLMs can still be easily fooled by adversarial word substitution, which is one of the most challenging textual adversarial attack methods. Existing defence approaches suffer from notable performance loss and complexities. Thus, this paper presents a… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

    Comments: Findings of ACL: ACL 2021

  33. arXiv:2103.12516  [pdf, other

    cs.MM

    Edge-Cloud Collaboration Enabled Video Service Enhancement: A Hybrid Human-Artificial Intelligence Scheme

    Authors: Dapeng Wu, Ruili Bao, Zhidu Li, Honggang Wang, Ruyan Wang

    Abstract: In this paper, a video service enhancement strategy is investigated under an edge-cloud collaboration framework, where video caching and delivery decisions are made in the cloud and edge respectively. We aim to guarantee the user fairness in terms of video coding rate under statistical delay constraint and edge caching capacity constraint. A hybrid human-artificial intelligence approach is develop… ▽ More

    Submitted 14 January, 2021; originally announced March 2021.

    Comments: This paper has been submitted to IEEE Transactions on Multimedia for review

  34. Stereo Camera Visual SLAM with Hierarchical Masking and Motion-state Classification at Outdoor Construction Sites Containing Large Dynamic Objects

    Authors: Runqiu Bao, Ren Komatsu, Renato Miyagusuku, Masaki Chino, Atsushi Yamashita, Hajime Asama

    Abstract: At modern construction sites, utilizing GNSS (Global Navigation Satellite System) to measure the real-time location and orientation (i.e. pose) of construction machines and navigate them is very common. However, GNSS is not always available. Replacing GNSS with on-board cameras and visual simultaneous localization and mapping (visual SLAM) to navigate the machines is a cost-effective solution. Nev… ▽ More

    Submitted 16 January, 2021; originally announced January 2021.

    Comments: This is an Accepted Manuscript of an article published by Taylor & Francis in Advanced Robotics on Jan. 11th, 2021, available online: https://www.tandfonline.com/doi/full/10.1080/01691864.2020.1869586 [Article DOI:10.1080/01691864.2020.1869586]

    Journal ref: Advanced Robotics (2021) 1-14

  35. arXiv:2012.15070  [pdf, other

    cs.CL

    Enhancing Pre-trained Language Model with Lexical Simplification

    Authors: Rongzhou Bao, Jiayi Wang, Zhuosheng Zhang, Hai Zhao

    Abstract: For both human readers and pre-trained language models (PrLMs), lexical diversity may lead to confusion and inaccuracy when understanding the underlying semantic meanings of given sentences. By substituting complex words with simple alternatives, lexical simplification (LS) is a recognized method to reduce such lexical diversity, and therefore to improve the understandability of sentences. In this… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  36. arXiv:2012.13489  [pdf, other

    cs.LG

    Learning Robust Representation for Clustering through Locality Preserving Variational Discriminative Network

    Authors: Ruixuan Luo, Wei Li, Zhiyuan Zhang, Ruihan Bao, Keiko Harimoto, Xu Sun

    Abstract: Clustering is one of the fundamental problems in unsupervised learning. Recent deep learning based methods focus on learning clustering oriented representations. Among those methods, Variational Deep Embedding achieves great success in various clustering tasks by specifying a Gaussian Mixture prior to the latent space. However, VaDE suffers from two problems: 1) it is fragile to the input noise; 2… ▽ More

    Submitted 10 March, 2021; v1 submitted 24 December, 2020; originally announced December 2020.

    Comments: Accepted by AAAI RSEML 2021

  37. arXiv:2012.03662  [pdf, other

    cs.CV cs.CL

    Confidence-aware Non-repetitive Multimodal Transformers for TextCaps

    Authors: Zhaokai Wang, Renda Bao, Qi Wu, Si Liu

    Abstract: When describing an image, reading text in the visual scene is crucial to understand the key information. Recent work explores the TextCaps task, i.e. image captioning with reading Optical Character Recognition (OCR) tokens, which requires models to read text and cover them in generated captions. Existing approaches fail to generate accurate descriptions because of their (1) poor reading ability; (… ▽ More

    Submitted 21 March, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: 9 pages; Accepted by AAAI 2021

  38. arXiv:2006.16433  [pdf, ps, other

    cs.LG stat.ME stat.ML

    Fast OSCAR and OWL Regression via Safe Screening Rules

    Authors: Runxue Bao, Bin Gu, Heng Huang

    Abstract: Ordered Weighted $L_{1}$ (OWL) regularized regression is a new regression analysis for high-dimensional sparse learning. Proximal gradient methods are used as standard approaches to solve OWL regression. However, it is still a burning issue to solve OWL regression due to considerable computational cost and memory usage when the feature or sample size is large. In this paper, we propose the first s… ▽ More

    Submitted 19 October, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: Correct the error of the optimality conditions

  39. arXiv:2006.13693  [pdf, other

    q-bio.PE cs.LG physics.soc-ph

    PECAIQR: A Model for Infectious Disease Applied to the Covid-19 Epidemic

    Authors: Richard Bao, August Chen, Jethin Gowda, Shiva Mudide

    Abstract: The Covid-19 pandemic has made clear the need to improve modern multivariate time-series forecasting models. Current state of the art predictions of future daily deaths and, especially, hospital resource usage have confidence intervals that are unacceptably wide. Policy makers and hospitals require accurate forecasts to make informed decisions on passing legislation and allocating resources. We us… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  40. arXiv:2004.00991  [pdf, other

    q-bio.GN cs.PF

    Computational Performance of a Germline Variant Calling Pipeline for Next Generation Sequencing

    Authors: Jie Liu, Xiaotian Wu, Kai Zhang, Bing Liu, Renyi Bao, Xiao Chen, Yiran Cai, Yiming Shen, Xinjun He, Jun Yan, Weixing Ji

    Abstract: With the booming of next generation sequencing technology and its implementation in clinical practice and life science research, the need for faster and more efficient data analysis methods becomes pressing in the field of sequencing. Here we report on the evaluation of an optimized germline mutation calling pipeline, HummingBird, by assessing its performance against the widely accepted BWA-GATK p… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: 6 pages, 6 figures, 3 tables

    MSC Class: cs.PF; q-bio.GN ACM Class: C.4; D.4.8; J.3

  41. arXiv:2001.05272  [pdf

    cs.CL cs.LG stat.ML

    FGN: Fusion Glyph Network for Chinese Named Entity Recognition

    Authors: Zhenyu Xuan, Rui Bao, Shengyi Jiang

    Abstract: Chinese NER is a challenging task. As pictographs, Chinese characters contain latent glyph information, which is often overlooked. In this paper, we propose the FGN, Fusion Glyph Network for Chinese NER. Except for adding glyph information, this method may also add extra interactive information with the fusion mechanism. The major innovations of FGN include: (1) a novel CNN structure called CGS-CN… ▽ More

    Submitted 8 October, 2020; v1 submitted 15 January, 2020; originally announced January 2020.

  42. arXiv:1910.05078  [pdf, other

    cs.CE q-fin.ST

    Incorporating Fine-grained Events in Stock Movement Prediction

    Authors: Deli Chen, Yanyan Zou, Keiko Harimoto, Ruihan Bao, Xuancheng Ren, Xu Sun

    Abstract: Considering event structure information has proven helpful in text-based stock movement prediction. However, existing works mainly adopt the coarse-grained events, which loses the specific semantic information of diverse event types. In this work, we propose to incorporate the fine-grained events in stock movement prediction. Firstly, we propose a professional finance event dictionary built by dom… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted by 2th ECONLP workshop in EMNLP2019

  43. arXiv:1910.05032  [pdf, other

    cs.CL

    Group, Extract and Aggregate: Summarizing a Large Amount of Finance News for Forex Movement Prediction

    Authors: Deli Chen, Shuming ma, Keiko Harimoto, Ruihan Bao, Qi Su, Xu Sun

    Abstract: Incorporating related text information has proven successful in stock market prediction. However, it is a huge challenge to utilize texts in the enormous forex (foreign currency exchange) market because the associated texts are too redundant. In this work, we propose a BERT-based Hierarchical Aggregation Model to summarize a large amount of finance news to predict forex movement. We firstly group… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

    Comments: Accepted by 2th ECONLP workshop in EMNLP2019

  44. arXiv:1807.06093  [pdf, ps, other

    cs.LG stat.ML

    Prognostics Estimations with Dynamic States

    Authors: Rong-Jing Bao, Hai-Jun Rong, Zhi-Xin Yang, Badong Chen

    Abstract: The health state assessment and remaining useful life (RUL) estimation play very important roles in prognostics and health management (PHM), owing to their abilities to reduce the maintenance and improve the safety of machines or equipment. However, they generally suffer from this problem of lacking prior knowledge to pre-define the exact failure thresholds for a machinery operating in a dynamic e… ▽ More

    Submitted 23 September, 2018; v1 submitted 16 July, 2018; originally announced July 2018.

  45. arXiv:1805.07862  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Featurized Bidirectional GAN: Adversarial Defense via Adversarially Learned Semantic Inference

    Authors: Ruying Bao, Sihang Liang, Qingcan Wang

    Abstract: Deep neural networks have been demonstrated to be vulnerable to adversarial attacks, where small perturbations intentionally added to the original inputs can fool the classifier. In this paper, we propose a defense method, Featurized Bidirectional Generative Adversarial Networks (FBGAN), to extract the semantic features of the input and filter the non-semantic perturbation. FBGAN is pre-trained on… ▽ More

    Submitted 29 September, 2018; v1 submitted 20 May, 2018; originally announced May 2018.

  46. arXiv:1802.00237  [pdf, other

    cs.CV

    Face Aging with Contextual Generative Adversarial Nets

    Authors: Si Liu, Yao Sun, Defa Zhu, Renda Bao, Wei Wang, Xiangbo Shu, Shuicheng Yan

    Abstract: Face aging, which renders aging faces for an input face, has attracted extensive attention in the multimedia research. Recently, several conditional Generative Adversarial Nets (GANs) based methods have achieved great success. They can generate images fitting the real face distributions conditioned on each individual age group. However, these methods fail to capture the transition patterns, e.g.,… ▽ More

    Submitted 1 February, 2018; originally announced February 2018.

    Comments: accepted at ACM Multimedia 2017

  47. arXiv:1611.09587  [pdf, other

    cs.CV

    Surveillance Video Parsing with Single Frame Supervision

    Authors: Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao

    Abstract: Surveillance video parsing, which segments the video frames into several labels, e.g., face, pants, left-leg, has wide applications. However,pixel-wisely annotating all frames is tedious and inefficient. In this paper, we develop a Single frame Video Parsing (SVP) method which requires only one labeled frame per video in training stage. To parse one particular frame, the video segment preceding th… ▽ More

    Submitted 29 November, 2016; originally announced November 2016.