Skip to main content

Showing 1–21 of 21 results for author: Yeh, J

  1. arXiv:2407.08839  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    A Survey on the Application of Generative Adversarial Networks in Cybersecurity: Prospective, Direction and Open Research Scopes

    Authors: Md Mashrur Arifin, Md Shoaib Ahmed, Tanmai Kumar Ghosh, Jun Zhuang, Jyh-haw Yeh

    Abstract: With the proliferation of Artificial Intelligence, there has been a massive increase in the amount of data required to be accumulated and disseminated digitally. As the data are available online in digital landscapes with complex and sophisticated infrastructures, it is crucial to implement various defense mechanisms based on cybersecurity. Generative Adversarial Networks (GANs), which are deep le… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  2. arXiv:2406.07117  [pdf, other

    cs.AI cs.LG

    Augmenting Offline RL with Unlabeled Data

    Authors: Zhao Wang, Briti Gangopadhyay, Jia-Fong Yeh, Shingo Takamatsu

    Abstract: Recent advancements in offline Reinforcement Learning (Offline RL) have led to an increased focus on methods based on conservative policy updates to address the Out-of-Distribution (OOD) issue. These methods typically involve adding behavior regularization or modifying the critic learning objective, focusing primarily on states or actions with substantial dataset support. However, we challenge thi… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  3. arXiv:2406.07041  [pdf, other

    cs.LG cs.AI

    Integrating Domain Knowledge for handling Limited Data in Offline RL

    Authors: Briti Gangopadhyay, Zhao Wang, Jia-Fong Yeh, Shingo Takamatsu

    Abstract: With the ability to learn from static datasets, Offline Reinforcement Learning (RL) emerges as a compelling avenue for real-world applications. However, state-of-the-art offline RL algorithms perform sub-optimally when confronted with limited data confined to specific regions within the state space. The performance degradation is attributed to the inability of offline RL algorithms to learn approp… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.00761  [pdf, other

    cs.LG cs.AI

    Shared-unique Features and Task-aware Prioritized Sampling on Multi-task Reinforcement Learning

    Authors: Po-Shao Lin, Jia-Fong Yeh, Yi-Ting Chen, Winston H. Hsu

    Abstract: We observe that current state-of-the-art (SOTA) methods suffer from the performance imbalance issue when performing multi-task reinforcement learning (MTRL) tasks. While these methods may achieve impressive performance on average, they perform extremely poorly on a few tasks. To address this, we propose a new and effective method called STARS, which consists of two novel strategies: a shared-uniqu… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: The first two authors contribute equally

  5. arXiv:2405.16545  [pdf, other

    cs.RO

    VICtoR: Learning Hierarchical Vision-Instruction Correlation Rewards for Long-horizon Manipulation

    Authors: Kuo-Han Hung, Pang-Chi Lo, Jia-Fong Yeh, Han-Yuan Hsu, Yi-Ting Chen, Winston H. Hsu

    Abstract: We study reward models for long-horizon manipulation tasks by learning from action-free videos and language instructions, which we term the visual-instruction correlation (VIC) problem. Recent advancements in cross-modality modeling have highlighted the potential of reward modeling through visual and language correlations. However, existing VIC methods face challenges in learning rewards for long-… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  6. arXiv:2403.18330  [pdf, other

    cs.CV cs.LG

    Tracking-Assisted Object Detection with Event Cameras

    Authors: Ting-Kang Yen, Igor Morawski, Shusil Dangi, Kai He, Chung-Yi Lin, Jia-Fong Yeh, Hung-Ting Su, Winston Hsu

    Abstract: Event-based object detection has recently garnered attention in the computer vision community due to the exceptional properties of event cameras, such as high dynamic range and no motion blur. However, feature asynchronism and sparsity cause invisible objects due to no relative motion to the camera, posing a significant challenge in the task. Prior works have studied various memory mechanisms to p… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  7. arXiv:2402.03860  [pdf, other

    cs.RO

    AED: Adaptable Error Detection for Few-shot Imitation Policy

    Authors: Jia-Fong Yeh, Kuo-Han Hung, Pang-Chi Lo, Chi-Ming Chung, Tsung-Han Wu, Hung-Ting Su, Yi-Ting Chen, Winston H. Hsu

    Abstract: We introduce a new task called Adaptable Error Detection (AED), which aims to identify behavior errors in few-shot imitation (FSI) policies based on visual observations in novel environments. The potential to cause serious damage to surrounding areas limits the application of FSI policies in real-world scenarios. Thus, a robust system is necessary to notify operators when FSI policies are inconsis… ▽ More

    Submitted 25 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  8. Predicting Failure of P2P Lending Platforms through Machine Learning: The Case in China

    Authors: Jen-Yin Yeh, Hsin-Yu Chiu, Jhih-Huei Huang

    Abstract: This study employs machine learning models to predict the failure of Peer-to-Peer (P2P) lending platforms, specifically in China. By employing the filter method and wrapper method with forward selection and backward elimination, we establish a rigorous and practical procedure that ensures the robustness and importance of variables in predicting platform failures. The research identifies a set of r… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

    Journal ref: Finance Research Letters Volume 59, January 2024, 104784

  9. arXiv:2304.04688  [pdf, other

    cs.CV cs.AI

    Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection

    Authors: Wei-Jhe Huang, Jheng-Hsien Yeh, Min-Hung Chen, Gueter Josmy Faure, Shang-Hong Lai

    Abstract: The goal of spatial-temporal action detection is to determine the time and place where each person's action occurs in a video and classify the corresponding action category. Most of the existing methods adopt fully-supervised learning, which requires a large amount of training data, making it very difficult to achieve zero-shot learning. In this paper, we propose to utilize a pre-trained visual-la… ▽ More

    Submitted 20 September, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: Accepted by ICCV Workshop 2023 (What is Next in Multimodal Foundation Models?)

  10. arXiv:2303.16637  [pdf, other

    cs.CV

    MuRAL: Multi-Scale Region-based Active Learning for Object Detection

    Authors: Yi-Syuan Liou, Tsung-Han Wu, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu

    Abstract: Obtaining large-scale labeled object detection dataset can be costly and time-consuming, as it involves annotating images with bounding boxes and class labels. Thus, some specialized active learning methods have been proposed to reduce the cost by selecting either coarse-grained samples or fine-grained instances from unlabeled data for labeling. However, the former approaches suffer from redundant… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  11. arXiv:2303.04027  [pdf, other

    cs.MM cs.RO

    BIRD-PCC: Bi-directional Range Image-based Deep LiDAR Point Cloud Compression

    Authors: Chia-Sheng Liu, Jia-Fong Yeh, Hao Hsu, Hung-Ting Su, Ming-Sui Lee, Winston H. Hsu

    Abstract: The large amount of data collected by LiDAR sensors brings the issue of LiDAR point cloud compression (PCC). Previous works on LiDAR PCC have used range image representations and followed the predictive coding paradigm to create a basic prototype of a coding framework. However, their prediction methods give an inaccurate result due to the negligence of invalid pixels in range images and the omissi… ▽ More

    Submitted 8 March, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: Accepted to ICASSP 2023

  12. arXiv:2212.08464  [pdf, other

    cs.CV

    Free-form 3D Scene Inpainting with Dual-stream GAN

    Authors: Ru-Fen Jheng, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu

    Abstract: Nowadays, the need for user editing in a 3D scene has rapidly increased due to the development of AR and VR technology. However, the existing 3D scene completion task (and datasets) cannot suit the need because the missing regions in scenes are generated by the sensor limitation or object occlusion. Thus, we present a novel task named free-form 3D scene inpainting. Unlike scenes in previous 3D com… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: BMVC 2022

  13. arXiv:2210.03941  [pdf, other

    cs.CV cs.CL

    Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling

    Authors: Hsin-Ying Lee, Hung-Ting Su, Bing-Chen Tsai, Tsung-Han Wu, Jia-Fong Yeh, Winston H. Hsu

    Abstract: While recent large-scale video-language pre-training made great progress in video question answering, the design of spatial modeling of video-language models is less fine-grained than that of image-language models; existing practices of temporal modeling also suffer from weak and noisy alignment between modalities. To learn fine-grained visual understanding, we decouple spatial-temporal modeling a… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

    Comments: BMVC 2022. Code is available at https://github.com/shinying/dest

  14. arXiv:2209.13274  [pdf, other

    cs.RO cs.CV

    Orbeez-SLAM: A Real-time Monocular Visual SLAM with ORB Features and NeRF-realized Mapping

    Authors: Chi-Ming Chung, Yang-Che Tseng, Ya-Ching Hsu, Xiang-Qian Shi, Yun-Hung Hua, Jia-Fong Yeh, Wen-Chin Chen, Yi-Ting Chen, Winston H. Hsu

    Abstract: A spatial AI that can perform complex tasks through visual signals and cooperate with humans is highly anticipated. To achieve this, we need a visual SLAM that easily adapts to new scenes without pre-training and generates dense maps for downstream tasks in real-time. None of the previous learning-based and non-learning-based visual SLAMs satisfy all needs due to the intrinsic limitations of their… ▽ More

    Submitted 31 January, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

  15. arXiv:2206.09570  [pdf, other

    cs.HC cs.CV

    Guardian Angel: A Novel Walking Aid for the Visually Impaired

    Authors: Ko-Wei Tai, HuaYen Lee, Hsin-Huei Chen, Jeng-Sheng Yeh, Ming Ouhyoung

    Abstract: This work introduces Guardian Angel, an Android App that assists visually impaired people to avoid danger in complex traffic environment. The system, consisting of object detection by pretrained YOLO model, distance estimation and moving direction estimation, provides information about surrounding vehicles and alarms users of potential danger without expensive special purpose device. With an exper… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

    Comments: 2 pages, 1 figure

  16. arXiv:2112.02278  [pdf, other

    cs.RO cs.AI

    Stage Conscious Attention Network (SCAN) : A Demonstration-Conditioned Policy for Few-Shot Imitation

    Authors: Jia-Fong Yeh, Chi-Ming Chung, Hung-Ting Su, Yi-Ting Chen, Winston H. Hsu

    Abstract: In few-shot imitation learning (FSIL), using behavioral cloning (BC) to solve unseen tasks with few expert demonstrations becomes a popular research direction. The following capabilities are essential in robotics applications: (1) Behaving in compound tasks that contain multiple stages. (2) Retrieving knowledge from few length-variant and misalignment demonstrations. (3) Learning from a different… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

    Comments: Accepted by AAAI 2022, preprint version, first two authors contribute equally

  17. arXiv:2102.12152  [pdf, other

    cs.CV

    Dual-Awareness Attention for Few-Shot Object Detection

    Authors: Tung-I Chen, Yueh-Cheng Liu, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu

    Abstract: While recent progress has significantly boosted few-shot classification (FSC) performance, few-shot object detection (FSOD) remains challenging for modern learning systems. Existing FSOD systems follow FSC approaches, ignoring critical issues such as spatial variability and uncertain representations, and consequently result in low performance. Observing this, we propose a novel \textbf{Dual-Awaren… ▽ More

    Submitted 15 September, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Journal ref: IEEE Transactions on Multimedia 2021

  18. DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

    Authors: Jiecao Chen, Liu Yang, Karthik Raman, Michael Bendersky, Jung-Jung Yeh, Yun Zhou, Marc Najork, Danyang Cai, Ehsan Emadzadeh

    Abstract: Pre-trained models like BERT (Devlin et al., 2018) have dominated NLP / IR applications such as single sentence classification, text pair classification, and question answering. However, deploying these models in real systems is highly non-trivial due to their exorbitant computational costs. A common remedy to this is knowledge distillation (Hinton et al., 2015), leading to faster inference. Howev… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: 13 pages. Accepted to Findings of EMNLP 2020

  19. arXiv:2005.09218  [pdf, other

    cs.LG stat.ML

    Large Margin Mechanism and Pseudo Query Set on Cross-Domain Few-Shot Learning

    Authors: Jia-Fong Yeh, Hsin-Ying Lee, Bing-Chen Tsai, Yi-Rong Chen, Ping-Chia Huang, Winston H. Hsu

    Abstract: In recent years, few-shot learning problems have received a lot of attention. While methods in most previous works were trained and tested on datasets in one single domain, cross-domain few-shot learning is a brand-new branch of few-shot learning problems, where models handle datasets in different domains between training and testing phases. In this paper, to solve the problem that the model is pr… ▽ More

    Submitted 6 February, 2024; v1 submitted 19 May, 2020; originally announced May 2020.

    Comments: Full version of the CDFSL competition report (in CVPRW'20), archived

  20. arXiv:1912.06667  [pdf, other

    stat.ML cs.LG q-bio.GN stat.AP stat.ME

    High dimensional precision medicine from patient-derived xenografts

    Authors: Naim U. Rashid, Daniel J. Luckett, Jingxiang Chen, Michael T. Lawson, Longshaokan Wang, Yunshu Zhang, Eric B. Laber, Yufeng Liu, Jen Jen Yeh, Donglin Zeng, Michael R. Kosorok

    Abstract: The complexity of human cancer often results in significant heterogeneity in response to treatment. Precision medicine offers potential to improve patient outcomes by leveraging this heterogeneity. Individualized treatment rules (ITRs) formalize precision medicine as maps from the patient covariate space into the space of allowable treatments. The optimal ITR is that which maximizes the mean of a… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  21. arXiv:1003.4065  [pdf

    cs.OH cs.CL

    Plagiarism Detection using ROUGE and WordNet

    Authors: Chien-Ying Chen, Jen-Yuan Yeh, Hao-Ren Ke

    Abstract: With the arrival of digital era and Internet, the lack of information control provides an incentive for people to freely use any content available to them. Plagiarism occurs when users fail to credit the original owner for the content referred to, and such behavior leads to violation of intellectual property. Two main approaches to plagiarism detection are fingerprinting and term occurrence; howev… ▽ More

    Submitted 22 March, 2010; originally announced March 2010.

    Journal ref: Journal of Computing, Volume 2, Issue 3, March 2010