Skip to main content

Showing 1–18 of 18 results for author: Hsiao, C

  1. arXiv:2407.01519  [pdf, other

    cs.CV

    DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models

    Authors: Chang-Han Yeh, Chin-Yang Lin, Zhixiang Wang, Chi-Wei Hsiao, Ting-Hsuan Chen, Yu-Lun Liu

    Abstract: This paper introduces a method for zero-shot video restoration using pre-trained image restoration diffusion models. Traditional video restoration methods often need retraining for different settings and struggle with limited generalization across various degradation types and datasets. Our approach uses a hierarchical token merging strategy for keyframes and local frames, combined with a hybrid c… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2404.15959  [pdf, other

    physics.geo-ph cs.LG

    Explainable AI models for predicting liquefaction-induced lateral spreading

    Authors: Cheng-Hsi Hsiao, Krishna Kumar, Ellen Rathje

    Abstract: Earthquake-induced liquefaction can cause substantial lateral spreading, posing threats to infrastructure. Machine learning (ML) can improve lateral spreading prediction models by capturing complex soil characteristics and site conditions. However, the "black box" nature of ML models can hinder their adoption in critical decision-making. This study addresses this limitation by using SHapley Additi… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: to be published in "Frontiers in Built Environment"

  3. MENTOR: Multilingual tExt detectioN TOward leaRning by analogy

    Authors: Hsin-Ju Lin, Tsu-Chun Chung, Ching-Chun Hsiao, Pin-Yu Chen, Wei-Chen Chiu, Ching-Chun Huang

    Abstract: Text detection is frequently used in vision-based mobile robots when they need to interpret texts in their surroundings to perform a given task. For instance, delivery robots in multilingual cities need to be capable of doing multilingual text detection so that the robots can read traffic signs and road markings. Moreover, the target languages change from region to region, implying the need of eff… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures, published to IROS 2023

    Journal ref: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 2023, pp. 3248-3255

  4. arXiv:2402.00362  [pdf

    physics.ao-ph cs.AI

    Climate Trends of Tropical Cyclone Intensity and Energy Extremes Revealed by Deep Learning

    Authors: Buo-Fu Chen, Boyo Chen, Chun-Min Hsiao, Hsu-Feng Teng, Cheng-Shang Lee, Hung-Chi Kuo

    Abstract: Anthropogenic influences have been linked to tropical cyclone (TC) poleward migration, TC extreme precipitation, and an increased proportion of major hurricanes [1, 2, 3, 4]. Understanding past TC trends and variability is critical for projecting future TC impacts on human society considering the changing climate [5]. However, past trends of TC structure/energy remain uncertain due to limited obse… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 41 pages

  5. arXiv:2401.05219  [pdf, other

    cs.CE cs.AI

    Distributed Monitoring for Data Distribution Shifts in Edge-ML Fraud Detection

    Authors: Nader Karayanni, Robert J. Shahla, Chieh-Lien Hsiao

    Abstract: The digital era has seen a marked increase in financial fraud. edge ML emerged as a promising solution for smartphone payment services fraud detection, enabling the deployment of ML models directly on edge devices. This approach enables a more personalized real-time fraud detection. However, a significant gap in current research is the lack of a robust system for monitoring data distribution shift… ▽ More

    Submitted 10 January, 2024; originally announced January 2024.

  6. arXiv:2401.00273  [pdf, ps, other

    eess.AS cs.CL

    Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

    Authors: Chih-Kai Yang, Kuan-Po Huang, Ke-Han Lu, Chun-Yi Kuan, Chi-Yuan Hsiao, Hung-yi Lee

    Abstract: This work evaluated several cutting-edge large-scale foundation models based on self-supervision or weak supervision, including SeamlessM4T, SeamlessM4T v2, and Whisper-large-v3, on three code-switched corpora. We found that self-supervised models can achieve performances close to the supervised model, indicating the effectiveness of multilingual self-supervised pre-training. We also observed that… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Submitted to ICASSP 2024 Self-supervision in Audio, Speech and Beyond workshop

  7. arXiv:2309.09510  [pdf, ps, other

    eess.AS cs.LG cs.SD

    Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

    Authors: Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee

    Abstract: Text language models have shown remarkable zero-shot capability in generalizing to unseen tasks when provided with well-formulated instructions. However, existing studies in speech processing primarily focus on limited or specific tasks. Moreover, the lack of standardized benchmarks hinders a fair comparison across different approaches. Thus, we present Dynamic-SUPERB, a benchmark designed for bui… ▽ More

    Submitted 22 March, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: To appear in the proceedings of ICASSP 2024

  8. arXiv:2212.14801  [pdf, other

    cs.CV

    ExReg: Wide-range Photo Exposure Correction via a Multi-dimensional Regressor with Attention

    Authors: Tzu-Hao Chiang, Hao-Chien Hsueh, Ching-Chun Hsiao, Ching-Chun Huang

    Abstract: Photo exposure correction is widely investigated, but fewer studies focus on correcting under and over-exposed images simultaneously. Three issues remain open to handle and correct under and over-exposed images in a unified way. First, a locally-adaptive exposure adjustment may be more flexible instead of learning a global mapping. Second, it is an ill-posed problem to determine the suitable expos… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 12 pages, 8 figures

  9. arXiv:2108.01866  [pdf, other

    cs.CV

    Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation

    Authors: Chi-Wei Hsiao, Cheng Sun, Hwann-Tzong Chen, Min Sun

    Abstract: We present a novel pyramidal output representation to ensure parsimony with our "specialize and fuse" process for semantic segmentation. A pyramidal "output" representation consists of coarse-to-fine levels, where each level is "specialize" in a different class distribution (e.g., more stuff than things classes at coarser levels). Two types of pyramidal outputs (i.e., unity and semantic pyramid) a… ▽ More

    Submitted 19 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Update presentation

  10. arXiv:2106.14166  [pdf, other

    cs.CV

    Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

    Authors: Cheng Sun, Chi-Wei Hsiao, Ning-Hsu Wang, Min Sun, Hwann-Tzong Chen

    Abstract: Indoor panorama typically consists of human-made structures parallel or perpendicular to gravity. We leverage this phenomenon to approximate the scene in a 360-degree image with (H)orizontal-planes and (V)ertical-planes. To this end, we propose an effective divide-and-conquer strategy that divides pixels based on their plane orientation estimation; then, the succeeding instance segmentation module… ▽ More

    Submitted 9 September, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Code at https://github.com/sunset1995/PanoPlane360. Video at https://www.youtube.com/watch?v=2uvP0V1oGRo

  11. arXiv:2010.15158  [pdf, other

    cs.CV cs.AI

    CNN Profiler on Polar Coordinate Images for Tropical Cyclone Structure Analysis

    Authors: Boyo Chen, Buo-Fu Chen, Chun-Min Hsiao

    Abstract: Convolutional neural networks (CNN) have achieved great success in analyzing tropical cyclones (TC) with satellite images in several tasks, such as TC intensity estimation. In contrast, TC structure, which is conventionally described by a few parameters estimated subjectively by meteorology specialists, is still hard to be profiled objectively and routinely. This study applies CNN on satellite ima… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: Submitted to AAAI2021

  12. arXiv:1905.12571  [pdf, other

    cs.CV

    Flat2Layout: Flat Representation for Estimating Layout of General Room Types

    Authors: Chi-Wei Hsiao, Cheng Sun, Min Sun, Hwann-Tzong Chen

    Abstract: This paper proposes a new approach, Flat2Layout, for estimating general indoor room layout from a single-view RGB image whereas existing methods can only produce layout topologies captured from the box-shaped room. The proposed flat representation encodes the layout information into row vectors which are treated as the training target of the deep model. A dynamic programming based postprocessing i… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

  13. arXiv:1901.03861  [pdf, other

    cs.CV

    HorizonNet: Learning Room Layout with 1D Representation and Pano Stretch Data Augmentation

    Authors: Cheng Sun, Chi-Wei Hsiao, Min Sun, Hwann-Tzong Chen

    Abstract: We present a new approach to the problem of estimating the 3D room layout from a single panoramic image. We represent room layout as three 1D vectors that encode, at each image column, the boundary positions of floor-wall and ceiling-wall, and the existence of wall-wall boundary. The proposed network, HorizonNet, trained for predicting 1D layout, outperforms previous state-of-the-art approaches. T… ▽ More

    Submitted 6 April, 2019; v1 submitted 12 January, 2019; originally announced January 2019.

    Comments: CVPR 2019

  14. arXiv:1808.09198  [pdf, other

    cs.MM cs.IR cs.SI

    Representation Learning for Image-based Music Recommendation

    Authors: Chih-Chun Hsia, Kwei-Herng Lai, Yian Chen, Chuan-Ju Wang, Ming-Feng Tsai

    Abstract: Image perception is one of the most direct ways to provide contextual information about a user concerning his/her surrounding environment; hence images are a suitable proxy for contextual recommendation. We propose a novel representation learning framework for image-based music recommendation that bridges the heterogeneity gap between music and image data; the proposed method is a key component fo… ▽ More

    Submitted 29 August, 2018; v1 submitted 28 August, 2018; originally announced August 2018.

    Comments: 2 pages, LBRS@RecSys'18

  15. arXiv:1106.1853  [pdf

    cs.AI

    Intelligent decision: towards interpreting the Pe Algorithm

    Authors: Ching-an Hsiao, Xinchun Tian

    Abstract: The human intelligence lies in the algorithm, the nature of algorithm lies in the classification, and the classification is equal to outlier detection. A lot of algorithms have been proposed to detect outliers, meanwhile a lot of definitions. Unsatisfying point is that definitions seem vague, which makes the solution an ad hoc one. We analyzed the nature of outliers, and give two clear definitions… ▽ More

    Submitted 22 August, 2011; v1 submitted 9 June, 2011; originally announced June 2011.

    Comments: 23pages, 12 figures, 7 tables

  16. arXiv:0909.1709  [pdf

    cs.OH

    How does certainty enter into the mind?

    Authors: Ching-an Hsiao

    Abstract: Any problem is concerned with the mind, but what do minds make a decision on? Here we show that there are three conditions for the mind to make a certain answer. We found that some difficulties in physics and mathematics are in fact introduced by infinity, which can not be rightly expressed by minds. Based on this point, we suggest a general observation system, where we use region (a type of inf… ▽ More

    Submitted 23 March, 2010; v1 submitted 9 September, 2009; originally announced September 2009.

    Comments: 7 pages, 1 figure, revised references

  17. arXiv:0907.5155  [pdf

    cs.AI

    On Classification from Outlier View

    Authors: C. A. Hsiao

    Abstract: Classification is the basis of cognition. Unlike other solutions, this study approaches it from the view of outliers. We present an expanding algorithm to detect outliers in univariate datasets, together with the underlying foundation. The expanding algorithm runs in a holistic way, making it a rather robust solution. Synthetic and real data experiments show its power. Furthermore, an application… ▽ More

    Submitted 2 January, 2012; v1 submitted 29 July, 2009; originally announced July 2009.

    Comments: Conclusion renewed; IAENG International Journal of Computer Science, Volume 37, Issue 4, Nov, 2010

  18. arXiv:0802.3072  [pdf

    cs.OH

    Enhanced Sensing Characteristics in MEMS-based Formaldehyde Gas Sensor

    Authors: Yu-Hsiang Wang, C. -C. Hsiao, Chia-Yen Lee, R. -H. Ma, Po-Cheng Chou

    Abstract: This study has successfully demonstrated a novel self-heating formaldehyde gas sensor based on a thin film of NiO sensing layer. A new fabrication process has been developed in which the Pt micro heater and electrodes are deposited directly on the substrate and the NiO thin film is deposited above on the micro heater to serve as sensing layer. Pt electrodes are formed below the sensing layer to… ▽ More

    Submitted 21 February, 2008; originally announced February 2008.

    Comments: Submitted on behalf of EDA Publishing Association (http://irevues.inist.fr/EDA-Publishing)

    Journal ref: Dans Symposium on Design, Test, Integration and Packaging of MEMS/MOEMS - DTIP 2007, Stresa, lago Maggiore : Italie (2007)