Skip to main content

Showing 1–50 of 51 results for author: Tseng, W

  1. arXiv:2404.09385  [pdf, other

    eess.AS cs.CL eess.SP

    A Large-Scale Evaluation of Speech Foundation Models

    Authors: Shu-wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee

    Abstract: The foundation model paradigm leverages a shared foundation model to achieve state-of-the-art (SOTA) performance for various tasks, requiring minimal downstream-specific modeling and data annotation. This approach has proven crucial in the field of Natural Language Processing (NLP). However, the speech processing community lacks a similar setup to explore the paradigm systematically. In this work,… ▽ More

    Submitted 29 May, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: The extended journal version for SUPERB and SUPERB-SG. Published in IEEE/ACM TASLP. The Arxiv version is preferred

  2. arXiv:2404.05525  [pdf, other

    astro-ph.EP

    ALMA Spectroscopy of Europa: A Search for Active Plumes

    Authors: M. A. Cordiner, A. E. Thelen, I. -L. Lai, W. -L. Tseng, C. A. Nixon, Y. -J. Kuan, G. L. Villanueva, L. Paganini, S. B. Charnley, K. D. Retherford

    Abstract: The subsurface ocean of Europa is a high priority target in the search for extraterrestrial life, but direct investigations are hindered by the presence of a thick, exterior ice shell. Here we present spectral line and continuum maps of Europa obtained over four epochs in May-June 2021 using the Atacama Large Millimeter/submillimeter Array (ALMA), to search for molecular emission from atmospheric… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: Submitted to IAU Symposium 383 conference proceedings --- Astrochemistry VIII: From the First Galaxies to the Formation of Habitable Worlds

  3. Understanding Physical Breakdowns in Virtual Reality

    Authors: Wen-Jie Tseng

    Abstract: Virtual Reality (VR) moves away from well-controlled laboratory environments into public and personal spaces. As users are visually disconnected from the physical environment, interacting in an uncontrolled space frequently leads to collisions and raises safety concerns. In my thesis, I investigate this phenomenon which I define as the physical breakdown in VR. The goal is to understand the reason… ▽ More

    Submitted 20 March, 2024; originally announced April 2024.

    Comments: 5 pages, 4 figures, CHI EA '23, Doctoral Consortium

    Journal ref: (CHI EA 2023) 1-5

  4. arXiv:2403.17847  [pdf, other

    cs.LG cs.AI

    Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections

    Authors: Chia-Hao Chiang, Zheng-Han Huang, Liwen Liu, Hsin-Chien Liang, Yi-Chi Wang, Wan-Ling Tseng, Chao Wang, Che-Ta Chen, Ko-Chih Wang

    Abstract: Human activities accelerate consumption of fossil fuels and produce greenhouse gases, resulting in urgent issues today: global warming and the climate change. These indirectly cause severe natural disasters, plenty of lives suffering and huge losses of agricultural properties. To mitigate impacts on our lands, scientists are developing renewable, reusable, and clean energies and climatologists are… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.13970  [pdf

    astro-ph.EP physics.space-ph

    Mass supply from Io to Jupiter's magnetosphere

    Authors: L. Roth, A. Blöcker, K. de Kleer, D. Goldstein, E. Lellouch, J. Saur, C. Schmidt, D. F. Strobel, C. Tao, F. Tsuchiya, V. Dols, H. Huybrighs, A. Mura, J. R. Szalay, S. V. Badman, I. de Pater, A. -C. Dott, M. Kagitani, L. Klaiber, R. Koga, A. McEwen, Z. Milby, K. D. Retherford, S. Schlegel, N. Thomas , et al. (2 additional authors not shown)

    Abstract: Since the Voyager mission flybys in 1979, we have known the moon Io to be extremely volcanically active as well as to be the main source of plasma in the vast magnetosphere of Jupiter. Material lost from Io forms neutral clouds, the Io plasma torus and ultimately the extended plasma sheet. This material is supplied from the upper atmosphere and atmospheric loss is likely driven by plasma-interacti… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  6. arXiv:2312.01820  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Electrically tunable flat bands with layer-resolved charge distribution in twisted monolayer-bilayer graphene

    Authors: Wei-En Tseng, Mei-Yin Chou

    Abstract: At a small twist angle, exotic electronic properties emerge in twisted monolayer-bilayer graphene (aAB), including electrically switchable magnetic order and correlated insulating states. These fascinating many-body phenomena manifest when the low-energy bands feature a narrow band width. In this study, we examine the electronic structure of aAB using first-principles calculations combined with an… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  7. arXiv:2311.18320  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    BN-embedded monolayer graphene with tunable electronic and topological properties

    Authors: Chih-Piao Chuu, Wei-En Tseng, Kuan-Hung Liu, Ching-Ming Wei, Mei-Yin Chou

    Abstract: Finding an effective and controllable way to create a sizable energy gap in graphene-based systems has been a challenging topic of intensive research. We propose that the hybrid of boron nitride and graphene (h-BNC) at low BN doping serves as an ideal platform for band-gap engineering and valleytronic applications. We report a systematic first-principles study of the atomic configurations and band… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  8. arXiv:2311.17344  [pdf, other

    astro-ph.EP

    The composition of Saturn's rings

    Authors: Kelly E. Miller, Gianrico Filacchione, Jeffrey Cuzzi, Philip D. Nicholson, Matthew M. Hedman, Kevin Baillie, Robert E. Johnson, Wei-Ling Tseng, Paul R. Estrada, J. Hunter Waite, Mauro Ciarniello, Cécile Ferrari, Zhimeng Zhang, Amanda Hendrix, Julianne I. Moses

    Abstract: The origin and evolution of Saturn's rings is critical to understanding the Saturnian system as a whole. Here, we discuss the physical and chemical composition of the rings, as a foundation for evolutionary models described in subsequent chapters. We review the physical characteristics of the main rings, and summarize current constraints on their chemical composition. Radial trends are observed in… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: Submitted to SSR for publication in the collection "New Vision of the Saturnian System in the Context of a Highly Dissipative Saturn"

  9. arXiv:2311.15582  [pdf, other

    cs.SD cs.LG eess.AS

    Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice

    Authors: Yi-Heng Lin, Wen-Hsuan Tseng, Li-Chin Chen, Ching-Ting Tan, Yu Tsao

    Abstract: The Consensus Auditory-Perceptual Evaluation of Voice is a widely employed tool in clinical voice quality assessment that is significant for streaming communication among clinical professionals and benchmarking for the determination of further treatment. Currently, because the assessment relies on experienced clinicians, it tends to be inconsistent, and thus, difficult to standardize. To address t… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Published in IEEE 42th International Conference on Consumer Electronics (ICCE 2024)

  10. arXiv:2311.04237  [pdf, ps, other

    quant-ph cs.LG math.OC stat.ML

    Online Learning Quantum States with the Logarithmic Loss via VB-FTRL

    Authors: Wei-Fu Tseng, Kai-Chun Chen, Zi-Hong Xiao, Yen-Huan Li

    Abstract: Online learning quantum states with the logarithmic loss (LL-OLQS) is a quantum generalization of online portfolio selection, a classic open problem in the field of online learning for over three decades. The problem also emerges in designing randomized optimization algorithms for maximum-likelihood quantum state tomography. Recently, Jezequel et al. (arXiv:2209.13932) proposed the VB-FTRL algorit… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 20 pages

  11. arXiv:2309.07114  [pdf, other

    astro-ph.SR astro-ph.EP

    Monitoring H$α$ Emission from the Wide-orbit Brown-dwarf Companion FU Tau B

    Authors: Ya-Lin Wu, Yu-Chi Cheng, Li-Ching Huang, Brendan Bowler, Laird Close, Wei-Ling Tseng, Ning Chen, Da-Wei Chen

    Abstract: Monitoring mass accretion onto substellar objects provides insights into the geometry of the accretion flows. We use the Lulin One-meter Telescope to monitor H$α$ emission from FU Tau B, a $\sim$19 $M_{\rm Jup}$ brown-dwarf companion at 5.7" (719 au) from the host star, for six consecutive nights. This is the longest continuous H$α$ monitoring for a substellar companion near the deuterium-burning… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Published in AJ

  12. Memory Manipulations in Extended Reality

    Authors: Elise Bonnail, Eric Lecolinet, Wen-Jie Tseng, Samuel Huron, Mark Mcgill, Jan Gugenheimer

    Abstract: Human memory has notable limitations (e.g., forgetting) which have necessitated a variety of memory aids (e.g., calendars). As we grow closer to mass adoption of everyday Extended Reality (XR), which is frequently leveraging perceptual limitations (e.g., redirected walking), it becomes pertinent to consider how XR could leverage memory limitations (forgetting, distorting, persistence) to induce me… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Journal ref: In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), Apr 2023, Hamburg, Germany

  13. arXiv:2303.12861  [pdf, other

    eess.IV cs.LG eess.SP physics.bio-ph

    Parallel Diffusion Model-based Sparse-view Cone-beam Breast CT

    Authors: Wenjun Xia, Hsin Wu Tseng, Chuang Niu, Wenxiang Cong, Xiaohua Zhang, Shaohua Liu, Ruola Ning, Srinivasan Vedantham, Ge Wang

    Abstract: Breast cancer is the most prevalent cancer among women worldwide, and early detection is crucial for reducing its mortality rate and improving quality of life. Dedicated breast computed tomography (CT) scanners offer better image quality than mammography and tomosynthesis in general but at higher radiation dose. To enable breast CT for cancer screening, the challenge is to minimize the radiation d… ▽ More

    Submitted 28 January, 2024; v1 submitted 22 March, 2023; originally announced March 2023.

  14. arXiv:2303.12379  [pdf, other

    cs.CV

    VMCML: Video and Music Matching via Cross-Modality Lifting

    Authors: Yi-Shan Lee, Wei-Cheng Tseng, Fu-En Wang, Min Sun

    Abstract: We propose a content-based system for matching video and background music. The system aims to address the challenges in music recommendation for new users or new music give short-form videos. To this end, we propose a cross-modal framework VMCML that finds a shared embedding space between video and music representations. To ensure the embedding space can be effectively shared by both representatio… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  15. arXiv:2303.00733  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

    Authors: Kai-Wei Chang, Yu-Kai Wang, Hua Shen, Iu-thing Kang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee

    Abstract: Prompt tuning is a technology that tunes a small set of parameters to steer a pre-trained language model (LM) to directly generate the output for downstream tasks. Recently, prompt tuning has demonstrated its storage and computation efficiency in both natural language processing (NLP) and speech processing fields. These advantages have also revealed prompt tuning as a candidate approach to serving… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Project website: https://ga642381.github.io/SpeechPrompt

  16. arXiv:2302.12757  [pdf, other

    eess.AS cs.CL cs.SD

    Ensemble knowledge distillation of self-supervised speech models

    Authors: Kuan-Po Huang, Tzu-hsun Feng, Yu-Kuan Fu, Tsu-Yuan Hsu, Po-Chieh Yen, Wei-Cheng Tseng, Kai-Wei Chang, Hung-yi Lee

    Abstract: Distilled self-supervised models have shown competitive performance and efficiency in recent years. However, there is a lack of experience in jointly distilling multiple self-supervised speech models. In our work, we performed Ensemble Knowledge Distillation (EKD) on various self-supervised speech models such as HuBERT, RobustHuBERT, and WavLM. We tried two different aggregation techniques, layerw… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  17. FingerMapper: Mapping Finger Motions onto Virtual Arms to Enable Safe Virtual Reality Interaction in Confined Spaces

    Authors: Wen-Jie Tseng, Samuel Huron, Eric Lecolinet, Jan Gugenheimer

    Abstract: Whole-body movements enhance the presence and enjoyment of Virtual Reality (VR) experiences. However, using large gestures is often uncomfortable and impossible in confined spaces (e.g., public transport). We introduce FingerMapper, mapping small-scale finger motions onto virtual arms and hands to enable whole-body virtual movements in VR. In a first target selection study (n=13) comparing FingerM… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 14 pages, 15 figures

  18. arXiv:2204.07052  [pdf, other

    cs.CV

    CroCo: Cross-Modal Contrastive learning for localization of Earth Observation data

    Authors: Wei-Hsin Tseng, Hoàng-Ân Lê, Alexandre Boulch, Sébastien Lefèvre, Dirk Tiede

    Abstract: It is of interest to localize a ground-based LiDAR point cloud on remote sensing imagery. In this work, we tackle a subtask of this problem, i.e. to map a digital elevation model (DEM) rasterized from aerial LiDAR point cloud on the aerial imagery. We proposed a contrastive learning-based method that trains on DEM and high-resolution optical imagery and experiment the framework on different data s… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences (online from July 2022)

  19. arXiv:2204.03219  [pdf, other

    eess.AS cs.LG cs.SD

    DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores

    Authors: Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

    Abstract: Mean opinion score (MOS) is a typical subjective evaluation metric for speech synthesis systems. Since collecting MOS is time-consuming, it would be desirable if there are accurate MOS prediction models for automatic evaluation. In this work, we propose DDOS, a novel MOS prediction model. DDOS utilizes domain adaptive pre-training to further pre-train self-supervised learning models on synthetic s… ▽ More

    Submitted 15 August, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Accepted to Interspeech 2022. Code will be available in the future

  20. arXiv:2203.16773  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

    Authors: Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee

    Abstract: Speech representations learned from Self-supervised learning (SSL) models can benefit various speech processing tasks. However, utilizing SSL representations usually requires fine-tuning the pre-trained models or designing task-specific downstream models and loss functions, causing much memory usage and human labor. Recently, prompting in Natural Language Processing (NLP) has been found to be an e… ▽ More

    Submitted 10 July, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Accepted to be published in the Proceedings of Interspeech 2022

  21. arXiv:2203.10168  [pdf, other

    cs.RO

    Boreas: A Multi-Season Autonomous Driving Dataset

    Authors: Keenan Burnett, David J. Yoon, Yuchen Wu, Andrew Zou Li, Haowei Zhang, Shichen Lu, Jingxing Qian, Wei-Kang Tseng, Andrew Lambert, Keith Y. K. Leung, Angela P. Schoellig, Timothy D. Barfoot

    Abstract: The Boreas dataset was collected by driving a repeated route over the course of one year, resulting in stark seasonal variations and adverse weather conditions such as rain and falling snow. In total, the Boreas dataset includes over 350km of driving data featuring a 128-channel Velodyne Alpha Prime lidar, a 360$^\circ$ Navtech CIR304-H scanning radar, a 5MP FLIR Blackfly S camera, and centimetre-… ▽ More

    Submitted 26 January, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted in IJRR as a data paper

  22. The Dark Side of Perceptual Manipulations in Virtual Reality

    Authors: Wen-Jie Tseng, Elise Bonnail, Mark McGill, Mohamed Khamis, Eric Lecolinet, Samuel Huron, Jan Gugenheimer

    Abstract: "Virtual-Physical Perceptual Manipulations" (VPPMs) such as redirected walking and haptics expand the user's capacity to interact with Virtual Reality (VR) beyond what would ordinarily physically be possible. VPPMs leverage knowledge of the limits of human perception to effect changes in the user's physical movements, becoming able to (perceptibly and imperceptibly) nudge their physical actions to… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: 15 pages, 7 figures

  23. arXiv:2202.11849  [pdf, other

    astro-ph.EP astro-ph.GA

    A SUBLIME 3D Model for Cometary Coma Emission: the Hypervolatile-Rich Comet C/2016 R2 (PanSTARRS)

    Authors: M. A. Cordiner, I. M. Coulson, E. Garcia-Berrios, C. Qi, F. Lique, M. Zoltowski, M. de Val-Borro, Y. -J. Kuan, W. -H. Ip, S. Mairs, N. X. Roth, S. B. Charnley, S. N. Milam, W. -L Tseng, Y. -L Chuang

    Abstract: The coma of comet C/2016 R2 (PanSTARRS) is one of the most chemically peculiar ever observed, in particular due to its extremely high CO/H2O and N2+/H2O ratios}, and unusual trace volatile abundances. However, the complex shape of its CO emission lines, as well as uncertainties in the coma structure and excitation, has lead to ambiguities in the total CO production rate. We performed high resoluti… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: Accepted for publication in ApJ

  24. arXiv:2202.00181  [pdf, other

    cs.CV cs.CG cs.RO

    CLA-NeRF: Category-Level Articulated Neural Radiance Field

    Authors: Wei-Cheng Tseng, Hung-Ju Liao, Lin Yen-Chen, Min Sun

    Abstract: We propose CLA-NeRF -- a Category-Level Articulated Neural Radiance Field that can perform view synthesis, part segmentation, and articulated pose estimation. CLA-NeRF is trained at the object category level using no CAD models and no depth, but a set of RGB images with ground truth camera poses and part segments. During inference, it only takes a few RGB views (i.e., few-shot) of an unseen 3D obj… ▽ More

    Submitted 3 March, 2022; v1 submitted 31 January, 2022; originally announced February 2022.

    Comments: accepted by ICRA 2022

  25. arXiv:2112.07222  [pdf, other

    cs.LG cs.AI cs.MA

    Meta-CPR: Generalize to Unseen Large Number of Agents with Communication Pattern Recognition Module

    Authors: Wei-Cheng Tseng, Wei Wei, Da-Cheng Juan, Min Sun

    Abstract: Designing an effective communication mechanism among agents in reinforcement learning has been a challenging task, especially for real-world applications. The number of agents can grow or an environment sometimes needs to interact with a changing number of agents in real-world scenarios. To this end, a multi-agent framework needs to handle various scenarios of agents, in terms of both scales and d… ▽ More

    Submitted 31 January, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

  26. arXiv:2112.00344  [pdf, other

    q-bio.QM cs.AI cs.LG q-bio.BM

    Leveraging Sequence Embedding and Convolutional Neural Network for Protein Function Prediction

    Authors: Wei-Cheng Tseng, Po-Han Chi, Jia-Hua Wu, Min Sun

    Abstract: The capability of accurate prediction of protein functions and properties is essential in the biotechnology industry, e.g. drug development and artificial protein synthesis, etc. The main challenges of protein function prediction are the large label space and the lack of labeled training data. Our method leverages unsupervised sequence embedding and the success of deep convolutional neural network… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: Published in NeurIPS 2018 Machine Learning for Molecules and Materials Workshop

  27. arXiv:2111.13532  [pdf

    astro-ph.EP

    The 3D Direct Simulation Monte Carlo Study of Europa Gas Plume

    Authors: Wei-Ling Tseng, Ian-Lin Lai, Wing-Huen Ip, Hsiang-Wen Hsu, Jong-Shinn Wu

    Abstract: Europa has been spotted to have water outgassing activities by the space and ground-based telescopes as well as reanalysis of the Galileo data (Roth et al. 2014; Sparks et al. 2016, 2017; Paganini et al. 2020; Jia et al. 2018; Arnold et al. 2019). However, these observations only provided limited information about plume dynamics, which is critical in understanding the eruption mechanism and prepar… ▽ More

    Submitted 29 March, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: This paper has been submitted to Universe in Feb 2022, and it is during minor revision

  28. arXiv:2111.05113  [pdf, other

    cs.CR cs.LG cs.SD eess.AS

    Membership Inference Attacks Against Self-supervised Speech Models

    Authors: Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee

    Abstract: Recently, adapting the idea of self-supervised learning (SSL) on continuous speech has started gaining attention. SSL models pre-trained on a huge amount of unlabeled audio can generate general-purpose representations that benefit a wide variety of speech processing tasks. Despite their ubiquitous deployment, however, the potential privacy risks of these models have not been well investigated. In… ▽ More

    Submitted 15 August, 2022; v1 submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted to Interspeech 2022. Code will be available in the future

  29. arXiv:2105.12182  [pdf, other

    cs.RO

    Self-Calibration of the Offset Between GPS and Semantic Map Frames for Robust Localization

    Authors: Wei-Kang Tseng, Angela P. Schoellig, Timothy D. Barfoot

    Abstract: In self-driving, standalone GPS is generally considered to have insufficient positioning accuracy to stay in lane. Instead, many turn to LIDAR localization, but this comes at the expense of building LIDAR maps that can be costly to maintain. Another possibility is to use semantic cues such as lane lines and traffic lights to achieve localization, but these are usually not continuously visible. Thi… ▽ More

    Submitted 30 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted for publication in CRV 2021; corrected reference 4

  30. arXiv:2105.01051  [pdf, ps, other

    cs.CL cs.SD eess.AS

    SUPERB: Speech processing Universal PERformance Benchmark

    Authors: Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

    Abstract: Self-supervised learning (SSL) has proven vital for advancing research in natural language processing (NLP) and computer vision (CV). The paradigm pretrains a shared model on large volumes of unlabeled data and achieves state-of-the-art (SOTA) for various tasks with minimal adaptation. However, the speech processing community lacks a similar setup to systematically explore the paradigm. To bridge… ▽ More

    Submitted 15 October, 2021; v1 submitted 3 May, 2021; originally announced May 2021.

    Comments: To appear in Interspeech 2021

  31. arXiv:2104.03017  [pdf, other

    eess.AS cs.LG cs.SD

    Utilizing Self-supervised Representations for MOS Prediction

    Authors: Wei-Cheng Tseng, Chien-yu Huang, Wei-Tsung Kao, Yist Y. Lin, Hung-yi Lee

    Abstract: Speech quality assessment has been a critical issue in speech processing for decades. Existing automatic evaluations usually require clean references or parallel ground truth data, which is infeasible when the amount of data soars. Subjective tests, on the other hand, do not need any additional clean or parallel data and correlates better to human perception. However, such a test is expensive and… ▽ More

    Submitted 20 September, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: In Proceedings of Interspeech 2021. We acknowledge the support of AWS Machine Learning Research Awards program. Source code available at https://github.com/s3prl/s3prl/tree/master/s3prl/downstream/mos_prediction

  32. arXiv:2103.02957  [pdf, other

    cs.LG cs.AI

    Toward Robust Long Range Policy Transfer

    Authors: Wei-Cheng Tseng, Jin-Siang Lin, Yao-Min Feng, Min Sun

    Abstract: Humans can master a new task within a few trials by drawing upon skills acquired through prior experience. To mimic this capability, hierarchical models combining primitive policies learned from prior tasks have been proposed. However, these methods fall short comparing to the human's range of transferability. We propose a method, which leverages the hierarchical structure to train the combination… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: Accepted by AAAI 2021

  33. arXiv:2011.02882  [pdf

    cs.SD cs.CL eess.AS

    Query Expansion System for the VoxCeleb Speaker Recognition Challenge 2020

    Authors: Yu-Sen Cheng, Chun-Liang Shih, Tien-Hong Lo, Wen-Ting Tseng, Berlin Chen

    Abstract: In this report, we describe our submission to the VoxCeleb Speaker Recognition Challenge (VoxSRC) 2020. Two approaches are adopted. One is to apply query expansion on speaker verification, which shows significant progress compared to baseline in the study. Another is to use Kaldi extract x-vector and to combine its Probabilistic Linear Discriminant Analysis (PLDA) score with ResNet score.

    Submitted 4 November, 2020; originally announced November 2020.

  34. arXiv:2010.14049  [pdf

    cs.AI cs.IR

    Effective FAQ Retrieval and Question Matching With Unsupervised Knowledge Injection

    Authors: Wen-Ting Tseng, Tien-Hong Lo, Yung-Chang Hsu, Berlin Chen

    Abstract: Frequently asked question (FAQ) retrieval, with the purpose of providing information on frequent questions or concerns, has far-reaching applications in many areas, where a collection of question-answer (Q-A) pairs compiled a priori can be employed to retrieve an appropriate answer in response to a user\u2019s query that is likely to reoccur frequently. To this end, predominant approaches to FAQ r… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  35. arXiv:2007.15767  [pdf, other

    astro-ph.IM astro-ph.EP

    The Saturn Ring Skimmer Mission Concept: The next step to explore Saturn's rings, atmosphere, interior, and inner magnetosphere

    Authors: Matthew S. Tiscareno, Mar Vaquero, Matthew M. Hedman, Hao Cao, Paul R. Estrada, Andrew P. Ingersoll, Kelly E. Miller, Marzia Parisi, David. H. Atkinson, Shawn M. Brooks, Jeffrey N. Cuzzi, James Fuller, Amanda R. Hendrix, Robert E. Johnson, Tommi Koskinen, William S. Kurth, Jonathan I. Lunine, Philip D. Nicholson, Carol S. Paty, Rebecca Schindhelm, Mark R. Showalter, Linda J. Spilker, Nathan J. Strange, Wendy Tseng

    Abstract: The innovative Saturn Ring Skimmer mission concept enables a wide range of investigations that address fundamental questions about Saturn and its rings, as well as giant planets and astrophysical disk systems in general. This mission would provide new insights into the dynamical processes that operate in astrophysical disk systems by observing individual particles in Saturn's rings for the first t… ▽ More

    Submitted 16 September, 2020; v1 submitted 30 July, 2020; originally announced July 2020.

    Comments: White paper submitted to the Planetary Science and Astrobiology Decadal Survey (submission #420)

  36. arXiv:2007.11784  [pdf

    eess.IV cs.CV cs.LG

    Deep Learning Based Segmentation of Various Brain Lesions for Radiosurgery

    Authors: Siang-Ruei Wu, Hao-Yun Chang, Florence T Su, Heng-Chun Liao, Wanju Tseng, Chun-Chih Liao, Feipei Lai, Feng-Ming Hsu, Furen Xiao

    Abstract: Semantic segmentation of medical images with deep learning models is rapidly developed. In this study, we benchmarked state-of-the-art deep learning segmentation algorithms on our clinical stereotactic radiosurgery dataset, demonstrating the strengths and weaknesses of these algorithms in a fairly practical scenario. In particular, we compared the model performances with respect to their sampling… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

  37. arXiv:2005.05007  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Direct growth of mm-size twisted bilayer graphene by plasma-enhanced chemical vapor deposition

    Authors: Yen-Chun Chen, Wei-Hsiang Lin, Wei-Shiuan Tseng, Chien-Chang Chen, George. R. Rossman, Chii-Dong Chen, Yu-Shu Wu, Nai-Chang Yeh

    Abstract: Plasma enhanced chemical vapor deposition (PECVD) techniques have been shown to be an efficient method to achieve single-step synthesis of high-quality monolayer graphene (MLG) without the need of active heating. Here we report PECVD-growth of single-crystalline hexagonal bilayer graphene (BLG) flakes and mm-size BLG films with the interlayer twist angle controlled by the growth parameters. The tw… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: Manuscript (39 pages, 10 figures) and Supplementary Information (11 pages, 6 figures). Published in Carbon

    Journal ref: Carbon 156, 212-224 (2020)

  38. arXiv:1807.00424  [pdf, other

    physics.atom-ph physics.ins-det quant-ph

    Oscillating magnetic field effects in high precision metrology

    Authors: H. C. J. Gan, G. Maslennikov, K. W. Tseng, T. R. Tan, R. Kaewuam, K. J. Arnold, D. Matsukevich, M. D. Barrett

    Abstract: We examine a range of effects arising from ac magnetic fields in high precision metrology. These results are directly relevant to high precision measurements, and accuracy assessments for state-of-the-art optical clocks. Strategies to characterize these effects are discussed and a simple technique to accurately determine trap-induced ac magnetic fields in a linear Paul trap is demonstrated using… ▽ More

    Submitted 7 July, 2018; v1 submitted 1 July, 2018; originally announced July 2018.

    Comments: 10 pages, 6 figures

    Journal ref: Phys. Rev. A 98, 032514 (2018)

  39. arXiv:1801.07411  [pdf

    cs.AI

    Comparison Training for Computer Chinese Chess

    Authors: Wen-Jie Tseng, Jr-Chang Chen, I-Chen Wu, Tinghan Wei

    Abstract: This paper describes the application of comparison training (CT) for automatic feature weight tuning, with the final objective of improving the evaluation functions used in Chinese chess programs. First, we propose an n-tuple network to extract features, since n-tuple networks require very little expert knowledge through its large numbers of features, while simulta-neously allowing easy access. Se… ▽ More

    Submitted 23 January, 2018; originally announced January 2018.

    Comments: Submitted to IEEE Transaction on Games

  40. arXiv:1705.07970  [pdf

    physics.app-ph

    Atomic-scale Structural and Chemical Characterization of Hexagonal Boron Nitride Layers Synthesized at the Wafer-Scale with Monolayer Thickness Control

    Authors: Wei-Hsiang Lin, Victor W. Brar, Deep Jariwala, Michelle C. Sherrott, Wei-Shiuan Tseng, Chih-I Wu, Nai-Chang Yeh, Harry A. Atwater

    Abstract: Hexagonal boron nitride (h-BN) is a promising two-dimensional insulator with a large band gap and low density of charged impurities that is isostructural and isoelectronic with graphene. Here we report the chemical and atomic-scale structure of CVD-grown wafer-scale (~25 cm2) h-BN sheets ranging in thickness from 1-20 monolayers. Atomic-scale images of h-BN on Au and graphene/Au substrates obtaine… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: 26 pages, 5 figures

  41. arXiv:1611.02621  [pdf

    physics.space-ph astro-ph.EP

    Nanograin densities outside Saturn's A-ring

    Authors: Robert E Johnson, Wei-Lin Tseng, Meredith K Elrod, Ann M Persoon

    Abstract: The observed disparity between the radial dependence of the ion and electron densities measured by the Cassini plasma and radio science instruments are used to show that the region between the outer edge of Saturn's main rings and its tenuous G-ring is permeated with small charged grains (nanograins). These grains emanate from the edge of the A-ring and from the tenuous F-ring and G-ring. This is… ▽ More

    Submitted 8 November, 2016; originally announced November 2016.

    Comments: 8 pages, 1 figure

  42. arXiv:1609.04206  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Optical spectroscopy study of charge density wave order in Sr$_{3}$Rh$_{4}$Sn$_{13}$ and (Sr$_{0.5}$Ca$_{0.5}$)$_{3}$Rh$_{4}$Sn$_{13}$

    Authors: W. J. Ban, H. P. Wang, C. W. Tseng, C. N. Kuo, C. S. Lue, N. L. Wang

    Abstract: We perform optical spectroscopy measurement across the charge density wave (CDW) phase transitions on single-crystal samples of Sr$_{3}$Rh$_{4}$Sn$_{13}$ and (Sr$_{0.5}$Ca$_{0.5}$)$_{3}$Rh$_{4}$Sn$_{13}$. Formation of CDW energy gap was clearly observed for both single-crystal samples when they undergo the phase transitions. The existence of a Drude component in $σ_1(ω)$ below \TCDW indicates that… ▽ More

    Submitted 9 January, 2017; v1 submitted 14 September, 2016; originally announced September 2016.

    Journal ref: Sci. China-Phys. Mech. Astron. 60, 047011 (2017)

  43. Mn-doping induced ferromagnetism and enhanced superconductivity in Bi_4-x Mn_x O_4 S_3 (0.075 < = x < = 0.15)

    Authors: Zhenjie Feng, Xunqing Yin, Yiming Cao, Xianglian Peng, Tian Gao, Chuan Yu, Jingzhe Chen, Baojuan Kang, Bo Lu, Juan Guo, Qing Li, Wei-Shiuan Tseng, Zhongquan Ma, Chao Jing, Shixun Cao, Jincang Zhang, N. -C. Yeh

    Abstract: We demonstrate that Mn-doping in the layered sulfides Bi_4O_4S_3 leads to stable Bi_4-x Mn_x O_4 S_3 compounds that exhibit both long-range ferromagnetism and enhanced superconductivity for 0.075 < = x < = 0.15, with a possible record superconducting transition temperature (T_c) = 15 K among all BiS_2-based superconductors. We conjecture that the coexistence of superconductivity and ferromagnetism… ▽ More

    Submitted 15 August, 2016; originally announced August 2016.

    Comments: 11 pages, 10 figures. Accepted for publication in Physical Review B

  44. arXiv:1312.7301  [pdf

    cond-mat.mes-hall

    Central role of domain wall depinning for perpendicular magnetization switching driven by spin torque from the spin Hall effect

    Authors: O. J. Lee, L. Q. Liu, C. F. Pai, H. W. Tseng, Y. Li, D. C. Ralph, R. A. Buhrman

    Abstract: We study deterministic magnetic reversal of a perpendicularly magnetized Co layer in a Co/MgO/Ta nano-square driven by spin Hall torque from an in-plane current flowing in an underlying Pt layer. The rate-limiting step of the switching process is domain-wall (DW) depinning by spin Hall torque via a thermally-assisted mechanism that eventually produces full reversal by domain expansion. An in-plane… ▽ More

    Submitted 27 December, 2013; originally announced December 2013.

  45. Seasonal and radial trends in Saturn's thermal plasma between the main rings and enceladus

    Authors: Meredith K. Elrod, Wei-Ling Tseng, Adam K. Woodson, Robert E. Johnson

    Abstract: A goal of Cassini's extended mission has been to examine the seasonal variations of Saturn's magnetosphere, moons, and rings. Recently we showed that the magnetospheric plasma between the main rings and Enceladus exhibited a time dependence that we attributed to a seasonally variable source of oxygen from the main rings (Elrod et al., 2012). Such a temporal variation was subsequently seen in the e… ▽ More

    Submitted 14 December, 2013; originally announced December 2013.

  46. arXiv:1311.2234  [pdf, other

    stat.ML cs.LG math.ST

    FuSSO: Functional Shrinkage and Selection Operator

    Authors: Junier B. Oliva, Barnabas Poczos, Timothy Verstynen, Aarti Singh, Jeff Schneider, Fang-Cheng Yeh, Wen-Yih Tseng

    Abstract: We present the FuSSO, a functional analogue to the LASSO, that efficiently finds a sparse set of functional input covariates to regress a real-valued response against. The FuSSO does so in a semi-parametric fashion, making no parametric assumptions about the nature of input functional covariates and assuming a linear form to the mapping of functional covariates to the response. We provide a statis… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.

  47. The Atomic Hydrogen Cloud in the Saturnian System

    Authors: W. -L. Tseng, R. E. Johnson, W. -H. Ip

    Abstract: The Voyager flyby observations revealed that a very broad doughnut shaped distribution of the hydrogen atoms existed in the Saturnian magnetosphere. Recent Cassini observations confirmed the local-time asymmetry but also showed the hydrogen cloud density increases with decreasing distance to Saturn. The origin of the atomic hydrogen cloud has been debated ever since. Therefore, we have carried out… ▽ More

    Submitted 13 February, 2013; originally announced February 2013.

    Comments: This paper has been submitted to P&SS

  48. arXiv:1208.1711  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Spin transfer torque devices utilizing the giant spin Hall effect of tungsten

    Authors: Chi-Feng Pai, Luqiao Liu, Y. Li, H. W. Tseng, D. C. Ralph, R. A. Buhrman

    Abstract: We report a giant spin Hall effect (SHE) in β-W thin films. Using spin torque induced ferromagnetic resonance with a β-W/CoFeB bilayer microstrip we determine the spin Hall angle to be |θ|=0.30\pm0.02, large enough for an in-plane current to efficiently reverse the orientation of an in-plane magnetized CoFeB free layer of a nanoscale magnetic tunnel junction adjacent to a thin β-W layer. From swit… ▽ More

    Submitted 8 August, 2012; originally announced August 2012.

  49. arXiv:1203.2875  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Spin torque switching with the giant spin Hall effect of tantalum

    Authors: Luqiao Liu, Chi-Feng Pai, Y. Li, H. W. Tseng, D. C. Ralph, R. A. Buhrman

    Abstract: We report a giant spin Hall effect (SHE) in β-Ta that generates spin currents intense enough to induce efficient spin-transfer-torque switching of ferromagnets, thereby providing a new approach for controlling magnetic devices that can be superior to existing technologies. We quantify this SHE by three independent methods and demonstrate spin-torque (ST) switching of both out-of-plane and in-plane… ▽ More

    Submitted 13 March, 2012; originally announced March 2012.

  50. Modeling the Seasonal Variability of the Plasma Environment in Saturn's Magnetosphere between Main Rings and Mimas

    Authors: W. -L. Tseng, R. E. Johnson, M. K. Elrod

    Abstract: The detection of O2+ and O+ ions over Saturn's main rings by the Cassini INMS and CAPS instruments at Saturn orbit insertion (SOI) in 2004 confirmed the existence of the ring atmosphere and ionosphere. The source mechanism was suggested to be primarily photolytic decomposition of water ice producing neutral O2 and H2 (Johnson et al., 2006). Therefore, we predicted that there would be seasonal vari… ▽ More

    Submitted 22 December, 2011; originally announced December 2011.

    Comments: This is submitted to P&SS